Science.gov

Sample records for assigning large proteins

  1. NMR Backbone Assignment of Large Proteins by Using (13) Cα -Only Triple-Resonance Experiments.

    PubMed

    Wei, Qingtao; Chen, Jiajing; Mi, Juan; Zhang, Jiahai; Ruan, Ke; Wu, Jihui

    2016-07-01

    Nuclear magnetic resonance (NMR) is a powerful tool to interrogate protein structure and dynamics residue by residue. However, the prerequisite chemical-shift assignment remains a bottleneck for large proteins due to the fast relaxation and the frequency degeneracy of the (13) Cα nuclei. Herein, we present a covariance NMR strategy to assign the backbone chemical shifts by using only HN(CO)CA and HNCA spectra that has a high sensitivity even for large proteins. By using the peak linear correlation coefficient (LCC), which is a sensitive probe even for tiny chemical-shift displacements, we correctly identify the fidelity of approximately 92 % cross-peaks in the covariance spectrum, which is thus a significant improvement on the approach developed by Snyder and Brüschweiler (66 %) and the use of spectral derivatives (50 %). Thus, we calculate the 4D covariance spectrum from HN(CO)CA and HNCA experiments, in which cross-peaks with LCCs above a universal threshold are considered as true correlations. This 4D covariance spectrum enables the sequential assignment of a 42 kDa maltose binding protein (MBP), in which about 95 % residues are successfully assigned with a high accuracy of 98 %. Our LCC approach, therefore, paves the way for a residue-by-residue study of the backbone structure and dynamics of large proteins. PMID:27276173

  2. EZ-ASSIGN, a program for exhaustive NMR chemical shift assignments of large proteins from complete or incomplete triple-resonance data

    PubMed Central

    Zuiderweg, Erik R.P.; Bagai, Ireena; Rossi, Paolo; Bertelsen, Eric B.

    2013-01-01

    For several of the proteins in the BioMagResBank larger than 200 residues, 60% or fewer of the backbone resonances were assigned. But how reliable are those assignments? In contrast to complete assignments, where it is possible to check whether every triple-resonance Generalized Spin System (GSS) is assigned once and only once, with incomplete data one should compare all possible assignments and pick the best one. But that is not feasible: For example, for 200 residues and an incomplete set of 100 GSS, there are 1.6*10260 possible assignments. In “EZ-ASSIGN”, the protein sequence is divided in smaller unique fragments. Combined with intelligent search approaches, an exhaustive comparison of all possible assignments is now feasible using a laptop computer. The program was tested with experimental data of a 388-residue domain of the Hsp70 chaperone protein DnaK and for a 351-residue domain of a type III secretion ATPase. EZ-ASSIGN reproduced the hand assignments. It did slightly better than the computer program PINE (Bahrami et al., PLoS Comput Biol. 2009 5 (3): e1000307) and significantly outperformed SAGA (Crippen et al, (2010) J Biomol NMR 46, 281–298), AUTOASSIGN (Zimmerman et al., (1997) J Mol Biol 269:592–610), and IBIS (Hyberts and Wagner (2003) J Biomol NMR 26:335–344). Next, EZ-ASSIGN was used to investigate how well NMR data of decreasing completeness can be assigned. We found that the program could confidently assign fragments in very incomplete data. Here, EZ-ASSIGN dramatically outperformed all the other assignment programs tested. PMID:24022834

  3. Fast and accurate resonance assignment of small-to-large proteins by combining automated and manual approaches.

    PubMed

    Niklasson, Markus; Ahlner, Alexandra; Andresen, Cecilia; Marsh, Joseph A; Lundström, Patrik

    2015-01-01

    The process of resonance assignment is fundamental to most NMR studies of protein structure and dynamics. Unfortunately, the manual assignment of residues is tedious and time-consuming, and can represent a significant bottleneck for further characterization. Furthermore, while automated approaches have been developed, they are often limited in their accuracy, particularly for larger proteins. Here, we address this by introducing the software COMPASS, which, by combining automated resonance assignment with manual intervention, is able to achieve accuracy approaching that from manual assignments at greatly accelerated speeds. Moreover, by including the option to compensate for isotope shift effects in deuterated proteins, COMPASS is far more accurate for larger proteins than existing automated methods. COMPASS is an open-source project licensed under GNU General Public License and is available for download from http://www.liu.se/forskning/foass/tidigare-foass/patrik-lundstrom/software?l=en. Source code and binaries for Linux, Mac OS X and Microsoft Windows are available. PMID:25569628

  4. Automated protein NMR resonance assignments.

    PubMed

    Wan, Xiang; Xu, Dong; Slupsky, Carolyn M; Lin, Guohui

    2003-01-01

    NMR resonance peak assignment is one of the key steps in solving an NMR protein structure. The assignment process links resonance peaks to individual residues of the target protein sequence, providing the prerequisite for establishing intra- and inter-residue spatial relationships between atoms. The assignment process is tedious and time-consuming, which could take many weeks. Though there exist a number of computer programs to assist the assignment process, many NMR labs are still doing the assignments manually to ensure quality. This paper presents (1) a new scoring system for mapping spin systems to residues, (2) an automated adjacency information extraction procedure from NMR spectra, and (3) a very fast assignment algorithm based on our previous proposed greedy filtering method and a maximum matching algorithm to automate the assignment process. The computational tests on 70 instances of (pseudo) experimental NMR data of 14 proteins demonstrate that the new score scheme has much better discerning power with the aid of adjacency information between spin systems simulated across various NMR spectra. Typically, with automated extraction of adjacency information, our method achieves nearly complete assignments for most of the proteins. The experiment shows very promising perspective that the fast automated assignment algorithm together with the new score scheme and automated adjacency extraction may be ready for practical use. PMID:16452794

  5. Managing Large Volumes of Assignments

    ERIC Educational Resources Information Center

    Park, James; Hagen, John, Jr.

    2005-01-01

    In spring 2003, the Distance Education Network (DEN), Viterbi School of Engineering at the University of Southern California (USC), had 860 students and more than 1,000 enrollments in 70 courses toward 10 different degrees. Typically, for assignments in engineering courses, professors require students to show how their answers are derived so that…

  6. A 3D Time-Shared NOESY Experiment Designed to Provide Optimal Resolution for Accurate Assignment of NMR Distance Restraints in Large Proteins

    PubMed Central

    Mishra, Subrata H; Harden, Bradley J

    2014-01-01

    Structure determination of proteins by solution NMR has become an established method, but challenges increase steeply with the size of proteins. Notably spectral crowding and signal overlap impair the analysis of cross-peaks in NOESY spectra that provide distance restraints for structural models. An optimal spectral resolution can alleviate overlap but requires prohibitively long experimental time with existing methods. Here we present a time-shared 3D experiment optimized for large proteins that provides 15N and 13C dispersed NOESY spectra in a single measurement. NOESY correlations appear in the detected dimension and hence benefit from the highest resolution achievable of all dimensions without increase in experimental time. By design, this experiment is inherently optimal for non-uniform sampling acquisition when compared to current alternatives. Thus, 15N and 13C dispersed NOESY spectra with ultra-high resolution in all dimensions were acquired in parallel within about 4 days instead of 80 days for a 52 kDa monomeric protein at a concentration of 350 μM. PMID:25381567

  7. Graphical interpretation of Boolean operators for protein NMR assignments.

    PubMed

    Verdegem, Dries; Dijkstra, Klaas; Hanoulle, Xavier; Lippens, Guy

    2008-09-01

    We have developed a graphics based algorithm for semi-automated protein NMR assignments. Using the basic sequential triple resonance assignment strategy, the method is inspired by the Boolean operators as it applies "AND"-, "OR"- and "NOT"-like operations on planes pulled out of the classical three-dimensional spectra to obtain its functionality. The method's strength lies in the continuous graphical presentation of the spectra, allowing both a semi-automatic peaklist construction and sequential assignment. We demonstrate here its general use for the case of a folded protein with a well-dispersed spectrum, but equally for a natively unfolded protein where spectral resolution is minimal. PMID:18762868

  8. Practical aspects of NMR signal assignment in larger and challenging proteins

    PubMed Central

    Frueh, Dominique P.

    2014-01-01

    NMR has matured into a technique routinely employed for studying proteins in near physiological conditions. However, applications to larger proteins are impeded by the complexity of the various correlation maps necessary to assign NMR signals. This article reviews the data analysis techniques traditionally employed for resonance assignment and describes alternative protocols necessary for overcoming challenges in large protein spectra. In particular, simultaneous analysis of multiple spectra may help overcome ambiguities or may reveal correlations in an indirect manner. Similarly, visualization of orthogonal planes in a multidimensional spectrum can provide alternative assignment procedures. We describe examples of such strategies for assignment of backbone, methyl, and nOe resonances. We describe experimental aspects of data acquisition for the related experiments and provide guidelines for preliminary studies. Focus is placed on large folded monomeric proteins and examples are provided for 37, 48, 53, and 81 kDa proteins. PMID:24534088

  9. Probabilistic validation of protein NMR chemical shift assignments.

    PubMed

    Dashti, Hesam; Tonelli, Marco; Lee, Woonghee; Westler, William M; Cornilescu, Gabriel; Ulrich, Eldon L; Markley, John L

    2016-01-01

    Data validation plays an important role in ensuring the reliability and reproducibility of studies. NMR investigations of the functional properties, dynamics, chemical kinetics, and structures of proteins depend critically on the correctness of chemical shift assignments. We present a novel probabilistic method named ARECA for validating chemical shift assignments that relies on the nuclear Overhauser effect data . ARECA has been evaluated through its application to 26 case studies and has been shown to be complementary to, and usually more reliable than, approaches based on chemical shift databases. ARECA is available online at http://areca.nmrfam.wisc.edu/. PMID:26724815

  10. SCOPmap: Automated assignment of protein structures to evolutionary superfamilies

    PubMed Central

    Cheek, Sara; Qi, Yuan; Krishna, S Sri; Kinch, Lisa N; Grishin, Nick V

    2004-01-01

    Background Inference of remote homology between proteins is very challenging and remains a prerogative of an expert. Thus a significant drawback to the use of evolutionary-based protein structure classifications is the difficulty in assigning new proteins to unique positions in the classification scheme with automatic methods. To address this issue, we have developed an algorithm to map protein domains to an existing structural classification scheme and have applied it to the SCOP database. Results The general strategy employed by this algorithm is to combine the results of several existing sequence and structure comparison tools applied to a query protein of known structure in order to find the homologs already classified in SCOP database and thus determine classification assignments. The algorithm is able to map domains within newly solved structures to the appropriate SCOP superfamily level with ~95% accuracy. Examples of correctly mapped remote homologs are discussed. The algorithm is also capable of identifying potential evolutionary relationships not specified in the SCOP database, thus helping to make it better. The strategy of the mapping algorithm is not limited to SCOP and can be applied to any other evolutionary-based classification scheme as well. SCOPmap is available for download. Conclusion The SCOPmap program is useful for assigning domains in newly solved structures to appropriate superfamilies and for identifying evolutionary links between different superfamilies. PMID:15598351

  11. Assigning protein functions by comparative genome analysis protein phylogenetic profiles

    DOEpatents

    Pellegrini, Matteo; Marcotte, Edward M.; Thompson, Michael J.; Eisenberg, David; Grothe, Robert; Yeates, Todd O.

    2003-05-13

    A computational method system, and computer program are provided for inferring functional links from genome sequences. One method is based on the observation that some pairs of proteins A' and B' have homologs in another organism fused into a single protein chain AB. A trans-genome comparison of sequences can reveal these AB sequences, which are Rosetta Stone sequences because they decipher an interaction between A' and B. Another method compares the genomic sequence of two or more organisms to create a phylogenetic profile for each protein indicating its presence or absence across all the genomes. The profile provides information regarding functional links between different families of proteins. In yet another method a combination of the above two methods is used to predict functional links.

  12. SIR: Deterministic protein inference from peptides assigned to MS data.

    PubMed

    Matthiesen, Rune; Prieto, Gorka; Amorim, Antonio; Aloria, Kerman; Fullaondo, Asier; Carvalho, Ana S; Arizmendi, Jesus M

    2012-07-16

    Currently the bottom up approach is the most popular for characterizing protein samples by mass spectrometry. This is mainly attributed to the fact that the bottom up approach has been successfully optimized for high throughput studies. However, the bottom up approach is associated with a number of challenges such as loss of linkage information between peptides. Previous publications have addressed some of these problems which are commonly referred to as protein inference. Nevertheless, all previous publications on the subject are oversimplified and do not represent the full complexity of the proteins identified. To this end we present here SIR (spectra based isoform resolver) that uses a novel transparent and systematic approach for organizing and presenting identified proteins based on peptide spectra assignments. The algorithm groups peptides and proteins into five evidence groups and calculates sixteen parameters for each identified protein that are useful for cases where deterministic protein inference is the goal. The novel approach has been incorporated into SIR which is a user-friendly tool only concerned with protein inference based on imports of Mascot search results. SIR has in addition two visualization tools that facilitate further exploration of the protein inference problem. PMID:22626983

  13. A Markov Random Field Framework for Protein Side-Chain Resonance Assignment

    NASA Astrophysics Data System (ADS)

    Zeng, Jianyang; Zhou, Pei; Donald, Bruce Randall

    Nuclear magnetic resonance (NMR) spectroscopy plays a critical role in structural genomics, and serves as a primary tool for determining protein structures, dynamics and interactions in physiologically-relevant solution conditions. The current speed of protein structure determination via NMR is limited by the lengthy time required in resonance assignment, which maps spectral peaks to specific atoms and residues in the primary sequence. Although numerous algorithms have been developed to address the backbone resonance assignment problem [68,2,10,37,14,64,1,31,60], little work has been done to automate side-chain resonance assignment [43, 48, 5]. Most previous attempts in assigning side-chain resonances depend on a set of NMR experiments that record through-bond interactions with side-chain protons for each residue. Unfortunately, these NMR experiments have low sensitivity and limited performance on large proteins, which makes it difficult to obtain enough side-chain resonance assignments. On the other hand, it is essential to obtain almost all of the side-chain resonance assignments as a prerequisite for high-resolution structure determination. To overcome this deficiency, we present a novel side-chain resonance assignment algorithm based on alternative NMR experiments measuring through-space interactions between protons in the protein, which also provide crucial distance restraints and are normally required in high-resolution structure determination. We cast the side-chain resonance assignment problem into a Markov Random Field (MRF) framework, and extend and apply combinatorial protein design algorithms to compute the optimal solution that best interprets the NMR data. Our MRF framework captures the contact map information of the protein derived from NMR spectra, and exploits the structural information available from the backbone conformations determined by orientational restraints and a set of discretized side-chain conformations (i.e., rotamers). A Hausdorff

  14. Normalized Cut Algorithm for Automated Assignment of Protein Domains

    NASA Technical Reports Server (NTRS)

    Samanta, M. P.; Liang, S.; Zha, H.; Biegel, Bryan A. (Technical Monitor)

    2002-01-01

    We present a novel computational method for automatic assignment of protein domains from structural data. At the core of our algorithm lies a recently proposed clustering technique that has been very successful for image-partitioning applications. This grap.,l-theory based clustering method uses the notion of a normalized cut to partition. an undirected graph into its strongly-connected components. Computer implementation of our method tested on the standard comparison set of proteins from the literature shows a high success rate (84%), better than most existing alternative In addition, several other features of our algorithm, such as reliance on few adjustable parameters, linear run-time with respect to the size of the protein and reduced complexity compared to other graph-theory based algorithms, would make it an attractive tool for structural biologists.

  15. Automating unambiguous NOE data usage in NVR for NMR protein structure-based assignments.

    PubMed

    Akhmedov, Murodzhon; Çatay, Bülent; Apaydın, Mehmet Serkan

    2015-12-01

    Nuclear Magnetic Resonance (NMR) Spectroscopy is an important technique that allows determining protein structure in solution. An important problem in protein structure determination using NMR spectroscopy is the mapping of peaks to corresponding amino acids, also known as the assignment problem. Structure-Based Assignment (SBA) is an approach to solve this problem using a template structure that is homologous to the target. Our previously developed approach Nuclear Vector Replacement-Binary Integer Programming (NVR-BIP) computed the optimal solution for small proteins, but was unable to solve the assignments of large proteins. NVR-Ant Colony Optimization (ACO) extended the applicability of the NVR approach for such proteins. One of the input data utilized in these approaches is the Nuclear Overhauser Effect (NOE) data. NOE is an interaction observed between two protons if the protons are located close in space. These protons could be amide protons, protons attached to the alpha-carbon atom in the backbone of the protein, or side chain protons. NVR only uses backbone protons. In this paper, we reformulate the NVR-BIP model to distinguish the type of proton in NOE data and use the corresponding proton coordinates in the extended formulation. In addition, the threshold value over interproton distances is set in a standard manner for all proteins by extracting the NOE upper bound distance information from the data. We also convert NOE intensities into distance thresholds. Our new approach thus handles the NOE data correctly and without manually determined parameters. We accordingly adapt NVR-ACO solution methodology to these changes. Computational results show that our approaches obtain optimal solutions for small proteins. For the large proteins our ant colony optimization-based approach obtains promising results. PMID:26260854

  16. Quantification of protein group coherence and pathway assignment using functional association

    PubMed Central

    2011-01-01

    Background Genomics and proteomics experiments produce a large amount of data that are awaiting functional elucidation. An important step in analyzing such data is to identify functional units, which consist of proteins that play coherent roles to carry out the function. Importantly, functional coherence is not identical with functional similarity. For example, proteins in the same pathway may not share the same Gene Ontology (GO) terms, but they work in a coordinated fashion so that the aimed function can be performed. Thus, simply applying existing functional similarity measures might not be the best solution to identify functional units in omics data. Results We have designed two scores for quantifying the functional coherence by considering association of GO terms observed in two biological contexts, co-occurrences in protein annotations and co-mentions in literature in the PubMed database. The counted co-occurrences of GO terms were normalized in a similar fashion as the statistical amino acid contact potential is computed in the protein structure prediction field. We demonstrate that the developed scores can identify functionally coherent protein sets, i.e. proteins in the same pathways, co-localized proteins, and protein complexes, with statistically significant score values showing a better accuracy than existing functional similarity scores. The scores are also capable of detecting protein pairs that interact with each other. It is further shown that the functional coherence scores can accurately assign proteins to their respective pathways. Conclusion We have developed two scores which quantify the functional coherence of sets of proteins. The scores reflect the actual associations of GO terms observed either in protein annotations or in literature. It has been shown that they have the ability to accurately distinguish biologically relevant groups of proteins from random ones as well as a good discriminative power for detecting interacting pairs of

  17. A tabu search approach for the NMR protein structure-based assignment problem.

    PubMed

    Cavuşlar, Gizem; Çatay, Bülent; Apaydın, Mehmet Serkan

    2012-01-01

    Spectroscopy is an experimental technique which exploits the magnetic properties of specific nuclei and enables the study of proteins in solution. The key bottleneck of NMR studies is to map the NMR peaks to corresponding nuclei, also known as the assignment problem. Structure-Based Assignment (SBA) is an approach to solve this computationally challenging problem by using prior information about the protein obtained from a homologous structure. NVR-BIP used the Nuclear Vector Replacement (NVR) framework to model SBA as a binary integer programming problem. In this paper, we prove that this problem is NP-hard and propose a tabu search (TS) algorithm (NVR-TS) equipped with a guided perturbation mechanism to efficiently solve it. NVR-TS uses a quadratic penalty relaxation of NVR-BIP where the violations in the Nuclear Overhauser Effect constraints are penalized in the objective function. Experimental results indicate that our algorithm finds the optimal solution on NVRBIP’s data set which consists of seven proteins with 25 templates (31 to 126 residues). Furthermore, it achieves relatively high assignment accuracies on two additional large proteins, MBP and EIN (348 and 243 residues, respectively), which NVR-BIP failed to solve. The executable and the input files are available for download at http://people.sabanciuniv.edu/catay/NVR-TS/NVR-TS.html. PMID:23221084

  18. Organic Chemistry YouTube Writing Assignment for Large Lecture Classes

    ERIC Educational Resources Information Center

    Franz, Annaliese K.

    2012-01-01

    This work describes efforts to incorporate and evaluate the use of a YouTube writing assignment in large lecture classes to personalize learning and improve conceptual understanding of chemistry through peer- and self-explanation strategies. Although writing assignments can be a method to incorporate peer- and self-explanation strategies, this…

  19. Efficient eigenvalue assignment by state and output feedback with applications for large space structures

    NASA Technical Reports Server (NTRS)

    Vannell, Eric C.; Kenny, Sean P.; Maghami, Peiman G.

    1995-01-01

    The erection and deployment of large flexible structures having thousands of degrees of freedom requires controllers based on new techniques of eigenvalue assignment that are computationally stable and more efficient. Scientists at NASA Langley Research Center have developed a novel and efficient algorithm for the eigenvalue assignment of large, time-invariant systems using full-state and output feedback. The objectives of this research were to improve upon the output feedback version of this algorithm, to produce a toolbox of MATLAB functions based on the efficient eigenvalue assignment algorithm, and to experimentally verify the algorithm and software by implementing controllers designed using the MATLAB toolbox on the phase 2 configuration of NASA Langley's controls-structures interaction evolutionary model, a laboratory model used to study space structures. Results from laboratory tests and computer simulations show that effective controllers can be designed using software based on the efficient eigenvalue assignment algorithm.

  20. Towards an intelligent system for the automatic assignment of domains in globular proteins.

    PubMed

    Sternberg, M J; Hegyi, H; Islam, S A; Luo, J; Russell, R B

    1995-01-01

    The automatic identification of protein domains from coordinates is the first step in the classification of protein folds and hence is required for databases to guide structure prediction. Most algorithms encode a single concept based and sometimes do not yield assignments that are consistent with the generally accepted perception. Our development of an automatic approach to identify reliably domains from protein coordinates is described. The algorithm is benchmarked against a manual identification of the domains in 284 representative protein chains. The first step is the domain assignment by distance (DAD) algorithm that considers the density of inter-residue contacts represented in a contact matrix. The algorithm yields 85% agreement with the manual assignment. The paper then considers how the reliability of these assignments could be evaluated. Finally the use of structural comparisons using the STAMP algorithm to validate domain assignment is reported on a test case. PMID:7584461

  1. Assignment of Oriented Sample NMR Resonances from a Three Transmembrane Helix Protein

    PubMed Central

    Murray, D. T.; Hung, I.; Cross, T. A.

    2014-01-01

    Oriented sample solid state NMR techniques have been routinely employed to determine the structures of membrane proteins with one or two transmembrane helices. For larger proteins the technique has been limited by spectral resolution and lack of assignment strategies. Here, a strategy for resonance assignment is devised and applied to a three transmembrane helix protein. Sequence specific assignments for all labeled transmembrane amino acid sites are obtained, which provide a set of orientational restraints and helix orientation in the bilayer. Our experiments expand the utility of solid state NMR in membrane protein structure characterization to three transmembrane helix proteins and represent a straightforward strategy for routinely characterizing multiple transmembrane helix protein structures. PMID:24509383

  2. Assignment of oriented sample NMR resonances from a three transmembrane helix protein.

    PubMed

    Murray, D T; Hung, I; Cross, T A

    2014-03-01

    Oriented sample solid state NMR techniques have been routinely employed to determine the structures of membrane proteins with one or two transmembrane helices. For larger proteins the technique has been limited by spectral resolution and lack of assignment strategies. Here, a strategy for resonance assignment is devised and applied to a three transmembrane helix protein. Sequence specific assignments for all labeled transmembrane amino acid sites are obtained, which provide a set of orientational restraints and helix orientations in the bilayer. Our experiments expand the utility of solid state NMR in membrane protein structure characterization to three transmembrane helix proteins and represent a straightforward strategy for routinely characterizing multiple transmembrane helix protein structures. PMID:24509383

  3. CONSERVATION. Genetic assignment of large seizures of elephant ivory reveals Africa's major poaching hotspots.

    PubMed

    Wasser, S K; Brown, L; Mailand, C; Mondol, S; Clark, W; Laurie, C; Weir, B S

    2015-07-01

    Poaching of elephants is now occurring at rates that threaten African populations with extinction. Identifying the number and location of Africa's major poaching hotspots may assist efforts to end poaching and facilitate recovery of elephant populations. We genetically assign origin to 28 large ivory seizures (≥0.5 metric tons) made between 1996 and 2014, also testing assignment accuracy. Results suggest that the major poaching hotspots in Africa may be currently concentrated in as few as two areas. Increasing law enforcement in these two hotspots could help curtail future elephant losses across Africa and disrupt this organized transnational crime. PMID:26089357

  4. Stereospecific assignments of glycine in proteins by stereospecific deuteration and {sup 15}N labeling

    SciTech Connect

    Hansen, A.P.; Curley, R.W. Jr.; Panigot, M.J.; Fesik, S.W.

    1994-12-01

    Stereospecific assignments are important for accurately determining the three-dimensional structures of proteins through the use of multidimensional NMR techniques. It is especially important to stereospecifically assign the glycine {alpha}-protons in proteins because of the potential for different backbone conformations of this residue. These stereospecific assignments are critical for interpreting the {sup 3}J{sub NH,{alpha}H} coupling constants and NOEs involving the glycine {alpha}-protons that determine the conformation of this part of the protein. However, it is often difficult to unambiguously obtain the stereospecific assignments for glycine residues by using only NOE data. In this poster, we present a method for unambiguous, stereospecific assignment of the {alpha}-protons of glycine residues. This method involves synthesis of stereo-specifically deuterated and {sup 15}N-labeled Gly using a slightly modified procedure originally described by Woodard and coworkers for the stereoselective deuteration of glycine. The stereospecifically deuterated and {sup 15}N-labeled Gy has been incorporated into recombinant proteins expressed in both bacterial systems (FKBP) and mammalian cells (u-PA). Two- and three-dimensional isotope-filtered and isotope-edited NMR experiments were used to obtain the stereospecific assignments of the glycine {alpha}-protons for these proteins.

  5. NMR Assignments for a Helical 40 kDa Membrane Protein

    SciTech Connect

    Oxenoid, Kirill; Kim, Hak J.; Jacob, Jaison; Soennichsen, Frank D.; Sanders, Charles R.

    2004-04-28

    Backbone nuclear magnetic resonance (NMR) assignments were achieved for diacylglycerol kinase (DAGK) in detergent micelles. DAGK is a homotrimeric integral membrane protein comprised of 121 residue subunits, each having three transmembrane segments. Assignments were made using TROSY-based pulse sequences. DAGK was found to be an almost exclusively helical protein. This work points to the feasibility of both solving the structure of DAGK using solution NMR methods and using NMR as a primary tool in structural studies of other helical integral membrane proteins of similar size and complexity.

  6. NMR assignment of the arenaviral protein Z from Lassa fever virus.

    PubMed

    Volpon, Laurent; Osborne, Michael J; Borden, Katherine L B

    2008-06-01

    The arenavirus protein Z from Lassa fever virus was recently found to inhibit mRNA translation through direct interaction with eIF4E. Here, we report the NMR assignment of this RING-containing protein that was determined by triple resonance NMR techniques. PMID:18958179

  7. NMR assignment of the arenaviral protein Z from Lassa fever virus

    PubMed Central

    Osborne, Michael J.; Borden, Katherine L.B.

    2008-01-01

    The arenavirus protein Z from Lassa fever virus was recently found to inhibit mRNA translation through direct interaction with eIF4E. Here, we report the NMR assignment of this RING-containing protein that was determined by triple resonance NMR techniques. PMID:18958179

  8. 1H, 13C and 15N resonance assignments of URNdesign, a computationally redesigned RRM protein

    SciTech Connect

    Dobson, Neil; Dantas, Gautam; Varani, Gabriele

    2005-10-01

    Protein design represents one of the great challenges of computational structural biology. The ability to successfully design new proteins would allow us to generate new reagents and enzymes, while at the same time providing us with an understanding of the principles of protein stability. Here we report 1H, 15N and 13C resonance assignments of a redesigned U1A protein, URNdesign. U1A has been studied extensively by our group and hence was chosen as a design target. For the assignments we sued 2D and 3D heteronuclearNMR experiments with uniformly 13C, 15N-labeled URNdesign. The assignments for the backbone NH, CO,Ca and Cb nuclei are 94%complete. Sidechain 1Hand13C, aromatic andQ/NNH2 resonances are essentially complete with guanidinium and K NH3 residues unassigned. BMRB deposit with accession number 6493

  9. NMR assignment of the amylase-binding protein A from Streptococcus parasanguinis.

    PubMed

    Liu, Bing; Zhu, Fan; Wu, Hui; Matthews, Stephen

    2015-04-01

    Streptococcus parasanguinis is a primary colonizer of tooth surfaces in the oral cavity. Amylase-binding protein A (AbpA) from S. parasanguinis is responsible for the recruitment of salivary amylase to bacterial surface, which plays an important role in the development of oral biofilms. Here, we describe the essentially complete NMR assignments for AbpA. PMID:25016927

  10. A mutagenesis-free approach to assignment of (19)F NMR resonances in biosynthetically labeled proteins.

    PubMed

    Kitevski-LeBlanc, Julianne L; Al-Abdul-Wahid, M Sameer; Prosser, R Scott

    2009-02-18

    Solution NMR studies of protein structure and dynamics using fluorinated amino acid probes are a valuable addition to the repertoire of existing (13)C, (15)N, and (1)H experiments. Despite the numerous advantages of the (19)F nucleus in NMR, protein studies are complicated by the dependence of resonance assignments on site-directed mutagenesis methods which are laborious and often problematic. Here we report an NMR-based route to the assignment of fluorine resonances in (13)C,(15)N-3-fluoro-l-tyrosine labeled calmodulin. The assignment begins with the correlation of the fluorine nucleus to the delta proton in the novel (13)C,(15)N-enriched probe which is achieved using a CT-HCCF-COSY experiment. Connection to the backbone is made through two additional solution NMR experiments, namely the (H(beta))C(beta)(C(gamma)C(delta))H(delta) and HNCACB. Assignments are completed using either previously published backbone chemical shift data or obtained experimentally provided uniform (13)C,(15)N labeling procedures are employed during protein expression. Additional benefits of the (13)C,(15)N-3-fluoro-l-tyrosine probe include the reduction of spectral overlap through ((13)C(19)F) CT-HSQCs, as well as the ability to monitor side chain dynamics using (19)F T(1), T(2), and the (13)C-(19)F NOE. PMID:19173647

  11. SAS-Pro: Simultaneous Residue Assignment and Structure Superposition for Protein Structure Alignment

    PubMed Central

    Shah, Shweta B.; Sahinidis, Nikolaos V.

    2012-01-01

    Protein structure alignment is the problem of determining an assignment between the amino-acid residues of two given proteins in a way that maximizes a measure of similarity between the two superimposed protein structures. By identifying geometric similarities, structure alignment algorithms provide critical insights into protein functional similarities. Existing structure alignment tools adopt a two-stage approach to structure alignment by decoupling and iterating between the assignment evaluation and structure superposition problems. We introduce a novel approach, SAS-Pro, which addresses the assignment evaluation and structure superposition simultaneously by formulating the alignment problem as a single bilevel optimization problem. The new formulation does not require the sequentiality constraints, thus generalizing the scope of the alignment methodology to include non-sequential protein alignments. We employ derivative-free optimization methodologies for searching for the global optimum of the highly nonlinear and non-differentiable RMSD function encountered in the proposed model. Alignments obtained with SAS-Pro have better RMSD values and larger lengths than those obtained from other alignment tools. For non-sequential alignment problems, SAS-Pro leads to alignments with high degree of similarity with known reference alignments. The source code of SAS-Pro is available for download at http://eudoxus.cheme.cmu.edu/saspro/SAS-Pro.html. PMID:22662161

  12. Ner protein of phage Mu: Assignments using {sup 13}C/{sup 15}N-labeled protein

    SciTech Connect

    Strzelecka, T.; Gronenborn, A.M.; Clore, G.M.

    1994-12-01

    The Ner protein is a small (74-amino acid) DNA-binding protein that regulates a switch between the lysogenic and lytic stages of phage Mu. It inhibits expression of the C repressor gene and down-regulates its own expression. Two-dimensional NMR experiments on uniformly {sup 15}N-labeled protein provided most of the backbone and some of the sidechain proton assignments. The secondary structure determination using two-dimensional NOESY experiments showed that Ner consists of five {alpha}-helices. However, because most of the sidechain protons could not be assigned, the full structure was not determined. Using uniformly {sup 13}C/{sup 15}N-labeled Ner and a set of three-dimensional experiments, we were able to assign all of the backbone and 98% of the sidechain protons. In particular, the CBCANH and CBCA(CO)NH experiments were used to sequentially assign the C{alpha} and C{beta} resonances; the HCCH-CTOCSY and HCCH-COSY were used to assign sidechain carbon and proton resonances.

  13. Rapid proton-detected NMR assignment for proteins with fast magic angle spinning.

    PubMed

    Barbet-Massin, Emeline; Pell, Andrew J; Retel, Joren S; Andreas, Loren B; Jaudzems, Kristaps; Franks, W Trent; Nieuwkoop, Andrew J; Hiller, Matthias; Higman, Victoria; Guerry, Paul; Bertarello, Andrea; Knight, Michael J; Felletti, Michele; Le Marchand, Tanguy; Kotelovica, Svetlana; Akopjana, Inara; Tars, Kaspars; Stoppini, Monica; Bellotti, Vittorio; Bolognesi, Martino; Ricagno, Stefano; Chou, James J; Griffin, Robert G; Oschkinat, Hartmut; Lesage, Anne; Emsley, Lyndon; Herrmann, Torsten; Pintacuda, Guido

    2014-09-01

    Using a set of six (1)H-detected triple-resonance NMR experiments, we establish a method for sequence-specific backbone resonance assignment of magic angle spinning (MAS) nuclear magnetic resonance (NMR) spectra of 5-30 kDa proteins. The approach relies on perdeuteration, amide (2)H/(1)H exchange, high magnetic fields, and high-spinning frequencies (ωr/2π ≥ 60 kHz) and yields high-quality NMR data, enabling the use of automated analysis. The method is validated with five examples of proteins in different condensed states, including two microcrystalline proteins, a sedimented virus capsid, and two membrane-embedded systems. In comparison to contemporary (13)C/(15)N-based methods, this approach facilitates and accelerates the MAS NMR assignment process, shortening the spectral acquisition times and enabling the use of unsupervised state-of-the-art computational data analysis protocols originally developed for solution NMR. PMID:25102442

  14. Homology modeling and assigned functional annotation of an uncharacterized antitoxin protein from Streptomyces xinghaiensis.

    PubMed

    Oany, Arafat Rahman; Ahmed, Md Shahabuddin; Jahan, Nasreen; Latif, Md Abdul; Mahmud, Shahin; Hossain, Md Ahmed; Akter, Fatema; Rakib, Hasibul Haque; Islam, Md Shariful

    2015-01-01

    Streptomyces xinghaiensis is a Gram-positive, aerobic and non-motile bacterium. The bacterial genome is known. Therefore, it is of interest to study the uncharacterized proteins in the genome. An uncharacterized protein (gi|518540893|86 residues) in the genome was selected for a comprehensive computational sequence-structure-function analysis using available data and tools. Subcellular localization of the targeted protein with conserved residues and assigned secondary structures is documented. Sequence homology search against the protein data bank (PDB) and non-redundant GenBank proteins using BLASTp showed different homologous proteins with known antitoxin function. A homology model of the target protein was developed using a known template (PDB ID: 3CTO:A) with 62% sequence similarity in HHpred after assessment using programs PROCHECK and QMEAN6. The predicted active site using CASTp is analyzed for assigned anti-toxin function. This information finds specific utility in annotating the said uncharacterized protein in the bacterial genome. PMID:26912949

  15. Homology modeling and assigned functional annotation of an uncharacterized antitoxin protein from Streptomyces xinghaiensis

    PubMed Central

    Oany, Arafat Rahman; Ahmed, Md Shahabuddin; Jahan, Nasreen; Latif, Md Abdul; Mahmud, Shahin; Hossain, Md. Ahmed; Akter, Fatema; Rakib, Hasibul Haque; Islam, Md. Shariful

    2015-01-01

    Streptomyces xinghaiensis is a Gram-positive, aerobic and non-motile bacterium. The bacterial genome is known. Therefore, it is of interest to study the uncharacterized proteins in the genome. An uncharacterized protein (gi|518540893|86 residues) in the genome was selected for a comprehensive computational sequence-structure-function analysis using available data and tools. Subcellular localization of the targeted protein with conserved residues and assigned secondary structures is documented. Sequence homology search against the protein data bank (PDB) and non-redundant GenBank proteins using BLASTp showed different homologous proteins with known antitoxin function. A homology model of the target protein was developed using a known template (PDB ID: 3CTO:A) with 62% sequence similarity in HHpred after assessment using programs PROCHECK and QMEAN6. The predicted active site using CASTp is analyzed for assigned anti-toxin function. This information finds specific utility in annotating the said uncharacterized protein in the bacterial genome. PMID:26912949

  16. Fast automated protein NMR data collection and assignment by ADAPT-NMR on Bruker spectrometers.

    PubMed

    Lee, Woonghee; Hu, Kaifeng; Tonelli, Marco; Bahrami, Arash; Neuhardt, Elizabeth; Glass, Karen C; Markley, John L

    2013-11-01

    ADAPT-NMR (Assignment-directed Data collection Algorithm utilizing a Probabilistic Toolkit in NMR) supports automated NMR data collection and backbone and side chain assignment for [U-(13)C, U-(15)N]-labeled proteins. Given the sequence of the protein and data for the orthogonal 2D (1)H-(15)N and (1)H-(13)C planes, the algorithm automatically directs the collection of tilted plane data from a variety of triple-resonance experiments so as to follow an efficient pathway toward the probabilistic assignment of (1)H, (13)C, and (15)N signals to specific atoms in the covalent structure of the protein. Data collection and assignment calculations continue until the addition of new data no longer improves the assignment score. ADAPT-NMR was first implemented on Varian (Agilent) spectrometers [A. Bahrami, M. Tonelli, S.C. Sahu, K.K. Singarapu, H.R. Eghbalnia, J.L. Markley, PLoS One 7 (2012) e33173]. Because of broader interest in the approach, we present here a version of ADAPT-NMR for Bruker spectrometers. We have developed two AU console programs (ADAPT_ORTHO_run and ADAPT_NMR_run) that run under TOPSPIN Versions 3.0 and higher. To illustrate the performance of the algorithm on a Bruker spectrometer, we tested one protein, chlorella ubiquitin (76 amino acid residues), that had been used with the Varian version: the Bruker and Varian versions achieved the same level of assignment completeness (98% in 20 h). As a more rigorous evaluation of the Bruker version, we tested a larger protein, BRPF1 bromodomain (114 amino acid residues), which yielded an automated assignment completeness of 86% in 55 h. Both experiments were carried out on a 500 MHz Bruker AVANCE III spectrometer equipped with a z-gradient 5 mm TCI probe. ADAPT-NMR is available at http://pine.nmrfam.wisc.edu/ADAPT-NMR in the form of pulse programs, the two AU programs, and instructions for installation and use. PMID:24091140

  17. Fast automated protein NMR data collection and assignment by ADAPT-NMR on Bruker spectrometers

    NASA Astrophysics Data System (ADS)

    Lee, Woonghee; Hu, Kaifeng; Tonelli, Marco; Bahrami, Arash; Neuhardt, Elizabeth; Glass, Karen C.; Markley, John L.

    2013-11-01

    ADAPT-NMR (Assignment-directed Data collection Algorithm utilizing a Probabilistic Toolkit in NMR) supports automated NMR data collection and backbone and side chain assignment for [U-13C, U-15N]-labeled proteins. Given the sequence of the protein and data for the orthogonal 2D 1H-15N and 1H-13C planes, the algorithm automatically directs the collection of tilted plane data from a variety of triple-resonance experiments so as to follow an efficient pathway toward the probabilistic assignment of 1H, 13C, and 15N signals to specific atoms in the covalent structure of the protein. Data collection and assignment calculations continue until the addition of new data no longer improves the assignment score. ADAPT-NMR was first implemented on Varian (Agilent) spectrometers [A. Bahrami, M. Tonelli, S.C. Sahu, K.K. Singarapu, H.R. Eghbalnia, J.L. Markley, PLoS One 7 (2012) e33173]. Because of broader interest in the approach, we present here a version of ADAPT-NMR for Bruker spectrometers. We have developed two AU console programs (ADAPT_ORTHO_run and ADAPT_NMR_run) that run under TOPSPIN Versions 3.0 and higher. To illustrate the performance of the algorithm on a Bruker spectrometer, we tested one protein, chlorella ubiquitin (76 amino acid residues), that had been used with the Varian version: the Bruker and Varian versions achieved the same level of assignment completeness (98% in 20 h). As a more rigorous evaluation of the Bruker version, we tested a larger protein, BRPF1 bromodomain (114 amino acid residues), which yielded an automated assignment completeness of 86% in 55 h. Both experiments were carried out on a 500 MHz Bruker AVANCE III spectrometer equipped with a z-gradient 5 mm TCI probe. ADAPT-NMR is available at http://pine.nmrfam.wisc.edu/ADAPT-NMR in the form of pulse programs, the two AU programs, and instructions for installation and use.

  18. Large-scale air traffic surveillance using an IMM estimator with assignment

    NASA Astrophysics Data System (ADS)

    Wang, Hui; Kirubarajan, Thiagalingam; Li, Yicong; Bar-Shalom, Yaakov

    1997-10-01

    In this paper we present the development and implementation of a multisensor-multitarget tracking algorithm for large scale air traffic surveillance based on the IMM state estimator combined with a 2-dimensional assignment for data association. The algorithm can be used to track a large umber of targets from measurements obtained with a large number of radars. The use of the algorithm is illustrated on measurements obtained from 5 FAA radars, which are asynchronous, heterogeneous and geographically distributed over a large area. Both secondary radar data (beacon returns from cooperative targets) as well as primary radar data (skin returns from non-cooperative targets) are used. The target IDs from the beacon returns are not used in the data association. The surveillance region includes about 800 targets that exhibit different types of motion. The performance of the IMM estimator is compared with that of the Kalman filter. A number of performance measures that can be used on real data without knowledge of the ground truth are presented for this purpose. It is shown that the IMM estimator performs better than the Kalman filter. The advantage of fusing multisensor data is quantified. It is also shown that the computational requirements in the multisensor case are lower than in single sensor case.

  19. Using Computer-Based Writing Software to Facilitate Writing Assignments in Large Political Science Classes

    ERIC Educational Resources Information Center

    Ishiyama, John; Watson, Wendy L.

    2014-01-01

    It is generally accepted in the literature that writing assignments, even short ones, increase both student writing ability and comprehension of the material covered in the assignments. As class enrollments increase, particularly at the introductory level, however, instructors often sacrifice writing assignments because of the difficulty in…

  20. Reduced dimensionality 3D HNCAN for unambiguous HN, CA and N assignment in proteins

    NASA Astrophysics Data System (ADS)

    Rout, Manoj Kumar; Mishra, Pushpa; Atreya, Hanudatta S.; Hosur, Ramakrishna V.

    2012-03-01

    We present here an improvisation of HNN (Panchal, Bhavesh et al., 2001) called RD 3D HNCAN for backbone (HN, CA and 15N) assignment in both folded and unfolded proteins. This is a reduced dimensionality experiment which employs CA chemical shifts to improve dispersion. Distinct positive and negative peak patterns of various triplet segments along the polypeptide chain observed in HNN are retained and these provide start and check points for the sequential walk. Because of co-incrementing of CA and 15N, peaks along one of the dimensions appear at sums and differences of the CA and 15N chemical shifts. This changes the backbone assignment protocol slightly and we present this in explicit detail. The performance of the experiment has been demonstrated using Ubiquitin and Plasmodium falciparum P2 proteins. The experiment is particularly valuable when two neighboring amino acid residues have nearly identical backbone 15N chemical shifts.

  1. Main-chain-directed strategy for the assignment of /sup 1/H NMR spectra of proteins

    SciTech Connect

    Englander, S.W.; Wand, A.J.

    1987-09-22

    A strategy for assigning the resonances in two-dimensional (2D) NMR spectra of proteins is described. The method emphasizes the analysis of through-space relationships between protons by use of the two-dimensional nuclear Overhauser effect (NOE) experiment. NOE patterns used in the algorithm were derived from a statistical analysis of the combinations of short proton-proton distances observed in the high-resolution crystal structures of 21 proteins. One starts with a search for authentic main-chain NH-C/sub ..cap alpha../H-C/sub ..beta../H J-coupled units, which can be found with high reliability. The many main-chain units of a protein are then placed in their proper juxtaposition by recognition of predefined NOE connectivity patterns. To discover these connectivities, the 2D NOE spectrum is examined, in a prescribed order, for the distinct NOE patterns characteristic of helices, sheets, turns, and extended chain. Finally, the recognition of a few amino acid side-chain types places the discovered secondary structure elements within the polypeptide sequences. Unlike the sequential assignment approach, the main-chain-directed strategy does not rely on the difficult task of recognizing many side-chain spin systems in J-correlated spectra, the assignment process is not in general sequential with the polypeptide chain, and the prescribed connectivity patterns are cyclic rather than linear. The latter characteristic avoids ambiguous branch points in the analysis and imposed an internally confirmatory property on each forward step.

  2. iHADAMAC: A complementary tool for sequential resonance assignment of globular and highly disordered proteins

    NASA Astrophysics Data System (ADS)

    Feuerstein, Sophie; Plevin, Michael J.; Willbold, Dieter; Brutscher, Bernhard

    2012-01-01

    An experiment, iHADAMAC, is presented that yields information on the amino-acid type of individual residues in a protein by editing the 1H- 15N correlations into seven different 2D spectra, each corresponding to a different class of amino-acid types. Amino-acid type discrimination is realized via a Hadamard encoding scheme based on four different spin manipulations as recently introduced in the context of the sequential HADAMAC experiment. Both sequential and intra-residue HADAMAC experiments yield highly complementary information that greatly facilitate resonance assignment of proteins with high frequency degeneracy, as demonstrated here for a 188-residue intrinsically disordered protein fragment of the hepatitis C virus protein NS5A.

  3. Novel 2D Triple-Resonance NMR Experiments for Sequential Resonance Assignments of Proteins

    NASA Astrophysics Data System (ADS)

    Ding, Keyang; Gronenborn, Angela M.

    2002-06-01

    We present 2D versions of the popular triple resonance HN(CO) CACB, HN(COCA)CACB, HN(CO)CAHA, and HN(COCA) CAHA experiments, commonly used for sequential resonance assignments of proteins. These experiments provide information about correlations between amino proton and nitrogen chemical shifts and the α- and β-carbon and α-proton chemical shifts within and between amino acid residues. Using these 2D spectra, sequential resonance assignments of H N, N, C α, C β, and H α nuclei are easily achieved. The resolution of these spectra is identical to the well-resolved 2D 15N- 1H HSQC and H(NCO)CA spectra, with slightly reduced sensitivity compared to their 3D and 4D versions. These types of spectra are ideally suited for exploitation in automated assignment procedures and thereby constitute a fast and efficient means for NMR structural determination of small and medium-sized proteins in solution in structural genomics programs.

  4. Mass spectrometry-based protein identification with accurate statistical significance assignment

    PubMed Central

    Alves, Gelio; Yu, Yi-Kuo

    2015-01-01

    Motivation: Assigning statistical significance accurately has become increasingly important as metadata of many types, often assembled in hierarchies, are constructed and combined for further biological analyses. Statistical inaccuracy of metadata at any level may propagate to downstream analyses, undermining the validity of scientific conclusions thus drawn. From the perspective of mass spectrometry-based proteomics, even though accurate statistics for peptide identification can now be achieved, accurate protein level statistics remain challenging. Results: We have constructed a protein ID method that combines peptide evidences of a candidate protein based on a rigorous formula derived earlier; in this formula the database P-value of every peptide is weighted, prior to the final combination, according to the number of proteins it maps to. We have also shown that this protein ID method provides accurate protein level E-value, eliminating the need of using empirical post-processing methods for type-I error control. Using a known protein mixture, we find that this protein ID method, when combined with the Sorić formula, yields accurate values for the proportion of false discoveries. In terms of retrieval efficacy, the results from our method are comparable with other methods tested. Availability and implementation: The source code, implemented in C++ on a linux system, is available for download at ftp://ftp.ncbi.nlm.nih.gov/pub/qmbp/qmbp_ms/RAId/RAId_Linux_64Bit. Contact: yyu@ncbi.nlm.nih.gov Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25362092

  5. Assignment of protein backbone resonances using connectivity, torsion angles and 13Calpha chemical shifts.

    PubMed

    Morris, Laura C; Valafar, Homayoun; Prestegard, James H

    2004-05-01

    A program is presented which will return the most probable sequence location for a short connected set of residues in a protein given just (13)C(alpha) chemical shifts (delta((13)C(alpha))) and data restricting the phi and psi backbone angles. Data taken from both the BioMagResBank and the Protein Data Bank were used to create a probability density function (PDF) using a multivariate normal distribution in delta((13)C(alpha)), phi, and psi space for each amino acid residue. Extracting and combining probabilities for particular amino acid residues in a short proposed sequence yields a score indicative of the correctness of the proposed assignment. The program is illustrated using several proteins for which structure and (13)C(alpha) chemical shift data are available. PMID:15017135

  6. Two-dimensional sup 1 H NMR studies on HPr protein from Staphylococcus aureus: Complete sequential assignments and secondary structure

    SciTech Connect

    Kalbitzer, H.R.; Neidig, K.P. ); Hengstenberg, W. )

    1991-11-19

    Complete sequence-specific assignments of the {sup 1}H NMR spectrum of HPr protein from Staphylococcus aureus were obtained by two-dimensional NMR methods. Important secondary structure elements that can be derived from the observed nuclear Overhauser effects are a large antiparallel {beta}-pleated sheet consisting of four strands, A, B, C, D, a segment S{sub AB} consisting of an extended region around the active-center histidine (His-15) and an {alpha}-helix, a half-turn between strands B and C, a segment S{sub CD} which shows no typical secondary structure, and the {alpha}-helical, C-terminal segment S{sub term}. These general structural features are similar to those found earlier in HPr proteins from different microorganisms such as Escherichia coli, Bacillus subtilis, and Streptococcus faecalis.

  7. Resonance assignment of PsbP: an extrinsic protein from photosystem II of Spinacia oleracea.

    PubMed

    Rathner, Adriana; Chandra, Kousik; Rathner, Petr; Horničáková, Michaela; Schlagnitweit, Judith; Kohoutová, Jaroslava; Ettrich, Rüdiger; Müller, Norbert

    2015-10-01

    PsbP (23 kDa) is an extrinsic eukaryotic protein of photosystem II found in the thylakoid membrane of higher plants and green algae. It has been proven to be indispensable for proper functioning of the oxygen evolving complex. By interaction with other extrinsic proteins (PsbQ, PsbO and PsbR), it modulates the concentration of two cofactors of the water splitting reaction, Ca(2+) and Cl(-). The crystallographic structure of PsbP from Spinacia oleracea lacks the N-terminal part as well as two inner regions which were modelled as loops. Those unresolved parts are believed to be functionally crucial for the binding of PsbP to the thylakoid membrane. In this NMR study we report (1)H, (15)N and (13)C resonance assignments of the backbone and side chain atoms of the PsbP protein. Based on these data, an estimate of the secondary structure has been made. The structural motifs found fit the resolved parts of the crystallographic structure very well. In addition, the complete assignment set provides preliminary insight into the dynamic regions. PMID:25903141

  8. Resonance assignments for latherin, a natural surfactant protein from horse sweat.

    PubMed

    Vance, Steven J; McDonald, Rhona E; Cooper, Alan; Kennedy, Malcolm W; Smith, Brian O

    2014-04-01

    Latherin is an intrinsically surfactant protein of ~23 kDa found in the sweat and saliva of horses. Its function is probably to enhance the translocation of sweat water from the skin to the surface of the pelt for evaporative cooling. Its role in saliva may be to enhance the wetting, softening and maceration of the dry, fibrous food for which equines are adapted. Latherin is unusual in its relatively high content of aliphatic amino acids (~25% leucines) that might contribute to its surfactant properties. Latherin is related to the palate, lung, and nasal epithelium carcinoma-associated proteins (PLUNCs) of mammals, at least one of which is now known to exhibit similar surfactant activity to latherin. No structures of any PLUNC protein are currently available. (15)N,(13)C-labelled recombinant latherin was produced in Escherichia coli, and essentially all of the resonances were assigned despite the signal overlap due to the preponderance of leucines. The most notable exceptions include a number of residues located in an apparently dynamic loop region between residues 145 and 154. The assignments have been deposited with BMRB accession number 19067. PMID:23708874

  9. Short communication : a linear assignment approach for the least-squares protein morphing problem.

    SciTech Connect

    Anitescu, M.; Park, S.; Mathematics and Computer Science

    2009-02-01

    This work addresses the computation of free-energy differences between protein conformations by using morphing (i.e., transformation) of a source conformation into a target conformation. To enhance the morphing procedure, we employ permutations of atoms: we seek to find the permutation s that minimizes the mean-square distance traveled by the atoms. Instead of performing this combinatorial search in the space of permutations, we show that the best permutation can be found by solving a linear assignment problem. We demonstrate that the use of such optimal permutations significantly improves the efficiency of the free-energy computation.

  10. Influence of assignment on the prediction of transmembrane helices in protein structures.

    PubMed

    Pylouster, Jean; Bornot, Aurélie; Etchebest, Catherine; de Brevern, Alexandre G

    2010-11-01

    α-Helical transmembrane proteins (TMPα) are composed of a series of helices embedded in the lipid bilayer. Due to technical difficulties, few 3D structures are available. Therefore, the design of structural models of TMPα is of major interest. We study the secondary structures of TMPα by analyzing the influence of secondary structures assignment methods (SSAMs). For this purpose, a published and updated benchmark databank of TMPα is used and several SSAMs (9) are evaluated. The analysis of the results points to significant differences in SSA depending on the methods used. Pairwise comparisons between SSAMs led to more than 10% of disagreement. Helical regions corresponding to transmembrane zones are often correctly characterized. The study of the sequence-structure relationship shows very limited differences with regard to the structural disagreement. Secondary structure prediction based on Bayes' rule and using only a single sequence give correct prediction rates ranging from 78 to 81%. A structural alphabet approach gives a slightly better prediction, i.e., only 2% less than the best equivalent approach, whereas the prediction rate with a very different assignment bypasses 86%. This last result highlights the importance of the correct assignment choice to evaluate the prediction assessment. PMID:20349322

  11. Development and Application of ANN Model for Worker Assignment into Virtual Cells of Large Sized Configurations

    NASA Astrophysics Data System (ADS)

    Murali, R. V.; Puri, A. B.; Fathi, Khalid

    2010-10-01

    This paper presents an extended version of study already undertaken on development of an artificial neural networks (ANNs) model for assigning workforce into virtual cells under virtual cellular manufacturing systems (VCMS) environments. Previously, the same authors have introduced this concept and applied it to virtual cells of two-cell configuration and the results demonstrated that ANNs could be a worth applying tool for carrying out workforce assignments. In this attempt, three-cell configurations problems are considered for worker assignment task. Virtual cells are formed under dual resource constraint (DRC) context in which the number of available workers is less than the total number of machines available. Since worker assignment tasks are quite non-linear and highly dynamic in nature under varying inputs & conditions and, in parallel, ANNs have the ability to model complex relationships between inputs and outputs and find similar patterns effectively, an attempt was earlier made to employ ANNs into the above task. In this paper, the multilayered perceptron with feed forward (MLP-FF) neural network model has been reused for worker assignment tasks of three-cell configurations under DRC context and its performance at different time periods has been analyzed. The previously proposed worker assignment model has been reconfigured and cell formation solutions available for three-cell configuration in the literature are used in combination to generate datasets for training ANNs framework. Finally, results of the study have been presented and discussed.

  12. CONNJUR R: an annotation strategy for fostering reproducibility in bio-NMR-protein spectral assignment.

    PubMed

    Fenwick, Matthew; Hoch, Jeffrey C; Ulrich, Eldon; Gryk, Michael R

    2015-10-01

    Reproducibility is a cornerstone of the scientific method, essential for validation of results by independent laboratories and the sine qua non of scientific progress. A key step toward reproducibility of biomolecular NMR studies was the establishment of public data repositories (PDB and BMRB). Nevertheless, bio-NMR studies routinely fall short of the requirement for reproducibility that all the data needed to reproduce the results are published. A key limitation is that considerable metadata goes unpublished, notably manual interventions that are typically applied during the assignment of multidimensional NMR spectra. A general solution to this problem has been elusive, in part because of the wide range of approaches and software packages employed in the analysis of protein NMR spectra. Here we describe an approach for capturing missing metadata during the assignment of protein NMR spectra that can be generalized to arbitrary workflows, different software packages, other biomolecules, or other stages of data analysis in bio-NMR. We also present extensions to the NMR-STAR data dictionary that enable machine archival and retrieval of the "missing" metadata. PMID:26253947

  13. ProPhylo: partial phylogenetic profiling to guide protein family construction and assignment of biological process

    PubMed Central

    2011-01-01

    Background Phylogenetic profiling is a technique of scoring co-occurrence between a protein family and some other trait, usually another protein family, across a set of taxonomic groups. In spite of several refinements in recent years, the technique still invites significant improvement. To be its most effective, a phylogenetic profiling algorithm must be able to examine co-occurrences among protein families whose boundaries are uncertain within large homologous protein superfamilies. Results Partial Phylogenetic Profiling (PPP) is an iterative algorithm that scores a given taxonomic profile against the taxonomic distribution of families for all proteins in a genome. The method works through optimizing the boundary of each protein family, rather than by relying on prebuilt protein families or fixed sequence similarity thresholds. Double Partial Phylogenetic Profiling (DPPP) is a related procedure that begins with a single sequence and searches for optimal granularities for its surrounding protein family in order to generate the best query profiles for PPP. We present ProPhylo, a high-performance software package for phylogenetic profiling studies through creating individually optimized protein family boundaries. ProPhylo provides precomputed databases for immediate use and tools for manipulating the taxonomic profiles used as queries. Conclusion ProPhylo results show universal markers of methanogenesis, a new DNA phosphorothioation-dependent restriction enzyme, and efficacy in guiding protein family construction. The software and the associated databases are freely available under the open source Perl Artistic License from ftp://ftp.jcvi.org/pub/data/ppp/. PMID:22070167

  14. Reliable resonance assignments of selected residues of proteins with known structure based on empirical NMR chemical shift prediction

    NASA Astrophysics Data System (ADS)

    Li, Da-Wei; Meng, Dan; Brüschweiler, Rafael

    2015-05-01

    A robust NMR resonance assignment method is introduced for proteins whose 3D structure has previously been determined by X-ray crystallography. The goal of the method is to obtain a subset of correct assignments from a parsimonious set of 3D NMR experiments of 15N, 13C labeled proteins. Chemical shifts of sequential residue pairs are predicted from static protein structures using PPM_One, which are then compared with the corresponding experimental shifts. Globally optimized weighted matching identifies the assignments that are robust with respect to small changes in NMR cross-peak positions. The method, termed PASSPORT, is demonstrated for 4 proteins with 100-250 amino acids using 3D NHCA and a 3D CBCA(CO)NH experiments as input producing correct assignments with high reliability for 22% of the residues. The method, which works best for Gly, Ala, Ser, and Thr residues, provides assignments that serve as anchor points for additional assignments by both manual and semi-automated methods or they can be directly used for further studies, e.g. on ligand binding, protein dynamics, or post-translational modification, such as phosphorylation.

  15. Resonance assignment of DVU2108 that is part of the Orange Protein complex in Desulfovibrio vulgaris Hildenborough.

    PubMed

    Neca, António J; Soares, Rui; Carepo, Marta S P; Pauleta, Sofia R

    2016-04-01

    We report the 94 % assignment of DVU2108, a protein belonging to the Orange Protein family, that in Desulfovibrio vulgaris Hildenborough forms a protein complex named the Orange Protein complex. This complex has been shown to be implicated in the cell division of this organism. DVU2108 is a conserved protein in anaerobic microorganisms and in Desulfovibrio gigas the homologous protein was isolated with a novel Mo-Cu cluster non-covalently attached to the polypeptide chain. However, the heterologously produced DVU2108 did not contain any bound metal. These assignments provide the means to characterize the interaction of DVU2108 with the proteins that form the Orange Protein complex using NMR methods. PMID:26373427

  16. Writing Assignments with a Metacognitive Component Enhance Learning in a Large Introductory Biology Course

    ERIC Educational Resources Information Center

    Mynlieff, Michelle; Manogaran, Anita L.; St. Maurice, Martin; Eddinger, Thomas J.

    2014-01-01

    Writing assignments, including note taking and written recall, should enhance retention of knowledge, whereas analytical writing tasks with metacognitive aspects should enhance higher-order thinking. In this study, we assessed how certain writing-intensive "interventions," such as written exam corrections and peer-reviewed writing…

  17. Large-scale extraction of proteins.

    PubMed

    Cunha, Teresa; Aires-Barros, Raquel

    2002-01-01

    The production of foreign proteins using selected host with the necessary posttranslational modifications is one of the key successes in modern biotechnology. This methodology allows the industrial production of proteins that otherwise are produced in small quantities. However, the separation and purification of these proteins from the fermentation media constitutes a major bottleneck for the widespread commercialization of recombinant proteins. The major production costs (50-90%) for typical biological product resides in the purification strategy. There is a need for efficient, effective, and economic large-scale bioseparation techniques, to achieve high purity and high recovery, while maintaining the biological activity of the molecule. Aqueous two-phase systems (ATPS) allow process integration as simultaneously separation and concentration of the target protein is achieved, with posterior removal and recycle of the polymer. The ease of scale-up combined with the high partition coefficients obtained allow its potential application in large-scale downstream processing of proteins produced by fermentation. The equipment and the methodology for aqueous two-phase extraction of proteins on a large scale using mixer-settlerand column contractors are described. The operation of the columns, either stagewise or differential, are summarized. A brief description of the methods used to account for mass transfer coefficients, hydrodynamics parameters of hold-up, drop size, and velocity, back mixing in the phases, and flooding performance, required for column design, is also provided. PMID:11876297

  18. Genome analysis: Assigning protein coding regions to three-dimensional structures.

    PubMed Central

    Salamov, A. A.; Suwa, M.; Orengo, C. A.; Swindells, M. B.

    1999-01-01

    We describe the results of a procedure for maximizing the number of sequences that can be reliably linked to a protein of known three-dimensional structure. Unlike other methods, which try to increase sensitivity through the use of fold recognition software, we only use conventional sequence alignment tools, but apply them in a manner that significantly increases the number of relationships detected. We analyzed 11 genomes and found that, depending on the genome, between 23 and 32% of the ORFs had significant matches to proteins of known structure. In all cases, the aligned region consisted of either >100 residues or >50% of the smaller sequence. Slightly higher percentages could be attained if smaller motifs were also included. This is significantly higher than most previously reported methods, even those that have a fold-recognition component. We survey the biochemical and structural characteristics of the most frequently occurring proteins, and discuss the extent to which alignment methods can realistically assign function to gene products. PMID:10211823

  19. Time-shared experiments for efficient assignment of triple-selectively labeled proteins

    PubMed Central

    Löhr, Frank; Laguerre, Aisha; Bock, Christoph; Reckel, Sina; Connolly, Peter J.; Abdul-Manan, Norzehan; Tumulka, Franz; Abele, Rupert; Moore, Jonathan M.; Dötsch, Volker

    2014-01-01

    Combinatorial triple-selective labeling facilitates the NMR assignment process for proteins that are subject to signal overlap and insufficient signal-to-noise in standard triple-resonance experiments. Aiming at maximum amino-acid type and sequence-specific information, the method represents a trade-off between the number of selectively labeled samples that have to be prepared and the number of spectra to be recorded per sample. In order to address the demand of long measurement times, we here propose pulse sequences in which individual phase-shifted transients are stored separately and recombined later to produce several 2D HN(CX) type spectra that are usually acquired sequentially. Sign encoding by the phases of 13C 90° pulses allows to either select or discriminate against 13C’ or 13Cα spins coupled to 15N. As a result, 1H-15N correlation maps of the various isotopomeric species present in triple-selectively labeled proteins are deconvoluted which in turn reduces problems due to spectral overlap. The new methods are demonstrated with four different membrane proteins with rotational correlation times ranging from 18 to 52 ns. PMID:25442777

  20. Writing Assignments with a Metacognitive Component Enhance Learning in a Large Introductory Biology Course

    PubMed Central

    Mynlieff, Michelle; Manogaran, Anita L.; St. Maurice, Martin

    2014-01-01

    Writing assignments, including note taking and written recall, should enhance retention of knowledge, whereas analytical writing tasks with metacognitive aspects should enhance higher-order thinking. In this study, we assessed how certain writing-intensive “interventions,” such as written exam corrections and peer-reviewed writing assignments using Calibrated Peer Review and including a metacognitive component, improve student learning. We designed and tested the possible benefits of these approaches using control and experimental variables across and between our three-section introductory biology course. Based on assessment, students who corrected exam questions showed significant improvement on postexam assessment compared with their nonparticipating peers. Differences were also observed between students participating in written and discussion-based exercises. Students with low ACT scores benefited equally from written and discussion-based exam corrections, whereas students with midrange to high ACT scores benefited more from written than discussion-based exam corrections. Students scored higher on topics learned via peer-reviewed writing assignments relative to learning in an active classroom discussion or traditional lecture. However, students with low ACT scores (17–23) did not show the same benefit from peer-reviewed written essays as the other students. These changes offer significant student learning benefits with minimal additional effort by the instructors. PMID:26086661

  1. Writing Assignments with a Metacognitive Component Enhance Learning in a Large Introductory Biology Course.

    PubMed

    Mynlieff, Michelle; Manogaran, Anita L; St Maurice, Martin; Eddinger, Thomas J

    2014-01-01

    Writing assignments, including note taking and written recall, should enhance retention of knowledge, whereas analytical writing tasks with metacognitive aspects should enhance higher-order thinking. In this study, we assessed how certain writing-intensive "interventions," such as written exam corrections and peer-reviewed writing assignments using Calibrated Peer Review and including a metacognitive component, improve student learning. We designed and tested the possible benefits of these approaches using control and experimental variables across and between our three-section introductory biology course. Based on assessment, students who corrected exam questions showed significant improvement on postexam assessment compared with their nonparticipating peers. Differences were also observed between students participating in written and discussion-based exercises. Students with low ACT scores benefited equally from written and discussion-based exam corrections, whereas students with midrange to high ACT scores benefited more from written than discussion-based exam corrections. Students scored higher on topics learned via peer-reviewed writing assignments relative to learning in an active classroom discussion or traditional lecture. However, students with low ACT scores (17-23) did not show the same benefit from peer-reviewed written essays as the other students. These changes offer significant student learning benefits with minimal additional effort by the instructors. PMID:26086661

  2. A HAUSDORFF-BASED NOE ASSIGNMENT ALGORITHM USING PROTEIN BACKBONE DETERMINED FROM RESIDUAL DIPOLAR COUPLINGS AND ROTAMER PATTERNS

    PubMed Central

    Zeng, Jianyang (Michael); Tripathy, Chittaranjan; Zhou, Pei; Donald, Bruce R.

    2008-01-01

    High-throughput structure determination based on solution Nuclear Magnetic Resonance (NMR) spectroscopy plays an important role in structural genomics. One of the main bottlenecks in NMR structure determination is the interpretation of NMR data to obtain a sufficient number of accurate distance restraints by assigning nuclear Overhauser effect (NOE) spectral peaks to pairs of protons. The difficulty in automated NOE assignment mainly lies in the ambiguities arising both from the resonance degeneracy of chemical shifts and from the uncertainty due to experimental errors in NOE peak positions. In this paper we present a novel NOE assignment algorithm, called HAusdorff-based NOE Assignment (HANA), that starts with a high-resolution protein backbone computed using only two residual dipolar couplings (RDCs) per residue37, 39, employs a Hausdorff-based pattern matching technique to deduce similarity between experimental and back-computed NOE spectra for each rotamer from a statistically diverse library, and drives the selection of optimal position-specific rotamers for filtering ambiguous NOE assignments. Our algorithm runs in time O(tn3 +tn log t), where t is the maximum number of rotamers per residue and n is the size of the protein. Application of our algorithm on biological NMR data for three proteins, namely, human ubiquitin, the zinc finger domain of the human DNA Y-polymerase Eta (pol η) and the human Set2-Rpb1 interacting domain (hSRI) demonstrates that our algorithm overcomes spectral noise to achieve more than 90% assignment accuracy. Additionally, the final structures calculated using our automated NOE assignments have backbone RMSD < 1.7 Å and all-heavy-atom RMSD < 2.5 Å from reference structures that were determined either by X-ray crystallography or traditional NMR approaches. These results show that our NOE assignment algorithm can be successfully applied to protein NMR spectra to obtain high-quality structures. PMID:19122773

  3. A new strategy for sequential assignment of intrinsically unstructured proteins based on 15N single isotope labelling

    NASA Astrophysics Data System (ADS)

    Lopez, Juan; Ahuja, Puneet; Gerard, Melanie; Wieruszeski, Jean-Michel; Lippens, Guy

    2013-11-01

    We describe a new efficient strategy for the sequential assignment of amide resonances of a conventional 15N-1H HSQC spectrum of intrinsically unfolded proteins, based on composite NOESY-TOCSY and TOCSY-NOESY mixing times. These composite mixing times lead to a Hα-proton mediated unidirectional transfer of amide to amide proton. We have implemented the composite mixing times in an HSQC-NOESY-HSQC manner to obtain directional connectivity between amides of neighbouring residues. We experimentally determine the optimal mixing times for both transfer schemes, and demonstrate its use in the assignment for both a fragment of the neuronal tau protein and for α-synuclein.

  4. Proton and nitrogen sequential assignments and secondary structure determination of the human FK506 and rapamycin binding protein

    SciTech Connect

    Rosen, M.K.; Michnick, S.W.; Karplus, M.; Schreiber, S.L. )

    1991-05-14

    Sequential {sup 1}H and {sup 15}N assignments of human FKBP, a cytosolic binding protein for the immunosuppressive agents FK506 and rapamycin, are reported. A combination of homonuclear and relayed heteronuclear experiments has enabled assignment of 98 of 99 backbone amide NHs, 119 of 120 C{sup {alpha}}Hs, 97 of 99 non-proline amide {sup 15}Ns, and 375 of 412 side-chain resonances of this 107-residue protein. Long-range NOEs are used to demonstrate that FKBP has a novel folding topology consisting of a five-stranded antiparallel {beta} sheet with +3, +1, {minus}3, +1 loop connectivity.

  5. Quadratic partial eigenvalue assignment in large-scale stochastic dynamic systems for resilient and economic design

    SciTech Connect

    Das, Sonjoy; Goswami, Kundan; Datta, Biswa N.

    2014-12-10

    Failure of structural systems under dynamic loading can be prevented via active vibration control which shifts the damped natural frequencies of the systems away from the dominant range of loading spectrum. The damped natural frequencies and the dynamic load typically show significant variations in practice. A computationally efficient methodology based on quadratic partial eigenvalue assignment technique and optimization under uncertainty has been formulated in the present work that will rigorously account for these variations and result in an economic and resilient design of structures. A novel scheme based on hierarchical clustering and importance sampling is also developed in this work for accurate and efficient estimation of probability of failure to guarantee the desired resilience level of the designed system. Numerical examples are presented to illustrate the proposed methodology.

  6. Quadratic partial eigenvalue assignment in large-scale stochastic dynamic systems for resilient and economic design

    NASA Astrophysics Data System (ADS)

    Das, Sonjoy; Goswami, Kundan; Datta, Biswa N.

    2014-12-01

    Failure of structural systems under dynamic loading can be prevented via active vibration control which shifts the damped natural frequencies of the systems away from the dominant range of loading spectrum. The damped natural frequencies and the dynamic load typically show significant variations in practice. A computationally efficient methodology based on quadratic partial eigenvalue assignment technique and optimization under uncertainty has been formulated in the present work that will rigorously account for these variations and result in an economic and resilient design of structures. A novel scheme based on hierarchical clustering and importance sampling is also developed in this work for accurate and efficient estimation of probability of failure to guarantee the desired resilience level of the designed system. Numerical examples are presented to illustrate the proposed methodology.

  7. Quadratic partial eigenvalue assignment in large-scale stochastic dynamic systems for resilient and economic design

    NASA Astrophysics Data System (ADS)

    Das, S.; Goswami, K.; Datta, B. N.

    2016-05-01

    Failure of structural systems under dynamic loading can be prevented via active vibration control which shifts the damped natural frequencies of the systems away from the dominant range of a loading spectrum. The damped natural frequencies and the dynamic load typically show significant variations in practice. A computationally efficient methodology based on quadratic partial eigenvalue assignment technique and optimization under uncertainty has been formulated in the present work that will rigorously account for these variations and result in economic and resilient design of structures. A novel scheme based on hierarchical clustering and importance sampling is also developed in this work for accurate and efficient estimation of probability of failure to guarantee the desired resilience level of the designed system. Finally the most robust set of feedback matrices is selected from the set of probabilistically characterized optimal closed-loop system to implement the new methodology for design of active controlled structures. Numerical examples are presented to illustrate the proposed methodology.

  8. Strategy for complete NMR assignment of disordered proteins with highly repetitive sequences based on resolution-enhanced 5D experiments

    PubMed Central

    Motáčková, Veronika; Nováček, Jiří; Zawadzka-Kazimierczuk, Anna; Kazimierczuk, Krzysztof; Šanderová, Hana; Krásný, Libor; Koźmiński, Wiktor; Sklenář, Vladimír

    2010-01-01

    A strategy for complete backbone and side-chain resonance assignment of disordered proteins with highly repetitive sequence is presented. The protocol is based on three resolution-enhanced NMR experiments: 5D HN(CA)CONH provides sequential connectivity, 5D HabCabCONH is utilized to identify amino acid types, and 5D HC(CC-TOCSY)CONH is used to assign the side-chain resonances. The improved resolution was achieved by a combination of high dimensionality and long evolution times, allowed by non-uniform sampling in the indirect dimensions. Random distribution of the data points and Sparse Multidimensional Fourier Transform processing were used. Successful application of the assignment procedure to a particularly difficult protein, δ subunit of RNA polymerase from Bacillus subtilis, is shown to prove the efficiency of the strategy. The studied protein contains a disordered C-terminal region of 81 amino acids with a highly repetitive sequence. While the conventional assignment methods completely failed due to a very small differences in chemical shifts, the presented strategy provided a complete backbone and side-chain assignment. Electronic supplementary material The online version of this article (doi:10.1007/s10858-010-9447-3) contains supplementary material, which is available to authorized users. PMID:20890634

  9. A Set of Efficient nD NMR Protocols for Resonance Assignments of Intrinsically Disordered Proteins.

    PubMed

    Wiedemann, Christoph; Bellstedt, Peter; Häfner, Sabine; Herbst, Christian; Bordusa, Frank; Görlach, Matthias; Ohlenschläger, Oliver; Ramachandran, Ramadurai

    2016-07-01

    The RF pulse scheme RN[N-CA HEHAHA]NH, which provides a convenient approach to the acquisition of different multidimensional chemical shift correlation NMR spectra leading to backbone resonance assignments, including those of the proline residues of intrinsically disordered proteins (IDPs), is experimentally demonstrated. Depending on the type of correlation data required, the method involves the generation of in-phase ((15) N)(x) magnetisation via different magnetisation transfer pathways such as H→N→CO→N, HA→CA→CO→N, H→N→CA→N and H→CA→N, the subsequent application of (15) N-(13) C(α) heteronuclear Hartmann-Hahn mixing over a period of ≈100 ms, chemical-shift labelling of relevant nuclei before and after the heteronuclear mixing step and amide proton detection in the acquisition dimension. It makes use of the favourable relaxation properties of IDPs and the presence of (1) JCαN and (2) JCαN couplings to achieve efficient correlation of the backbone resonances of each amino acid residue "i" with the backbone amide resonances of residues "i-1" and "i+1". It can be implemented in a straightforward way through simple modifications of the RF pulse schemes commonly employed in protein NMR studies. The efficacy of the approach is demonstrated using a uniformly ((15) N,(13) C) labelled sample of α-synuclein. The different possibilities for obtaining the amino-acid-type information, simultaneously with the connectivity data between the backbone resonances of sequentially neighbouring residues, have also been outlined. PMID:27061973

  10. 4D APSY-HBCB(CG)CDHD experiment for automated assignment of aromatic amino acid side chains in proteins.

    PubMed

    Krähenbühl, Barbara; Hiller, Sebastian; Wider, Gerhard

    2011-11-01

    A four-dimensional (4D) APSY (automated projection spectroscopy)-HBCB(CG)CDHD experiment is presented. This 4D experiment correlates aromatic with aliphatic carbon and proton resonances from the same amino acid side chain of proteins in aqueous solution. It thus allows unambiguous sequence-specific assignment of aromatic amino acid ring signals based on backbone assignments. Compared to conventional 2D approaches, the inclusion of evolution periods on (1)H(β) and (13)C(δ) efficiently removes overlaps, and provides two additional frequencies for consequent automated or manual matching. The experiment was successfully applied to three proteins with molecular weights from 6 to 13 kDa. For the complementation of the assignment of the aromatic resonances, TOCSY- or COSY-based versions of a 4D APSY-HCCH(aro) sequence are proposed. PMID:21947871

  11. (1)H, (13)C and (15)N NMR assignments of a calcium-binding protein from Entamoeba histolytica.

    PubMed

    Verma, Deepshikha; Bhattacharya, Alok; Chary, Kandala V R

    2016-04-01

    We report almost complete sequence specific (1)H, (13)C and (15)N NMR assignments of a 150-residue long calmodulin-like calcium-binding protein from Entamoeba histolytica (EhCaBP6), as a prelude to its structural and functional characterization. PMID:26377206

  12. Investigations of Protein Structure and Function Using the Scientific Literature: An Assignment for an Undergraduate Cell Physiology Course

    ERIC Educational Resources Information Center

    Mulnix, Amy B.

    2003-01-01

    Undergraduate biology curricula are being modified to model and teach the activities of scientists better. The assignment described here, one that investigates protein structure and function, was designed for use in a sophomore-level cell physiology course at Earlham College. Students work in small groups to read and present in poster format on…

  13. Recruiting Participants for Large-Scale Random Assignment Experiments in School Settings

    ERIC Educational Resources Information Center

    Roschelle, Jeremy; Feng, Mingyu; Gallagher, H. Alix; Murphy, Robert; Harris, Christopher; Kamdar, Danae; Trinidad, Gucci

    2014-01-01

    Recruitment is a key challenge for researchers conducting any large school-based study. Control is needed not only over the condition participants receive, but also over how the intervention is implemented, and may include restrictions in other areas of school and classroom functioning. We report here on our experiences in recruiting participants…

  14. Computer-assisted assignment of 2D 1H NMR spectra of proteins: basic algorithms and application to phoratoxin B.

    PubMed

    Kleywegt, G J; Boelens, R; Cox, M; Llinás, M; Kaptein, R

    1991-05-01

    A suite of computer programs (CLAIRE) is described which can be of assistance in the process of assigning 2D 1H NMR spectra of proteins. The programs embody a software implementation of the sequential assignment approach first developed by Wüthrich and co-workers (K. Wüthrich, G. Wider, G. Wagner and W. Braun (1982) J. Mol. Biol. 155, 311). After data-abstraction (peakpicking), the software can be used to detect patterns (spin systems), to find cross peaks between patterns in 2D NOE data sets and to generate assignments that are consistent with all available data and which satisfy a number of constraints imposed by the user. An interactive graphics program called CONPAT is used to control the entire assignment process as well as to provide the essential feedback from the experimental NMR spectra. The algorithms are described in detail and the approach is demonstrated on a set of spectra from the mistletoe protein phoratoxin B, a homolog of crambin. The results obtained compare well with those reported earlier based entirely on a manual assignment process. PMID:1841687

  15. Reduced dimensionality (4,3)D-hnCOCANH experiment: an efficient backbone assignment tool for NMR studies of proteins.

    PubMed

    Kumar, Dinesh

    2013-09-01

    Sequence specific resonance assignment of proteins forms the basis for variety of structural and functional proteomics studies by NMR. In this context, an efficient standalone method for rapid assignment of backbone ((1)H, (15)N, (13)C(α) and (13)C') resonances of proteins has been presented here. Compared to currently available strategies used for the purpose, the method employs only a single reduced dimensionality experiment--(4,3)D-hnCOCANH and exploits the linear combinations of backbone ((13)C(α) and (13)C') chemical shifts to achieve a dispersion relatively better compared to those of individual chemical shifts (see the text). The resulted increased dispersion of peaks--which is different in sum (CA + CO) and difference (CA - CO) frequency regions--greatly facilitates the analysis of the spectrum by resolving the problems (associated with routine assignment strategies) arising because of degenerate amide (15)N and backbone (13)C chemical shifts. Further, the spectrum provides direct distinction between intra- and inter-residue correlations because of their opposite peak signs. The other beneficial feature of the spectrum is that it provides: (a) multiple unidirectional sequential (i→i + 1) (15)N and (13)C correlations and (b) facile identification of certain specific triplet sequences which serve as check points for mapping the stretches of sequentially connected HSQC cross peaks on to the primary sequence for assigning the resonances sequence specifically. On top of all this, the F₂-F₃ planes of the spectrum corresponding to sum (CA + CO) and difference (CA - CO) chemical shifts enable rapid and unambiguous identification of sequential HSQC peaks through matching their coordinates in these two planes (see the text). Overall, the experiment presented here will serve as an important backbone assignment tool for variety of structural and functional proteomics and drug discovery research programs by NMR involving well behaved small folded proteins (MW

  16. GroEL assisted folding of large polypeptide substrates in Escherichia coli: Present scenario and assignments for the future.

    PubMed

    Chaudhuri, Tapan K; Verma, Vikash K; Maheshwari, Aditi

    2009-01-01

    Escherichia coli chaperonins GroEL and GroES are indispensable for survival and growth of the cell since they provide essential assistance to the folding of many newly translated proteins in the cell. Recent studies indicate that a substantial portion of the proteins involved in the host pathways are completely dependent on GroEL-GroES for their folding and hence providing some explanation for why GroEL is essential for cell growth. Many proteins either small-single domain or large multidomains require assistance from GroEL-ES during their lifetime. Proteins of size up to approximately 70kDa can fold via the cis mechanism during GroEL-ES assisted pathway, but other proteins (>70kDa) that cannot be pushed inside the cavity of GroEL-ATP complex upon binding of GroES fold by an evolved mechanism called trans. In recent years, much work has been done on revealing facts about the cis mechanism involving the GroEL assisted folding of small proteins whereas the trans mechanism with larger polypeptide substrates still remains under cover. In order to disentangle the role of chaperonin GroEL-GroES in the folding of large E. coli proteins, this review discusses a number of issues like the range of large polypeptide substrates acted on by GroEL. Do all these substrates need the complete chaperonin system along with ATP for their folding? Does GroEL act as foldase or holdase during the process? We conclude with a discussion of the various queries that need to be resolved in the future for an extensive understanding of the mechanism of GroEL mediated folding of large substrate proteins in E. coli cytosol. PMID:19027782

  17. A Monte Carlo/Simulated Annealing Algorithm for Sequential Resonance Assignment in Solid State NMR of Uniformly Labeled Proteins with Magic-Angle Spinning

    PubMed Central

    Tycko, Robert; Hu, Kan-Nian

    2010-01-01

    We describe a computational approach to sequential resonance assignment in solid state NMR studies of uniformly 15N,13C-labeled proteins with magic-angle spinning. As input, the algorithm uses only the protein sequence and lists of 15N/13Cα crosspeaks from 2D NCACX and NCOCX spectra that include possible residue-type assignments of each crosspeak. Assignment of crosspeaks to specific residues is carried out by a Monte Carlo/simulated annealing algorithm, implemented in the program MC_ASSIGN1. The algorithm tolerates substantial ambiguity in residue-type assignments and coexistence of visible and invisible segments in the protein sequence. We use MC_ASSIGN1 and our own 2D spectra to replicate and extend the sequential assignments for uniformly labeled HET-s(218-289) fibrils previously determined manually by Siemer et al. (J. Biomolec. NMR, vol. 34, pp. 75-87, 2006) from a more extensive set of 2D and 3D spectra. Accurate assignments by MC_ASSIGN1 do not require data that are of exceptionally high quality. Use of MC_ASSIGN1 (and its extensions to other types of 2D and 3D data) is likely to alleviate many of the difficulties and uncertainties associated with manual resonance assignments in solid state NMR studies of uniformly labeled proteins, where spectral resolution and signal-to-noise are often sub-optimal. PMID:20547467

  18. Fast structure-based assignment of 15N HSQC spectra of selectively 15N-labeled paramagnetic proteins.

    PubMed

    Pintacuda, Guido; Keniry, Max A; Huber, Thomas; Park, Ah Young; Dixon, Nicholas E; Otting, Gottfried

    2004-03-10

    A novel strategy for fast NMR resonance assignment of (15)N HSQC spectra of proteins is presented. It requires the structure coordinates of the protein, a paramagnetic center, and one or more residue-selectively (15)N-labeled samples. Comparison of sensitive undecoupled (15)N HSQC spectra recorded of paramagnetic and diamagnetic samples yields data for every cross-peak on pseudocontact shift, paramagnetic relaxation enhancement, cross-correlation between Curie-spin and dipole-dipole relaxation, and residual dipolar coupling. Comparison of these four different paramagnetic quantities with predictions from the three-dimensional structure simultaneously yields the resonance assignment and the anisotropy of the susceptibility tensor of the paramagnetic center. The method is demonstrated with the 30 kDa complex between the N-terminal domain of the epsilon subunit and the theta subunit of Escherichia coli DNA polymerase III. The program PLATYPUS was developed to perform the assignment, provide a measure of reliability of the assignment, and determine the susceptibility tensor anisotropy. PMID:14995214

  19. METHODOLOGY AND CALCULATIONS FOR THE ASSIGNMENT OF WASTE GROUPS FOR THE LARGE UNDERGROUND WASTE STORAGE TANKS AT THE HANFORD SITE

    SciTech Connect

    WEBER RA

    2009-01-16

    The Hanford Site contains 177 large underground radioactive waste storage tanks (28 double-shell tanks and 149 single-shell tanks). These tanks are categorized into one of three waste groups (A, B, and C) based on their waste and tank characteristics. These waste group assignments reflect a tank's propensity to retain a significant volume of flammable gases and the potential of the waste to release retained gas by a buoyant displacement gas release event. Assignments of waste groups to the 177 double-shell tanks and single-shell tanks, as reported in this document, are based on a Monte Carlo analysis of three criteria. The first criterion is the headspace flammable gas concentration following release of retained gas. This criterion determines whether the tank contains sufficient retained gas such that the well-mixed headspace flammable gas concentration would reach 100% of the lower flammability limit if the entire tank's retained gas were released. If the volume of retained gas is not sufficient to reach 100% of the lower flammability limit, then flammable conditions cannot be reached and the tank is classified as a waste group C tank independent of the method the gas is released. The second criterion is the energy ratio and considers whether there is sufficient supernatant on top of the saturated solids such that gas-bearing solids have the potential energy required to break up the material and release gas. Tanks that are not waste group C tanks and that have an energy ratio < 3.0 do not have sufficient potential energy to break up material and release gas and are assigned to waste group B. These tanks are considered to represent a potential induced flammable gas release hazard, but no spontaneous buoyant displacement flammable gas release hazard. Tanks that are not waste group C tanks and have an energy ratio {ge} 3.0, but that pass the third criterion (buoyancy ratio < 1.0, see below) are also assigned to waste group B. Even though the designation as a waste

  20. METHODOLOGY AND CALCULATIONS FOR THE ASSIGNMENT OF WASTE GROUPS FOR THE LARGE UNDERGROUND WASTE STORAGE TANKS AT THE HANFORD SITE

    SciTech Connect

    FOWLER KD

    2007-12-27

    This document categorizes each of the large waste storage tanks into one of several categories based on each tank's waste characteristics. These waste group assignments reflect a tank's propensity to retain a significant volume of flammable gases and the potential of the waste to release retained gas by a buoyant displacement event. Revision 7 is the annual update of the calculations of the flammable gas Waste Groups for DSTs and SSTs. The Hanford Site contains 177 large underground radioactive waste storage tanks (28 double-shell tanks and 149 single-shell tanks). These tanks are categorized into one of three waste groups (A, B, and C) based on their waste and tank characteristics. These waste group assignments reflect a tank's propensity to retain a significant volume of flammable gases and the potential of the waste to release retained gas by a buoyant displacement gas release event. Assignments of waste groups to the 177 double-shell tanks and single-shell tanks, as reported in this document, are based on a Monte Carlo analysis of three criteria. The first criterion is the headspace flammable gas concentration following release of retained gas. This criterion determines whether the tank contains sufficient retained gas such that the well-mixed headspace flammable gas concentration would reach 100% of the lower flammability limit if the entire tank's retained gas were released. If the volume of retained gas is not sufficient to reach 100% of the lower flammability limit, then flammable conditions cannot be reached and the tank is classified as a waste group C tank independent of the method the gas is released. The second criterion is the energy ratio and considers whether there is sufficient supernatant on top of the saturated solids such that gas-bearing solids have the potential energy required to break up the material and release gas. Tanks that are not waste group C tanks and that have an energy ratio < 3.0 do not have sufficient potential energy to break up

  1. Secondary structure and (1)H, (13)C, (15)N resonance assignments of the endosomal sorting protein sorting nexin 3.

    PubMed

    Overduin, Michael; Rajesh, Sandya; Gruenberg, Jean; Lenoir, Marc

    2015-10-01

    Sorting nexin 3 (SNX3) belongs to a sub-family of sorting nexins that primarily contain a single Phox homology domain capable of binding phosphoinositides and membranes. We report the complete (1)H, (13)C and (15)N resonance assignments of the full-length human SNX3 protein and identification of its secondary structure elements, revealing a canonical fold and unstructured termini. PMID:25893673

  2. A reduced dimensionality NMR pulse sequence and an efficient protocol for unambiguous assignment in intrinsically disordered proteins.

    PubMed

    Reddy, Jithender G; Hosur, Ramakrishna V

    2014-07-01

    Resonance assignment in intrinsically disordered proteins poses a great challenge because of poor chemical shift dispersion in most of the nuclei that are commonly monitored. Reduced dimensionality (RD) experiments where more than one nuclei are co-evolved simultaneously along one of the time axes of a multi-dimensional NMR experiment help to resolve this problem partially, and one can conceive of different combinations of nuclei for co-evolution depending upon the magnetization transfer pathways and the desired information content in the spectrum. Here, we present a RD experiment, (4,3)D-hNCOCAnH, which uses a combination of CO and CA chemical shifts along one of the axes of the 3-dimensional spectrum, to improve spectral dispersion on one hand, and provide information on four backbone atoms of every residue-HN, N, CA and CO chemical shifts-from a single experiment, on the other. The experiment provides multiple unidirectional sequential (i → i - 1) amide (1)H correlations along different planes of the spectrum enabling easy assignment of most nuclei along the protein backbone. Occasional ambiguities that may arise due to degeneracy of amide proton chemical shifts are proposed to be resolved using the HNN experiment described previously (Panchal et al. in J Biomol NMR 20:135-147, 2001). Applications of the experiment and the assignment protocol have been demonstrated using intrinsically disordered α-synuclein (140 aa) protein. PMID:24854885

  3. Proton nuclear magnetic resonance study of the antihypertensive and antiviral protein BDS-I from the sea anemone Anemonia sulcata: Sequential and stereospecific resonance assignment and secondary structure

    SciTech Connect

    Driscoll, P.C.; Clore, G.M.; Beress, L.; Gronenborn, A.M. )

    1989-03-07

    The sequential resonance assignment of the {sup 1}H NMR spectrum of the antihypertensive and antiviral protein BDS-I from the sea anemone Anemonia sulcata is presented. This is carried out with two-dimensional NMR techniques to identify through-bond and through-space (< 5{angstrom}) connectivities. Added spectral complexity arises from the fact that the sample is an approximately 1:1 mixture of two BDS-I isoproteins, (Leu-18)-BDS-I and (Phe-18)-BDS-I. Complete assignments, however, are obtained, largely due to the increased resolution and sensitivity afforded at 600 MHz. In addition, the stereospecific assignment of a large number of {beta}-methylene protons is achieved from an analysis of the pattern of {sup 3}J{sub {alpha}{beta}} coupling constants and the relative magnitudes of intraresidue NOEs involving the NH, C{sup {alpha}}H, and C{sup {beta}}H protons. Regular secondary structure elements are deduced from a qualitative interpretation of the nuclear Overhauser enhancement, {sup 3}J{sub HN{alpha}} coupling constant, and amide NH exchange data. A triple-stranded antiparallel {beta}-sheet is found to be related to that found in partially homologous sea anemone polypeptide toxins.

  4. NMR assignment of intrinsically disordered self-processing module of the FrpC protein of Neisseria meningitidis.

    PubMed

    Kubáň, Vojtěch; Nováček, Jiří; Bumba, Ladislav; Žídek, Lukáš

    2015-10-01

    The self-processing module (SPM) is an internal segment of the FrpC protein (P415-F591) secreted by the pathogenic Gram-negative bacterium Neisseria meningitidis during meningococcal infection of human upper respiratory tract. SPM mediates 'protein trans-splicing', a unique natural mechanism for editing of proteins, which involves a calcium-dependent autocatalytic cleavage of the peptide bond between D414 and P415 and covalent linkage of the cleaved fragment through its carboxy-terminal group of D414 to [Formula: see text]-amino group of lysine residue within a neighboring polypeptide chain. We present an NMR resonance assignment of the calcium-free SPM, which displays characteristic features of intrinsically disordered proteins. Non-uniformly sampled 5D HN(CA)CONH, 4D HCBCACON, and HCBCANCO spectra were recorded to resolve poorly dispersed resonance frequencies of the disordered protein and 91 % of SPM residues were unambiguously assigned. Analysis of the chemical shifts revealed that two regions of the intrinsically disordered SPM (A95-S101 and R120-I127) have a tendency to form a helical structure, whereas the residues P1-D7 and G36-A40 have the propensity to adopt a [Formula: see text]-structure. PMID:26138689

  5. 1H, 15N and 13C assignment of the amyloidogenic protein medin using fast-pulsing NMR techniques.

    PubMed

    Davies, H A; Phelan, M M; Madine, J

    2016-04-01

    Thirty-one proteins are known to form extracellular fibrillar amyloid in humans. Molecular information about many of these proteins in their monomeric, intermediate or fibrillar form and how they aggregate and interact to form the insoluble fibrils is sparse. This is because amyloid proteins are notoriously difficult to study in their soluble forms, due to their inherent propensity to aggregate. Using recent developments in fast NMR techniques, band-selective excitation short transient and band-selective optimized flip-angle short-transient heteronuclear multiple quantum coherence we have been able to assign a 5 kDa full-length amyloidogenic protein called medin. Medin is the key protein component of the most common form of localised amyloid with a proposed role in aortic aneurysm and dissection. This assignment will now enable the study of the early interactions that could influence initiation and progression of medin aggregation. The chemical shifts have been deposited in the BioMagRes-Bank accession Nos. 25399 and 26576. PMID:26377205

  6. Optimization of amino acid type-specific 13C and 15N labeling for the backbone assignment of membrane proteins by solution- and solid-state NMR with the UPLABEL algorithm.

    PubMed

    Hefke, Frederik; Bagaria, Anurag; Reckel, Sina; Ullrich, Sandra Johanna; Dötsch, Volker; Glaubitz, Clemens; Güntert, Peter

    2011-02-01

    We present a computational method for finding optimal labeling patterns for the backbone assignment of membrane proteins and other large proteins that cannot be assigned by conventional strategies. Following the approach of Kainosho and Tsuji (Biochemistry 21:6273-6279 (1982)), types of amino acids are labeled with (13)C or/and (15)N such that cross peaks between (13)CO(i - 1) and (15)NH(i) result only for pairs of sequentially adjacent amino acids of which the first is labeled with (13)C and the second with (15)N. In this way, unambiguous sequence-specific assignments can be obtained for unique pairs of amino acids that occur exactly once in the sequence of the protein. To be practical, it is crucial to limit the number of differently labeled protein samples that have to be prepared while obtaining an optimal extent of labeled unique amino acid pairs. Our computer algorithm UPLABEL for optimal unique pair labeling, implemented in the program CYANA and in a standalone program, and also available through a web portal, uses combinatorial optimization to find for a given amino acid sequence labeling patterns that maximize the number of unique pair assignments with a minimal number of differently labeled protein samples. Various auxiliary conditions, including labeled amino acid availability and price, previously known partial assignments, and sequence regions of particular interest can be taken into account when determining optimal amino acid type-specific labeling patterns. The method is illustrated for the assignment of the human G-protein coupled receptor bradykinin B2 (B(2)R) and applied as a starting point for the backbone assignment of the membrane protein proteorhodopsin. PMID:21170670

  7. Proteomic Analysis of a Fraction with Intact Eyespots of Chlamydomonas reinhardtii and Assignment of Protein Methylation

    PubMed Central

    Eitzinger, Nicole; Wagner, Volker; Weisheit, Wolfram; Geimer, Stefan; Boness, David; Kreimer, Georg; Mittag, Maria

    2015-01-01

    Flagellate green algae possess a visual system, the eyespot. In Chlamydomonas reinhardtii it is situated at the edge of the chloroplast and consists of two carotenoid rich lipid globule layers subtended by thylakoid membranes (TM) that are attached to both chloroplast envelope membranes and a specialized area of the plasma membrane (PM). A former analysis of an eyespot fraction identified 203 proteins. To increase the understanding of eyespot related processes, knowledge of the protein composition of the membranes in its close vicinity is desirable. Here, we present a purification procedure that allows isolation of intact eyespots. This gain in intactness goes, however, hand in hand with an increase of contaminants from other organelles. Proteomic analysis identified 742 proteins. Novel candidates include proteins for eyespot development, retina-related proteins, ion pumps, and membrane-associated proteins, calcium sensing proteins as well as kinases, phosphatases and 14-3-3 proteins. Methylation of proteins at Arg or Lys is known as an important posttranslational modification involved in, e.g., signal transduction. Here, we identify several proteins from eyespot fractions that are methylated at Arg and/or Lys. Among them is the eyespot specific SOUL3 protein that influences the size and position of the eyespot and EYE2, a protein important for its development. PMID:26697039

  8. (1)H, (15)N and (13)C resonance assignments of translationally-controlled tumor protein from photosynthetic microalga Nannochloropsis oceanica.

    PubMed

    Yao, Xingzhe; Xiao, Yan; Cui, Qiu; Feng, Yingang

    2015-10-01

    Translationally-controlled tumor protein (TCTP) is a eukaryote-conserved protein with crucial roles in cellular growth. It has also been proposed that plant TCTP has functions specific to plant, while no structure of TCTP from photosynthetic organism has been reported. Nannochloropsis is a photosynthetic microalga with high yield of lipid and high-value polyunsaturated fatty acid, which is promising for biodiesel production. Study of growth-related proteins may provide new clue for improving the yield of lipid. TCTP from Nannochloropsis oceanica shares low sequence identity with structure-known TCTPs. Here we reported the NMR resonance assignments of TCTP from N. oceanica for further structural and functional studies. PMID:25680850

  9. Letter to the Editor: H-1, C-13 and N-15 Assignments for the Archaeglobus fulgidis Protein AF2095.

    SciTech Connect

    Powers, Robert; Acton, Thomas; Chiang, Yiwen; Rajan, Paranji K.; Cort, John R.; Kennedy, Michael A.; Liu, Jinfeng; Ma, LiChung; Rost, Burkhard; Montelione, Gaetano

    2004-09-01

    targeted for structural analysis by NESG. AF2095 belongs to the Pfam family PF01981 - UPF0099, protein domain family of unknown function that has been found in yeast, archaebacteria and eubacteria. AF2095 has been assigned to NESG Cluster ID:17431, a set of fourteen protein sequences with high (> {approx}30%) sequence identity (Liu, 2004). This cluster includes proteins of human, Drosophila, Caenorhabditis elegans, Arabidopsis, yeast, archaeal and eubacterial origin. A total of fifty-six proteins are identified when the analysis is expanded to include all available genomes. Therefore, determining the NMR solution structure of AF2095 can be leveraged to infer 3D structural information for at least an additional fifty-five proteins. Here we report the near complete 1H, 15N, 13CO, and 13C NMR assignments and secondary structure of AF2095. These data provide a basis for determining the solution structure of AF2095, for further investigation of the function of this protein and for providing representative structural and functional information for the protein domain family that includes AF2095.

  10. CATH FunFHMMer web server: protein functional annotations using functional family assignments.

    PubMed

    Das, Sayoni; Sillitoe, Ian; Lee, David; Lees, Jonathan G; Dawson, Natalie L; Ward, John; Orengo, Christine A

    2015-07-01

    The widening function annotation gap in protein databases and the increasing number and diversity of the proteins being sequenced presents new challenges to protein function prediction methods. Multidomain proteins complicate the protein sequence-structure-function relationship further as new combinations of domains can expand the functional repertoire, creating new proteins and functions. Here, we present the FunFHMMer web server, which provides Gene Ontology (GO) annotations for query protein sequences based on the functional classification of the domain-based CATH-Gene3D resource. Our server also provides valuable information for the prediction of functional sites. The predictive power of FunFHMMer has been validated on a set of 95 proteins where FunFHMMer performs better than BLAST, Pfam and CDD. Recent validation by an independent international competition ranks FunFHMMer as one of the top function prediction methods in predicting GO annotations for both the Biological Process and Molecular Function Ontology. The FunFHMMer web server is available at http://www.cathdb.info/search/by_funfhmmer. PMID:25964299

  11. CATH FunFHMMer web server: protein functional annotations using functional family assignments

    PubMed Central

    Das, Sayoni; Sillitoe, Ian; Lee, David; Lees, Jonathan G.; Dawson, Natalie L.; Ward, John; Orengo, Christine A.

    2015-01-01

    The widening function annotation gap in protein databases and the increasing number and diversity of the proteins being sequenced presents new challenges to protein function prediction methods. Multidomain proteins complicate the protein sequence–structure–function relationship further as new combinations of domains can expand the functional repertoire, creating new proteins and functions. Here, we present the FunFHMMer web server, which provides Gene Ontology (GO) annotations for query protein sequences based on the functional classification of the domain-based CATH-Gene3D resource. Our server also provides valuable information for the prediction of functional sites. The predictive power of FunFHMMer has been validated on a set of 95 proteins where FunFHMMer performs better than BLAST, Pfam and CDD. Recent validation by an independent international competition ranks FunFHMMer as one of the top function prediction methods in predicting GO annotations for both the Biological Process and Molecular Function Ontology. The FunFHMMer web server is available at http://www.cathdb.info/search/by_funfhmmer. PMID:25964299

  12. Identification and RNA segment assignment of six structural proteins of Scylla serrata reovirus.

    PubMed

    Yuan, Yangyang; Fan, Dongyang; Zhang, Zhao; Yang, Jifang; Liu, Jingwen; Chen, Jigang

    2016-08-01

    Scylla serrata reovirus (SsRV) is one of the most prevalent viral pathogens of the mud crab (S. serrata). The virus represents an unassigned novel genus in the Reoviridae family, and contains 12 double-stranded RNA genomic segments. Previous analysis of virion proteins concluded that SsRV contains at least eight structural proteins, ranging from 25 to 160 kDa. Here, tandem time-of-flight mass spectrometry and Western blotting were used to re-identify the structural proteins. The results indicate that proteins encoded by SsRV segments S1, S3, S6, S9, S11, and S12 are structural proteins. PMID:27023722

  13. NMR resonance assignments of the lantibiotic immunity protein NisI from Lactococcus lactis.

    PubMed

    Hacker, Carolin; Christ, Nina Alexandra; Duchardt-Ferner, Elke; Korn, Sophie; Berninger, Lucija; Kötter, Peter; Entian, Karl-Dieter; Wöhnert, Jens

    2015-10-01

    The lantibiotic nisin is a small antimicrobial peptide which acts against a wide range of Gram-positive bacteria. Nisin-producing Lactococcus lactis strains express four genes for self-protection against their own antimicrobial compound. This immunity system consists of the lipoprotein NisI and the ABC transporter NisFEG. NisI is attached to the outside of the cytoplasmic membrane via a covalently linked diacylglycerol anchor. Both the lipoprotein and the ABC transporter are needed for full immunity but the exact immunity mechanism is still unclear. To gain insights into the highly specific immunity mechanism of nisin producing strains on a structural level we present here the backbone resonance assignment of NisI (25.8 kDa) as well as the virtually complete (1)H,(15)N,(13)C chemical shift assignments for the isolated 12.7 kDa N-terminal and 14.6 kDa C-terminal domains of NisI. PMID:25613223

  14. The Personal Response: A Novel Writing Assignment to Engage First Year Students in Large Human Biology Classes

    ERIC Educational Resources Information Center

    Moni, Roger W.; Moni, Karen B.; Poronnik, Philip

    2007-01-01

    The teaching of highly valued scientific writing skills in the first year of university is challenging. This report describes the design, implementation, and evaluation of a novel written assignment, "The Personal Response" and accompanying Peer Review, in the course, Human Biology (BIOL1015) at The University of Queensland. These assignments were…

  15. ProDomAs, protein domain assignment algorithm using center-based clustering and independent dominating set.

    PubMed

    Ansari, Elnaz Saberi; Eslahchi, Changiz; Pezeshk, Hamid; Sadeghi, Mehdi

    2014-09-01

    Decomposition of structural domains is an essential task in classifying protein structures, predicting protein function, and many other proteomics problems. As the number of known protein structures in PDB grows exponentially, the need for accurate automatic domain decomposition methods becomes more essential. In this article, we introduce a bottom-up algorithm for assigning protein domains using a graph theoretical approach. This algorithm is based on a center-based clustering approach. For constructing initial clusters, members of an independent dominating set for the graph representation of a protein are considered as the centers. A distance matrix is then defined for these clusters. To obtain final domains, these clusters are merged using the compactness principle of domains and a method similar to the neighbor-joining algorithm considering some thresholds. The thresholds are computed using a training set consisting of 50 protein chains. The algorithm is implemented using C++ language and is named ProDomAs. To assess the performance of ProDomAs, its results are compared with seven automatic methods, against five publicly available benchmarks. The results show that ProDomAs outperforms other methods applied on the mentioned benchmarks. The performance of ProDomAs is also evaluated against 6342 chains obtained from ASTRAL SCOP 1.71. ProDomAs is freely available at http://www.bioinf.cs.ipm.ir/software/prodomas. PMID:24596179

  16. METHODOLOGY & CALCULATIONS FOR THE ASSIGNMENT OF WASTE GROUPS FOR THE LARGE UNDERGROUND WASTE STORAGE TANKS AT THE HANFORD SITE

    SciTech Connect

    BARKER, S.A.

    2006-07-27

    Waste stored within tank farm double-shell tanks (DST) and single-shell tanks (SST) generates flammable gas (principally hydrogen) to varying degrees depending on the type, amount, geometry, and condition of the waste. The waste generates hydrogen through the radiolysis of water and organic compounds, thermolytic decomposition of organic compounds, and corrosion of a tank's carbon steel walls. Radiolysis and thermolytic decomposition also generates ammonia. Nonflammable gases, which act as dilutents (such as nitrous oxide), are also produced. Additional flammable gases (e.g., methane) are generated by chemical reactions between various degradation products of organic chemicals present in the tanks. Volatile and semi-volatile organic chemicals in tanks also produce organic vapors. The generated gases in tank waste are either released continuously to the tank headspace or are retained in the waste matrix. Retained gas may be released in a spontaneous or induced gas release event (GRE) that can significantly increase the flammable gas concentration in the tank headspace as described in RPP-7771. The document categorizes each of the large waste storage tanks into one of several categories based on each tank's waste characteristics. These waste group assignments reflect a tank's propensity to retain a significant volume of flammable gases and the potential of the waste to release retained gas by a buoyant displacement event. Revision 5 is the annual update of the methodology and calculations of the flammable gas Waste Groups for DSTs and SSTs.

  17. METHODOLOGY & CALCULATIONS FOR THE ASSIGNMENT OF WASTE FOR THE LARGE UNDERGROUND WASTE STORAGE TANKS AT THE HANFORD SITE

    SciTech Connect

    TU, T.A.

    2007-01-04

    Waste stored within tank farm double-shell tanks (DST) and single-shell tanks (SST) generates flammable gas (principally hydrogen) to varying degrees depending on the type, amount, geometry, and condition of the waste. The waste generates hydrogen through the radiolysis of water and organic compounds, thermolytic decomposition of organic compounds, and corrosion of a tank's carbon steel walls. Radiolysis and thermolytic decomposition also generates ammonia. Nonflammable gases, which act as dilutents (such as nitrous oxide), are also produced. Additional flammable gases (e.g., methane) are generated by chemical reactions between various degradation products of organic chemicals present in the tanks. Volatile and semi-volatile organic chemicals in tanks also produce organic vapors. The generated gases in tank waste are either released continuously to the tank headspace or are retained in the waste matrix. Retained gas may be released in a spontaneous or induced gas release event (GRE) that can significantly increase the flammable gas concentration in the tank headspace as described in RPP-7771, Flammable Gas Safety Isme Resolution. Appendices A through I provide supporting information. The document categorizes each of the large waste storage tanks into one of several categories based on each tank's waste and characteristics. These waste group assignments reflect a tank's propensity to retain a significant volume of flammable gases and the potential of the waste to release retained gas by a buoyant displacement event. Revision 6 is the annual update of the flammable gas Waste Groups for DSTs and SSTs.

  18. METHODOLOGY & CALCULATIONS FOR THE ASSIGNMENT OF WASTE FOR THE LARGE UNDERGROUND WASTE STORAGE TANKS AT HANFORD SITE [SEC 1 & 2

    SciTech Connect

    BARKER, S.A.; HEDENGREN, D.C.

    2003-08-28

    Waste stored within tank farm double-shell tanks (DST) and single-shell tanks (SST) generates flammable gas (principally hydrogen) to varying degrees depending on the type, amount, geometry, and condition of the waste. The waste generates hydrogen through the radiolysis of water, thermolytic decomposition of organic compounds, and corrosion of a tank's carbon steel walls. Radiolysis and thermolytic decomposition also generate ammonia. Nonflammable gases, which act as diluents (such as nitrous oxide), are also produced. Additional flammable gases (e.g., methane) are generated by chemical reactions between various degradation products of organic chemicals present in the tanks. Volatile and semivolatile organic chemicals in tanks also produce organic vapors. The generated gases in tank waste are either released continuously to the tank headspace or are retained in the waste matrix. Retained gas may be released in a spontaneous or induced gas release event (GRE) that can significantly increase the flammable gas concentration in tank headspace as described in RPP-7771, Flammable Gas Safety Issue Resolution. Appendices A through L provide supporting information. This document categorizes each of the large waste storage tanks into one of several categories based on each tank's waste and characteristics. These waste group assignments reflect a tank's propensity to retain a significant volume of flammable gases and the potential of the waste to release retained gas by a buoyant displacement event.

  19. Charge site assignment in native proteins by ultraviolet photodissociation (UVPD) mass spectrometry.

    PubMed

    Morrison, Lindsay J; Brodbelt, Jennifer S

    2016-01-01

    Characterization of all gas-phase charge sites of natively sprayed proteins and peptides is demonstrated using 193 nm UVPD. The high sequence coverage offered by UVPD is exploited for the accurate determination of charge sites in protein systems up to 18 kDa, allowing charge site to be studied as a function of protein conformation and the presence of disulfide bonds. Charging protons are found on both basic sidechains and on the amide backbone of less basic amino acids such as serine, glutamine, and proline. UVPD analysis was performed on the 3+ charge state of melittin, the 5+ to 8+ charge states of ubiquitin, and the 8+ charge state of reduced and oxidized β-lactoglobulin. The location of charges in gas-phase proteins is known to impact structure; molecular modeling of different charge site motifs of 3+ melittin demonstrates how placement of protons in simulations can dramatically impact the predicted structure of the molecule. The location of positive charge sites in ubiquitin and β-lactoglobulin are additionally found to depend on the presence or absence of salt-bridges, columbic repulsion across the length of the peptide, and protein conformation. Charge site isomers are demonstrated for ubiquitin and β-lactoglobulin but found to be much less numerous than previously predicted. PMID:26596460

  20. NMR assignments, secondary structure, and global fold of calerythrin, an EF-hand calcium-binding protein from Saccharopolyspora erythraea.

    PubMed Central

    Aitio, H.; Annila, A.; Heikkinen, S.; Thulin, E.; Drakenberg, T.; Kilpeläinen, I.

    1999-01-01

    Calerythrin is a 20 kDa calcium-binding protein isolated from gram-positive bacterium Saccharopolyspora erythraea. Based on amino acid sequence homology, it has been suggested that calerythrin belongs to the family of invertebrate sarcoplasmic EF-hand calcium-binding proteins (SCPs), and therefore it is expected to function as a calcium buffer. NMR spectroscopy was used to obtain structural information on the protein in solution. Backbone and side chain 1H, 13C, and 15N assignments were obtained from triple resonance experiments HNCACB, HN(CO)CACB, HNCO, CC(CO)NH, and [15N]-edited TOCSY, and HCCH-TOCSY. Secondary structure was determined by using secondary chemical shifts and characteristic NOEs. In addition, backbone N-H residual dipolar couplings were measured from a spin-state selective [1H, 15N] correlation spectrum acquired from a sample dissolved in a dilute liquid crystal. Four EF-hand motifs with characteristic helix-loop-helix patterns were observed. Three of these are typical calcium-binding EF-hands, whereas site 2 is an atypical nonbinding site. The global fold of calerythrin was assessed by dipolar couplings. Measured dipolar couplings were compared with values calculated from four crystal structures of proteins with sequence homology to calerythrin. These data allowed us to recognize an overall similarity between the folds of calerythrin and sarcoplasmic calcium-binding proteins from the sandworm Nereis diversicolor and the amphioxus Branchiostoma lanceolatum. PMID:10631973

  1. Automated assignment of MS/MS cleavable cross-links in protein 3D-structure analysis.

    PubMed

    Götze, Michael; Pettelkau, Jens; Fritzsche, Romy; Ihling, Christian H; Schäfer, Mathias; Sinz, Andrea

    2015-01-01

    CID-MS/MS cleavable cross-linkers hold an enormous potential for an automated analysis of cross-linked products, which is essential for conducting structural proteomics studies. The created characteristic fragment ion patterns can easily be used for an automated assignment and discrimination of cross-linked products. To date, there are only a few software solutions available that make use of these properties, but none allows for an automated analysis of cleavable cross-linked products. The MeroX software fills this gap and presents a powerful tool for protein 3D-structure analysis in combination with MS/MS cleavable cross-linkers. We show that MeroX allows an automatic screening of characteristic fragment ions, considering static and variable peptide modifications, and effectively scores different types of cross-links. No manual input is required for a correct assignment of cross-links and false discovery rates are calculated. The self-explanatory graphical user interface of MeroX provides easy access for an automated cross-link search platform that is compatible with commonly used data file formats, enabling analysis of data originating from different instruments. The combination of an MS/MS cleavable cross-linker with a dedicated software tool for data analysis provides an automated workflow for 3D-structure analysis of proteins. MeroX is available at www.StavroX.com . PMID:25261217

  2. Assigning in vivo carbamylation and acetylation in human lens proteins using tandem mass spectrometry and database searching

    NASA Astrophysics Data System (ADS)

    Park, Zee-Yong; Sadygov, Rovshan; Clark, Judy M.; Clark, John I.; Yates, John R., III

    2007-01-01

    In this paper, we show that ion trap mass spectrometers can differentiate acetylation and carbamylation modifications based on database search results for a lens protein sample. These types of modifications are difficult to distinguish on ion trap instruments because of their lower resolution and mass accuracy. The results were corroborated by using accurate mass information derived from MALDI TOF MS analysis of eluted peptides from a duplicate capillary RPLC separation. Tandem mass spectra of lysine carbamylated peptides were further verified by manual assignments of fragment ions and by the presence of characteristic fragment ions of carbamylated peptides. It was also observed that carbamylated peptides show a strong neutral loss of the carbamyl group in collision induced dissociation (CID), a feature that can be prognostic for carbamylation. In a lens tissue sample of a 67-year-old patient, 12 in vivo carbamylation sites were detected on 7 different lens proteins and 4 lysine acetylation sites were detected on 3 different lens proteins. Among the 12 in vivo carbamylation sites, 9 are novel in vivo carbamylation modification sites. Notably, in vivo carbamylation of [gamma]S crystallin, [beta]A4 crystallin, [beta]B1 crystallin, and [beta]B2 crystallin observed in this study have never been reported before.

  3. Automated Assignment of MS/MS Cleavable Cross-Links in Protein 3D-Structure Analysis

    NASA Astrophysics Data System (ADS)

    Götze, Michael; Pettelkau, Jens; Fritzsche, Romy; Ihling, Christian H.; Schäfer, Mathias; Sinz, Andrea

    2015-01-01

    CID-MS/MS cleavable cross-linkers hold an enormous potential for an automated analysis of cross-linked products, which is essential for conducting structural proteomics studies. The created characteristic fragment ion patterns can easily be used for an automated assignment and discrimination of cross-linked products. To date, there are only a few software solutions available that make use of these properties, but none allows for an automated analysis of cleavable cross-linked products. The MeroX software fills this gap and presents a powerful tool for protein 3D-structure analysis in combination with MS/MS cleavable cross-linkers. We show that MeroX allows an automatic screening of characteristic fragment ions, considering static and variable peptide modifications, and effectively scores different types of cross-links. No manual input is required for a correct assignment of cross-links and false discovery rates are calculated. The self-explanatory graphical user interface of MeroX provides easy access for an automated cross-link search platform that is compatible with commonly used data file formats, enabling analysis of data originating from different instruments. The combination of an MS/MS cleavable cross-linker with a dedicated software tool for data analysis provides an automated workflow for 3D-structure analysis of proteins. MeroX is available at www.StavroX.com .

  4. How do you assign persistent identifiers to extracts from large, complex, dynamic data sets that underpin scholarly publications?

    NASA Astrophysics Data System (ADS)

    Wyborn, Lesley; Car, Nicholas; Evans, Benjamin; Klump, Jens

    2016-04-01

    Persistent identifiers in the form of a Digital Object Identifier (DOI) are becoming more mainstream, assigned at both the collection and dataset level. For static datasets, this is a relatively straight-forward matter. However, many new data collections are dynamic, with new data being appended, models and derivative products being revised with new data, or the data itself revised as processing methods are improved. Further, because data collections are becoming accessible as services, researchers can log in and dynamically create user-defined subsets for specific research projects: they also can easily mix and match data from multiple collections, each of which can have a complex history. Inevitably extracts from such dynamic data sets underpin scholarly publications, and this presents new challenges. The National Computational Infrastructure (NCI) has been experiencing and making progress towards addressing these issues. The NCI is large node of the Research Data Services initiative (RDS) of the Australian Government's research infrastructure, which currently makes available over 10 PBytes of priority research collections, ranging from geosciences, geophysics, environment, and climate, through to astronomy, bioinformatics, and social sciences. Data are replicated to, or are produced at, NCI and then processed there to higher-level data products or directly analysed. Individual datasets range from multi-petabyte computational models and large volume raster arrays, down to gigabyte size, ultra-high resolution datasets. To facilitate access, maximise reuse and enable integration across the disciplines, datasets have been organized on a platform called the National Environmental Research Data Interoperability Platform (NERDIP). Combined, the NERDIP data collections form a rich and diverse asset for researchers: their co-location and standardization optimises the value of existing data, and forms a new resource to underpin data-intensive Science. New publication

  5. Proteomics beyond large-scale protein expression analysis.

    PubMed

    Boersema, Paul J; Kahraman, Abdullah; Picotti, Paola

    2015-08-01

    Proteomics is commonly referred to as the application of high-throughput approaches to protein expression analysis. Typical results of proteomics studies are inventories of the protein content of a sample or lists of differentially expressed proteins across multiple conditions. Recently, however, an explosion of novel proteomics workflows has significantly expanded proteomics beyond the analysis of protein expression. Targeted proteomics methods, for example, enable the analysis of the fine dynamics of protein systems, such as a specific pathway or a network of interacting proteins, and the determination of protein complex stoichiometries. Structural proteomics tools allow extraction of restraints for structural modeling and identification of structurally altered proteins on a proteome-wide scale. Other variations of the proteomic workflow can be applied to the large-scale analysis of protein activity, location, degradation and turnover. These exciting developments provide new tools for multi-level 'omics' analysis and for the modeling of biological networks in the context of systems biology studies. PMID:25636126

  6. (4,2)D Projection--reconstruction experiments for protein backbone assignment: application to human carbonic anhydrase II and calbindin D(28K).

    PubMed

    Venters, Ronald A; Coggins, Brian E; Kojetin, Doug; Cavanagh, John; Zhou, Pei

    2005-06-22

    Projection-reconstruction NMR experiments have been shown to significantly reduce the acquisition time required to obtain protein backbone assignment data. To date, this concept has only been applied to smaller (15)N/(13)C-labeled proteins. Here, we show that projection-reconstruction NMR techniques can be extended to larger protonated and perdeuterated proteins. We present a suite of (4,2)D triple-resonance experiments for protein backbone assignment and a Hybrid Backprojection/Lower-Value algorithm for reconstructing data with relatively weak signal-to-noise ratios. In addition, we propose a sampling theorem and discuss its implication on the choice of projection angles. We demonstrate the efficacy of this approach using the 29 kDa protein, human carbonic anhydrase II and the 30 kDa protein, calbindin D(28K). PMID:15954785

  7. Proton-detected scalar coupling based assignment strategies in MAS solid-state NMR spectroscopy applied to perdeuterated proteins

    NASA Astrophysics Data System (ADS)

    Linser, Rasmus; Fink, Uwe; Reif, Bernd

    2008-07-01

    Assignment of proteins in MAS (magic angle spinning) solid-state NMR relies so far on correlations among heteronuclei. This strategy is based on well dispersed resonances in the 15N dimension. In many complex cases like membrane proteins or amyloid fibrils, an additional frequency dimension is desirable in order to spread the amide resonances. We show here that proton detected HNCO, HNCA, and HNCACB type experiments can successfully be implemented in the solid-state. Coherences are sufficiently long lived to allow pulse schemes of a duration greater than 70 ms before incrementation of the first indirect dimension. The achieved resolution is comparable to the resolution obtained in solution-state NMR experiments. We demonstrate the experiments using a triply labeled sample of the SH3 domain of chicken α-spectrin, which was re-crystallized in H 2O/D 2O using a ratio of 1/9. We employ paramagnetic relaxation enhancement (PRE) using EDTA chelated Cu II to enable rapid data acquisition.

  8. A large-scale evaluation of computational protein function prediction

    PubMed Central

    Radivojac, Predrag; Clark, Wyatt T; Ronnen Oron, Tal; Schnoes, Alexandra M; Wittkop, Tobias; Sokolov, Artem; Graim, Kiley; Funk, Christopher; Verspoor, Karin; Ben-Hur, Asa; Pandey, Gaurav; Yunes, Jeffrey M; Talwalkar, Ameet S; Repo, Susanna; Souza, Michael L; Piovesan, Damiano; Casadio, Rita; Wang, Zheng; Cheng, Jianlin; Fang, Hai; Gough, Julian; Koskinen, Patrik; Törönen, Petri; Nokso-Koivisto, Jussi; Holm, Liisa; Cozzetto, Domenico; Buchan, Daniel W A; Bryson, Kevin; Jones, David T; Limaye, Bhakti; Inamdar, Harshal; Datta, Avik; Manjari, Sunitha K; Joshi, Rajendra; Chitale, Meghana; Kihara, Daisuke; Lisewski, Andreas M; Erdin, Serkan; Venner, Eric; Lichtarge, Olivier; Rentzsch, Robert; Yang, Haixuan; Romero, Alfonso E; Bhat, Prajwal; Paccanaro, Alberto; Hamp, Tobias; Kassner, Rebecca; Seemayer, Stefan; Vicedo, Esmeralda; Schaefer, Christian; Achten, Dominik; Auer, Florian; Böhm, Ariane; Braun, Tatjana; Hecht, Maximilian; Heron, Mark; Hönigschmid, Peter; Hopf, Thomas; Kaufmann, Stefanie; Kiening, Michael; Krompass, Denis; Landerer, Cedric; Mahlich, Yannick; Roos, Manfred; Björne, Jari; Salakoski, Tapio; Wong, Andrew; Shatkay, Hagit; Gatzmann, Fanny; Sommer, Ingolf; Wass, Mark N; Sternberg, Michael J E; Škunca, Nives; Supek, Fran; Bošnjak, Matko; Panov, Panče; Džeroski, Sašo; Šmuc, Tomislav; Kourmpetis, Yiannis A I; van Dijk, Aalt D J; ter Braak, Cajo J F; Zhou, Yuanpeng; Gong, Qingtian; Dong, Xinran; Tian, Weidong; Falda, Marco; Fontana, Paolo; Lavezzo, Enrico; Di Camillo, Barbara; Toppo, Stefano; Lan, Liang; Djuric, Nemanja; Guo, Yuhong; Vucetic, Slobodan; Bairoch, Amos; Linial, Michal; Babbitt, Patricia C; Brenner, Steven E; Orengo, Christine; Rost, Burkhard; Mooney, Sean D; Friedberg, Iddo

    2013-01-01

    Automated annotation of protein function is challenging. As the number of sequenced genomes rapidly grows, the overwhelming majority of protein products can only be annotated computationally. If computational predictions are to be relied upon, it is crucial that the accuracy of these methods be high. Here we report the results from the first large-scale community-based Critical Assessment of protein Function Annotation (CAFA) experiment. Fifty-four methods representing the state-of-the-art for protein function prediction were evaluated on a target set of 866 proteins from eleven organisms. Two findings stand out: (i) today’s best protein function prediction algorithms significantly outperformed widely-used first-generation methods, with large gains on all types of targets; and (ii) although the top methods perform well enough to guide experiments, there is significant need for improvement of currently available tools. PMID:23353650

  9. A large-scale evaluation of computational protein function prediction.

    PubMed

    Radivojac, Predrag; Clark, Wyatt T; Oron, Tal Ronnen; Schnoes, Alexandra M; Wittkop, Tobias; Sokolov, Artem; Graim, Kiley; Funk, Christopher; Verspoor, Karin; Ben-Hur, Asa; Pandey, Gaurav; Yunes, Jeffrey M; Talwalkar, Ameet S; Repo, Susanna; Souza, Michael L; Piovesan, Damiano; Casadio, Rita; Wang, Zheng; Cheng, Jianlin; Fang, Hai; Gough, Julian; Koskinen, Patrik; Törönen, Petri; Nokso-Koivisto, Jussi; Holm, Liisa; Cozzetto, Domenico; Buchan, Daniel W A; Bryson, Kevin; Jones, David T; Limaye, Bhakti; Inamdar, Harshal; Datta, Avik; Manjari, Sunitha K; Joshi, Rajendra; Chitale, Meghana; Kihara, Daisuke; Lisewski, Andreas M; Erdin, Serkan; Venner, Eric; Lichtarge, Olivier; Rentzsch, Robert; Yang, Haixuan; Romero, Alfonso E; Bhat, Prajwal; Paccanaro, Alberto; Hamp, Tobias; Kaßner, Rebecca; Seemayer, Stefan; Vicedo, Esmeralda; Schaefer, Christian; Achten, Dominik; Auer, Florian; Boehm, Ariane; Braun, Tatjana; Hecht, Maximilian; Heron, Mark; Hönigschmid, Peter; Hopf, Thomas A; Kaufmann, Stefanie; Kiening, Michael; Krompass, Denis; Landerer, Cedric; Mahlich, Yannick; Roos, Manfred; Björne, Jari; Salakoski, Tapio; Wong, Andrew; Shatkay, Hagit; Gatzmann, Fanny; Sommer, Ingolf; Wass, Mark N; Sternberg, Michael J E; Škunca, Nives; Supek, Fran; Bošnjak, Matko; Panov, Panče; Džeroski, Sašo; Šmuc, Tomislav; Kourmpetis, Yiannis A I; van Dijk, Aalt D J; ter Braak, Cajo J F; Zhou, Yuanpeng; Gong, Qingtian; Dong, Xinran; Tian, Weidong; Falda, Marco; Fontana, Paolo; Lavezzo, Enrico; Di Camillo, Barbara; Toppo, Stefano; Lan, Liang; Djuric, Nemanja; Guo, Yuhong; Vucetic, Slobodan; Bairoch, Amos; Linial, Michal; Babbitt, Patricia C; Brenner, Steven E; Orengo, Christine; Rost, Burkhard; Mooney, Sean D; Friedberg, Iddo

    2013-03-01

    Automated annotation of protein function is challenging. As the number of sequenced genomes rapidly grows, the overwhelming majority of protein products can only be annotated computationally. If computational predictions are to be relied upon, it is crucial that the accuracy of these methods be high. Here we report the results from the first large-scale community-based critical assessment of protein function annotation (CAFA) experiment. Fifty-four methods representing the state of the art for protein function prediction were evaluated on a target set of 866 proteins from 11 organisms. Two findings stand out: (i) today's best protein function prediction algorithms substantially outperform widely used first-generation methods, with large gains on all types of targets; and (ii) although the top methods perform well enough to guide experiments, there is considerable need for improvement of currently available tools. PMID:23353650

  10. Improved Peak Detection and Deconvolution of Native Electrospray Mass Spectra from Large Protein Complexes

    NASA Astrophysics Data System (ADS)

    Lu, Jonathan; Trnka, Michael J.; Roh, Soung-Hun; Robinson, Philip J. J.; Shiau, Carrie; Fujimori, Danica Galonic; Chiu, Wah; Burlingame, Alma L.; Guan, Shenheng

    2015-12-01

    Native electrospray-ionization mass spectrometry (native MS) measures biomolecules under conditions that preserve most aspects of protein tertiary and quaternary structure, enabling direct characterization of large intact protein assemblies. However, native spectra derived from these assemblies are often partially obscured by low signal-to-noise as well as broad peak shapes because of residual solvation and adduction after the electrospray process. The wide peak widths together with the fact that sequential charge state series from highly charged ions are closely spaced means that native spectra containing multiple species often suffer from high degrees of peak overlap or else contain highly interleaved charge envelopes. This situation presents a challenge for peak detection, correct charge state and charge envelope assignment, and ultimately extraction of the relevant underlying mass values of the noncovalent assemblages being investigated. In this report, we describe a comprehensive algorithm developed for addressing peak detection, peak overlap, and charge state assignment in native mass spectra, called PeakSeeker. Overlapped peaks are detected by examination of the second derivative of the raw mass spectrum. Charge state distributions of the molecular species are determined by fitting linear combinations of charge envelopes to the overall experimental mass spectrum. This software is capable of deconvoluting heterogeneous, complex, and noisy native mass spectra of large protein assemblies as demonstrated by analysis of (1) synthetic mononucleosomes containing severely overlapping peaks, (2) an RNA polymerase II/α-amanitin complex with many closely interleaved ion signals, and (3) human TriC complex containing high levels of background noise.

  11. ¹H, ¹⁵N, ¹³C assignment and secondary structure determination of two domains of La protein from D. discoideum.

    PubMed

    Apostolidi, Maria; Vourtsis, Dionysios J; Chasapis, Christos T; Stathopoulos, Constantinos; Bentrop, Detlef; Spyroulias, Georgios A

    2014-04-01

    Biosynthesis of RNA polymerase III transcripts requires binding of the La protein at their 3' end. La is an abundant nuclear RNA-binding protein which protects the nascent transcripts from 3' exonuclease degradation. Here, we report the high yield expression and preliminary structural analysis through NMR spectroscopy of two recombinant RNA binding domains (La motif and NRRM) from the La protein of Dictyostelium discoideum. Both recombinant protein constructs were well-folded and allowed for an almost complete sequence-specific assignment of the (15)N and (13)C labeled domains and their secondary structure prediction using PECAN online tool. PMID:23239108

  12. Two- and three-dimensional sup 1 H NMR studies of a wheat phospholipid transfer protein: Sequential resonance assignments and secondary structure

    SciTech Connect

    Simorre, J.P.; Caille, A. ); Marion, D. ); Marion, D. ); Ptak, M. Univ. d'Orleans )

    1991-12-10

    Two- and three-dimensional {sup 1}H NMR experiments have been used to sequentially assign nearly all proton resonances of the 90 residues of wheat phospholipid transfer protein. Only a few side-chain protons were not identified because of degeneracy or overlapping. The identification of spin systems and the sequential assignment were made at the same time by combining the data of the two- and three-dimensional experiments. The classical two-dimensional COSY, HOHAHA, and NOESY experiments benefit from both good resolution and high sensitivity, allowing the detection of long-range dipolar connectivities. The three-dimensional HOHAHA-NOESY experiment offers the advantage of a faster and unambiguous assignment. As a matter of fact, homonuclear three-dimensional NMR spectroscopy prove to be a very efficient method for resonance assignments of protein {sup 1}H NMR spectra which cannot be unraveled by 2D methods. An assignment strategy which overcomes most of the ambiguities has been proposed, in which each individual assignment toward the C-terminal end is supported by another in the opposite direction originating from a completely different part of the spectrum. Location of secondary structures of the phospholipid transfer protein was determined by using the method of analysis introduced here and was confirmed by {sup 3}J{sub {alpha}NH} coupling and NH exchange rates. Except for the C-terminal part, the polypeptide chain appears to be organized mainly as helical fragments connected by disulfide bridges. Further modeling will display the overall folding of the protein and should provide a better understanding of its interactions with lipids.

  13. 4D non-uniformly sampled HCBCACON and ¹J(NCα)-selective HCBCANCO experiments for the sequential assignment and chemical shift analysis of intrinsically disordered proteins.

    PubMed

    Nováček, Jiří; Haba, Noam Y; Chill, Jordan H; Zídek, Lukáš; Sklenář, Vladimír

    2012-06-01

    A pair of 4D NMR experiments for the backbone assignment of disordered proteins is presented. The experiments exploit (13)C direct detection and non-uniform sampling of the indirectly detected dimensions, and provide correlations of the aliphatic proton (H(α), and H(β)) and carbon (C(α), C(β)) resonance frequencies to the protein backbone. Thus, all the chemical shifts regularly used to map the transient secondary structure motifs in the intrinsically disordered proteins (H(α), C(α), C(β), C', and N) can be extracted from each spectrum. Compared to the commonly used assignment strategy based on matching the C(α) and C(β) chemical shifts, inclusion of the H(α) and H(β) provides up to three extra resonance frequencies that decrease the chance of ambiguous assignment. The experiments were successfully applied to the original assignment of a 12.8 kDa intrinsically disordered protein having a high content of proline residues (26 %) in the sequence. PMID:22580891

  14. Crystal structures of fusion proteins with large-affinity tags.

    PubMed

    Smyth, Douglas R; Mrozkiewicz, Marek K; McGrath, William J; Listwan, Pawel; Kobe, Bostjan

    2003-07-01

    The fusion of a protein of interest to a large-affinity tag, such as the maltose-binding protein (MBP), thioredoxin (TRX), or glutathione-S-transferase (GST), can be advantageous in terms of increased expression, enhanced solubility, protection from proteolysis, improved folding, and protein purification via affinity chromatography. Unfortunately, crystal growth is hindered by the conformational heterogeneity induced by the fusion tag, requiring that the tag is removed by a potentially problematic cleavage step. The first three crystal structures of fusion proteins with large-affinity tags have been reported recently. All three structures used a novel strategy to rigidly fuse the protein of interest to MBP via a short three- to five-amino acid spacer. This strategy has the potential to aid structure determination of proteins that present particular experimental challenges and are not conducive to more conventional crystallization strategies (e.g., membrane proteins). Structural genomics initiatives may also benefit from this approach as a way to crystallize problematic proteins of significant interest. PMID:12824478

  15. Photoswitchable red fluorescent protein with a large Stokes shift

    PubMed Central

    Piatkevich, Kiryl D.; English, Brian P.; Malashkevich, Vladimir N.; Xiao, Hui; Almo, Steven C.; Singer, Robert H.; Verkhusha, Vladislav V.

    2014-01-01

    SUMMARY Subclass of fluorescent proteins, large Stokes shift fluorescent proteins, is characterized by their increased spread between the excitation and emission maxima. Here we report a photoswitchable variant of a red fluorescent protein with a large Stokes shift, PSLSSmKate, which initially exhibits excitation/emission at 445/622 nm, but irradiation with violet light photoswitches PSLSSmKate into a common red form with excitation/emission at 573/621 nm. We characterize spectral, photophysical and biochemical properties of PSLSSmKate in vitro and in mammalian cells, and determine its crystal structure in the large Stokes shift form. Mass-spectrometry, mutagenesis and spectroscopic analysis of PSLSSmKate allow us to propose molecular mechanisms for the large Stokes shift, pH dependence and light-induced chromophore transformation. We demonstrate applicability of PSLSSmKate to superresolution PALM microscopy and protein dynamics in live cells. Given its promising properties, we expect that PSLSSmKate-like phenotype will be further used for photoactivatable imaging and tracking multiple populations of intracellular objects. PMID:25242289

  16. Backbone and Ile-δ1, Leu, Val Methyl 1H, 13C and 15N NMR chemical shift assignments for human interferon-stimulated gene 15 protein

    SciTech Connect

    Yin, Cuifeng; Aramini, James M.; Ma, LiChung; Cort, John R.; Swapna, G.V.T.; Krug, R. M.; Montelione, Gaetano

    2011-10-01

    Human interferon-stimulated gene 15 protein (ISG15), also called ubiquitin cross-reactive protein (UCRP), is the first identified ubiquitin-like protein containing two ubiquitin-like domains fused in tandem. The active form of ISG15 is conjugated to target proteins via the C-terminal glycine residue through an isopeptide bond in a manner similar to ubiquitin. The biological role of ISG15 is strongly associated with the modulation of cell immune function, and there is mounting evidence suggesting that many viral pathogens evade the host innate immune response by interfering with ISG15 conjugation to both host and viral proteins in a variety of ways. Here we report nearly complete backbone 1HN, 15N, 13CO, and 13Ca, as well as side chain 13Cb, methyl (Ile-d1, Leu, Val), amide (Asn, Gln), and indole NH (Trp) NMR resonance assignments for the 157-residue human ISG15 protein. These resonance assignments provide the basis for future structural and functional solution NMR studies of the biologically important human ISG15 protein.

  17. Direct mass spectrometric analysis of intact proteins of the yeast large ribosomal subunit using capillary LC/FTICR

    PubMed Central

    Lee, Sang-Won; Berger, Scott J.; Martinović, Suzana; Paša-Tolić, Ljiljana; Anderson, Gordon A.; Shen, Yufeng; Zhao, Rui; Smith, Richard D.

    2002-01-01

    Electrospray ionization Fourier transform ion cyclotron resonance mass spectrometry coupled with capillary reverse-phase liquid chromatography was used to characterize intact proteins from the large subunit of the yeast ribosome. High mass measurement accuracy, achieved by “mass locking” with an internal standard from a dual electrospray ionization source, allowed identification of ribosomal proteins. Analyses of the intact proteins revealed information on cotranslational and posttranslational modifications of the ribosomal proteins that included loss of the initiating methionine, acetylation, methylation, and proteolytic maturation. High-resolution separations permitted differentiation of protein isoforms having high structural similarity as well as proteins from their modified forms, facilitating unequivocal assignments. The study identified 42 of the 43 core large ribosomal subunit proteins and 58 (of 64 possible) core large subunit protein isoforms having unique masses in a single analysis. These results demonstrate the basis for the high-throughput analyses of complex mixtures of intact proteins, which we believe will be an important complement to other approaches for defining protein modifications and their changes resulting from physiological processes or environmental perturbations. PMID:11983894

  18. Method for Rapid Protein Identification in a Large Database

    PubMed Central

    Zhang, Wenli; Zhao, Xiaofang

    2013-01-01

    Protein identification is an integral part of proteomics research. The available tools to identify proteins in tandem mass spectrometry experiments are not optimized to face current challenges in terms of identification scale and speed owing to the exponential growth of the protein database and the accelerated generation of mass spectrometry data, as well as the demand for nonspecific digestion and post-modifications in complex-sample identification. As a result, a rapid method is required to mitigate such complexity and computation challenges. This paper thus aims to present an open method to prevent enzyme and modification specificity on a large database. This paper designed and developed a distributed program to facilitate application to computer resources. With this optimization, nearly linear speedup and real-time support are achieved on a large database with nonspecific digestion, thus enabling testing with two classical large protein databases in a 20-blade cluster. This work aids in the discovery of more significant biological results, such as modification sites, and enables the identification of more complex samples, such as metaproteomics samples. PMID:24000323

  19. Assignment of the gene for the core protein II (UQCRC2) subunit of the mitochondrial cytochrome bc[sub 1] complex to human chromosome 16p12

    SciTech Connect

    Duncan, A.M.V. Kingston General Hospital ); Ozawa, Takayuki; Suzuki, Hiroshi ); Rozen, R. Montreal Children's Hospital )

    1993-11-01

    The mammalian cytochrome be[sub 1] complex (complex III) of the mitochondrial respiratory chain catalyzes electron transfer from ubiquinol to cytochrome c. The complex consists of 10-11 subunits: Core proteins I and II, cytochromes b and c[sub 1], the Rieske iron-sulfur protein, the ubiquinone-binding protein, the hinge protein, and 3-4 subunits of low molecular weight. Cytochrome b is encoded by the mitochondrial genome; the other subunits are encoded by nuclear genes. Both the human cytochrome c[sub 1] and the human ubiquinone-binding protein subunits have been assigned to chromosome 8 by somatic cell hybrid mapping. In this study, the authors used in situ hybridization to map core protein II. In situ hybridization to BrdU-synchronized peripheral blood lymphocytes was performed using the method of Harper and Saunders. Chromosomes were stained with a modified fluorescence, 0.25% Wright's stain procedure. The positions of silver grains directly over or touching well-banded metaphase chromosomes were mapped to an ISCN idiogram. The analysis of the distribution of 200 silver grams following in situ hybridization revealed a significant clustering of grains in the p12 region of chromosome 16. The assignment of the core II subunit to human chromosome 16p12 confirms that it is encoded by the nuclear, rather than the mitochondrial, genome. The identification of a single strong hybridization signal is indicative of one locus with no pseudogenes. 6 refs., 1 fig.

  20. Insights into Hox Protein Function from a Large Scale Combinatorial Analysis of Protein Domains

    PubMed Central

    Karlsson, Daniel; Dixit, Richa; Saadaoui, Mehdi; Monier, Bruno; Brun, Christine; Thor, Stefan; Vijayraghavan, K.; Perrin, Laurent; Pradel, Jacques; Graba, Yacine

    2011-01-01

    Protein function is encoded within protein sequence and protein domains. However, how protein domains cooperate within a protein to modulate overall activity and how this impacts functional diversification at the molecular and organism levels remains largely unaddressed. Focusing on three domains of the central class Drosophila Hox transcription factor AbdominalA (AbdA), we used combinatorial domain mutations and most known AbdA developmental functions as biological readouts to investigate how protein domains collectively shape protein activity. The results uncover redundancy, interactivity, and multifunctionality of protein domains as salient features underlying overall AbdA protein activity, providing means to apprehend functional diversity and accounting for the robustness of Hox-controlled developmental programs. Importantly, the results highlight context-dependency in protein domain usage and interaction, allowing major modifications in domains to be tolerated without general functional loss. The non-pleoitropic effect of domain mutation suggests that protein modification may contribute more broadly to molecular changes underlying morphological diversification during evolution, so far thought to rely largely on modification in gene cis-regulatory sequences. PMID:22046139

  1. Insights into Hox protein function from a large scale combinatorial analysis of protein domains.

    PubMed

    Merabet, Samir; Litim-Mecheri, Isma; Karlsson, Daniel; Dixit, Richa; Saadaoui, Mehdi; Monier, Bruno; Brun, Christine; Thor, Stefan; Vijayraghavan, K; Perrin, Laurent; Pradel, Jacques; Graba, Yacine

    2011-10-01

    Protein function is encoded within protein sequence and protein domains. However, how protein domains cooperate within a protein to modulate overall activity and how this impacts functional diversification at the molecular and organism levels remains largely unaddressed. Focusing on three domains of the central class Drosophila Hox transcription factor AbdominalA (AbdA), we used combinatorial domain mutations and most known AbdA developmental functions as biological readouts to investigate how protein domains collectively shape protein activity. The results uncover redundancy, interactivity, and multifunctionality of protein domains as salient features underlying overall AbdA protein activity, providing means to apprehend functional diversity and accounting for the robustness of Hox-controlled developmental programs. Importantly, the results highlight context-dependency in protein domain usage and interaction, allowing major modifications in domains to be tolerated without general functional loss. The non-pleoitropic effect of domain mutation suggests that protein modification may contribute more broadly to molecular changes underlying morphological diversification during evolution, so far thought to rely largely on modification in gene cis-regulatory sequences. PMID:22046139

  2. (1)H, (15)N and (13)C chemical shift assignment of the Gram-positive conjugative transfer protein TraHpIP501.

    PubMed

    Fercher, Christian; Keller, Walter; Zangger, Klaus; Helge Meyer, N

    2016-04-01

    Conjugative transfer of DNA represents the most important transmission pathway in terms of antibiotic resistance and virulence gene dissemination among bacteria. TraH is a putative transfer protein of the type IV secretion system (T4SS) encoded by the Gram-positive (G+) conjugative plasmid pIP501. This molecular machine involves a multi-protein core complex spanning the bacterial envelope thereby serving as a macromolecular secretion channel. Here, we report the near complete (1)H, (13)C and (15)N resonance assignment of a soluble TraH variant comprising the C-terminal domain. PMID:26559076

  3. A structural dissection of large protein-protein crystal packing contacts

    PubMed Central

    Luo, Jiesi; Liu, Zhongyu; Guo, Yanzhi; Li, Menglong

    2015-01-01

    With the rapid increase in crystal structures of protein-protein complexes deposited in the Protein Data Bank (PDB), more and more crystal contacts have been shown to have similar or even larger interface areas than biological interfaces. However, little attention has been paid to these large crystal packing contacts and their structural principles remain unknown. To address this issue, we used a comparative feature analysis to analyze the geometric and physicochemical properties of large crystal packing contacts by comparing two types of specific protein-protein interactions (PPIs), weak transient complexes and permanent homodimers. Our results show that although large crystal packing contacts have a similar interface area and contact size as permanent homodimers, they tend to be more planar, loosely packed and less hydrophobic than permanent homodimers and cannot form a central core region that is fully buried during interaction. However, the properties of large crystal packing contacts, except for the interface area and contact size, more closely resemble those of weak transient complexes. The large overlap between biological and large crystal packing contacts indicates that interface properties are not efficient indicators for classification of biological interfaces from large crystal packing contacts and finding other specific features urgently needed. PMID:26370141

  4. Analyzing Large Protein Complexes by Structural Mass Spectrometry

    PubMed Central

    Kirshenbaum, Noam; Michaelevski, Izhak; Sharon, Michal

    2010-01-01

    Living cells control and regulate their biological processes through the coordinated action of a large number of proteins that assemble themselves into an array of dynamic, multi-protein complexes1. To gain a mechanistic understanding of the various cellular processes, it is crucial to determine the structure of such protein complexes, and reveal how their structural organization dictates their function. Many aspects of multi-protein complexes are, however, difficult to characterize, due to their heterogeneous nature, asymmetric structure, and dynamics. Therefore, new approaches are required for the study of the tertiary levels of protein organization. One of the emerging structural biology tools for analyzing macromolecular complexes is mass spectrometry (MS)2-5. This method yields information on the complex protein composition, subunit stoichiometry, and structural topology. The power of MS derives from its high sensitivity and, as a consequence, low sample requirement, which enables examination of protein complexes expressed at endogenous levels. Another advantage is the speed of analysis, which allows monitoring of reactions in real time. Moreover, the technique can simultaneously measure the characteristics of separate populations co-existing in a mixture. Here, we describe a detailed protocol for the application of structural MS to the analysis of large protein assemblies. The procedure begins with the preparation of gold-coated capillaries for nanoflow electrospray ionization (nESI). It then continues with sample preparation, emphasizing the buffer conditions which should be compatible with nESI on the one hand, and enable to maintain complexes intact on the other. We then explain, step-by-step, how to optimize the experimental conditions for high mass measurements and acquire MS and tandem MS spectra. Finally, we chart the data processing and analyses that follow. Rather than attempting to characterize every aspect of protein assemblies, this protocol

  5. Mutational effects on stability are largely conserved during protein evolution.

    PubMed

    Ashenberg, Orr; Gong, L Ian; Bloom, Jesse D

    2013-12-24

    Protein stability and folding are the result of cooperative interactions among many residues, yet phylogenetic approaches assume that sites are independent. This discrepancy has engendered concerns about large evolutionary shifts in mutational effects that might confound phylogenetic approaches. Here we experimentally investigate this issue by introducing the same mutations into a set of diverged homologs of the influenza nucleoprotein and measuring the effects on stability. We find that mutational effects on stability are largely conserved across the homologs. We reach qualitatively similar conclusions when we simulate protein evolution with molecular-mechanics force fields. Our results do not mean that proteins evolve without epistasis, which can still arise even when mutational stability effects are conserved. However, our findings indicate that large evolutionary shifts in mutational effects on stability are rare, at least among homologs with similar structures and functions. We suggest that properly describing the clearly observable and highly conserved amino acid preferences at individual sites is likely to be far more important for phylogenetic analyses than accounting for rare shifts in amino acid propensities due to site covariation. PMID:24324165

  6. Predicting protein functions from redundancies in large-scale protein interaction networks

    NASA Technical Reports Server (NTRS)

    Samanta, Manoj Pratim; Liang, Shoudan

    2003-01-01

    Interpreting data from large-scale protein interaction experiments has been a challenging task because of the widespread presence of random false positives. Here, we present a network-based statistical algorithm that overcomes this difficulty and allows us to derive functions of unannotated proteins from large-scale interaction data. Our algorithm uses the insight that if two proteins share significantly larger number of common interaction partners than random, they have close functional associations. Analysis of publicly available data from Saccharomyces cerevisiae reveals >2,800 reliable functional associations, 29% of which involve at least one unannotated protein. By further analyzing these associations, we derive tentative functions for 81 unannotated proteins with high certainty. Our method is not overly sensitive to the false positives present in the data. Even after adding 50% randomly generated interactions to the measured data set, we are able to recover almost all (approximately 89%) of the original associations.

  7. Sizing Large Proteins and Protein Complexes by Electrospray Ionization Mass Spectrometry and Ion Mobility

    PubMed Central

    Kaddis, Catherine S.; Lomeli, Shirley H.; Yin, Sheng; Berhane, Beniam; Apostol, Marcin I.; Kickhoefer, Valerie A.; Rome, Leonard H.; Loo, Joseph A.

    2009-01-01

    Mass spectrometry (MS) and ion mobility with electrospray ionization (ESI) have the capability to measure and detect large noncovalent protein-ligand and protein-protein complexes. Using an ion mobility method termed GEMMA (Gas-Phase Electrophoretic Mobility Molecular Analysis), protein particles representing a range of sizes can be separated by their electrophoretic mobility in air. Highly charged particles produced from a protein complex solution using electrospray can be manipulated to produce singly charged ions which can be separated and quantified by their electrophoretic mobility. Results from ESI-GEMMA analysis from our laboratory and others were compared to other experimental and theoretically determined parameters, such as molecular mass and cryoelectron microscopy and x-ray crystal structure dimensions. There is a strong correlation between the electrophoretic mobility diameter determined from GEMMA analysis and the molecular mass for protein complexes up to 12 MDa, including the 93 kDa enolase dimer, the 480 kDa ferritin 24-mer complex, the 4.6 MDa cowpea chlorotic mottle virus (CCMV), and the 9 MDa MVP-vault assembly. ESI-GEMMA is used to differentiate a number of similarly sized vault complexes that are composed of different N-terminal protein tags on the MVP subunit. The average effective density of the proteins and protein complexes studied was 0.6 g/cm3. Moreover, there is evidence that proteins and protein complexes collapse or become more compact in the gas phase in the absence of water. PMID:17434746

  8. Proton-detected MAS NMR experiments based on dipolar transfers for backbone assignment of highly deuterated proteins

    NASA Astrophysics Data System (ADS)

    Chevelkov, Veniamin; Habenstein, Birgit; Loquet, Antoine; Giller, Karin; Becker, Stefan; Lange, Adam

    2014-05-01

    Proton-detected solid-state NMR was applied to a highly deuterated insoluble, non-crystalline biological assembly, the Salmonella typhimurium type iii secretion system (T3SS) needle. Spectra of very high resolution and sensitivity were obtained at a low protonation level of 10-20% at exchangeable amide positions. We developed efficient experimental protocols for resonance assignment tailored for this system and the employed experimental conditions. Using exclusively dipolar-based interspin magnetization transfers, we recorded two sets of 3D spectra allowing for an almost complete backbone resonance assignment of the needle subunit PrgI. The additional information provided by the well-resolved proton dimension revealed the presence of two sets of resonances in the N-terminal helix of PrgI, while in previous studies employing 13C detection only a single set of resonances was observed.

  9. Reduced dimensionality tailored HN(C)N experiments for facile backbone resonance assignment of proteins through unambiguous identification of sequential HSQC peaks

    NASA Astrophysics Data System (ADS)

    Kumar, Dinesh

    2013-12-01

    Two novel reduced dimensionality (RD) tailored HN(C)N [S.C. Panchal, N.S. Bhavesh, R.V. Hosur, Improved 3D triple resonance experiments, HNN and HN(C)N, for HN and 15N sequential correlations in (13C, 15N) labeled proteins: application to unfolded proteins, J. Biomol. NMR 20 (2001) 135-147] experiments are proposed to facilitate the backbone resonance assignment of proteins both in terms of its accuracy and speed. These experiments - referred here as (4,3)D-hNCOcaNH and (4,3)D-hNcoCANH - exploit the linear combination of backbone 15N and 13C‧/13Cα chemical shifts simultaneously to achieve higher peak dispersion and randomness along their respective F1 dimensions. Simply, this has been achieved by modulating the backbone 15N(i) chemical shifts with that of 13C‧ (i - 1)/13Cα (i - 1) spins following the established reduced dimensionality NMR approach [T. Szyperski, D.C. Yeh, D.K. Sukumaran, H.N. Moseley, G.T. Montelione, Reduced-dimensionality NMR spectroscopy for high-throughput protein resonance assignment, Proc. Natl. Acad. Sci. USA 99 (2002) 8009-8014]. Though the modification is simple it has resulted an ingenious improvement of HN(C)N both in terms of peak dispersion and easiness of establishing the sequential connectivities. The increased dispersion along F1 dimension solves two purposes here: (i) resolves the ambiguities arising because of degenerate 15N chemical shifts and (ii) reduces the signal overlap in F2(15N)-F3(1H) planes (an important requisite in HN(C)N based assignment protocol for facile and unambiguous identification of sequentially connected HSQC peaks). The performance of both these experiments and the assignment protocol has been demonstrated using bovine apo Calbindin-d9k (75 aa) and urea denatured UNC60B (a 152 amino acid ADF/cofilin family protein of Caenorhabditis elegans), as representatives of folded and unfolded protein systems, respectively.

  10. Reduced dimensionality tailored HN(C)N experiments for facile backbone resonance assignment of proteins through unambiguous identification of sequential HSQC peaks.

    PubMed

    Kumar, Dinesh

    2013-12-01

    Two novel reduced dimensionality (RD) tailored HN(C)N [S.C. Panchal, N.S. Bhavesh, R.V. Hosur, Improved 3D triple resonance experiments, HNN and HN(C)N, for HN and 15N sequential correlations in (13C, 15N) labeled proteins: application to unfolded proteins, J. Biomol. NMR 20 (2001) 135-147] experiments are proposed to facilitate the backbone resonance assignment of proteins both in terms of its accuracy and speed. These experiments - referred here as (4,3)D-hNCOcaNH and (4,3)D-hNcoCANH - exploit the linear combination of backbone (15)N and (13)C'/(13)C(α) chemical shifts simultaneously to achieve higher peak dispersion and randomness along their respective F1 dimensions. Simply, this has been achieved by modulating the backbone (15)N(i) chemical shifts with that of (13)C' (i-1)/(13)C(α) (i-1) spins following the established reduced dimensionality NMR approach [T. Szyperski, D.C. Yeh, D.K. Sukumaran, H.N. Moseley, G.T. Montelione, Reduced-dimensionality NMR spectroscopy for high-throughput protein resonance assignment, Proc. Natl. Acad. Sci. USA 99 (2002) 8009-8014]. Though the modification is simple it has resulted an ingenious improvement of HN(C)N both in terms of peak dispersion and easiness of establishing the sequential connectivities. The increased dispersion along F1 dimension solves two purposes here: (i) resolves the ambiguities arising because of degenerate (15)N chemical shifts and (ii) reduces the signal overlap in F2((15)N)-F3((1)H) planes (an important requisite in HN(C)N based assignment protocol for facile and unambiguous identification of sequentially connected HSQC peaks). The performance of both these experiments and the assignment protocol has been demonstrated using bovine apo Calbindin-d9k (75 aa) and urea denatured UNC60B (a 152 amino acid ADF/cofilin family protein of Caenorhabditis elegans), as representatives of folded and unfolded protein systems, respectively. PMID:24161682

  11. Discovery of Manassantin A Protein Targets Using Large-Scale Protein Folding and Stability Measurements.

    PubMed

    Geer Wallace, M Ariel; Kwon, Do-Yeon; Weitzel, Douglas H; Lee, Chen-Ting; Stephenson, Tesia N; Chi, Jen-Tsan; Mook, Robert A; Dewhirst, Mark W; Hong, Jiyong; Fitzgerald, Michael C

    2016-08-01

    Manassantin A is a natural product that has been shown to have anticancer activity in cell-based assays, but has a largely unknown mode-of-action. Described here is the use of two different energetics-based approaches to identify protein targets of manassantin A. Using the stability of proteins from rates of oxidation technique with an isobaric mass tagging strategy (iTRAQ-SPROX) and the pulse proteolysis technique with a stable isotope labeling with amino acids in cell culture strategy (SILAC-PP), over 1000 proteins in a MDA-MB-231 cell lysate grown under hypoxic conditions were assayed for manassantin A interactions (both direct and indirect). A total of 28 protein hits were identified with manassantin A-induced thermodynamic stability changes. Two of the protein hits (filamin A and elongation factor 1α) were identified using both experimental approaches. The remaining 26 hit proteins were only assayed in either the iTRAQ-SPROX or the SILAC-PP experiment. The 28 potential protein targets of manassantin A identified here provide new experimental avenues along which to explore the molecular basis of manassantin A's mode of action. The current work also represents the first application iTRAQ-SPROX and SILAC-PP to the large-scale analysis of protein-ligand binding interactions involving a potential anticancer drug with an unknown mode-of-action. PMID:27322910

  12. An automated system designed for large scale NMR data deposition and annotation: application to over 600 assigned chemical shift data entries to the BioMagResBank from the Riken Structural Genomics/Proteomics Initiative internal database.

    PubMed

    Kobayashi, Naohiro; Harano, Yoko; Tochio, Naoya; Nakatani, Eiichi; Kigawa, Takanori; Yokoyama, Shigeyuki; Mading, Steve; Ulrich, Eldon L; Markley, John L; Akutsu, Hideo; Fujiwara, Toshimichi

    2012-08-01

    Biomolecular NMR chemical shift data are key information for the functional analysis of biomolecules and the development of new techniques for NMR studies utilizing chemical shift statistical information. Structural genomics projects are major contributors to the accumulation of protein chemical shift information. The management of the large quantities of NMR data generated by each project in a local database and the transfer of the data to the public databases are still formidable tasks because of the complicated nature of NMR data. Here we report an automated and efficient system developed for the deposition and annotation of a large number of data sets including (1)H, (13)C and (15)N resonance assignments used for the structure determination of proteins. We have demonstrated the feasibility of our system by applying it to over 600 entries from the internal database generated by the RIKEN Structural Genomics/Proteomics Initiative (RSGI) to the public database, BioMagResBank (BMRB). We have assessed the quality of the deposited chemical shifts by comparing them with those predicted from the PDB coordinate entry for the corresponding protein. The same comparison for other matched BMRB/PDB entries deposited from 2001-2011 has been carried out and the results suggest that the RSGI entries greatly improved the quality of the BMRB database. Since the entries include chemical shifts acquired under strikingly similar experimental conditions, these NMR data can be expected to be a promising resource to improve current technologies as well as to develop new NMR methods for protein studies. PMID:22689068

  13. Detecting differential protein expression in large-scale population proteomics

    SciTech Connect

    Ryu, Soyoung; Qian, Weijun; Camp, David G.; Smith, Richard D.; Tompkins, Ronald G.; Davis, Ronald W.; Xiao, Wenzhong

    2014-06-17

    Mass spectrometry-based high-throughput quantitative proteomics shows great potential in clinical biomarker studies, identifying and quantifying thousands of proteins in biological samples. However, methods are needed to appropriately handle issues/challenges unique to mass spectrometry data in order to detect as many biomarker proteins as possible. One issue is that different mass spectrometry experiments generate quite different total numbers of quantified peptides, which can result in more missing peptide abundances in an experiment with a smaller total number of quantified peptides. Another issue is that the quantification of peptides is sometimes absent, especially for less abundant peptides and such missing values contain the information about the peptide abundance. Here, we propose a Significance Analysis for Large-scale Proteomics Studies (SALPS) that handles missing peptide intensity values caused by the two mechanisms mentioned above. Our model has a robust performance in both simulated data and proteomics data from a large clinical study. Because varying patients’ sample qualities and deviating instrument performances are not avoidable for clinical studies performed over the course of several years, we believe that our approach will be useful to analyze large-scale clinical proteomics data.

  14. Large-volume protein crystal growth for neutron macromolecular crystallography.

    PubMed

    Ng, Joseph D; Baird, James K; Coates, Leighton; Garcia-Ruiz, Juan M; Hodge, Teresa A; Huang, Sijay

    2015-04-01

    Neutron macromolecular crystallography (NMC) is the prevailing method for the accurate determination of the positions of H atoms in macromolecules. As neutron sources are becoming more available to general users, finding means to optimize the growth of protein crystals to sizes suitable for NMC is extremely important. Historically, much has been learned about growing crystals for X-ray diffraction. However, owing to new-generation synchrotron X-ray facilities and sensitive detectors, protein crystal sizes as small as in the nano-range have become adequate for structure determination, lessening the necessity to grow large crystals. Here, some of the approaches, techniques and considerations for the growth of crystals to significant dimensions that are now relevant to NMC are revisited. These include experimental strategies utilizing solubility diagrams, ripening effects, classical crystallization techniques, microgravity and theoretical considerations. PMID:25849493

  15. Large-volume protein crystal growth for neutron macromolecular crystallography

    SciTech Connect

    Ng, Joseph D.; Baird, James K.; Coates, Leighton; Garcia-Ruiz, Juan M.; Hodge, Teresa A.; Huang, Sijay

    2015-03-30

    Neutron macromolecular crystallography (NMC) is the prevailing method for the accurate determination of the positions of H atoms in macromolecules. As neutron sources are becoming more available to general users, finding means to optimize the growth of protein crystals to sizes suitable for NMC is extremely important. Historically, much has been learned about growing crystals for X-ray diffraction. However, owing to new-generation synchrotron X-ray facilities and sensitive detectors, protein crystal sizes as small as in the nano-range have become adequate for structure determination, lessening the necessity to grow large crystals. Here, some of the approaches, techniques and considerations for the growth of crystals to significant dimensions that are now relevant to NMC are revisited. We report that these include experimental strategies utilizing solubility diagrams, ripening effects, classical crystallization techniques, microgravity and theoretical considerations.

  16. Large-volume protein crystal growth for neutron macromolecular crystallography

    DOE PAGESBeta

    Ng, Joseph D.; Baird, James K.; Coates, Leighton; Garcia-Ruiz, Juan M.; Hodge, Teresa A.; Huang, Sijay

    2015-03-30

    Neutron macromolecular crystallography (NMC) is the prevailing method for the accurate determination of the positions of H atoms in macromolecules. As neutron sources are becoming more available to general users, finding means to optimize the growth of protein crystals to sizes suitable for NMC is extremely important. Historically, much has been learned about growing crystals for X-ray diffraction. However, owing to new-generation synchrotron X-ray facilities and sensitive detectors, protein crystal sizes as small as in the nano-range have become adequate for structure determination, lessening the necessity to grow large crystals. Here, some of the approaches, techniques and considerations for themore » growth of crystals to significant dimensions that are now relevant to NMC are revisited. We report that these include experimental strategies utilizing solubility diagrams, ripening effects, classical crystallization techniques, microgravity and theoretical considerations.« less

  17. 1H, 13C, and 15N resonance assignments for the protein coded by gene locus BB0938 of Bordetella bronchiseptica

    SciTech Connect

    Rossi, Paolo; Ramelot, Theresa A.; Xiao, Rong; Ho, Chi K.; Ma, LiChung; Acton, Thomas; Kennedy, Michael A.; Montelione, Gaetano

    2005-11-01

    The product of gene locus BB0938 from Bordetella bronchiseptica (Swiss-Prot ID: Q7WNU7-BORBR; NESG target ID: BoR11; Wunderlich et al., 2004; Pfam ID: PF03476) is a 128-residue protein of unknown function. This broadly conserved protein family is found in eubacteria and eukaryotes. Using triple resonance NMR techniques, we have determined 98% of backbone and 94% of side chain 1H, 13C, and 15N resonance assignments. The chemical shift and 3J(HN?Ha) scalar coupling data reveal a b topology with a seven-residue helical insert, ??????????. BMRB deposit with accession number 6693. Reference: Wunderlich et al. (2004) Proteins, 56, 181?187.

  18. 1H, 13C, and 15N resonance assignments for Escherichia coli ytfP, a member of the broadly conserved UPF0131 protein domain family

    SciTech Connect

    Aramini, James M.; Swapna, G.V.T.; Huang, Yuanpeng; Rajan, Paranji K.; Xiao, Rong; Shastry, Ritu; Acton, Thomas; Cort, John R.; Kennedy, Michael A.; Montelione, Gaetano

    2005-11-01

    Protein ytfP from Escherichia coli (Swiss-Prot ID: YTFP-ECOLI; NESG target ID: ER111; Wunderlich et al., 2004) is a 113-residue member of the UPF0131 protein family (Pfam ID: PF03674) of unknown function. This domain family is found in organisms from all three kingdoms, archaea, eubacteria and eukaryotes. Using triple resonance NMR techniques, we have determined 97% of backbone and 91% of side chain 1H, 13C, and 15N resonance assignments. The chemical shift and 3J(HN?Ha) scalar coupling data reveal a mixed a/b topology,????????. BMRB deposit with Accession No. 6448. Reference: Wunderlich et al. (2004) Proteins, 56, 181?187.

  19. Folding of a large protein at high structural resolution.

    PubMed

    Walters, Benjamin T; Mayne, Leland; Hinshaw, James R; Sosnick, Tobin R; Englander, S Walter

    2013-11-19

    Kinetic folding of the large two-domain maltose binding protein (MBP; 370 residues) was studied at high structural resolution by an advanced hydrogen-exchange pulse-labeling mass-spectrometry method (HX MS). Dilution into folding conditions initiates a fast molecular collapse into a polyglobular conformation (<20 ms), determined by various methods including small angle X-ray scattering. The compaction produces a structurally heterogeneous state with widespread low-level HX protection and spectroscopic signals that match the equilibrium melting posttransition-state baseline. In a much slower step (7-s time constant), all of the MBP molecules, although initially heterogeneously structured, form the same distinct helix plus sheet folding intermediate with the same time constant. The intermediate is composed of segments that are distant in the MBP sequence but adjacent in the native protein where they close the longest residue-to-residue contact. Segments that are most HX protected in the early molecular collapse do not contribute to the initial intermediate, whereas the segments that do participate are among the less protected. The 7-s intermediate persists through the rest of the folding process. It contains the sites of three previously reported destabilizing mutations that greatly slow folding. These results indicate that the intermediate is an obligatory step on the MBP folding pathway. MBP then folds to the native state on a longer time scale (~100 s), suggestively in more than one step, the first of which forms structure adjacent to the 7-s intermediate. These results add a large protein to the list of proteins known to fold through distinct native-like intermediates in distinct pathways. PMID:24191053

  20. Folding of a large protein at high structural resolution

    PubMed Central

    Walters, Benjamin T.; Mayne, Leland; Hinshaw, James R.; Sosnick, Tobin R.; Englander, S. Walter

    2013-01-01

    Kinetic folding of the large two-domain maltose binding protein (MBP; 370 residues) was studied at high structural resolution by an advanced hydrogen-exchange pulse-labeling mass-spectrometry method (HX MS). Dilution into folding conditions initiates a fast molecular collapse into a polyglobular conformation (<20 ms), determined by various methods including small angle X-ray scattering. The compaction produces a structurally heterogeneous state with widespread low-level HX protection and spectroscopic signals that match the equilibrium melting posttransition-state baseline. In a much slower step (7-s time constant), all of the MBP molecules, although initially heterogeneously structured, form the same distinct helix plus sheet folding intermediate with the same time constant. The intermediate is composed of segments that are distant in the MBP sequence but adjacent in the native protein where they close the longest residue-to-residue contact. Segments that are most HX protected in the early molecular collapse do not contribute to the initial intermediate, whereas the segments that do participate are among the less protected. The 7-s intermediate persists through the rest of the folding process. It contains the sites of three previously reported destabilizing mutations that greatly slow folding. These results indicate that the intermediate is an obligatory step on the MBP folding pathway. MBP then folds to the native state on a longer time scale (∼100 s), suggestively in more than one step, the first of which forms structure adjacent to the 7-s intermediate. These results add a large protein to the list of proteins known to fold through distinct native-like intermediates in distinct pathways. PMID:24191053

  1. 1H, 15N and 13C resonance assignments of light organ-associated fatty acid-binding protein of Taiwanese fireflies.

    PubMed

    Tseng, Kai-Li; Lee, Yi-Zong; Chen, Yun-Ru; Lyu, Ping-Chiang

    2016-04-01

    Fatty acid-binding proteins (FABPs) are a family of proteins that modulate the transfer of various fatty acids in the cytosol and constitute a significant portion in many energy-consuming cells. The ligand binding properties and specific functions of a particular type of FABP seem to be diverse and depend on the respective binding cavity as well as the cell type from which this protein is derived. Previously, a novel FABP (lcFABP; lc: Luciola cerata) was identified in the light organ of Taiwanese fireflies. The lcFABP was proved to possess fatty acids binding capabilities, especially for fatty acids of length C14-C18. However, the structural details are unknown, and the structure-function relationship has remained to be further investigated. In this study, we finished the (1)H, (15)N and (13)C chemical shift assignments of (15)N/(13)C-enriched lcFABP by solution NMR spectroscopy. In addition, the secondary structure distribution was revealed based on the backbone N, H, Cα, Hα, C and side chain Cβ assignments. These results can provide the basis for further structural exploration of lcFABP. PMID:26373428

  2. Light-fuelled transport of large dendrimers and proteins.

    PubMed

    Koskela, Jenni E; Liljeström, Ville; Lim, Jongdoo; Simanek, Eric E; Ras, Robin H A; Priimagi, Arri; Kostiainen, Mauri A

    2014-05-14

    This work presents a facile water-based supramolecular approach for light-induced surface patterning. The method is based upon azobenzene-functionalized high-molecular weight triazine dendrimers up to generation 9, demonstrating that even very large globular supramolecular complexes can be made to move in response to light. We also demonstrate light-fuelled macroscopic movements in native biomolecules, showing that complexes of apoferritin protein and azobenzene can effectively form light-induced surface patterns. Fundamentally, the results establish that thin films comprising both flexible and rigid globular particles of large diameter can be moved with light, whereas the presented material concepts offer new possibilities for the yet marginally explored biological applications of azobenzene surface patterning. PMID:24785836

  3. A sequential assignment procedure for proteins that have intermediate line widths in MAS NMR spectra: amyloid fibrils of human CA150.WW2.

    PubMed

    Becker, Johanna; Ferguson, Neil; Flinders, Jeremy; van Rossum, Barth-Jan; Fersht, Alan R; Oschkinat, Hartmut

    2008-08-11

    The second WW domain (WW2) of CA150, a human transcriptional activator, forms amyloid fibrils in vitro under physiological conditions. Based on experimental constraints from MAS NMR spectroscopy experiments, alanine scanning and electron microscopy, a structural model of CA150.WW2 amyloid fibrils was calculated earlier. Here, the assignment strategy is presented and suggested as a general approach for proteins that show intermediate line width. The (13)C,(13)C correlation experiments were recorded on fully or partially (13)C-labelled fibrils. The earlier (13)C assignment (26 residues) was extended to 34 of the 40 residues by direct (13)C-excitation experiments by using a deuterated sample that showed strongly improved line width. A 3D HNC-TEDOR (transferred-echo double-resonance) experiment with deuterated CA150.WW2 fibrils yielded 14 amide nitrogen and proton resonance assignments. The obtained chemical shifts were compared with the chemical shifts determined with the natively folded WW domain. TALOS (Torsion angle likelihood obtained from shift and sequence similarity) predictions confirmed that, under physiological conditions, the fibrillar form of CA150.WW2 adopts a significantly different beta structure than the native WW-domain fold. PMID:18642254

  4. Actin-interacting and flagellar proteins in Leishmania spp.: Bioinformatics predictions to functional assignments in phagosome formation

    PubMed Central

    2009-01-01

    Several motile processes are responsible for the movement of proteins into and within the flagellar membrane, but little is known about the process by which specific proteins (either actin-associated or not) are targeted to protozoan flagellar membranes. Actin is a major cytoskeleton protein, while polymerization and depolymerization of parasite actin and actin-interacting proteins (AIPs) during both processes of motility and host cell entry might be key events for successful infection. For a better understanding the eukaryotic flagellar dynamics, we have surveyed genomes, transcriptomes and proteomes of pathogenic Leishmania spp. to identify pertinent genes/proteins and to build in silico models to properly address their putative roles in trypanosomatid virulence. In a search for AIPs involved in flagellar activities, we applied computational biology and proteomic tools to infer from the biological meaning of coronins and Arp2/3, two important elements in phagosome formation after parasite phagocytosis by macrophages. Results presented here provide the first report of Leishmania coronin and Arp2/3 as flagellar proteins that also might be involved in phagosome formation through actin polymerization within the flagellar environment. This is an issue worthy of further in vitro examination that remains now as a direct, positive bioinformatics-derived inference to be presented. PMID:21637533

  5. Actin-interacting and flagellar proteins in Leishmania spp.: Bioinformatics predictions to functional assignments in phagosome formation.

    PubMed

    Diniz, Michely C; Costa, Marcília P; Pacheco, Ana C L; Kamimura, Michel T; Silva, Samara C; Carneiro, Laura D G; Sousa, Ana P L; Soares, Carlos E A; Souza, Celeste S F; de Oliveira, Diana Magalhães

    2009-07-01

    Several motile processes are responsible for the movement of proteins into and within the flagellar membrane, but little is known about the process by which specific proteins (either actin-associated or not) are targeted to protozoan flagellar membranes. Actin is a major cytoskeleton protein, while polymerization and depolymerization of parasite actin and actin-interacting proteins (AIPs) during both processes of motility and host cell entry might be key events for successful infection. For a better understanding the eukaryotic flagellar dynamics, we have surveyed genomes, transcriptomes and proteomes of pathogenic Leishmania spp. to identify pertinent genes/proteins and to build in silico models to properly address their putative roles in trypanosomatid virulence. In a search for AIPs involved in flagellar activities, we applied computational biology and proteomic tools to infer from the biological meaning of coronins and Arp2/3, two important elements in phagosome formation after parasite phagocytosis by macrophages. Results presented here provide the first report of Leishmania coronin and Arp2/3 as flagellar proteins that also might be involved in phagosome formation through actin polymerization within the flagellar environment. This is an issue worthy of further in vitro examination that remains now as a direct, positive bioinformatics-derived inference to be presented. PMID:21637533

  6. APoc: large-scale identification of similar protein pockets

    PubMed Central

    Gao, Mu; Skolnick, Jeffrey

    2013-01-01

    Motivation: Most proteins interact with small-molecule ligands such as metabolites or drug compounds. Over the past several decades, many of these interactions have been captured in high-resolution atomic structures. From a geometric point of view, most interaction sites for grasping these small-molecule ligands, as revealed in these structures, form concave shapes, or ‘pockets’, on the protein’s surface. An efficient method for comparing these pockets could greatly assist the classification of ligand-binding sites, prediction of protein molecular function and design of novel drug compounds. Results: We introduce a computational method, APoc (Alignment of Pockets), for the large-scale, sequence order-independent, structural comparison of protein pockets. A scoring function, the Pocket Similarity Score (PS-score), is derived to measure the level of similarity between pockets. Statistical models are used to estimate the significance of the PS-score based on millions of comparisons of randomly related pockets. APoc is a general robust method that may be applied to pockets identified by various approaches, such as ligand-binding sites as observed in experimental complex structures, or predicted pockets identified by a pocket-detection method. Finally, we curate large benchmark datasets to evaluate the performance of APoc and present interesting examples to demonstrate the usefulness of the method. We also demonstrate that APoc has better performance than the geometric hashing-based method SiteEngine. Availability and implementation: The APoc software package including the source code is freely available at http://cssb.biology.gatech.edu/APoc. Contact: skolnick@gatech.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23335017

  7. A New Secondary Structure Assignment Algorithm Using Cα Backbone Fragments

    PubMed Central

    Cao, Chen; Wang, Guishen; Liu, An; Xu, Shutan; Wang, Lincong; Zou, Shuxue

    2016-01-01

    The assignment of secondary structure elements in proteins is a key step in the analysis of their structures and functions. We have developed an algorithm, SACF (secondary structure assignment based on Cα fragments), for secondary structure element (SSE) assignment based on the alignment of Cα backbone fragments with central poses derived by clustering known SSE fragments. The assignment algorithm consists of three steps: First, the outlier fragments on known SSEs are detected. Next, the remaining fragments are clustered to obtain the central fragments for each cluster. Finally, the central fragments are used as a template to make assignments. Following a large-scale comparison of 11 secondary structure assignment methods, SACF, KAKSI and PROSS are found to have similar agreement with DSSP, while PCASSO agrees with DSSP best. SACF and PCASSO show preference to reducing residues in N and C cap regions, whereas KAKSI, P-SEA and SEGNO tend to add residues to the terminals when DSSP assignment is taken as standard. Moreover, our algorithm is able to assign subtle helices (310-helix, π-helix and left-handed helix) and make uniform assignments, as well as to detect rare SSEs in β-sheets or long helices as outlier fragments from other programs. The structural uniformity should be useful for protein structure classification and prediction, while outlier fragments underlie the structure–function relationship. PMID:26978354

  8. A New Secondary Structure Assignment Algorithm Using Cα Backbone Fragments.

    PubMed

    Cao, Chen; Wang, Guishen; Liu, An; Xu, Shutan; Wang, Lincong; Zou, Shuxue

    2016-01-01

    The assignment of secondary structure elements in proteins is a key step in the analysis of their structures and functions. We have developed an algorithm, SACF (secondary structure assignment based on Cα fragments), for secondary structure element (SSE) assignment based on the alignment of Cα backbone fragments with central poses derived by clustering known SSE fragments. The assignment algorithm consists of three steps: First, the outlier fragments on known SSEs are detected. Next, the remaining fragments are clustered to obtain the central fragments for each cluster. Finally, the central fragments are used as a template to make assignments. Following a large-scale comparison of 11 secondary structure assignment methods, SACF, KAKSI and PROSS are found to have similar agreement with DSSP, while PCASSO agrees with DSSP best. SACF and PCASSO show preference to reducing residues in N and C cap regions, whereas KAKSI, P-SEA and SEGNO tend to add residues to the terminals when DSSP assignment is taken as standard. Moreover, our algorithm is able to assign subtle helices (310-helix, π-helix and left-handed helix) and make uniform assignments, as well as to detect rare SSEs in β-sheets or long helices as outlier fragments from other programs. The structural uniformity should be useful for protein structure classification and prediction, while outlier fragments underlie the structure-function relationship. PMID:26978354

  9. Systemic transport of Alfalfa mosaic virus can be mediated by the movement proteins of several viruses assigned to five genera of the 30K family.

    PubMed

    Fajardo, Thor V M; Peiró, Ana; Pallás, Vicente; Sánchez-Navarro, Jesús

    2013-03-01

    We previously showed that the movement protein (MP) gene of Alfalfa mosaic virus (AMV) is functionally exchangeable for the cell-to-cell transport of the corresponding genes of Tobacco mosaic virus (TMV), Brome mosaic virus, Prunus necrotic ringspot virus, Cucumber mosaic virus and Cowpea mosaic virus. We have analysed the capacity of the heterologous MPs to systemically transport the corresponding chimeric AMV genome. All MPs were competent in systemic transport but required the fusion at their C terminus of the coat protein-interacting C-terminal 44 aa (A44) of the AMV MP. Except for the TMV MP, the presence of the hybrid virus in upper leaves correlated with the capacity to move locally. These results suggest that all the MPs assigned to the 30K superfamily should be exchangeable not only for local virus movement but also for systemic transport when the A44 fragment is present. PMID:23136366

  10. Studies of individual carbon sites of proteins in solution by natural abundance carbon 13 nuclear magnetic resonance spectroscopy. Strategies for assignments.

    PubMed

    Oldfield, E; Norton, R S; Allerhand, A

    1975-08-25

    Natural abundance 13C Fourier transform NMR spectra (at 15.18 MHz, in 20-mm sample tubes) of aqueous native proteins yield numerous narrow single carbon resonances of nonprotonated aromatic carbons. Techniques for the assignment of these resonances are presented. Each technique is applied to one or more of the following proteins: ferricytochrome c from horse heart and Candida krusei, ferrocytochrome c and cyanoferricytochrome c from horse heart, lysozyme from hen egg white, cyanoferrimyoglobins from horse and sperm whale skeletal muscle, and carbon monoxide myoglobin from horse. In all of the protein spectra we have examined, methine aromatic carbons give rise to broad bands. Studies of the narrow resonances of nonprotonated aromatic carbons of proteins are facilitated by removal of these broad bands by means of the convolution-difference method, preferably from spectra recorded under conditions of noise-modulated off-resonance proton decoupling. We present a summary of the chemical shift ranges for the various types of nonprotonated aromatic carbons of amino acid residues and hemes of diamagnetic proteins, based on our results for hen egg white lysozyme, horse heart ferrocytochrome c, horse carbon monoxide myoglobin, and carbon monoxide hemoglobins from various species... PMID:169240

  11. Hepatitis B virus large surface protein: function and fame

    PubMed Central

    Churin, Yuri; Roderfeld, Martin

    2015-01-01

    Chronic infection with hepatitis B virus (HBV) is the leading cause of liver cirrhosis and hepatocellular carcinoma worldwide. HBV life cycle begins with viral attachment to hepatocytes, mediated by the large HBV surface protein (LHBs). Identification of the sodium-taurocholate cotransporting polypeptide (NTCP) as a HBV receptor has revealed a suitable target for viral entry inhibition. Analysis of serum hepatitis B surface antigen (HBsAg) level is a non-invasive diagnostic parameter that improves HBV treatment opportunities. Furthermore, HBsAg plays an important role in manipulation of host immune response by HBV. However, observations in patients with chronic hepatitis B under conditions of immune suppression and in transgenic mouse models of HBV infection suggest, that in absence of adaptive immune responses cellular mechanisms induced by HBV may also lead to the development of liver diseases. Thus, the multifaceted pathological aspects of HBsAg predetermine the design of new therapeutical options modulating associated biological implications. PMID:25713800

  12. Hepatitis B virus large surface protein: function and fame.

    PubMed

    Churin, Yuri; Roderfeld, Martin; Roeb, Elke

    2015-02-01

    Chronic infection with hepatitis B virus (HBV) is the leading cause of liver cirrhosis and hepatocellular carcinoma worldwide. HBV life cycle begins with viral attachment to hepatocytes, mediated by the large HBV surface protein (LHBs). Identification of the sodium-taurocholate cotransporting polypeptide (NTCP) as a HBV receptor has revealed a suitable target for viral entry inhibition. Analysis of serum hepatitis B surface antigen (HBsAg) level is a non-invasive diagnostic parameter that improves HBV treatment opportunities. Furthermore, HBsAg plays an important role in manipulation of host immune response by HBV. However, observations in patients with chronic hepatitis B under conditions of immune suppression and in transgenic mouse models of HBV infection suggest, that in absence of adaptive immune responses cellular mechanisms induced by HBV may also lead to the development of liver diseases. Thus, the multifaceted pathological aspects of HBsAg predetermine the design of new therapeutical options modulating associated biological implications. PMID:25713800

  13. Redistribution of the discs large tumor suppressor protein during mitosis.

    PubMed

    Massimi, Paola; Gardiol, Daniela; Roberts, Sally; Banks, Lawrence

    2003-11-01

    Drosophila discs large (Dlg) has been shown to be an essential regulator of cell polarity and attachment, and is classified as a potential tumour suppressor in higher eukaryotes. Human Dlg is expressed in epithelial cells at sites of cell-cell contact and acts as a negative regulator of cell growth. Although hDlg has been shown to be phosphorylated during mitosis, little is known about its activity during this stage of the cell cycle. To investigate this further we have analysed in detail the pattern of hDlg expression during mitotic cell division. In early mitosis there is a marked increase in membrane-bound hDlg which is then retained throughout mitosis, while during cytokinesis, there is a specific concentration of hDlg at the midbody. Using mutants of Dlg we show that this is mediated by sequences in the carboxy terminal region of Dlg, but it does not require the SH3 or PDZ domains, and is independent of binding to protein 4.1. Finally, using a mutant of Dlg that consists of just this carboxy terminal region of the protein, we show that it can compete with endogenous hDlg for midbody accumulation, and this mutant also gives rise to altered cell growth. We conclude that localisation of Dlg to the midbody indicates a role for Dlg at this critical point in cytokinesis. PMID:14567986

  14. Large-Scale Biophysical Evaluation of Protein PEGylation Effects: In Vitro Properties of 61 Protein Entities.

    PubMed

    Vernet, Erik; Popa, Gina; Pozdnyakova, Irina; Rasmussen, Jakob E; Grohganz, Holger; Giehm, Lise; Jensen, Malene H; Wang, Huabing; Plesner, Bitten; Nielsen, Hanne M; Jensen, Knud J; Berthelsen, Jens; Sundström, Michael; van de Weert, Marco

    2016-05-01

    PEGylation is the most widely used method to chemically modify protein biopharmaceuticals, but surprisingly limited public data is available on the biophysical effects of protein PEGylation. Here we report the first large-scale study, with site-specific mono-PEGylation of 15 different proteins and characterization of 61 entities in total using a common set of analytical methods. Predictions of molecular size were typically accurate in comparison with actual size determined by size-exclusion chromatography (SEC) or dynamic light scattering (DLS). In contrast, there was no universal trend regarding the effect of PEGylation on the thermal stability of a protein based on data generated by circular dichroism (CD), differential scanning calorimetry (DSC), or differential scanning fluorimetry (DSF). In addition, DSF was validated as a fast and inexpensive screening method for thermal unfolding studies of PEGylated proteins. Multivariate data analysis revealed clear trends in biophysical properties upon PEGylation for a subset of proteins, although no universal trends were found. Taken together, these findings are important in the consideration of biophysical methods and evaluation of second-generation biopharmaceutical drug candidates. PMID:27043713

  15. Sequence-specific (1)H, (15)N, and (13)C resonance assignments of the autophagy-related protein LC3C.

    PubMed

    Krichel, Carsten; Weiergräber, Oliver H; Pavlidou, Marina; Mohrlüder, Jeannine; Schwarten, Melanie; Willbold, Dieter; Neudecker, Philipp

    2016-04-01

    Autophagy is a versatile catabolic pathway for lysosomal degradation of cytoplasmic material. While the phenomenological and molecular characteristics of autophagic non-selective (bulk) decomposition have been investigated for decades, the focus of interest is increasingly shifting towards the selective mechanisms of autophagy. Both, selective as well as bulk autophagy critically depend on ubiquitin-like modifiers belonging to the Atg8 (autophagy-related 8) protein family. During evolution, Atg8 has diversified into eight different human genes. While all human homologues participate in the formation of autophagosomal membrane compartments, microtubule-associated protein light chain 3C (LC3C) additionally plays a unique role in selective autophagic clearance of intracellular pathogens (xenophagy), which relies on specific protein-protein recognition events mediated by conserved motifs. The sequence-specific (1)H, (15)N, and (13)C resonance assignments presented here form the stepping stone to investigate the high-resolution structure and dynamics of LC3C and to delineate LC3C's complex network of molecular interactions with the autophagic machinery by NMR spectroscopy. PMID:26280529

  16. Re-fraction: a machine learning approach for deterministic identification of protein homologues and splice variants in large-scale MS-based proteomics.

    PubMed

    Yang, Pengyi; Humphrey, Sean J; Fazakerley, Daniel J; Prior, Matthew J; Yang, Guang; James, David E; Yang, Jean Yee-Hwa

    2012-05-01

    A key step in the analysis of mass spectrometry (MS)-based proteomics data is the inference of proteins from identified peptide sequences. Here we describe Re-Fraction, a novel machine learning algorithm that enhances deterministic protein identification. Re-Fraction utilizes several protein physical properties to assign proteins to expected protein fractions that comprise large-scale MS-based proteomics data. This information is then used to appropriately assign peptides to specific proteins. This approach is sensitive, highly specific, and computationally efficient. We provide algorithms and source code for the current version of Re-Fraction, which accepts output tables from the MaxQuant environment. Nevertheless, the principles behind Re-Fraction can be applied to other protein identification pipelines where data are generated from samples fractionated at the protein level. We demonstrate the utility of this approach through reanalysis of data from a previously published study and generate lists of proteins deterministically identified by Re-Fraction that were previously only identified as members of a protein group. We find that this approach is particularly useful in resolving protein groups composed of splice variants and homologues, which are frequently expressed in a cell- or tissue-specific manner and may have important biological consequences. PMID:22428558

  17. Large-scale protein-protein interactions detection by integrating big biosensing data with computational model.

    PubMed

    You, Zhu-Hong; Li, Shuai; Gao, Xin; Luo, Xin; Ji, Zhen

    2014-01-01

    Protein-protein interactions are the basis of biological functions, and studying these interactions on a molecular level is of crucial importance for understanding the functionality of a living cell. During the past decade, biosensors have emerged as an important tool for the high-throughput identification of proteins and their interactions. However, the high-throughput experimental methods for identifying PPIs are both time-consuming and expensive. On the other hand, high-throughput PPI data are often associated with high false-positive and high false-negative rates. Targeting at these problems, we propose a method for PPI detection by integrating biosensor-based PPI data with a novel computational model. This method was developed based on the algorithm of extreme learning machine combined with a novel representation of protein sequence descriptor. When performed on the large-scale human protein interaction dataset, the proposed method achieved 84.8% prediction accuracy with 84.08% sensitivity at the specificity of 85.53%. We conducted more extensive experiments to compare the proposed method with the state-of-the-art techniques, support vector machine. The achieved results demonstrate that our approach is very promising for detecting new PPIs, and it can be a helpful supplement for biosensor-based PPI data detection. PMID:25215285

  18. Extra-Large G Proteins Expand the Repertoire of Subunits in Arabidopsis Heterotrimeric G Protein Signaling.

    PubMed

    Chakravorty, David; Gookin, Timothy E; Milner, Matthew J; Yu, Yunqing; Assmann, Sarah M

    2015-09-01

    Heterotrimeric G proteins, consisting of Gα, Gβ, and Gγ subunits, are a conserved signal transduction mechanism in eukaryotes. However, G protein subunit numbers in diploid plant genomes are greatly reduced as compared with animals and do not correlate with the diversity of functions and phenotypes in which heterotrimeric G proteins have been implicated. In addition to GPA1, the sole canonical Arabidopsis (Arabidopsis thaliana) Gα subunit, Arabidopsis has three related proteins: the extra-large GTP-binding proteins XLG1, XLG2, and XLG3. We demonstrate that the XLGs can bind Gβγ dimers (AGB1 plus a Gγ subunit: AGG1, AGG2, or AGG3) with differing specificity in yeast (Saccharomyces cerevisiae) three-hybrid assays. Our in silico structural analysis shows that XLG3 aligns closely to the crystal structure of GPA1, and XLG3 also competes with GPA1 for Gβγ binding in yeast. We observed interaction of the XLGs with all three Gβγ dimers at the plasma membrane in planta by bimolecular fluorescence complementation. Bioinformatic and localization studies identified and confirmed nuclear localization signals in XLG2 and XLG3 and a nuclear export signal in XLG3, which may facilitate intracellular shuttling. We found that tunicamycin, salt, and glucose hypersensitivity and increased stomatal density are agb1-specific phenotypes that are not observed in gpa1 mutants but are recapitulated in xlg mutants. Thus, XLG-Gβγ heterotrimers provide additional signaling modalities for tuning plant G protein responses and increase the repertoire of G protein heterotrimer combinations from three to 12. The potential for signal partitioning and competition between the XLGs and GPA1 is a new paradigm for plant-specific cell signaling. PMID:26157115

  19. Tracking metal ions through a Cu/Ag efflux pump assigns the functional roles of the periplasmic proteins

    PubMed Central

    Mealman, Tiffany D.; McEvoy, Megan M.; Blackburn, Ninian J.

    2014-01-01

    Copper is an essential nutrient for all aerobic organisms but is toxic in excess. At the host–pathogen interface, macrophages respond to bacterial infection by copper-dependent killing mechanisms, whereas the invading bacteria are thought to counter with an up-regulation of copper transporters and efflux pumps. The tripartite efflux pump CusCBA and its metallochaperone CusF are vital to the detoxification of copper and silver ions in the periplasm of Escherichia coli. However, the mechanism of efflux by this complex, which requires the activation of the inner membrane pump CusA, is poorly understood. Here, we use selenomethionine (SeM) active site labels in a series of biological X-ray absorption studies at the selenium, copper, and silver edges to establish a “switch” role for the membrane fusion protein CusB. We determine that metal-bound CusB is required for activation of cuprous ion transfer from CusF directly to a site in the CusA antiporter, showing for the first time (to our knowledge) the in vitro activation of the Cus efflux pump. This metal-binding site of CusA is unlike that observed in the crystal structures of the CusA protein and is composed of one oxygen and two sulfur ligands. Our results suggest that metal transfer occurs between CusF and apo-CusB, and that, when metal-loaded, CusB plays a role in the regulation of metal ion transfer from CusF to CusA in the periplasm. PMID:25313055

  20. A large volume flat coil probe for oriented membrane proteins.

    PubMed

    Gor'kov, Peter L; Chekmenev, Eduard Y; Fu, Riqiang; Hu, Jun; Cross, Timothy A; Cotten, Myriam; Brey, William W

    2006-07-01

    15N detection of mechanically aligned membrane proteins benefits from large sample volumes that compensate for the low sensitivity of the observe nuclei, dilute sample preparation, and for the poor filling factor arising from the presence of alignment plates. Use of larger multi-tuned solenoids, however, is limited by wavelength effects that lead to inhomogeneous RF fields across the sample, complicating cross-polarization experiments. We describe a 600 MHz 15N-1H solid-state NMR probe with large (580 mm3) RF solenoid for high-power, multi-pulse sequence experiments, such as polarization inversion spin exchange at the magic angle (PISEMA). In order to provide efficient detection for 15N, a 4-turn solenoidal sample coil is used that exceeds 0.27 lambda at the 600 MHz 1H resonance. A balanced tuning-matching circuit is employed to preserve RF homogeneity across the sample for adequate magnetization transfer from 1H to 15N. We describe a procedure for optimization of the shorted 1/4 lambda coaxial trap that allows for the sufficiently strong RF fields in both 1H and 15N channels to be achieved within the power limits of 300 W 1H and 1 kW 15N amplifiers. The 8 x 6 x 12 mm solenoid sustains simultaneous B1 irradiation of 100 kHz at 1H frequency and 51 kHz at 15N frequency for at least 5 ms with 265 and 700 W of input power in the respective channels. The probe functionality is demonstrated by 2D 15N-1H PISEMA spectroscopy for two applications at 600 MHz. PMID:16580852

  1. The HN(COCA)HAHB NMR experiment for the stereospecific assignment of Hbeta-protons in non-native states of proteins.

    PubMed

    Hähnke, Martin J; Richter, Christian; Heinicke, Friederike; Schwalbe, Harald

    2010-01-27

    (3)J(H(alpha),H(beta))-coupling constants deliver precious information on the population of the three favored chi(1)-rotamers in unfolded states of proteins. Here, a novel pulse sequence, tailored toward the NMR analysis of non-native states of proteins, the HN(COCA)HAHB experiment, is developed to measure (3)J(H(alpha),H(beta)). In four subsequent INEPT steps, magnetization is transferred from H(N) to H(alpha). In a COSY-like magnetization transfer step, dephasing of magnetization on H(alpha) is quantified to determine the (3)J(H(alpha),H(beta))-coupling constants. Analysis of the measured homonuclear coupling constants, together with measurement of heteronuclear (3)J(N,C(gamma))- and (3)J(C',C(gamma))-coupling constants, allows stereospecific assignment of the two diastereotopic H(beta)-protons even in unfolded states of proteins, and the derivation of populations according to a Pachler-type analysis. PMID:20039672

  2. Large-scale proteomic analysis of membrane proteins

    SciTech Connect

    Ahram, Mamoun; Springer, David L.

    2004-10-01

    Proteomic analysis of membrane proteins is promising in identification of novel candidates as drug targets and/or disease biomarkers. Despite notable technological developments, obstacles related to extraction and solubilization of membrane proteins are frequently encountered. A critical discussion of the different preparative methods of membrane proteins is offered in relation to downstream proteomic applications, mainly gel-based analyses and mass spectrometry. Unknown proteins are often identified by high-throughput profiling of membrane proteins. In search for novel membrane proteins, analysis of protein sequences using computational tools is performed to predict for the presence of transmembrane domains. Here, we also present these bioinformatic tools with the human proteome as a case study. Along with technological innovations, advancements in the areas of sample preparation and computational prediction of membrane proteins will lead to exciting discoveries.

  3. Heteronuclear NMR assignments and secondary structure of the coiled coil trimerization domain from cartilage matrix protein in oxidized and reduced forms.

    PubMed Central

    Wiltscheck, R.; Kammerer, R. A.; Dames, S. A.; Schulthess, T.; Blommers, M. J.; Engel, J.; Alexandrescu, A. T.

    1997-01-01

    The C-terminal oligomerization domain of chicken cartilage matrix protein is a trimeric coiled coil comprised of three identical 43-residue chains. NMR spectra of the protein show equivalent magnetic environments for each monomer, indicating a parallel coiled coil structure with complete threefold symmetry. Sequence-specific assignments for 1H-, 15N-, and 13C-NMR resonances have been obtained from 2D 1H NOESY and TOCSY spectra, and from 3D HNCA, 15N NOESY-HSQC, and HCCH-TOCSY spectra. A stretch of alpha-helix encompassing five heptad repeats (35 residues) has been identified from intra-chain HN-HN and HN-H alpha NOE connectivities. 3JHNH alpha coupling constants, and chemical shift indices. The alpha-helix begins immediately downstream of inter-chain disulfide bonds between residues Cys 5 and Cys 7, and extends to near the C-terminus of the molecule. The threefold symmetry of the molecule is maintained when the inter-chain disulfide bonds that flank the N-terminus of the coiled coil are reduced. Residues Ile 21 through Glu 36 show conserved chemical shifts and NOE connectivities, as well as strong protection from solvent exchange in the oxidized and reduced forms of the protein. By contrast, residues Ile 10 through Val 17 show pronounced chemical shift differences between the oxidized and reduced protein. Strong chemical exchange NOEs between HN resonances and water indicate solvent exchange on time scales faster than 10 s, and suggests a dynamic fraying of the N-terminus of the coiled coil upon reduction of the disulfide bonds. Possible roles for the disulfide crosslinks of the oligomerization domain in the function of cartilage matrix protein are proposed. PMID:9260286

  4. Assignment of IR bands of isolated and protein-bound Peridinin in its fundamental and triplet state by static FTIR, time-resolved step-scan FTIR and DFT calculations

    NASA Astrophysics Data System (ADS)

    Mezzetti, Alberto; Kish, Elizabeth; Robert, Bruno; Spezia, Riccardo

    2015-06-01

    The vibrational properties of Peridinin in its fundamental state and in the excited triplet state have been investigated by DFT calculations and static and time-resolved FTIR spectroscopy. The infrared spectrum of Peridinin in its fundamental state has been explored in the whole 2000-600 cm-1 range, and interpreted in term of molecular vibrations. In particular, new infrared bands have been identified and assigned to specific molecular vibrations. 3Peridinin molecular vibrations have also been investigated by DFT calculations. In addition, putative IR bands belonging to Peridinin and 3Peridinin have been identified in the step-scan FTIR difference spectrum of the Peridinin-Chlorophyll a-Protein from Amphidinium carterae, where light induce formation of a triplet state localized on one or more Peridinins. The exact nature of the triplet state formed in Peridinin-Chlorophyll a-Protein from dinoflagellates, in particular the possible involvement in this triplet state of 3Chlorophyll a, has been largely debated in the last few years (see Carbonera et al., 2014 [3]); time-resolved differential FTIR experiments have played a key role in this debate. Identification of IR marker bands for the main molecule (Peridinin) implicated in this photophysical process is therefore particularly important and makes this study a significant step towards the full understanding of Peridinin-Chlorophyll-a-Proteins photophysics.

  5. Negotiating Assignment Pathways: Students and Academic Assignments

    ERIC Educational Resources Information Center

    McDowell, Liz

    2008-01-01

    Existing research identifies that students' approaches to assignments are related to their general approaches to study. It is suggested that students need to better understand the requirements of assignments and acquire new concepts such as "argument". This fine-grained study proposes four qualitatively distinct assignment pathways: gathering,…

  6. Simultaneous Acquisition of 2D and 3D Solid-State NMR Experiments for Sequential Assignment of Oriented Membrane Protein Samples

    PubMed Central

    Gopinath, T.; Mote, Kaustubh R; Veglia, Gianluigi

    2016-01-01

    We present a new method called DAISY (Dual Acquisition orIented ssNMR spectroScopY) for the simultaneous acquisition of 2D and 3D oriented solid-state NMR experiments for membrane proteins aligned in mechanically or magnetically lipid bilayers. DAISY utilizes dual acquisition of sine and cosine dipolar or chemical shift coherences and long living 15N longitudinal polarization to obtain two multi-dimensional spectra, simultaneously. In these new experiments, the first acquisition gives the polarization inversion spin exchange at the magic angle (PISEMA) or heteronuclear correlation (HETCOR) spectra, the second acquisition gives PISEMA-mixing or HETCOR-mixing spectra, where the mixing element enables inter-residue correlations through 15N-15N homonuclear polarization transfer. The analysis of the two 2D spectra (first and second acquisitions) enables one to distinguish 15N-15N inter-residue correlations for sequential assignment of membrane proteins. DAISY can be implemented in 3D experiments that include the polarization inversion spin exchange at magic angle via I spin coherence (PISEMAI) sequence, as we show for the simultaneous acquisition of 3D PISEMAI-HETCOR and 3D PISEMAI-HETCOR-mixing experiments. PMID:25749871

  7. Simultaneous acquisition of 2D and 3D solid-state NMR experiments for sequential assignment of oriented membrane protein samples.

    PubMed

    Gopinath, T; Mote, Kaustubh R; Veglia, Gianluigi

    2015-05-01

    We present a new method called DAISY (Dual Acquisition orIented ssNMR spectroScopY) for the simultaneous acquisition of 2D and 3D oriented solid-state NMR experiments for membrane proteins reconstituted in mechanically or magnetically aligned lipid bilayers. DAISY utilizes dual acquisition of sine and cosine dipolar or chemical shift coherences and long living (15)N longitudinal polarization to obtain two multi-dimensional spectra, simultaneously. In these new experiments, the first acquisition gives the polarization inversion spin exchange at the magic angle (PISEMA) or heteronuclear correlation (HETCOR) spectra, the second acquisition gives PISEMA-mixing or HETCOR-mixing spectra, where the mixing element enables inter-residue correlations through (15)N-(15)N homonuclear polarization transfer. The analysis of the two 2D spectra (first and second acquisitions) enables one to distinguish (15)N-(15)N inter-residue correlations for sequential assignment of membrane proteins. DAISY can be implemented in 3D experiments that include the polarization inversion spin exchange at magic angle via I spin coherence (PISEMAI) sequence, as we show for the simultaneous acquisition of 3D PISEMAI-HETCOR and 3D PISEMAI-HETCOR-mixing experiments. PMID:25749871

  8. Large-scale crystallization of proteins for purification and formulation.

    PubMed

    Hekmat, Dariusch

    2015-07-01

    Since about 170 years, salts were used to create supersaturated solutions and crystallize proteins. The dehydrating effect of salts as well as their kosmotropic or chaotropic character was revealed. Even the suitability of organic solvents for crystallization was already recognized. Interestingly, what was performed during the early times is still practiced today. A lot of effort was put into understanding the underlying physico-chemical interaction mechanisms leading to protein crystallization. However, it was understood that already the solvation of proteins is a highly complex process not to mention the intricate interrelation of electrostatic and hydrophobic interactions taking place. Although many basic questions are still unanswered, preparative protein crystallization was attempted as illustrated in the presented case studies. Due to the highly variable nature of crystallization, individual design of the crystallization process is needed in every single case. It was shown that preparative crystallization from impure protein solutions as a capture step is possible after applying adequate pre-treatment procedures like precipitation or extraction. Protein crystallization can replace one or more chromatography steps. It was further shown that crystallization can serve as an attractive alternative means for formulation of therapeutic proteins. Crystalline proteins can offer enhanced purity and enable highly concentrated doses of the active ingredient. Easy scalability of the proposed protein crystallization processes was shown using the maximum local energy dissipation as a suitable scale-up criterion. Molecular modeling and target-oriented protein engineering may allow protein crystallization to become part of a platform purification process in the near future. PMID:25700885

  9. Quality control test for sequence-phenotype assignments.

    PubMed

    Ortiz, Maria Teresa Lara; Rosario, Pablo Benjamín Leon; Luna-Nevarez, Pablo; Gamez, Alba Savin; Martínez-del Campo, Ana; Del Rio, Gabriel

    2015-01-01

    Relating a gene mutation to a phenotype is a common task in different disciplines such as protein biochemistry. In this endeavour, it is common to find false relationships arising from mutations introduced by cells that may be depurated using a phenotypic assay; yet, such phenotypic assays may introduce additional false relationships arising from experimental errors. Here we introduce the use of high-throughput DNA sequencers and statistical analysis aimed to identify incorrect DNA sequence-phenotype assignments and observed that 10-20% of these false assignments are expected in large screenings aimed to identify critical residues for protein function. We further show that this level of incorrect DNA sequence-phenotype assignments may significantly alter our understanding about the structure-function relationship of proteins. We have made available an implementation of our method at http://bis.ifc.unam.mx/en/software/chispas. PMID:25700273

  10. Quality Control Test for Sequence-Phenotype Assignments

    PubMed Central

    Ortiz, Maria Teresa Lara; Rosario, Pablo Benjamín Leon; Luna-Nevarez, Pablo; Gamez, Alba Savin; Martínez-del Campo, Ana; Del Rio, Gabriel

    2015-01-01

    Relating a gene mutation to a phenotype is a common task in different disciplines such as protein biochemistry. In this endeavour, it is common to find false relationships arising from mutations introduced by cells that may be depurated using a phenotypic assay; yet, such phenotypic assays may introduce additional false relationships arising from experimental errors. Here we introduce the use of high-throughput DNA sequencers and statistical analysis aimed to identify incorrect DNA sequence-phenotype assignments and observed that 10–20% of these false assignments are expected in large screenings aimed to identify critical residues for protein function. We further show that this level of incorrect DNA sequence-phenotype assignments may significantly alter our understanding about the structure-function relationship of proteins. We have made available an implementation of our method at http://bis.ifc.unam.mx/en/software/chispas. PMID:25700273

  11. The Nifty Assignments Site.

    ERIC Educational Resources Information Center

    Parlante, Nick

    2001-01-01

    Describes a Web site called Nifty Assignments that offers assignments for computer science education. Topics include programming assignments; student appeal; appropriateness for high school classes; and links to other related Web sites. (LRW)

  12. Large-scale analysis of phosphorylated proteins in maize leaf.

    PubMed

    Bi, Ying-Dong; Wang, Hong-Xia; Lu, Tian-Cong; Li, Xiao-Hui; Shen, Zhuo; Chen, Yi-Bo; Wang, Bai-Chen

    2011-02-01

    Phosphorylation is an ubiquitous regulatory mechanism governing the activity, subcellular localization, and intermolecular interactions of proteins. To identify a broad range of phosphoproteins from Zea mays, we enriched phosphopeptides from Zea mays leaves using titanium dioxide microcolumns and then extensively fractionated and identified the phosphopeptides by mass spectrometry. A total of 165 unique phosphorylation sites with a putative role in biological processes were identified in 125 phosphoproteins. Most of these proteins are involved in metabolism, including carbohydrate and protein metabolism. We identified novel phosphorylation sites on translation initiation factors, splicing factors, nucleolar RNA helicases, and chromatin-remodeling proteins such as histone deacetylases. Intriguingly, we also identified phosphorylation sites on several proteins associated with photosynthesis, and we speculate that these sites may be involved in carbohydrate metabolism or electron transport. Among these phosphoproteins, phosphoenolpyruvate carboxylase and NADH: nitrate reductase (NR) which catalyzes the rate-limiting and regulated step in the pathway of inorganic nitrogen assimilation were identified. A conserved phosphorylation site was found in the cytochrome b5 heme-binding domain of NADH: nitrate reductase, suggesting that NADH: nitrate reductase is phosphorylated by the same protein kinase or highly related kinases. These data demonstrate that the pathways that regulate diverse processes in plants are major targets of phosphorylation. PMID:21053013

  13. Backbone 1H, 15N, and 13C resonance assignments and secondary structure of a novel protein OGL-20P(T)-358 from hyperthermophile Thermococcus thioreducens sp. nov.

    PubMed

    Wilson, Randall; Hughes, Ronny; Curto, Ernest; Ng, Joseph; Twigg, Pamela

    2007-12-31

    OGL-20P(T)-358 is a novel 66 amino acid residue protein from the hyperthermophile Thermococcus thioreducens sp. nov., strain OGL-20PT, which was collected from the wall of the hydrothermal black smoker in the Rainbow Vent along the mid-Atlantic ridge. This protein, which has no detectable sequence homology with proteins or domains of known function, has a calculated pI of 4.76 and a molecular mass of 8.2 kDa. We report here the backbone 1H, 15N, and 13C resonance assignments of OGL-20PT-358. Assignments are 97.5% (316/324) complete. Chemical shift index was used to determine the secondary structure of the protein, which appears to consist of primarily alpha-helical regions. This work is the foundation for future studies to determine the three-dimensional solution structure of the protein. PMID:18182861

  14. BACHSCORE. A tool for evaluating efficiently and reliably the quality of large sets of protein structures

    NASA Astrophysics Data System (ADS)

    Sarti, E.; Zamuner, S.; Cossio, P.; Laio, A.; Seno, F.; Trovato, A.

    2013-12-01

    In protein structure prediction it is of crucial importance, especially at the refinement stage, to score efficiently large sets of models by selecting the ones that are closest to the native state. We here present a new computational tool, BACHSCORE, that allows its users to rank different structural models of the same protein according to their quality, evaluated by using the BACH++ (Bayesian Analysis Conformation Hunt) scoring function. The original BACH statistical potential was already shown to discriminate with very good reliability the protein native state in large sets of misfolded models of the same protein. BACH++ features a novel upgrade in the solvation potential of the scoring function, now computed by adapting the LCPO (Linear Combination of Pairwise Orbitals) algorithm. This change further enhances the already good performance of the scoring function. BACHSCORE can be accessed directly through the web server: bachserver.pd.infn.it. Catalogue identifier: AEQD_v1_0 Program summary URL:http://cpc.cs.qub.ac.uk/summaries/AEQD_v1_0.html Program obtainable from: CPC Program Library, Queen’s University, Belfast, N. Ireland Licensing provisions: GNU General Public License version 3 No. of lines in distributed program, including test data, etc.: 130159 No. of bytes in distributed program, including test data, etc.: 24 687 455 Distribution format: tar.gz Programming language: C++. Computer: Any computer capable of running an executable produced by a g++ compiler (4.6.3 version). Operating system: Linux, Unix OS-es. RAM: 1 073 741 824 bytes Classification: 3. Nature of problem: Evaluate the quality of a protein structural model, taking into account the possible “a priori” knowledge of a reference primary sequence that may be different from the amino-acid sequence of the model; the native protein structure should be recognized as the best model. Solution method: The contact potential scores the occurrence of any given type of residue pair in 5 possible

  15. Recent advances in large-scale protein interactome mapping

    PubMed Central

    Mehta, Virja; Trinkle-Mulcahy, Laura

    2016-01-01

    Protein-protein interactions (PPIs) underlie most, if not all, cellular functions. The comprehensive mapping of these complex networks of stable and transient associations thus remains a key goal, both for systems biology-based initiatives (where it can be combined with other ‘omics’ data to gain a better understanding of functional pathways and networks) and for focused biological studies. Despite the significant challenges of such an undertaking, major strides have been made over the past few years. They include improvements in the computation prediction of PPIs and the literature curation of low-throughput studies of specific protein complexes, but also an increase in the deposition of high-quality data from non-biased high-throughput experimental PPI mapping strategies into publicly available databases. PMID:27158474

  16. Recent advances in large-scale protein interactome mapping.

    PubMed

    Mehta, Virja; Trinkle-Mulcahy, Laura

    2016-01-01

    Protein-protein interactions (PPIs) underlie most, if not all, cellular functions. The comprehensive mapping of these complex networks of stable and transient associations thus remains a key goal, both for systems biology-based initiatives (where it can be combined with other 'omics' data to gain a better understanding of functional pathways and networks) and for focused biological studies. Despite the significant challenges of such an undertaking, major strides have been made over the past few years. They include improvements in the computation prediction of PPIs and the literature curation of low-throughput studies of specific protein complexes, but also an increase in the deposition of high-quality data from non-biased high-throughput experimental PPI mapping strategies into publicly available databases. PMID:27158474

  17. Scaling approach to the folding kinetics of large proteins.

    PubMed

    Nelson, Erik D; Grishin, Nick V

    2006-01-01

    We study a nucleation-growth model of protein folding and extend it to describe larger proteins with multiple folding units. The model is of one of an extremely simple type in which amino acids are allowed just two states--either folded (frozen) or unfolded. Its energetics are heterogeneous and Gō-like, the energy being defined in terms of the number of atom-to-atom contacts that would occur between frozen amino acids in the native crystal structure of the protein. Each collective state of the amino acids is intended to represent a small free energy microensemble consisting of the possible configurations of unfolded loops, open segments, and free ends constrained by the cross-links that form between folded parts of the molecule. We approximate protein free energy landscapes by an infinite subset of these microensemble topologies in which loops and open unfolded segments can be viewed roughly as independent objects for the purpose of calculating their entropy, and we develop a means to implement this approximation in Monte Carlo simulations. We show that this approach describes transition state structures (phi values) more accurately and identifies folding intermediates that were unavailable to previous versions of the model that restricted the number of loops and nuclei. PMID:16486182

  18. Sequence-specific {sup 1}H, {sup 13}C, and {sup 15}N resonance assignments for intestinal fatty-acid-binding protein complexed with palmitate (15.4 kDA)

    SciTech Connect

    Hodsdon, M.E.; Toner, J.J.; Cistola, D.P.

    1994-12-01

    Intestinal fatty-acid-binding protein (I-FABP) belongs to a family of soluble, cytoplasmic proteins that are thought to function in the intracellular transport and trafficking of polar lipids. Individual members of this protein family have distinct specificities and affinities for fatty acids, cholesterol, bile salts, and retinoids. We are comparing several retinol- and fatty-acid-binding proteins from intestine in order to define the factors that control molecular recognition in this family of proteins. We have established sequential resonance assignments for uniformly {sup 13}C/{sup 15}N-enriched I-FABP complexed with perdeuterated palmitate at pH7.2 and 37{degrees}C. The assignment strategy was similar to that introduced for calmodulin. We employed seven three-dimensional NMR experiments to establish scalar couplings between backbone and sidechain atoms. Backbone atoms were correlated using triple-resonance HNCO, HNCA, TOCSY-HMQC, HCACO, and HCA(CO)N experiments. Sidechain atoms were correlated using CC-TOCSY, HCCH-TOCSY, and TOCSY-HMQC. The correlations of peaks between three-dimensional spectra were established in a computer-assisted manner using NMR COMPASS (Molecular Simulations, Inc.) Using this approach, {sup 1}H, {sup 13}C, and {sup 15}N resonance assignments have been established for 120 of the 131 residues of I-FABP. For 18 residues, amide {sup 1}H and {sup 15}N resonances were unobservable, apparently because of the rapid exchange of amide protons with bulk water at pH 7.2. The missing amide protons correspond to distinct amino acid patterns in the protein sequence, which will be discussed. During the assignment process, several sources of ambiguity in spin correlations were observed. To overcome this ambiguity, the additional inter-residue correlations often observed in the HNCA experiment were used as cross-checks for the sequential backbone assignments.

  19. A General Assignment Method for Oriented Sample (OS) Solid-state NMR of Proteins Based on The Correlation of Resonances through Heteronuclear Dipolar Couplings in Samples Aligned Parallel and Perpendicular to the Magnetic Field

    PubMed Central

    Lu, George J.; Son, Woo Sung; Opella, Stanley J.

    2011-01-01

    A general method for assigning oriented sample (OS) solid-state NMR spectra of proteins is demonstrated. In principle, this method requires only a single sample of a uniformly 15N-labeled membrane protein in magnetically aligned bilayers, and a previously assigned isotropic chemical shift spectrum obtained either from solution NMR on micelle or isotropic bicelle samples or from magic angle spinning (MAS) solid-state NMR on unoriented proteoliposomes. The sequential isotropic resonance assignments are transferred to the OS solid-state NMR spectra of aligned samples by correlating signals from the same residue observed in protein-containing bilayers aligned with their normals parallel and perpendicular to the magnetic field. The underlying principle is that the resonances from the same residue have heteronuclear dipolar couplings that differ by exactly a factor of two between parallel and perpendicular alignments. The method is demonstrated on the membrane-bound form of Pf1 coat protein in phospholipid bilayers, whose assignments have been previously made using an earlier generation of methods that relied on the preparation of many selectively labeled (by residue type) samples. The new method provides the correct resonance assignments using only a single uniformly 15N-labeled sample, two solid-state NMR spectra, and a previously assigned isotropic spectrum. Significantly, this approach is equally applicable to residues in alpha helices, beta sheets, loops, and any other elements of tertiary structure. Moreover, the strategy bridges between OS solid-state NMR of aligned samples and solution NMR or MAS solid-state NMR of unoriented samples. In combination with the development of complementary experimental methods, it provides a step towards unifying these apparently different NMR approaches. PMID:21316275

  20. (1)H, (13)C and (15)N backbone and side-chain resonance assignment of the LAM-RRM1 N-terminal module of La protein from Dictyostelium discoideum.

    PubMed

    Chasapis, Christos T; Argyriou, Aikaterini I; Apostolidi, Maria; Konstantinidou, Parthena; Bentrop, Detlef; Stathopoulos, Constantinos; Spyroulias, Georgios A

    2015-10-01

    The N-terminal half of La protein consists of two concatenated motifs: La motif (LAM) and the N-terminal RNA recognition motif (RRM1) both of which are responsible for poly(U) RNA binding. Here, we present the backbone and side-chain assignments of the (1)H, (13)C and (15)N resonances of the 191-residue LAM-RRM1 region of the La protein from the lower eukaryote Dictyostelium discoideum and its secondary structure prediction. PMID:25687647

  1. 43 CFR 2521.3 - Assignment.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... OF THE INTERIOR LAND RESOURCE MANAGEMENT (2000) DESERT-LAND ENTRIES Procedures § 2521.3 Assignment...), assignments of desert-land entries were recognized, the Department of the Interior, largely for administrative reasons, held that a desert-land entry might be assigned as a whole or in its entirety, but refused...

  2. 43 CFR 2521.3 - Assignment.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... OF THE INTERIOR LAND RESOURCE MANAGEMENT (2000) DESERT-LAND ENTRIES Procedures § 2521.3 Assignment...), assignments of desert-land entries were recognized, the Department of the Interior, largely for administrative reasons, held that a desert-land entry might be assigned as a whole or in its entirety, but refused...

  3. 43 CFR 2521.3 - Assignment.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... OF THE INTERIOR LAND RESOURCE MANAGEMENT (2000) DESERT-LAND ENTRIES Procedures § 2521.3 Assignment...), assignments of desert-land entries were recognized, the Department of the Interior, largely for administrative reasons, held that a desert-land entry might be assigned as a whole or in its entirety, but refused...

  4. 43 CFR 2521.3 - Assignment.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... OF THE INTERIOR LAND RESOURCE MANAGEMENT (2000) DESERT-LAND ENTRIES Procedures § 2521.3 Assignment...), assignments of desert-land entries were recognized, the Department of the Interior, largely for administrative reasons, held that a desert-land entry might be assigned as a whole or in its entirety, but refused...

  5. A Simple and Effective Protein Folding Activity Suitable for Large Lectures

    ERIC Educational Resources Information Center

    White, Brian

    2006-01-01

    This article describes a simple and inexpensive hands-on simulation of protein folding suitable for use in large lecture classes. This activity uses a minimum of parts, tools, and skill to simulate some of the fundamental principles of protein folding. The major concepts targeted are that proteins begin as linear polypeptides and fold to…

  6. Photophysical behavior and assignment of the low-energy chlorophyll states in the CP43 proximal antenna protein of higher plant photosystem II.

    PubMed

    Hughes, Joseph L; Picorel, Rafael; Seibert, Michael; Krausz, Elmars

    2006-10-10

    We have employed absorption, circular dichroism (CD), and persistent spectral hole-burning measurements at 1.7 K to study the photoconversion properties and exciton coupling of low-energy chlorophylls (Chls) in the CP43 proximal antenna light-harvesting subunit of photosystem II (PSII) isolated from spinach. These approximately 683 nm states act as traps for excitation energy in isolated CP43. They "bleach" at 683 nm upon illumination and photoconvert to a form absorbing in the range approximately 660-680 nm. We present new data that show the changes in the CD spectrum due to the photoconversion process. These changes occur in parallel with those in absorption, providing evidence that the feature undergoing the apparent bleach is a component of a weakly exciton-coupled system. From our photoconversion difference spectra, we assign four states in the Chl long-wavelength region of CP43, two of which are the known trap states and are both highly localized on single Chls. The other two states are associated with weak exciton coupling (maximally approximately 50 cm(-)(1)) to one of these traps. We propose a mechanism for photoconversion that involves Chl-protein hydrogen bonding. New hole-burning data are presented that indicate this mechanism is distinct to that for narrow-band spectral hole burning in CP43. We discuss the photophysical behavior of the Chl trap states in isolated CP43 compared to their behavior in intact PSII preparations. The latter represent a more intact, physiological complex, and we find no clear evidence that they exhibit the photoconversion process reported here. PMID:17014087

  7. Sampling small-scale and large-scale conformational changes in proteins and molecular complexes

    NASA Astrophysics Data System (ADS)

    Yun, Mi-Ran; Mousseau, N.; Derreumaux, P.

    2007-03-01

    Sampling of small-scale and large-scale motions is important in various computational tasks, such as protein-protein docking and ligand binding. Here, we report further development and applications of the activation-relaxation technique for internal coordinate space trajectories (ARTIST). This method generates conformational moves of any complexity and size by identifying and crossing well-defined saddle points connecting energy minima. Simulations on two all-atom proteins and three protein complexes containing between 70 and 300 amino acids indicate that ARTIST opens the door to the full treatment of all degrees of freedom in dense systems such as protein-protein complexes.

  8. Automated assignment of NMR chemical shifts based on a known structure and 4D spectra.

    PubMed

    Trautwein, Matthias; Fredriksson, Kai; Möller, Heiko M; Exner, Thomas E

    2016-08-01

    Apart from their central role during 3D structure determination of proteins the backbone chemical shift assignment is the basis for a number of applications, like chemical shift perturbation mapping and studies on the dynamics of proteins. This assignment is not a trivial task even if a 3D protein structure is known and needs almost as much effort as the assignment for structure prediction if performed manually. We present here a new algorithm based solely on 4D [(1)H,(15)N]-HSQC-NOESY-[(1)H,(15)N]-HSQC spectra which is able to assign a large percentage of chemical shifts (73-82 %) unambiguously, demonstrated with proteins up to a size of 250 residues. For the remaining residues, a small number of possible assignments is filtered out. This is done by comparing distances in the 3D structure to restraints obtained from the peak volumes in the 4D spectrum. Using dead-end elimination, assignments are removed in which at least one of the restraints is violated. Including additional information from chemical shift predictions, a complete unambiguous assignment was obtained for Ubiquitin and 95 % of the residues were correctly assigned in the 251 residue-long N-terminal domain of enzyme I. The program including source code is available at https://github.com/thomasexner/4Dassign . PMID:27484442

  9. Very simple combination of TROSY, CRINEPT and multiple quantum coherence for signal enhancement in an HN(CO)CA experiment for large proteins

    NASA Astrophysics Data System (ADS)

    Bayrhuber, Monika; Riek, Roland

    2011-04-01

    Sensitivity enhancement in liquid state nuclear magnetic resonance (NMR) triple resonance experiments for the sequential assignment of proteins is important for the investigation of large proteins or protein complexes. We present here the 3D TROSY-MQ/CRINEPT-HN(CO)CA which makes use of a 15N- 1H-TROSY element and a 13C'- 13CA CRINEPT step combined with a multiple quantum coherence during the 13CA evolution period. Because of the introduction of these relaxation-optimized elements and 10 less pulses required, when compared with the conventional TROSY-HN(CO)CA experiment an average signal enhancement of a factor of 1.8 was observed for the membrane protein-detergent complex KcsA with a rotational correlation time τ c of around 60 ns.

  10. Nano-Mole Scale Side-Chain Signal Assignment by 1H-Detected Protein Solid-State NMR by Ultra-Fast Magic-Angle Spinning and Stereo-Array Isotope Labeling

    PubMed Central

    Nishiyama, Yusuke; Endo, Yuki; Nemoto, Takahiro; Yamauchi, Kazuo; Asakura, Tetsuo; Takeda, Mitsuhiro; Terauchi, Tsutomu; Kainosho, Masatsune; Ishii, Yoshitaka

    2015-01-01

    We present a general approach in 1H-detected 13C solid-state NMR (SSNMR) for side-chain signal assignments of 10-50 nmol quantities of proteins using a combination of a high magnetic field, ultra-fast magic-angle spinning (MAS) at ~80 kHz, and stereo-array-isotope-labeled (SAIL) proteins [Kainosho M. et al., Nature 440, 52–57, 2006]. First, we demonstrate that 1H indirect detection improves the sensitivity and resolution of 13C SSNMR of SAIL proteins for side-chain assignments in the ultra-fast MAS condition. 1H-detected SSNMR was performed for micro-crystalline ubiquitin (~55 nmol or ~0.5mg) that was SAIL-labeled at seven isoleucine (Ile) residues. Sensitivity was dramatically improved by 1H-detected 2D 1H/13C SSNMR by factors of 5.4-9.7 and 2.1-5.0, respectively, over 13C-detected 2D 1H/13C SSNMR and 1D 13C CPMAS, demonstrating that 2D 1H-detected SSNMR offers not only additional resolution but also sensitivity advantage over 1D 13C detection for the first time. High 1H resolution for the SAIL-labeled side-chain residues offered reasonable resolution even in the 2D data. A 1H-detected 3D 13C/13C/1H experiment on SAIL-ubiquitin provided nearly complete 1H and 13C assignments for seven Ile residues only within ~2.5 h. The results demonstrate the feasibility of side-chain signal assignment in this approach for as little as 10 nmol of a protein sample within ~3 days. The approach is likely applicable to a variety of proteins of biological interest without any requirements of highly efficient protein expression systems. PMID:25856081

  11. Nano-mole scale side-chain signal assignment by 1H-detected protein solid-state NMR by ultra-fast magic-angle spinning and stereo-array isotope labeling.

    PubMed

    Wang, Songlin; Parthasarathy, Sudhakar; Nishiyama, Yusuke; Endo, Yuki; Nemoto, Takahiro; Yamauchi, Kazuo; Asakura, Tetsuo; Takeda, Mitsuhiro; Terauchi, Tsutomu; Kainosho, Masatsune; Ishii, Yoshitaka

    2015-01-01

    We present a general approach in 1H-detected 13C solid-state NMR (SSNMR) for side-chain signal assignments of 10-50 nmol quantities of proteins using a combination of a high magnetic field, ultra-fast magic-angle spinning (MAS) at ~80 kHz, and stereo-array-isotope-labeled (SAIL) proteins [Kainosho M. et al., Nature 440, 52-57, 2006]. First, we demonstrate that 1H indirect detection improves the sensitivity and resolution of 13C SSNMR of SAIL proteins for side-chain assignments in the ultra-fast MAS condition. 1H-detected SSNMR was performed for micro-crystalline ubiquitin (~55 nmol or ~0.5mg) that was SAIL-labeled at seven isoleucine (Ile) residues. Sensitivity was dramatically improved by 1H-detected 2D 1H/13C SSNMR by factors of 5.4-9.7 and 2.1-5.0, respectively, over 13C-detected 2D 1H/13C SSNMR and 1D 13C CPMAS, demonstrating that 2D 1H-detected SSNMR offers not only additional resolution but also sensitivity advantage over 1D 13C detection for the first time. High 1H resolution for the SAIL-labeled side-chain residues offered reasonable resolution even in the 2D data. A 1H-detected 3D 13C/13C/1H experiment on SAIL-ubiquitin provided nearly complete 1H and 13C assignments for seven Ile residues only within ~2.5 h. The results demonstrate the feasibility of side-chain signal assignment in this approach for as little as 10 nmol of a protein sample within ~3 days. The approach is likely applicable to a variety of proteins of biological interest without any requirements of highly efficient protein expression systems. PMID:25856081

  12. Assignments That Work.

    ERIC Educational Resources Information Center

    Hashimoto, I.

    1986-01-01

    Suggests, on a humorous note, a game-plan for assignment justification and elaboration that utilizes, in a constructive and professional manner, the best of what is known about assignment-making. (EL)

  13. ¹H, ¹³C and ¹⁵N chemical shift assignments of Na-FAR-1, a helix-rich fatty acid and retinol binding protein of the parasitic nematode Necator americanus.

    PubMed

    Rey-Burusco, M Florencia; Ibañez-Shimabukuro, Marina; Cooper, Alan; Kennedy, Malcolm W; Córsico, Betina; Smith, Brian O

    2014-04-01

    The fatty acid and retinol-binding (FAR) proteins are a family of unusual helix-rich lipid binding proteins found exclusively in nematodes, and are secreted by a range of parasites of humans, animals and plants. Na-FAR-1 is from the parasitic nematode Necator americanus, an intestinal blood-feeding parasite of humans. Sequence-specific (1)H, (13)C and (15)N resonance assignments have been obtained for the recombinant 170 amino acid protein, using three-dimensional triple-resonance heteronuclear magnetic resonance experiments. Backbone assignments have been obtained for 99.3% of the non-proline HN/N pairs (146 out of 147). The amide resonance of T45 was not observed, probably due to rapid exchange with solvent water. A total of 96.9% of backbone resonances were identified, while 97.7% assignment of amino acid sidechain protons is complete. All Hα(166), Hβ(250) and Hγ(160) and 98.4% of the Hδ (126 out of 128) atoms were assigned. In addition, 99.4% Cα (154 out of 155) and 99.3% Cβ (143 out of 144) resonances have been assigned. No resonances were observed for the NH(n) groups of R93 NεHε, arginine, N(η1)H2, N(η2)H2, histidine N(δ1)H(δ1), N(ε1)H(ε1) and lysine N(ζ3)H3. Na-FAR-1 has a similar overall arrangement of α-helices to Ce-FAR-7 of the free-living Caeorhabditis elegans, but with an extra C-terminal helix. PMID:23179061

  14. Large-scale determination of previously unsolved protein structures using evolutionary information.

    PubMed

    Ovchinnikov, Sergey; Kinch, Lisa; Park, Hahnbeom; Liao, Yuxing; Pei, Jimin; Kim, David E; Kamisetty, Hetunandan; Grishin, Nick V; Baker, David

    2015-01-01

    The prediction of the structures of proteins without detectable sequence similarity to any protein of known structure remains an outstanding scientific challenge. Here we report significant progress in this area. We first describe de novo blind structure predictions of unprecendented accuracy we made for two proteins in large families in the recent CASP11 blind test of protein structure prediction methods by incorporating residue-residue co-evolution information in the Rosetta structure prediction program. We then describe the use of this method to generate structure models for 58 of the 121 large protein families in prokaryotes for which three-dimensional structures are not available. These models, which are posted online for public access, provide structural information for the over 400,000 proteins belonging to the 58 families and suggest hypotheses about mechanism for the subset for which the function is known, and hypotheses about function for the remainder. PMID:26335199

  15. NPHP4 controls ciliary trafficking of membrane proteins and large soluble proteins at the transition zone

    PubMed Central

    Awata, Junya; Takada, Saeko; Standley, Clive; Lechtreck, Karl F.; Bellvé, Karl D.; Pazour, Gregory J.; Fogarty, Kevin E.; Witman, George B.

    2014-01-01

    ABSTRACT The protein nephrocystin-4 (NPHP4) is widespread in ciliated organisms, and defects in NPHP4 cause nephronophthisis and blindness in humans. To learn more about the function of NPHP4, we have studied it in Chlamydomonas reinhardtii. NPHP4 is stably incorporated into the distal part of the flagellar transition zone, close to the membrane and distal to CEP290, another transition zone protein. Therefore, these two proteins, which are incorporated into the transition zone independently of each other, define different domains of the transition zone. An nphp4-null mutant forms flagella with nearly normal length, ultrastructure and intraflagellar transport. When fractions from isolated wild-type and nphp4 flagella were compared, few differences were observed between the axonemes, but the amounts of certain membrane proteins were greatly reduced in the mutant flagella, and cellular housekeeping proteins >50 kDa were no longer excluded from mutant flagella. Therefore, NPHP4 functions at the transition zone as an essential part of a barrier that regulates both membrane and soluble protein composition of flagella. The phenotypic consequences of NPHP4 mutations in humans likely follow from protein mislocalization due to defects in the transition zone barrier. PMID:25150219

  16. A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets.

    PubMed

    Savitski, Mikhail M; Wilhelm, Mathias; Hahne, Hannes; Kuster, Bernhard; Bantscheff, Marcus

    2015-09-01

    Calculating the number of confidently identified proteins and estimating false discovery rate (FDR) is a challenge when analyzing very large proteomic data sets such as entire human proteomes. Biological and technical heterogeneity in proteomic experiments further add to the challenge and there are strong differences in opinion regarding the conceptual validity of a protein FDR and no consensus regarding the methodology for protein FDR determination. There are also limitations inherent to the widely used classic target-decoy strategy that particularly show when analyzing very large data sets and that lead to a strong over-representation of decoy identifications. In this study, we investigated the merits of the classic, as well as a novel target-decoy-based protein FDR estimation approach, taking advantage of a heterogeneous data collection comprised of ∼19,000 LC-MS/MS runs deposited in ProteomicsDB (https://www.proteomicsdb.org). The "picked" protein FDR approach treats target and decoy sequences of the same protein as a pair rather than as individual entities and chooses either the target or the decoy sequence depending on which receives the highest score. We investigated the performance of this approach in combination with q-value based peptide scoring to normalize sample-, instrument-, and search engine-specific differences. The "picked" target-decoy strategy performed best when protein scoring was based on the best peptide q-value for each protein yielding a stable number of true positive protein identifications over a wide range of q-value thresholds. We show that this simple and unbiased strategy eliminates a conceptual issue in the commonly used "classic" protein FDR approach that causes overprediction of false-positive protein identification in large data sets. The approach scales from small to very large data sets without losing performance, consistently increases the number of true-positive protein identifications and is readily implemented in

  17. "C.R.E.A.T.E."-ing Unique Primary-Source Research Paper Assignments for a Pleasure and Pain Course Teaching Neuroscientific Principles in a Large General Education Undergraduate Course.

    PubMed

    Bodnar, Richard J; Rotella, Francis M; Loiacono, Ilyssa; Coke, Tricia; Olsson, Kerstin; Barrientos, Alicia; Blachorsky, Lauren; Warshaw, Deena; Buras, Agata; Sanchez, Ciara M; Azad, Raihana; Stellar, James R

    2016-01-01

    A large (250 registrants) General Education lecture course, Pleasure and Pain, presented basic neuroscience principles as they related to animal and human models of pleasure and pain by weaving basic findings related to food and drug addiction and analgesic states with human studies examining empathy, social neuroscience and neuroeconomics. In its first four years, the course grade was based on weighted scores from two multiple-choice exams and a five-page review of three unique peer-reviewed research articles. Although well-registered and well-received, 18% of the students received Incomplete grades, primarily due to failing to submit the paper that went largely unresolved and eventually resulted in a failing grade. To rectify this issue, a modified version of the C.R.E.A.T.E. (Consider, Read, Elucidate hypotheses, Analyze and interpret data, Think of the next Experiment) method replaced the paper with eight structured assignments focusing on an initial general-topic article, the introduction-methods, and results-discussion of each of three related peer-review neuroscience-related articles, and a final summary. Compliance in completing these assignments was very high, resulting in only 11 INC grades out of 228 students. Thus, use of the C.R.E.A.T.E. method reduced the percentage of problematic INC grades from 18% to 4.8%, a 73% decline, without changing the overall grade distribution. Other analyses suggested the students achieved a deeper understanding of the scientific process using the C.R.E.A.T.E. method relative to the original term paper assignment. PMID:27385918

  18. “C.R.E.A.T.E.”-ing Unique Primary-Source Research Paper Assignments for a Pleasure and Pain Course Teaching Neuroscientific Principles in a Large General Education Undergraduate Course

    PubMed Central

    Bodnar, Richard J.; Rotella, Francis M.; Loiacono, Ilyssa; Coke, Tricia; Olsson, Kerstin; Barrientos, Alicia; Blachorsky, Lauren; Warshaw, Deena; Buras, Agata; Sanchez, Ciara M.; Azad, Raihana; Stellar, James R.

    2016-01-01

    A large (250 registrants) General Education lecture course, Pleasure and Pain, presented basic neuroscience principles as they related to animal and human models of pleasure and pain by weaving basic findings related to food and drug addiction and analgesic states with human studies examining empathy, social neuroscience and neuroeconomics. In its first four years, the course grade was based on weighted scores from two multiple-choice exams and a five-page review of three unique peer-reviewed research articles. Although well-registered and well-received, 18% of the students received Incomplete grades, primarily due to failing to submit the paper that went largely unresolved and eventually resulted in a failing grade. To rectify this issue, a modified version of the C.R.E.A.T.E. (Consider, Read, Elucidate hypotheses, Analyze and interpret data, Think of the next Experiment) method replaced the paper with eight structured assignments focusing on an initial general-topic article, the introduction-methods, and results-discussion of each of three related peer-review neuroscience-related articles, and a final summary. Compliance in completing these assignments was very high, resulting in only 11 INC grades out of 228 students. Thus, use of the C.R.E.A.T.E. method reduced the percentage of problematic INC grades from 18% to 4.8%, a 73% decline, without changing the overall grade distribution. Other analyses suggested the students achieved a deeper understanding of the scientific process using the C.R.E.A.T.E. method relative to the original term paper assignment. PMID:27385918

  19. Large-scale de novo prediction of physical protein-protein association.

    PubMed

    Elefsinioti, Antigoni; Saraç, Ömer Sinan; Hegele, Anna; Plake, Conrad; Hubner, Nina C; Poser, Ina; Sarov, Mihail; Hyman, Anthony; Mann, Matthias; Schroeder, Michael; Stelzl, Ulrich; Beyer, Andreas

    2011-11-01

    Information about the physical association of proteins is extensively used for studying cellular processes and disease mechanisms. However, complete experimental mapping of the human interactome will remain prohibitively difficult in the near future. Here we present a map of predicted human protein interactions that distinguishes functional association from physical binding. Our network classifies more than 5 million protein pairs predicting 94,009 new interactions with high confidence. We experimentally tested a subset of these predictions using yeast two-hybrid analysis and affinity purification followed by quantitative mass spectrometry. Thus we identified 462 new protein-protein interactions and confirmed the predictive power of the network. These independent experiments address potential issues of circular reasoning and are a distinctive feature of this work. Analysis of the physical interactome unravels subnetworks mediating between different functional and physical subunits of the cell. Finally, we demonstrate the utility of the network for the analysis of molecular mechanisms of complex diseases by applying it to genome-wide association studies of neurodegenerative diseases. This analysis provides new evidence implying TOMM40 as a factor involved in Alzheimer's disease. The network provides a high-quality resource for the analysis of genomic data sets and genetic association studies in particular. Our interactome is available via the hPRINT web server at: www.print-db.org. PMID:21836163

  20. Large-scale De Novo Prediction of Physical Protein-Protein Association*

    PubMed Central

    Elefsinioti, Antigoni; Saraç, Ömer Sinan; Hegele, Anna; Plake, Conrad; Hubner, Nina C.; Poser, Ina; Sarov, Mihail; Hyman, Anthony; Mann, Matthias; Schroeder, Michael; Stelzl, Ulrich; Beyer, Andreas

    2011-01-01

    Information about the physical association of proteins is extensively used for studying cellular processes and disease mechanisms. However, complete experimental mapping of the human interactome will remain prohibitively difficult in the near future. Here we present a map of predicted human protein interactions that distinguishes functional association from physical binding. Our network classifies more than 5 million protein pairs predicting 94,009 new interactions with high confidence. We experimentally tested a subset of these predictions using yeast two-hybrid analysis and affinity purification followed by quantitative mass spectrometry. Thus we identified 462 new protein-protein interactions and confirmed the predictive power of the network. These independent experiments address potential issues of circular reasoning and are a distinctive feature of this work. Analysis of the physical interactome unravels subnetworks mediating between different functional and physical subunits of the cell. Finally, we demonstrate the utility of the network for the analysis of molecular mechanisms of complex diseases by applying it to genome-wide association studies of neurodegenerative diseases. This analysis provides new evidence implying TOMM40 as a factor involved in Alzheimer's disease. The network provides a high-quality resource for the analysis of genomic data sets and genetic association studies in particular. Our interactome is available via the hPRINT web server at: www.print-db.org. PMID:21836163

  1. Large-scale screening for novel low-affinity extracellular protein interactions

    PubMed Central

    Bushell, K. Mark; Söllner, Christian; Schuster-Boeckler, Benjamin; Bateman, Alex; Wright, Gavin J.

    2008-01-01

    Extracellular protein–protein interactions are essential for both intercellular communication and cohesion within multicellular organisms. Approximately a fifth of human genes encode membrane-tethered or secreted proteins, but they are largely absent from recent large-scale protein interaction datasets, making current interaction networks biased and incomplete. This discrepancy is due to the unsuitability of popular high-throughput methods to detect extracellular interactions because of the biochemical intractability of membrane proteins and their interactions. For example, cell surface proteins contain insoluble hydrophobic transmembrane regions, and their extracellular interactions are often highly transient, having half-lives of less than a second. To detect transient extracellular interactions on a large scale, we developed AVEXIS (avidity-based extracellular interaction screen), a high-throughput assay that overcomes these technical issues and can detect very transient interactions (half-lives ≤ 0.1 sec) with a low false-positive rate. We used it to systematically screen for receptor–ligand pairs within the zebrafish immunoglobulin superfamily and identified novel ligands for both well-known and orphan receptors. Genes encoding receptor–ligand pairs were often clustered phylogenetically and expressed in the same or adjacent tissues, immediately implying their involvement in similar biological processes. Using AVEXIS, we have determined the first systematic low–affinity extracellular protein interaction network, supported by independent biological data. This technique will now allow large-scale extracellular protein interaction mapping in a broad range of experimental contexts. PMID:18296487

  2. Ion exchange using poorly activated supports, an easy way for purification of large proteins.

    PubMed

    Pessela, Benevides C C; Munilla, Roberto; Betancor, Lorena; Fuentes, Manuel; Carrascosa, Alfonso V; Vian, Alejandro; Fernandez-Lafuente, Roberto; Guisán, Jose M

    2004-04-23

    Ion-exchange chromatography using commercial ionic supports is a commonly used technique for protein purification. However, selective adsorption of a target protein from a given extract onto commercial ion exchangers seems to be quite complex since they are designed to adsorb the maximum percentage of proteins with the opposite charge. In this paper, ion-exchanger supports with different activation degrees (from 1 to 40 micromol of amino groups per g of agarose) have been prepared and used for the purification of large proteins. These kinds of proteins have large surfaces to interact by many points with the support. Therefore, it was possible to purify large proteins as beta-galactosidase from Thermus sp. strain T2 from a crude extract from Escherichia coli or bovine liver catalase from a commercial preparation, with tailor-made ion-exchanger supports. A simple step of adsorption/desorption on lowly activated supports rendered both enzymes rather pure as confirmed by sodium dodecyl sulphate-polyacrylamide gel electrophoresis. Moreover, this strategy makes also easy the desorption step that requires rather low NaCl concentrations, which may become a serious problem for desorption of large proteins when using conventional supports, due to their ability of generating a very strong adsorption. PMID:15116925

  3. Human ESP1/CRP2, a member of the LIM domain protein family: Characterization of the cDNA and assignment of the gene locus to chromosome 14q32.3

    SciTech Connect

    Karim, Mohammad Azharul; Ohta, Kohji; Matsuda, Ichiro

    1996-01-15

    The LIM domain is present in a wide variety of proteins with diverse functions and exhibits characteristic arrangements of Cys and His residues with a novel zinc-binding motif. LIM domain proteins have been implicated in development, cell regulation, and cell structure. A LIM domain protein was identified by screening a human cDNA library with rat cysteine-rich intestinal protein (CRIP) as a probe, under conditions of low stringency. Comparison of the predicted amino acid sequence with several LIM domain proteins revealed 93% of the residues to be identical to rat LIM domain protein, termed ESP1 or CRP2. Thus, the protein is hereafter referred to as human ESP1/CRP2. The cDNA encompasses a 1171-base region, including 26, 624, and 521 bases in the 5{prime}-noncoding region, coding region, and 3{prime}-noncoding regions, respectively, and encodes the entire ESP1/CRP2 protein has two LIM domains, and each shares 35.1% and 77 or 79% identical residues with human cysteine-rich protein (CRP) and rat CRIP, respectively. Northern blot analysis of ESP1/CRP2 in various human tissues showed distinct tissue distributions compared with CRP and CRIP, suggesting that each might serve related but specific roles in tissue organization or function. Using a panel of human-rodent somatic cell hybrids, the ESP1/CRP2 locus was assigned to chromosome 14. Fluorescence in situ hybridization, using cDNA and a genome DNA fragment of the ESP1/CRP2 as probes, confirms this assignment and relegates regional localization to band 14q32.3 47 refs., 7 figs.

  4. Large-scale identification of encystment-related proteins and genes in Pseudourostyla cristata

    PubMed Central

    Gao, Xiuxia; Chen, Fenfen; Niu, Tao; Qu, Ruidan; Chen, Jiwu

    2015-01-01

    The transformation of a ciliate into cyst is an advance strategy against an adverse situation. However, the molecular mechanism for the encystation of free-living ciliates is poorly understood. A large-scale identification of the encystment-related proteins and genes in ciliate would provide us with deeper insights into the molecular mechanisms for the encystations of ciliate. We identified the encystment-related proteins and genes in Pseudourostyla cristata with shotgun LC-MS/MS and scale qRT-PCR, respectively, in this report. A total of 668 proteins were detected in the resting cysts, 102 of these proteins were high credible proteins, whereas 88 high credible proteins of the 724 total proteins were found in the vegetative cells. Compared with the vegetative cell, 6 specific proteins were found in the resting cyst. However, the majority of high credible proteins in the resting cyst and the vegetative cell were co-expressed. We compared 47 genes of the co-expressed proteins with known functions in both the cyst and the vegetative cell using scale qRT-PCR. Twenty-seven of 47 genes were differentially expressed in the cyst compared with the vegetative cell. In our identifications, many uncharacterized proteins were also found. These results will help reveal the molecular mechanism for the formation of cyst in ciliates. PMID:26079518

  5. HUGE: a database for human large proteins identified in the Kazusa cDNA sequencing project.

    PubMed

    Kikuno, R; Nagase, T; Suyama, M; Waki, M; Hirosawa, M; Ohara, O

    2000-01-01

    HUGE is a database for human large proteins newly identified in the Kazusa cDNA project, the aim of which is to predict the primary structure of proteins from the sequences of human large cDNAs (>4 kb). In particular, cDNA clones capable of coding for large proteins (>50 kDa) are the current targets of the project. HUGE contains >1100 cDNA sequences and detailed information obtained through analysis of the sequences of cDNAs and the predicted proteins. Besides an increase in the number of cDNA entries, the amount of experimental data for expression profiling has been largely increased and data on chromosomal locations have been newly added. All of the protein-coding regions were examined by GeneMark analysis, and the results of a motif/domain search of each predicted protein sequence against the Pfam database have been newly added. HUGE is available through the WWW at http://www.kazusa.or.jp/huge PMID:10592264

  6. A Large Gene Cluster Encoding Several Magnetosome Proteins Is Conserved in Different Species of Magnetotactic Bacteria

    PubMed Central

    Grünberg, Karen; Wawer, Cathrin; Tebo, Bradley M.; Schüler, Dirk

    2001-01-01

    In magnetotactic bacteria, a number of specific proteins are associated with the magnetosome membrane (MM) and may have a crucial role in magnetite biomineralization. We have cloned and sequenced the genes of several of these polypeptides in the magnetotactic bacterium Magnetospirillum gryphiswaldense that could be assigned to two different genomic regions. Except for mamA, none of these genes have been previously reported to be related to magnetosome formation. Homologous genes were found in the genome sequences of M. magnetotacticum and magnetic coccus strain MC-1. The MM proteins identified display homology to tetratricopeptide repeat proteins (MamA), cation diffusion facilitators (MamB), and HtrA-like serine proteases (MamE) or bear no similarity to known proteins (MamC and MamD). A major gene cluster containing several magnetosome genes (including mamA and mamB) was found to be conserved in all three of the strains investigated. The mamAB cluster also contains additional genes that have no known homologs in any nonmagnetic organism, suggesting a specific role in magnetosome formation. PMID:11571158

  7. Small-scale batch crystallization of proteins revisited: an underutilized way to grow large protein crystals.

    PubMed

    Rayment, Ivan

    2002-02-01

    Growth of high-quality crystals is a major obstacle in many structural investigations. In recent years, the techniques for screening crystals have improved dramatically, whereas the methods for obtaining large crystals have progressed more slowly. This is an important issue since, although many structures can be solved from small crystals with synchrotron radiation, it is far easier to solve and refine structures when strong data is recorded from large crystals. In an effort to improve the size of crystals, a strategy for a small-scale batch method has been developed that in many cases yields far larger crystals than attainable by vapor diffusion. PMID:11839300

  8. BCSearch: fast structural fragment mining over large collections of protein structures

    PubMed Central

    Guyon, Frédéric; Martz, François; Vavrusa, Marek; Bécot, Jérôme; Rey, Julien; Tufféry, Pierre

    2015-01-01

    Resources to mine the large amount of protein structures available today are necessary to better understand how amino acid variations are compatible with conformation preservation, to assist protein design, engineering and, further, the development of biologic therapeutic compounds. BCSearch is a versatile service to efficiently mine large collections of protein structures. It relies on a new approach based on a Binet–Cauchy kernel that is more discriminative than the widely used root mean square deviation criterion. It has statistics independent of size even for short fragments, and is fast. The systematic mining of large collections of structures such as the complete SCOPe protein structural classification or comprehensive subsets of the Protein Data Bank can be performed in few minutes. Based on this new score, we propose four innovative applications: BCFragSearch and BCMirrorSearch, respectively, search for fragments similar and anti-similar to a query and return information on the diversity of the sequences of the hits. BCLoopSearch identifies candidate fragments of fixed size matching the flanks of a gaped structure. BCSpecificitySearch analyzes a complete protein structure and returns information about sites having few similar fragments. BCSearch is available at http://bioserv.rpbs.univ-paris-diderot.fr/services/BCSearch. PMID:25977292

  9. BCSearch: fast structural fragment mining over large collections of protein structures.

    PubMed

    Guyon, Frédéric; Martz, François; Vavrusa, Marek; Bécot, Jérôme; Rey, Julien; Tufféry, Pierre

    2015-07-01

    Resources to mine the large amount of protein structures available today are necessary to better understand how amino acid variations are compatible with conformation preservation, to assist protein design, engineering and, further, the development of biologic therapeutic compounds. BCSearch is a versatile service to efficiently mine large collections of protein structures. It relies on a new approach based on a Binet-Cauchy kernel that is more discriminative than the widely used root mean square deviation criterion. It has statistics independent of size even for short fragments, and is fast. The systematic mining of large collections of structures such as the complete SCOPe protein structural classification or comprehensive subsets of the Protein Data Bank can be performed in few minutes. Based on this new score, we propose four innovative applications: BCFragSearch and BCMirrorSearch, respectively, search for fragments similar and anti-similar to a query and return information on the diversity of the sequences of the hits. BCLoopSearch identifies candidate fragments of fixed size matching the flanks of a gaped structure. BCSpecificitySearch analyzes a complete protein structure and returns information about sites having few similar fragments. BCSearch is available at http://bioserv.rpbs.univ-paris-diderot.fr/services/BCSearch. PMID:25977292

  10. Analyses of the Sequence and Structural Properties Corresponding to Pentapeptide and Large Palindromes in Proteins.

    PubMed

    Sridhar, Settu; Nagamruta, Mallapragada; Guruprasad, Kunchur

    2015-01-01

    The analyses of 3967 representative proteins selected from the Protein Data Bank revealed the presence of 2803 pentapeptide and large palindrome sequences with known secondary structure conformation. These represent 2014 unique palindrome sequences. 60% palindromes are not associated with any regular secondary structure and 28% are in helix conformation, 11% in strand conformation and 1% in the coil conformation. The average solvent accessibility values are in the range between 0-155.28 Å2 suggesting that the palindromes in proteins can be either buried, exposed to the solvent or share an intermittent property. The number of residue neighborhood contacts defined by interactions ≤ 3.2 Ǻ is in the range between 0-29 residues. Palindromes of the same length in helix, strand and coil conformation are associated with different amino acid residue preferences at the individual positions. Nearly, 20% palindromes interact with catalytic/active site residues, ligand or metal ions in proteins and may therefore be important for function in the corresponding protein. The average hydrophobicity values for the pentapeptide and large palindromes range between -4.3 to +4.32 and the number of palindromes is almost equally distributed between the negative and positive hydrophobicity values. The palindromes represent 107 different protein families and the hydrolases, transferases, oxidoreductases and lyases contain relatively large number of palindromes. PMID:26465610

  11. Analyses of the Sequence and Structural Properties Corresponding to Pentapeptide and Large Palindromes in Proteins

    PubMed Central

    Sridhar, Settu; Nagamruta, Mallapragada; Guruprasad, Kunchur

    2015-01-01

    The analyses of 3967 representative proteins selected from the Protein Data Bank revealed the presence of 2803 pentapeptide and large palindrome sequences with known secondary structure conformation. These represent 2014 unique palindrome sequences. 60% palindromes are not associated with any regular secondary structure and 28% are in helix conformation, 11% in strand conformation and 1% in the coil conformation. The average solvent accessibility values are in the range between 0–155.28 Å2 suggesting that the palindromes in proteins can be either buried, exposed to the solvent or share an intermittent property. The number of residue neighborhood contacts defined by interactions ≤ 3.2 Ǻ is in the range between 0–29 residues. Palindromes of the same length in helix, strand and coil conformation are associated with different amino acid residue preferences at the individual positions. Nearly, 20% palindromes interact with catalytic/active site residues, ligand or metal ions in proteins and may therefore be important for function in the corresponding protein. The average hydrophobicity values for the pentapeptide and large palindromes range between -4.3 to +4.32 and the number of palindromes is almost equally distributed between the negative and positive hydrophobicity values. The palindromes represent 107 different protein families and the hydrolases, transferases, oxidoreductases and lyases contain relatively large number of palindromes. PMID:26465610

  12. Strategy for large scale solubilization of coal - characterization of Neurospora protein and gene

    SciTech Connect

    Patel, A.; Chen, Y.P.; Mishra, N.C.

    1995-12-31

    Low grade coal placed on mycelial mat of Neurospora crassa growing on Petri plate was found to be solubilized by this fungus. A heat stable protein has been purified to near homogeneity which can solubilize low grade coal in in vitro. The biochemical properties of the Neurospora protein will be presented. The nature of the product obtained after solubilization of coal by Neurospora protein in vivo and in vitro will also be presented. The N-terminus sequence of the amino acids of this protein will be used to design primer for possible cloning of gene for Neurospora protein capable of solubilization of coal in order to develop methodology for coal solubilization on a large scale.

  13. Interactive Effects of Indigestible Carbohydrates, Protein Type, and Protein Level on Biomarkers of Large Intestine Health in Rats

    PubMed Central

    Taciak, Marcin; Barszcz, Marcin; Tuśnio, Anna; Pastuszewska, Barbara

    2015-01-01

    The effects of indigestible carbohydrates, protein type, and protein level on large intestine health were examined in rats. For 21 days, 12 groups of six 12-week-old male Wistar rats were fed diets with casein (CAS), or potato protein concentrate (PPC), providing 14% (lower protein level; LP), or 20% (higher protein level; HP) protein, and containing cellulose, resistant potato starch, or pectin. Fermentation end-products, pH, and β-glucuronidase levels in cecal digesta, and ammonia levels in colonic digesta were determined. Cecal digesta, tissue weights, cecal and colon morphology, and colonocyte DNA damage were also analyzed. Digesta pH was lower, whereas relative mass of cecal tissue and digesta were higher in rats fed pectin diets than in those fed cellulose. Cecal parameters were greater in rats fed PPC and HP diets than in those fed CAS and LP diets, respectively. Short-chain fatty acid (SCFA) concentrations were unaffected by protein or carbohydrate type. Total SCFA, acetic acid, and propionic acid concentrations were greater in rats fed LP diets than in those fed HP. Cecal pool of isobutyric and isovaleric acids was greater in rats fed PPC than in those fed CAS diets. PPC diets decreased phenol concentration and increased ammonia concentration in cecal and colonic digesta, respectively. Cecal crypt depth was greater in rats fed PPC and HP diets, and was unaffected by carbohydrates; whereas colonic crypt depth was greater in rats fed cellulose. Myenteron thickness in the cecum was unaffected by nutrition, but was greater in the colon of rats fed cellulose. Colonocyte DNA damage was greater in rats fed LP diets than in those fed HP diets, and was unaffected by carbohydrate or protein type. It was found that nutritional factors decreasing cecal digesta weight contribute to greater phenol production, increased DNA damage, and reduced ammonia concentration in the colon. PMID:26536028

  14. Interactive Effects of Indigestible Carbohydrates, Protein Type, and Protein Level on Biomarkers of Large Intestine Health in Rats.

    PubMed

    Taciak, Marcin; Barszcz, Marcin; Tuśnio, Anna; Pastuszewska, Barbara

    2015-01-01

    The effects of indigestible carbohydrates, protein type, and protein level on large intestine health were examined in rats. For 21 days, 12 groups of six 12-week-old male Wistar rats were fed diets with casein (CAS), or potato protein concentrate (PPC), providing 14% (lower protein level; LP), or 20% (higher protein level; HP) protein, and containing cellulose, resistant potato starch, or pectin. Fermentation end-products, pH, and β-glucuronidase levels in cecal digesta, and ammonia levels in colonic digesta were determined. Cecal digesta, tissue weights, cecal and colon morphology, and colonocyte DNA damage were also analyzed. Digesta pH was lower, whereas relative mass of cecal tissue and digesta were higher in rats fed pectin diets than in those fed cellulose. Cecal parameters were greater in rats fed PPC and HP diets than in those fed CAS and LP diets, respectively. Short-chain fatty acid (SCFA) concentrations were unaffected by protein or carbohydrate type. Total SCFA, acetic acid, and propionic acid concentrations were greater in rats fed LP diets than in those fed HP. Cecal pool of isobutyric and isovaleric acids was greater in rats fed PPC than in those fed CAS diets. PPC diets decreased phenol concentration and increased ammonia concentration in cecal and colonic digesta, respectively. Cecal crypt depth was greater in rats fed PPC and HP diets, and was unaffected by carbohydrates; whereas colonic crypt depth was greater in rats fed cellulose. Myenteron thickness in the cecum was unaffected by nutrition, but was greater in the colon of rats fed cellulose. Colonocyte DNA damage was greater in rats fed LP diets than in those fed HP diets, and was unaffected by carbohydrate or protein type. It was found that nutritional factors decreasing cecal digesta weight contribute to greater phenol production, increased DNA damage, and reduced ammonia concentration in the colon. PMID:26536028

  15. Comprehensive characterization of protein 4.1 expression in epithelium of large intestine.

    PubMed

    Zhang, Jingxin; Yang, Shaomin; An, Chao; Wang, Jie; Yan, Hongxia; Huang, Yumin; Song, Jinlei; Yin, Changcheng; Baines, Anthony J; Mohandas, Narla; An, Xiuli

    2014-11-01

    The protein 4.1 family consists of four members, 4.1R, 4.1N, 4.1B and 4.1G, each encoded by a distinct gene. All 4.1 mRNAs undergo extensive alternative splicing. Functionally, they usually serve as adapters that link actin-based cytoskeleton to plasma membrane proteins. It has been reported that 4.1 proteins are expressed in most animal cell types and tissues including epithelial cells and epithelial tissues. However, the expression of 4.1 proteins in large intestine has not been well characterized. In the present study, we performed RT-PCR, western blot and immunohistochemistry analysis to characterize the transcripts, the protein expression and cellular localization of 4.1 proteins in the epithelia of mouse large intestine. We show that multiple transcripts derive from each gene, including eight 4.1R isoforms, four 4.1N isoforms, four 4.1B isoforms and six 4.1G isoforms. However, at the protein level, only one or two major bands were detected, implying that not all transcripts are translated and/or the proteins do not accumulate at detectable levels. Immunohistochemistry revealed that 4.1R, 4.1N and 4.1B are all expressed at the lateral membrane as well as cytoplasm of epithelial cells, suggesting a potentially redundant role of these proteins. Our findings not only provide new insights into the structure of protein 4.1 genes but also lay the foundation for future functional studies. PMID:24912669

  16. Genome-scale phylogenetic function annotation of large and diverse protein families

    PubMed Central

    Engelhardt, Barbara E.; Jordan, Michael I.; Srouji, John R.; Brenner, Steven E.

    2011-01-01

    The Statistical Inference of Function Through Evolutionary Relationships (SIFTER) framework uses a statistical graphical model that applies phylogenetic principles to automate precise protein function prediction. Here we present a revised approach (SIFTER version 2.0) that enables annotations on a genomic scale. SIFTER 2.0 produces equivalently precise predictions compared to the earlier version on a carefully studied family and on a collection of 100 protein families. We have added an approximation method to SIFTER 2.0 and show a 500-fold improvement in speed with minimal impact on prediction results in the functionally diverse sulfotransferase protein family. On the Nudix protein family, previously inaccessible to the SIFTER framework because of the 66 possible molecular functions, SIFTER achieved 47.4% accuracy on experimental data (where BLAST achieved 34.0%). Finally, we used SIFTER to annotate all of the Schizosaccharomyces pombe proteins with experimental functional characterizations, based on annotations from proteins in 46 fungal genomes. SIFTER precisely predicted molecular function for 45.5% of the characterized proteins in this genome, as compared with four current function prediction methods that precisely predicted function for 62.6%, 30.6%, 6.0%, and 5.7% of these proteins. We use both precision-recall curves and ROC analyses to compare these genome-scale predictions across the different methods and to assess performance on different types of applications. SIFTER 2.0 is capable of predicting protein molecular function for large and functionally diverse protein families using an approximate statistical model, enabling phylogenetics-based protein function prediction for genome-wide analyses. The code for SIFTER and protein family data are available at http://sifter.berkeley.edu. PMID:21784873

  17. Correlated motion of protein subdomains and large-scale conformational flexibility of RecA protein filament

    NASA Astrophysics Data System (ADS)

    Yu, Garmay; A, Shvetsov; D, Karelov; D, Lebedev; A, Radulescu; M, Petukhov; V, Isaev-Ivanov

    2012-02-01

    Based on X-ray crystallographic data available at Protein Data Bank, we have built molecular dynamics (MD) models of homologous recombinases RecA from E. coli and D. radiodurans. Functional form of RecA enzyme, which is known to be a long helical filament, was approximated by a trimer, simulated in periodic water box. The MD trajectories were analyzed in terms of large-scale conformational motions that could be detectable by neutron and X-ray scattering techniques. The analysis revealed that large-scale RecA monomer dynamics can be described in terms of relative motions of 7 subdomains. Motion of C-terminal domain was the major contributor to the overall dynamics of protein. Principal component analysis (PCA) of the MD trajectories in the atom coordinate space showed that rotation of C-domain is correlated with the conformational changes in the central domain and N-terminal domain, that forms the monomer-monomer interface. Thus, even though C-terminal domain is relatively far from the interface, its orientation is correlated with large-scale filament conformation. PCA of the trajectories in the main chain dihedral angle coordinate space implicates a co-existence of a several different large-scale conformations of the modeled trimer. In order to clarify the relationship of independent domain orientation with large-scale filament conformation, we have performed analysis of independent domain motion and its implications on the filament geometry.

  18. Assignment of the human FKBP12-rapamycin-associated protein (FRAP) gene to chromosome 1p36 by fluorescence in situ hybridization

    SciTech Connect

    Moore, P.A.; Rosen, C.A.; Carter, K.C.

    1996-04-15

    This report describes the localization of the human FKBP12-rapamycin-associated protein (FRAP) gene to human chromosome 1p36 using fluorescence in situ hybridization. This protein is the binding site for rapamycin and FK506, two potent immunosuppressive drugs. 12 refs., 1 fig.

  19. My Favorite Assignment.

    ERIC Educational Resources Information Center

    Post, Robert E.; Johnson, Jack E.

    1982-01-01

    Presents two assignments that show (1) how George Orwell's "Politics and the English Language" can be applied to business writing and (2) how structured student-teacher conferences can generate enthusiasm for oral expression in a business communication course. (AEA)

  20. Extra-Large G Proteins Expand the Repertoire of Subunits in Arabidopsis Heterotrimeric G Protein Signaling1[OPEN

    PubMed Central

    Chakravorty, David; Gookin, Timothy E.; Milner, Matthew J.; Yu, Yunqing; Assmann, Sarah M.

    2015-01-01

    Heterotrimeric G proteins, consisting of Gα, Gβ, and Gγ subunits, are a conserved signal transduction mechanism in eukaryotes. However, G protein subunit numbers in diploid plant genomes are greatly reduced as compared with animals and do not correlate with the diversity of functions and phenotypes in which heterotrimeric G proteins have been implicated. In addition to GPA1, the sole canonical Arabidopsis (Arabidopsis thaliana) Gα subunit, Arabidopsis has three related proteins: the extra-large GTP-binding proteins XLG1, XLG2, and XLG3. We demonstrate that the XLGs can bind Gβγ dimers (AGB1 plus a Gγ subunit: AGG1, AGG2, or AGG3) with differing specificity in yeast (Saccharomyces cerevisiae) three-hybrid assays. Our in silico structural analysis shows that XLG3 aligns closely to the crystal structure of GPA1, and XLG3 also competes with GPA1 for Gβγ binding in yeast. We observed interaction of the XLGs with all three Gβγ dimers at the plasma membrane in planta by bimolecular fluorescence complementation. Bioinformatic and localization studies identified and confirmed nuclear localization signals in XLG2 and XLG3 and a nuclear export signal in XLG3, which may facilitate intracellular shuttling. We found that tunicamycin, salt, and glucose hypersensitivity and increased stomatal density are agb1-specific phenotypes that are not observed in gpa1 mutants but are recapitulated in xlg mutants. Thus, XLG-Gβγ heterotrimers provide additional signaling modalities for tuning plant G protein responses and increase the repertoire of G protein heterotrimer combinations from three to 12. The potential for signal partitioning and competition between the XLGs and GPA1 is a new paradigm for plant-specific cell signaling. PMID:26157115

  1. Genetics of single-cell protein abundance variation in large yeast populations

    NASA Astrophysics Data System (ADS)

    Albert, Frank W.; Treusch, Sebastian; Shockley, Arthur H.; Bloom, Joshua S.; Kruglyak, Leonid

    2014-02-01

    Variation among individuals arises in part from differences in DNA sequences, but the genetic basis for variation in most traits, including common diseases, remains only partly understood. Many DNA variants influence phenotypes by altering the expression level of one or several genes. The effects of such variants can be detected as expression quantitative trait loci (eQTL). Traditional eQTL mapping requires large-scale genotype and gene expression data for each individual in the study sample, which limits sample sizes to hundreds of individuals in both humans and model organisms and reduces statistical power. Consequently, many eQTL are probably missed, especially those with smaller effects. Furthermore, most studies use messenger RNA rather than protein abundance as the measure of gene expression. Studies that have used mass-spectrometry proteomics reported unexpected differences between eQTL and protein QTL (pQTL) for the same genes, but these studies have been even more limited in scope. Here we introduce a powerful method for identifying genetic loci that influence protein expression in the yeast Saccharomyces cerevisiae. We measure single-cell protein abundance through the use of green fluorescent protein tags in very large populations of genetically variable cells, and use pooled sequencing to compare allele frequencies across the genome in thousands of individuals with high versus low protein abundance. We applied this method to 160 genes and detected many more loci per gene than previous studies. We also observed closer correspondence between loci that influence protein abundance and loci that influence mRNA abundance of a given gene. Most loci that we detected were clustered in `hotspots' that influence multiple proteins, and some hotspots were found to influence more than half of the proteins that we examined. The variants that underlie these hotspots have profound effects on the gene regulatory network and provide insights into genetic variation in cell

  2. Genetics of single-cell protein abundance variation in large yeast populations.

    PubMed

    Albert, Frank W; Treusch, Sebastian; Shockley, Arthur H; Bloom, Joshua S; Kruglyak, Leonid

    2014-02-27

    Variation among individuals arises in part from differences in DNA sequences, but the genetic basis for variation in most traits, including common diseases, remains only partly understood. Many DNA variants influence phenotypes by altering the expression level of one or several genes. The effects of such variants can be detected as expression quantitative trait loci (eQTL). Traditional eQTL mapping requires large-scale genotype and gene expression data for each individual in the study sample, which limits sample sizes to hundreds of individuals in both humans and model organisms and reduces statistical power. Consequently, many eQTL are probably missed, especially those with smaller effects. Furthermore, most studies use messenger RNA rather than protein abundance as the measure of gene expression. Studies that have used mass-spectrometry proteomics reported unexpected differences between eQTL and protein QTL (pQTL) for the same genes, but these studies have been even more limited in scope. Here we introduce a powerful method for identifying genetic loci that influence protein expression in the yeast Saccharomyces cerevisiae. We measure single-cell protein abundance through the use of green fluorescent protein tags in very large populations of genetically variable cells, and use pooled sequencing to compare allele frequencies across the genome in thousands of individuals with high versus low protein abundance. We applied this method to 160 genes and detected many more loci per gene than previous studies. We also observed closer correspondence between loci that influence protein abundance and loci that influence mRNA abundance of a given gene. Most loci that we detected were clustered in 'hotspots' that influence multiple proteins, and some hotspots were found to influence more than half of the proteins that we examined. The variants that underlie these hotspots have profound effects on the gene regulatory network and provide insights into genetic variation in cell

  3. Revealing the global map of protein folding space by large-scale simulations

    NASA Astrophysics Data System (ADS)

    Sinner, Claude; Lutz, Benjamin; Verma, Abhinav; Schug, Alexander

    2015-12-01

    The full characterization of protein folding is a remarkable long-standing challenge both for experiment and simulation. Working towards a complete understanding of this process, one needs to cover the full diversity of existing folds and identify the general principles driving the process. Here, we want to understand and quantify the diversity in folding routes for a large and representative set of protein topologies covering the full range from all alpha helical topologies towards beta barrels guided by the key question: Does the majority of the observed routes contribute to the folding process or only a particular route? We identified a set of two-state folders among non-homologous proteins with a sequence length of 40-120 residues. For each of these proteins, we ran native-structure based simulations both with homogeneous and heterogeneous contact potentials. For each protein, we simulated dozens of folding transitions in continuous uninterrupted simulations and constructed a large database of kinetic parameters. We investigate folding routes by tracking the formation of tertiary structure interfaces and discuss whether a single specific route exists for a topology or if all routes are equiprobable. These results permit us to characterize the complete folding space for small proteins in terms of folding barrier ΔG‡, number of routes, and the route specificity RT.

  4. Structure and evolutionary history of a large family of NLR proteins in the zebrafish.

    PubMed

    Howe, Kerstin; Schiffer, Philipp H; Zielinski, Julia; Wiehe, Thomas; Laird, Gavin K; Marioni, John C; Soylemez, Onuralp; Kondrashov, Fyodor; Leptin, Maria

    2016-04-01

    Multicellular eukaryotes have evolved a range of mechanisms for immune recognition. A widespread family involved in innate immunity are the NACHT-domain and leucine-rich-repeat-containing (NLR) proteins. Mammals have small numbers of NLR proteins, whereas in some species, mostly those without adaptive immune systems, NLRs have expanded into very large families. We describe a family of nearly 400 NLR proteins encoded in the zebrafish genome. The proteins share a defining overall structure, which arose in fishes after a fusion of the core NLR domains with a B30.2 domain, but can be subdivided into four groups based on their NACHT domains. Gene conversion acting differentially on the NACHT and B30.2 domains has shaped the family and created the groups. Evidence of positive selection in the B30.2 domain indicates that this domain rather than the leucine-rich repeats acts as the pathogen recognition module. In an unusual chromosomal organization, the majority of the genes are located on one chromosome arm, interspersed with other large multigene families, including a new family encoding zinc-finger proteins. The NLR-B30.2 proteins represent a new family with diversity in the specific recognition module that is present in fishes in spite of the parallel existence of an adaptive immune system. PMID:27248802

  5. Structure and evolutionary history of a large family of NLR proteins in the zebrafish

    PubMed Central

    Zielinski, Julia; Kondrashov, Fyodor

    2016-01-01

    Multicellular eukaryotes have evolved a range of mechanisms for immune recognition. A widespread family involved in innate immunity are the NACHT-domain and leucine-rich-repeat-containing (NLR) proteins. Mammals have small numbers of NLR proteins, whereas in some species, mostly those without adaptive immune systems, NLRs have expanded into very large families. We describe a family of nearly 400 NLR proteins encoded in the zebrafish genome. The proteins share a defining overall structure, which arose in fishes after a fusion of the core NLR domains with a B30.2 domain, but can be subdivided into four groups based on their NACHT domains. Gene conversion acting differentially on the NACHT and B30.2 domains has shaped the family and created the groups. Evidence of positive selection in the B30.2 domain indicates that this domain rather than the leucine-rich repeats acts as the pathogen recognition module. In an unusual chromosomal organization, the majority of the genes are located on one chromosome arm, interspersed with other large multigene families, including a new family encoding zinc-finger proteins. The NLR-B30.2 proteins represent a new family with diversity in the specific recognition module that is present in fishes in spite of the parallel existence of an adaptive immune system. PMID:27248802

  6. Large-scale determination of previously unsolved protein structures using evolutionary information

    PubMed Central

    Ovchinnikov, Sergey; Kinch, Lisa; Park, Hahnbeom; Liao, Yuxing; Pei, Jimin; Kim, David E; Kamisetty, Hetunandan; Grishin, Nick V; Baker, David

    2015-01-01

    The prediction of the structures of proteins without detectable sequence similarity to any protein of known structure remains an outstanding scientific challenge. Here we report significant progress in this area. We first describe de novo blind structure predictions of unprecendented accuracy we made for two proteins in large families in the recent CASP11 blind test of protein structure prediction methods by incorporating residue–residue co-evolution information in the Rosetta structure prediction program. We then describe the use of this method to generate structure models for 58 of the 121 large protein families in prokaryotes for which three-dimensional structures are not available. These models, which are posted online for public access, provide structural information for the over 400,000 proteins belonging to the 58 families and suggest hypotheses about mechanism for the subset for which the function is known, and hypotheses about function for the remainder. DOI: http://dx.doi.org/10.7554/eLife.09248.001 PMID:26335199

  7. Hyperfine-shifted 13C resonance assignments in an iron-sulfur protein with quantum chemical verification: aliphatic C-H···S 3-center-4-electron interactions.

    PubMed

    Westler, William M; Lin, I-Jin; Perczel, András; Weinhold, Frank; Markley, John L

    2011-02-01

    Although the majority of noncovalent interactions associated with hydrogen and heavy atoms in proteins and other biomolecules are classical hydrogen bonds between polar N-H or O-H moieties and O atoms or aromatic π electrons, high-resolution X-ray crystallographic models deposited in the Protein Data Bank show evidence for weaker C-H···O hydrogen bonds, including ones involving sp(3)-hybridized carbon atoms. Little evidence is available in proteins for the (even) weaker C-H···S interactions described in the crystallographic literature on small molecules. Here, we report experimental evidence and theoretical verification for the existence of nine aliphatic (sp(3)-hybridized) C-H···S 3-center-4-electron interactions in the protein Clostridium pasteurianum rubredoxin. Our evidence comes from the analysis of carbon-13 NMR chemical shifts assigned to atoms near the iron at the active site of this protein. We detected anomalous chemical shifts for these carbon-13 nuclei and explained their origin in terms of unpaired spin density from the iron atom being delocalized through interactions of the type: C-H···S-Fe, where S is the sulfur of one of the four cysteine side chains covalently bonded to the iron. These results suggest that polarized sulfur atoms in proteins can engage in multiple weak interactions with surrounding aliphatic groups. We analyze the strength and angular dependence of these interactions and conclude that they may contribute small, but significant, stabilization to the molecule. PMID:21207994

  8. 42 CFR 433.146 - Rights assigned; assignment method.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... 42 Public Health 4 2010-10-01 2010-10-01 false Rights assigned; assignment method. 433.146 Section 433.146 Public Health CENTERS FOR MEDICARE & MEDICAID SERVICES, DEPARTMENT OF HEALTH AND HUMAN... Assignment of Rights to Benefits § 433.146 Rights assigned; assignment method. (a) Except as specified...

  9. Understanding the Physical Properties that Control Protein Crystallization by Analysis of Large-Scale Experimental Data

    SciTech Connect

    Price, W.; Chen, Y; Handelman, S; Neely, H; Manor, P; Karlin, R; Nair, R; Montelione, G; Hunt, J; et. al.

    2008-01-01

    Crystallization is the most serious bottleneck in high-throughput protein-structure determination by diffraction methods. We have used data mining of the large-scale experimental results of the Northeast Structural Genomics Consortium and experimental folding studies to characterize the biophysical properties that control protein crystallization. This analysis leads to the conclusion that crystallization propensity depends primarily on the prevalence of well-ordered surface epitopes capable of mediating interprotein interactions and is not strongly influenced by overall thermodynamic stability. We identify specific sequence features that correlate with crystallization propensity and that can be used to estimate the crystallization probability of a given construct. Analyses of entire predicted proteomes demonstrate substantial differences in the amino acid-sequence properties of human versus eubacterial proteins, which likely reflect differences in biophysical properties, including crystallization propensity. Our thermodynamic measurements do not generally support previous claims regarding correlations between sequence properties and protein stability.

  10. Large, dynamic, multi-protein complexes: a challenge for structural biology.

    PubMed

    Różycki, Bartosz; Boura, Evzen

    2014-11-19

    Structural biology elucidates atomic structures of macromolecules such as proteins, DNA, RNA, and their complexes to understand the basic mechanisms of their functions. Among proteins that pose the most difficult problems to current efforts are those which have several large domains connected by long, flexible polypeptide segments. Although abundant and critically important in biological cells, such proteins have proven intractable by conventional techniques. This gap has recently led to the advancement of hybrid methods that use state-of-the-art computational tools to combine complementary data from various high- and low-resolution experiments. In this review, we briefly discuss the individual experimental techniques to illustrate their strengths and limitations, and then focus on the use of hybrid methods in structural biology. We describe how representative structures of dynamic multi-protein complexes are obtained utilizing the EROS hybrid method that we have co-developed. PMID:25335513

  11. Large, dynamic, multi-protein complexes: a challenge for structural biology

    NASA Astrophysics Data System (ADS)

    Różycki, Bartosz; Boura, Evzen

    2014-11-01

    Structural biology elucidates atomic structures of macromolecules such as proteins, DNA, RNA, and their complexes to understand the basic mechanisms of their functions. Among proteins that pose the most difficult problems to current efforts are those which have several large domains connected by long, flexible polypeptide segments. Although abundant and critically important in biological cells, such proteins have proven intractable by conventional techniques. This gap has recently led to the advancement of hybrid methods that use state-of-the-art computational tools to combine complementary data from various high- and low-resolution experiments. In this review, we briefly discuss the individual experimental techniques to illustrate their strengths and limitations, and then focus on the use of hybrid methods in structural biology. We describe how representative structures of dynamic multi-protein complexes are obtained utilizing the EROS hybrid method that we have co-developed.

  12. Protein-bound molecules: a large family with a bad character.

    PubMed

    Sirich, Tammy L; Meyer, Timothy W; Gondouin, Bertrand; Brunet, Philippe; Niwa, Toshimitsu

    2014-03-01

    Many small solutes excreted by the kidney are bound to plasma proteins, chiefly albumin, in the circulation. The combination of protein binding and tubular secretion allows the kidney to reduce the free, unbound concentrations of such solutes to lower levels than could be obtained by tubular secretion alone. Protein-bound solutes accumulate in the plasma when the kidneys fail, and the free, unbound levels of these solutes increase more than their total plasma levels owing to competition for binding sites on plasma proteins. Given the efficiency by which the kidney can clear protein-bound solutes, it is tempting to speculate that some compounds in this class are important uremic toxins. Studies to date have focused largely on two specific protein-bound solutes: indoxyl sulfate and p-cresyl sulfate. The largest body of evidence suggests that both of these compounds contribute to cardiovascular disease, and that indoxyl sulfate contributes to the progression of chronic kidney disease. Other protein-bound solutes have been investigated to a much lesser extent, and could in the future prove to be even more important uremic toxins. PMID:24780467

  13. Unfolding the Role of Large Heat Shock Proteins: New Insights and Therapeutic Implications

    PubMed Central

    Zuo, Daming; Subjeck, John; Wang, Xiang-Yang

    2016-01-01

    Heat shock proteins (HSPs) of eukaryotes are evolutionarily conserved molecules present in all the major intracellular organelles. They mainly function as molecular chaperones and participate in maintenance of protein homeostasis in physiological state and under stressful conditions. Despite their relative abundance, the large HSPs, i.e., Hsp110 and glucose-regulated protein 170 (Grp170), have received less attention compared to other conventional HSPs. These proteins are distantly related to the Hsp70 and belong to Hsp70 superfamily. Increased sizes of Hsp110 and Grp170, due to the presence of a loop structure, result in their exceptional capability in binding to polypeptide substrates or non-protein ligands, such as pathogen-associated molecules. These interactions that occur in the extracellular environment during tissue injury or microbial infection may lead to amplification of an immune response engaging both innate and adaptive immune components. Here, we review the current advances in understanding these large HSPs as molecular chaperones in proteostasis control and immune modulation as well as their therapeutic implications in treatment of cancer and neurodegeneration. Given their unique immunoregulatory activities, we also discuss the emerging evidence of their potential involvement in inflammatory and immune-related diseases. PMID:26973652

  14. Effects of dietary restriction followed by high dietary energy or protein on compensatory growth of Ashanti Black × Large White crossbred weaner pigs.

    PubMed

    Addah, Weseh; Dzewu, Reuben Rudolph Kafui; Alenyorege, Benjamin

    2016-01-01

    The study determined the effect of re-alimenting dietary protein or energy on compensatory growth. Eighteen Ashanti Black × Large White crossbred weaner pigs (7.5 ± 0.30 kg) were randomly assigned to one of three dietary treatments in a completely randomized design resulting in three replicate pens per treatment (n = 3) and two pigs per pen. In the first treatment, pigs were fed ad libitum a diet containing 12.0 MJ/kg of metabolizable energy (ME) and 14.4% crude protein (CP) (maintenance diet) for 56 days. In the second and third dietary treatments, pigs were fed the maintenance diet for the initial 28 days and then switched to a high protein (17.4% dry matter (DM) CP; protein) or high (14.0 MJ/kg DM; energy) diet for the rest of the 28-day period. Dry matter intake and growth performance were similar (P ≥ 0.52) among treatments during the first 28 days of restrictive feeding, but pigs re-alimented with the protein diet achieved superior (P = 0.004) DM intake, average daily gain (ADG), and feed efficiency than those fed the maintenance diet or re-alimented with the energy diet in the re-alimentation period. At the end of the entire 56-day period, pigs re-alimented with the protein diet had higher (P ≥ 0.01) live weight gains and ADG compared with those fed the maintenance diet or re-alimented with the energy diet, but DM intake was similar (P = 0.66) among treatments. It was concluded that re-alimentation with protein rather than energy can improve compensatory growth of Ashanti Black × Large White crossbred weaner pigs. PMID:26494544

  15. Approach for growth of high-quality and large protein crystals

    PubMed Central

    Matsumura, Hiroyoshi; Sugiyama, Shigeru; Hirose, Mika; Kakinouchi, Keisuke; Maruyama, Mihoko; Murai, Ryota; Adachi, Hiroaki; Takano, Kazufumi; Murakami, Satoshi; Mori, Yusuke; Inoue, Tsuyoshi

    2011-01-01

    Three crystallization methods for growing large high-quality protein crystals, i.e. crystallization in the presence of a semi-solid agarose gel, top-seeded solution growth (TSSG) and a large-scale hanging-drop method, have previously been presented. In this study the effectiveness of crystallization in the presence of a semi-solid agarose gel has been further evaluated by crystallizing additional proteins in the presence of 2.0% (w/v) agarose gel, resulting in complete gelification with high mechanical strength. In TSSG the seed crystals are hung by a seed holder protruding from the top of the growth vessel to prevent polycrystallization. In the large-scale hanging-drop method, a cut pipette tip was used to maintain large-scale droplets consisting of protein–precipitant solution. Here a novel crystallization method that combines TSSG and the large-scale hanging-drop method is reported. A large and single crystal of lysozyme was obtained by this method. PMID:21169683

  16. Neandertals' large lower thorax may represent adaptation to high protein diet.

    PubMed

    Ben-Dor, Miki; Gopher, Avi; Barkai, Ran

    2016-07-01

    Humans are limited in their capacity to convert protein into energy. We present a hypothesis that a "bell" shaped thorax and a wide pelvis evolved in Neandertals, at least in part, as an adaptation to a high protein diet. A high protein diet created a need to house an enlarged liver and urinary system in a wider lower trunk. To test the hypothesis, we applied a model developed to identify points of nutritional stress. A ratio of obligatory dietary fat to total animal fat and protein sourced calories is calculated based on various known and estimated parameters. Stress is identified when the obligatory dietary fat ratio is higher than fat content ratios in available prey. The model predicts that during glacial winters, when carbohydrates weren't available, 74%-85% of Neandertals' caloric intake would have had to come from animal fat. Large animals contain around 50% fat calories, and their fat content is diminished during winter, so a significant stressful dietary fat deficit was identified by the model. This deficit could potentially be ameliorated by an increased capability to convert protein into energy. Given that high protein consumption is associated with larger liver and kidneys in animal models, it appears likely that the enlarged inferior section of the Neandertals thorax and possibly, in part, also his wide pelvis, represented an adaptation to provide encasement for those enlarged organs. Behavioral and evolutionary implications of the hypothesis are also discussed. Am J Phys Anthropol 160:367-378, 2016. © 2016 Wiley Periodicals, Inc. PMID:26973080

  17. A silk hydrogel-based delivery system of bone morphogenetic protein for the treatment of large bone defects.

    PubMed

    Diab, Tamim; Pritchard, Eleanor M; Uhrig, Brent A; Boerckel, Joel D; Kaplan, David L; Guldberg, Robert E

    2012-07-01

    The use of tissue grafting for the repair of large bone defects has numerous limitations including donor site morbidity and the risk of disease transmission. These limitations have prompted research efforts to investigate the effects of combining biomaterial scaffolds with biochemical cues to augment bone repair. The goal of this study was to use a critically-sized rat femoral segmental defect model to investigate the efficacy of a delivery system consisting of an electrospun polycaprolactone (PCL) nanofiber mesh tube with a silk fibroin hydrogel for local recombinant bone morphogenetic protein 2 (BMP-2) delivery. Bilateral 8 mm segmental femoral defects were formed in 13-week-old Sprague Dawley rats. Perforated electrospun PCL nanofiber mesh tubes were fitted into the adjacent native bone such that the lumen of the tubes contained the defect (Kolambkar et al., 2011b). Silk hydrogels with or without BMP-2 were injected into the defect. Bone regeneration was longitudinally assessed using 2D X-ray radiography and 3D microcomputed topography (μCT). Following sacrifice at 12 weeks after surgery, the extracted femurs were either subjected to biomechanical testing or assigned for histology. The results demonstrated that silk was an effective carrier for BMP-2. Compared to the delivery system without BMP-2, the delivery system that contained BMP-2 resulted in more bone formation (p<0.05) at 4, 8, 12 weeks after surgery. Biomechanical properties were also significantly improved in the presence of BMP-2 (p<0.05) and were comparable to age-matched intact femurs. Histological evaluation of the defect region indicated that the silk hydrogel has been completely degraded by the end of the study. Based on these results, we conclude that a BMP-2 delivery system consisting of an electrospun PCL nanofiber mesh tube with a silk hydrogel presents an effective strategy for functional repair of large bone defects. PMID:22658161

  18. Automated resonance assignment of the 21 kDa stereo-array isotope labeled thioldisulfide oxidoreductase DsbA

    NASA Astrophysics Data System (ADS)

    Schmidt, Elena; Ikeya, Teppei; Takeda, Mitsuhiro; Löhr, Frank; Buchner, Lena; Ito, Yutaka; Kainosho, Masatsune; Güntert, Peter

    2014-12-01

    The automated chemical shift assignment algorithm FLYA has been extended for use with stereo-array isotope labeled (SAIL) proteins to determine the sequence-specific resonance assignments of large proteins. Here we present the assignment of the backbone and sidechain chemical shifts of the 21 kDa thioldisulfide oxidoreductase DsbA from Escherichia coli that were determined with the SAIL-FLYA algorithm in conjunction with automated peak picking. No manual corrections of peak lists or assignments were applied. The assignments agreed with manually determined reference assignments in 95.4% of the cases if 16 input spectra were used, 94.1% if only 3D 13C/15N-resolved NOESY, CBCA(CO)NH, and 2D [13C/15N,1H]-HSQC were used, and 86.8% if exclusively 3D 13C/15N-resolved NOESY spectra were used. Considering only the assignments that are classified as reliable by the SAIL-FLYA algorithm, the degrees of agreement increased to 97.5%, 96.5%, and 94.2%, respectively. With our approach it is thus possible to automatically obtain almost complete and correct assignments of proteins larger than 20 kDa.

  19. Large Proteins Have a Great Tendency to Aggregate but a Low Propensity to Form Amyloid Fibrils

    PubMed Central

    Ramshini, Hassan; Parrini, Claudia; Relini, Annalisa; Zampagni, Mariagioia; Mannini, Benedetta; Pesce, Alessandra; Saboury, Ali Akbar; Nemat-Gorgani, Mohsen; Chiti, Fabrizio

    2011-01-01

    The assembly of soluble proteins into ordered fibrillar aggregates with cross-β structure is an essential event of many human diseases. The polypeptides undergoing aggregation are generally small in size. To explore if the small size is a primary determinant for the formation of amyloids under pathological conditions we have created two databases of proteins, forming amyloid-related and non-amyloid deposits in human diseases, respectively. The size distributions of the two protein populations are well separated, with the systems forming non-amyloid deposits appearing significantly larger. We have then investigated the propensity of the 486-residue hexokinase-B from Saccharomyces cerevisiae (YHKB) to form amyloid-like fibrils in vitro. This size is intermediate between the size distributions of amyloid and non-amyloid forming proteins. Aggregation was induced under conditions known to be most effective for amyloid formation by normally globular proteins: (i) low pH with salts, (ii) pH 5.5 with trifluoroethanol. In both situations YHKB aggregated very rapidly into species with significant β-sheet structure, as detected using circular dichroism and X-ray diffraction, but a weak Thioflavin T and Congo red binding. Moreover, atomic force microscopy indicated a morphology distinct from typical amyloid fibrils. Both types of aggregates were cytotoxic to human neuroblastoma cells, as indicated by the MTT assay. This analysis indicates that large proteins have a high tendency to form toxic aggregates, but low propensity to form regular amyloid in vivo and that such a behavior is intrinsically determined by the size of the protein, as suggested by the in vitro analysis of our sample protein. PMID:21249193

  20. My Favorite Assignment.

    ERIC Educational Resources Information Center

    Hebert, Margaret; And Others

    1991-01-01

    Contains seven brief articles which offer assignments designed to help students perform job searches, write job application letters, answer difficult questions, write letters of resignation, alleviate fears of public speaking, use the interview effectively in the business communication, and develop listening skills. (PRA)

  1. Principals Make Assignments Matter

    ERIC Educational Resources Information Center

    Dougherty, Eleanor

    2013-01-01

    The inner-city high school in Washington, DC, that Guillaume Gendre joined as an assistant principal had a modest reputation for achievement but was nevertheless challenged to raise expectations for student work. In other schools, Gendre had used assignments--a specific kind of instructional task in which students are charged to think about an…

  2. Making Effective Assignments.

    ERIC Educational Resources Information Center

    McLeod, Alan M., Ed.

    1982-01-01

    Although the focus of this issue of the "Virginia English Bulletin" is on making effective assignments, most of the articles also emphasize the importance and power of writing. Articles deal with the following topics: (1) the use of I-search (as explained by Kenneth Macrorie in "Searching Writing") as a form of research paper that narrates the…

  3. Direct detection of x-rays for protein crystallography employing a thick, large area CCD

    DOEpatents

    Atac, Muzaffer; McKay, Timothy

    1999-01-01

    An apparatus and method for directly determining the crystalline structure of a protein crystal. The crystal is irradiated by a finely collimated x-ray beam. The interaction of the x-ray beam with the crystal produces scattered x-rays. These scattered x-rays are detected by means of a large area, thick CCD which is capable of measuring a significant number of scattered x-rays which impact its surface. The CCD is capable of detecting the position of impact of the scattered x-ray on the surface of the CCD and the quantity of scattered x-rays which impact the same cell or pixel. This data is then processed in real-time and the processed data is outputted to produce a image of the structure of the crystal. If this crystal is a protein the molecular structure of the protein can be determined from the data received.

  4. Coarse-Grained Simulations of Heme Proteins: Validation and Study of Large Conformational Transitions.

    PubMed

    Ramírez, Claudia L; Petruk, Ariel; Bringas, Mauro; Estrin, Dario A; Roitberg, Adrian E; Marti, Marcelo A; Capece, Luciana

    2016-07-12

    Heme proteins are ubiquitous in nature and perform many diverse functions in all kingdoms of life. Many of these functions are related to large-scale conformational transitions and allosteric processes. Sampling of these large conformational changes is computationally very challenging. In this context, coarse-grain simulations emerge as an efficient approach to explore the conformational landscape. In this work, we present a coarse-grained model of the heme group and thoroughly validate this model in different benchmark examples, which include the monomeric heme proteins myoglobin and neuroglobin and the tetrameric human hemoglobin where we evaluated the method's ability to explore conformational changes (as the formation of hexacoordinated species) and allosteric transitions (as the well-known R → T transition). The obtained results are compared with atomistic molecular dynamics simulations. Overall, the results indicate that this approach conserves the essential dynamical information on different allosteric processes. PMID:27267322

  5. Large-scale serum protein biomarker discovery in Duchenne muscular dystrophy

    PubMed Central

    Hathout, Yetrib; Brody, Edward; Clemens, Paula R.; Cripe, Linda; DeLisle, Robert Kirk; Furlong, Pat; Gordish-Dressman, Heather; Hache, Lauren; Henricson, Erik; Hoffman, Eric P.; Kobayashi, Yvonne Monique; Lorts, Angela; Mah, Jean K.; McDonald, Craig; Mehler, Bob; Nelson, Sally; Nikrad, Malti; Singer, Britta; Steele, Fintan; Sterling, David; Sweeney, H. Lee; Williams, Steve; Gold, Larry

    2015-01-01

    Serum biomarkers in Duchenne muscular dystrophy (DMD) may provide deeper insights into disease pathogenesis, suggest new therapeutic approaches, serve as acute read-outs of drug effects, and be useful as surrogate outcome measures to predict later clinical benefit. In this study a large-scale biomarker discovery was performed on serum samples from patients with DMD and age-matched healthy volunteers using a modified aptamer-based proteomics technology. Levels of 1,125 proteins were quantified in serum samples from two independent DMD cohorts: cohort 1 (The Parent Project Muscular Dystrophy–Cincinnati Children’s Hospital Medical Center), 42 patients with DMD and 28 age-matched normal volunteers; and cohort 2 (The Cooperative International Neuromuscular Research Group, Duchenne Natural History Study), 51 patients with DMD and 17 age-matched normal volunteers. Forty-four proteins showed significant differences that were consistent in both cohorts when comparing DMD patients and healthy volunteers at a 1% false-discovery rate, a large number of significant protein changes for such a small study. These biomarkers can be classified by known cellular processes and by age-dependent changes in protein concentration. Our findings demonstrate both the utility of this unbiased biomarker discovery approach and suggest potential new diagnostic and therapeutic avenues for ameliorating the burden of DMD and, we hope, other rare and devastating diseases. PMID:26039989

  6. Transport of large breakdown products of dietary protein through the gut wall.

    PubMed Central

    Hemmings, W A; Williams, E W

    1978-01-01

    Ferritin or tritium labelled immunoglobulin G may, by electron microscopy, be demonstrated entering, within, and leaving the epithelial cells. Quantitative studies using various proteins labelled with radioiodine show that large amounts of protein bound radioactivity may be demonstrated in the tissues after feeding the labelled protein to adult rats by stomach tube. The molecular size of this material as determined by sugar gradient ultracentrifugation of tissue extracts ranges when IgG is fed from 50,000-20,000 Daltons. The material retains its ability to react as antigen with antisera specific to the original molecule: precipitation reactions may be obtained in gels and quantitative studies show that cnosiderable amounts of the protein-bound radioactivity are still specifically precipitable. Such studies have been carried out with alpha-gliadin as well as bovine IgG. At 100 days old rats may absorb as much as 40% of a dose of bovine IgG in the form of these large molecular breakdown products. PMID:680603

  7. A Scalable Parallel Algorithm for Large-Scale Protein Sequence Homology Detection

    SciTech Connect

    Wu, Changjun; Kalyanaraman, Anantharaman; Cannon, William R.

    2010-09-13

    Protein sequence homology detection is a fundamental problem in computational molecular biology, with a pervasive application in nearly all analyses that aim to structurally and functionally characterize protein molecules. While detecting homology between two protein sequences is computationally inexpensive, detecting pairwise homology at a large-scale becomes prohibitive, requiring millions of CPU hours. Yet, there is currently no efficient method available to parallelize this kernel. In this paper, we present the key characteristics that make this problem particularly hard to parallelize, and then propose a new parallel algorithm that is suited for large-scale protein sequence data. Our method, called pGraph, is designed using a hierarchical multiple-master multiple-worker model, where the processor space is partitioned into subgroups and the hierarchy helps in ensuring the workload is load balanced fashion despite the inherent irregularity that may originate in the input. Experimental evaluation demonstrates that our method scales linearly on all input sizes tested (up to 640K sequences) on a 1,024 node supercomputer. In addition to demonstrating strong scaling, we present an extensive study of the various components of the system and related parametric studies.

  8. pGraph: Efficient Parallel Construction of Large-Scale Protein Sequence Homology Graphs

    SciTech Connect

    Wu, Changjun; Kalyanaraman, Anantharaman; Cannon, William R.

    2012-09-15

    Detecting sequence homology between protein sequences is a fundamental problem in computational molecular biology, with a pervasive application in nearly all analyses that aim to structurally and functionally characterize protein molecules. While detecting the homology between two protein sequences is relatively inexpensive, detecting pairwise homology for a large number of protein sequences can become computationally prohibitive for modern inputs, often requiring millions of CPU hours. Yet, there is currently no robust support to parallelize this kernel. In this paper, we identify the key characteristics that make this problemparticularly hard to parallelize, and then propose a new parallel algorithm that is suited for detecting homology on large data sets using distributed memory parallel computers. Our method, called pGraph, is a novel hybrid between the hierarchical multiple-master/worker model and producer-consumer model, and is designed to break the irregularities imposed by alignment computation and work generation. Experimental results show that pGraph achieves linear scaling on a 2,048 processor distributed memory cluster for a wide range of inputs ranging from as small as 20,000 sequences to 2,560,000 sequences. In addition to demonstrating strong scaling, we present an extensive report on the performance of the various system components and related parametric studies.

  9. A large scale membrane-binding protein conformational change that initiates at small length scales

    NASA Astrophysics Data System (ADS)

    Grandpre, Trevor; Andorf, Matthew; Chakravarthy, Srinivas; Lamb, Robert; Poor, Taylor; Landahl, Eric

    2013-03-01

    The fusion (F) protein of parainfluenza virus 5 (PIV5) is a membrane-bound, homotrimeric glycoprotein located on the surface of PIV5 viral envelopes. Upon being triggered by the receptor-binding protein (HN), F undergoes a greater than 100Å ATP-independent refolding event. This refolding event results in the insertion of a hydrophobic fusion peptide into the membrane of the target cell, followed by the desolvation and subsequent fusion event as the two membranes are brought together. Isothermal calorimetry and hydrophobic dye incorporation experiments indicate that the soluble construct of the F protein undergoes a conformational rearrangement event at around 55 deg C. We present the results of an initial Time-Resolved Small-Angle X-Ray Scattering (TR-SAXS) study of this large scale, entropically driven conformational change using a temperature jump. Although we the measured radius of gyration of this protein changes on a 110 second timescale, we find that the x-ray scattering intensity at higher angles (corresponding to smaller length scales in the protein) changes nearly an order of magnitude faster. We believe this may be a signature of entropically-driven conformational change. To whom correspondence should be addressed

  10. Live-cell multiphoton fluorescence correlation spectroscopy with an improved large Stokes shift fluorescent protein

    PubMed Central

    Guan, Yinghua; Meurer, Matthias; Raghavan, Sarada; Rebane, Aleksander; Lindquist, Jake R.; Santos, Sofia; Kats, Ilia; Davidson, Michael W.; Mazitschek, Ralph; Hughes, Thomas E.; Drobizhev, Mikhail; Knop, Michael; Shah, Jagesh V.

    2015-01-01

    We report an improved variant of mKeima, a monomeric long Stokes shift red fluorescent protein, hmKeima8.5. The increased intracellular brightness and large Stokes shift (∼180 nm) make it an excellent partner with teal fluorescent protein (mTFP1) for multiphoton, multicolor applications. Excitation of this pair by a single multiphoton excitation wavelength (MPE, 850 nm) yields well-separable emission peaks (∼120-nm separation). Using this pair, we measure homo- and hetero-oligomerization interactions in living cells via multiphoton excitation fluorescence correlation spectroscopy (MPE-FCS). Using tandem dimer proteins and small-molecule inducible dimerization domains, we demonstrate robust and quantitative detection of intracellular protein–protein interactions. We also use MPE-FCCS to detect drug–protein interactions in the intracellular environment using a Coumarin 343 (C343)-conjugated drug and hmKeima8.5 as a fluorescence pair. The mTFP1/hmKeima8.5 and C343/hmKeima8.5 combinations, together with our calibration constructs, provide a practical and broadly applicable toolbox for the investigation of molecular interactions in the cytoplasm of living cells. PMID:25877871