Kinsinger, Christopher R.; Apffel, James; Baker, Mark; Bian, Xiaopeng; Borchers, Christoph H.; Bradshaw, Ralph; Brusniak, Mi-Youn; Chan, Daniel W.; Deutsch, Eric W.; Domon, Bruno; Gorman, Jeff; Grimm, Rudolf; Hancock, William; Hermjakob, Henning; Horn, David; Hunter, Christie; Kolar, Patrik; Kraus, Hans-Joachim; Langen, Hanno; Linding, Rune; Moritz, Robert L.; Omenn, Gilbert S.; Orlando, Ron; Pandey, Akhilesh; Ping, Peipei; Rahbar, Amir; Rivers, Robert; Seymour, Sean L.; Simpson, Richard J.; Slotta, Douglas; Smith, Richard D.; Stein, Stephen E.; Tabb, David L.; Tagle, Danilo; Yates, John R.; Rodriguez, Henry
2011-01-01
Policies supporting the rapid and open sharing of proteomic data are being implemented by the leading journals in the field. The proteomics community is taking steps to ensure that data are made publicly accessible and are of high quality, a challenging task that requires the development and deployment of methods for measuring and documenting data quality metrics. On September 18, 2010, the U.S. National Cancer Institute (NCI) convened the “International Workshop on Proteomic Data Quality Metrics” in Sydney, Australia, to identify and address issues facing the development and use of such methods for open access proteomics data. The stakeholders at the workshop enumerated the key principles underlying a framework for data quality assessment in mass spectrometry data that will meet the needs of the research community, journals, funding agencies, and data repositories. Attendees discussed and agreed up on two primary needs for the wide use of quality metrics: (1) an evolving list of comprehensive quality metrics and (2) standards accompanied by software analytics. Attendees stressed the importance of increased education and training programs to promote reliable protocols in proteomics. This workshop report explores the historic precedents, key discussions, and necessary next steps to enhance the quality of open access data. By agreement, this article is published simultaneously in the Journal of Proteome Research, Molecular and Cellular Proteomics, Proteomics, and Proteomics Clinical Applications as a public service to the research community. The peer review process was a coordinated effort conducted by a panel of referees selected by the journals. PMID:22053864
Kinsinger, Christopher R.; Apffel, James; Baker, Mark; Bian, Xiaopeng; Borchers, Christoph H.; Bradshaw, Ralph; Brusniak, Mi-Youn; Chan, Daniel W.; Deutsch, Eric W.; Domon, Bruno; Gorman, Jeff; Grimm, Rudolf; Hancock, William; Hermjakob, Henning; Horn, David; Hunter, Christie; Kolar, Patrik; Kraus, Hans-Joachim; Langen, Hanno; Linding, Rune; Moritz, Robert L.; Omenn, Gilbert S.; Orlando, Ron; Pandey, Akhilesh; Ping, Peipei; Rahbar, Amir; Rivers, Robert; Seymour, Sean L.; Simpson, Richard J.; Slotta, Douglas; Smith, Richard D.; Stein, Stephen E.; Tabb, David L.; Tagle, Danilo; Yates, John R.; Rodriguez, Henry
2011-01-01
Policies supporting the rapid and open sharing of proteomic data are being implemented by the leading journals in the field. The proteomics community is taking steps to ensure that data are made publicly accessible and are of high quality, a challenging task that requires the development and deployment of methods for measuring and documenting data quality metrics. On September 18, 2010, the United States National Cancer Institute convened the “International Workshop on Proteomic Data Quality Metrics” in Sydney, Australia, to identify and address issues facing the development and use of such methods for open access proteomics data. The stakeholders at the workshop enumerated the key principles underlying a framework for data quality assessment in mass spectrometry data that will meet the needs of the research community, journals, funding agencies, and data repositories. Attendees discussed and agreed up on two primary needs for the wide use of quality metrics: 1) an evolving list of comprehensive quality metrics and 2) standards accompanied by software analytics. Attendees stressed the importance of increased education and training programs to promote reliable protocols in proteomics. This workshop report explores the historic precedents, key discussions, and necessary next steps to enhance the quality of open access data. By agreement, this article is published simultaneously in the Journal of Proteome Research, Molecular and Cellular Proteomics, Proteomics, and Proteomics Clinical Applications as a public service to the research community. The peer review process was a coordinated effort conducted by a panel of referees selected by the journals. PMID:22052993
Williamson, Nicholas A
2018-03-01
This paper discusses the successful adoption of a subscription-based, open-access model of service delivery for a mass spectrometry and proteomics facility. In 2009, the Mass Spectrometry and Proteomics Facility at the University of Melbourne (Australia) moved away from the standard fee for service model of service provision. Instead, the facility adopted a subscription- or membership-based, open-access model of service delivery. For a low fixed yearly cost, users could directly operate the instrumentation but, more importantly, there were no limits on usage other than the necessity to share available instrument time with all other users. All necessary training from platform staff and many of the base reagents were also provided as part of the membership cost. These changes proved to be very successful in terms of financial outcomes for the facility, instrument access and usage, and overall research output. This article describes the systems put in place as well as the overall successes and challenges associated with the operation of a mass spectrometry/proteomics core in this manner. Graphical abstract ᅟ.
NASA Astrophysics Data System (ADS)
Williamson, Nicholas A.
2018-03-01
This paper discusses the successful adoption of a subscription-based, open-access model of service delivery for a mass spectrometry and proteomics facility. In 2009, the Mass Spectrometry and Proteomics Facility at the University of Melbourne (Australia) moved away from the standard fee for service model of service provision. Instead, the facility adopted a subscription- or membership-based, open-access model of service delivery. For a low fixed yearly cost, users could directly operate the instrumentation but, more importantly, there were no limits on usage other than the necessity to share available instrument time with all other users. All necessary training from platform staff and many of the base reagents were also provided as part of the membership cost. These changes proved to be very successful in terms of financial outcomes for the facility, instrument access and usage, and overall research output. This article describes the systems put in place as well as the overall successes and challenges associated with the operation of a mass spectrometry/proteomics core in this manner. [Figure not available: see fulltext.
Reverse-phase protein arrays (RPPA) represent a powerful functional proteomic approach to elucidate cancer-related molecular mechanisms and to develop novel cancer therapies. To facilitate community-based investigation of the large-scale protein expression data generated by this platform, we have developed a user-friendly, open-access bioinformatic resource, The Cancer Proteome Atlas (TCPA, http://tcpaportal.org), which contains two separate web applications.
FunRich proteomics software analysis, let the fun begin!
Benito-Martin, Alberto; Peinado, Héctor
2015-08-01
Protein MS analysis is the preferred method for unbiased protein identification. It is normally applied to a large number of both small-scale and high-throughput studies. However, user-friendly computational tools for protein analysis are still needed. In this issue, Mathivanan and colleagues (Proteomics 2015, 15, 2597-2601) report the development of FunRich software, an open-access software that facilitates the analysis of proteomics data, providing tools for functional enrichment and interaction network analysis of genes and proteins. FunRich is a reinterpretation of proteomic software, a standalone tool combining ease of use with customizable databases, free access, and graphical representations. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Construction of a nasopharyngeal carcinoma 2D/MS repository with Open Source XML database--Xindice.
Li, Feng; Li, Maoyu; Xiao, Zhiqiang; Zhang, Pengfei; Li, Jianling; Chen, Zhuchu
2006-01-11
Many proteomics initiatives require integration of all information with uniformcriteria from collection of samples and data display to publication of experimental results. The integration and exchanging of these data of different formats and structure imposes a great challenge to us. The XML technology presents a promise in handling this task due to its simplicity and flexibility. Nasopharyngeal carcinoma (NPC) is one of the most common cancers in southern China and Southeast Asia, which has marked geographic and racial differences in incidence. Although there are some cancer proteome databases now, there is still no NPC proteome database. The raw NPC proteome experiment data were captured into one XML document with Human Proteome Markup Language (HUP-ML) editor and imported into native XML database Xindice. The 2D/MS repository of NPC proteome was constructed with Apache, PHP and Xindice to provide access to the database via Internet. On our website, two methods, keyword query and click query, were provided at the same time to access the entries of the NPC proteome database. Our 2D/MS repository can be used to share the raw NPC proteomics data that are generated from gel-based proteomics experiments. The database, as well as the PHP source codes for constructing users' own proteome repository, can be accessed at http://www.xyproteomics.org/.
Morisawa, Hiraku; Hirota, Mikako; Toda, Tosifusa
2006-01-01
Background In the post-genome era, most research scientists working in the field of proteomics are confronted with difficulties in management of large volumes of data, which they are required to keep in formats suitable for subsequent data mining. Therefore, a well-developed open source laboratory information management system (LIMS) should be available for their proteomics research studies. Results We developed an open source LIMS appropriately customized for 2-D gel electrophoresis-based proteomics workflow. The main features of its design are compactness, flexibility and connectivity to public databases. It supports the handling of data imported from mass spectrometry software and 2-D gel image analysis software. The LIMS is equipped with the same input interface for 2-D gel information as a clickable map on public 2DPAGE databases. The LIMS allows researchers to follow their own experimental procedures by reviewing the illustrations of 2-D gel maps and well layouts on the digestion plates and MS sample plates. Conclusion Our new open source LIMS is now available as a basic model for proteome informatics, and is accessible for further improvement. We hope that many research scientists working in the field of proteomics will evaluate our LIMS and suggest ways in which it can be improved. PMID:17018156
Fast and Efficient XML Data Access for Next-Generation Mass Spectrometry.
Röst, Hannes L; Schmitt, Uwe; Aebersold, Ruedi; Malmström, Lars
2015-01-01
In mass spectrometry-based proteomics, XML formats such as mzML and mzXML provide an open and standardized way to store and exchange the raw data (spectra and chromatograms) of mass spectrometric experiments. These file formats are being used by a multitude of open-source and cross-platform tools which allow the proteomics community to access algorithms in a vendor-independent fashion and perform transparent and reproducible data analysis. Recent improvements in mass spectrometry instrumentation have increased the data size produced in a single LC-MS/MS measurement and put substantial strain on open-source tools, particularly those that are not equipped to deal with XML data files that reach dozens of gigabytes in size. Here we present a fast and versatile parsing library for mass spectrometric XML formats available in C++ and Python, based on the mature OpenMS software framework. Our library implements an API for obtaining spectra and chromatograms under memory constraints using random access or sequential access functions, allowing users to process datasets that are much larger than system memory. For fast access to the raw data structures, small XML files can also be completely loaded into memory. In addition, we have improved the parsing speed of the core mzML module by over 4-fold (compared to OpenMS 1.11), making our library suitable for a wide variety of algorithms that need fast access to dozens of gigabytes of raw mass spectrometric data. Our C++ and Python implementations are available for the Linux, Mac, and Windows operating systems. All proposed modifications to the OpenMS code have been merged into the OpenMS mainline codebase and are available to the community at https://github.com/OpenMS/OpenMS.
Fast and Efficient XML Data Access for Next-Generation Mass Spectrometry
Röst, Hannes L.; Schmitt, Uwe; Aebersold, Ruedi; Malmström, Lars
2015-01-01
Motivation In mass spectrometry-based proteomics, XML formats such as mzML and mzXML provide an open and standardized way to store and exchange the raw data (spectra and chromatograms) of mass spectrometric experiments. These file formats are being used by a multitude of open-source and cross-platform tools which allow the proteomics community to access algorithms in a vendor-independent fashion and perform transparent and reproducible data analysis. Recent improvements in mass spectrometry instrumentation have increased the data size produced in a single LC-MS/MS measurement and put substantial strain on open-source tools, particularly those that are not equipped to deal with XML data files that reach dozens of gigabytes in size. Results Here we present a fast and versatile parsing library for mass spectrometric XML formats available in C++ and Python, based on the mature OpenMS software framework. Our library implements an API for obtaining spectra and chromatograms under memory constraints using random access or sequential access functions, allowing users to process datasets that are much larger than system memory. For fast access to the raw data structures, small XML files can also be completely loaded into memory. In addition, we have improved the parsing speed of the core mzML module by over 4-fold (compared to OpenMS 1.11), making our library suitable for a wide variety of algorithms that need fast access to dozens of gigabytes of raw mass spectrometric data. Availability Our C++ and Python implementations are available for the Linux, Mac, and Windows operating systems. All proposed modifications to the OpenMS code have been merged into the OpenMS mainline codebase and are available to the community at https://github.com/OpenMS/OpenMS. PMID:25927999
Colaert, Niklaas; Barsnes, Harald; Vaudel, Marc; Helsens, Kenny; Timmerman, Evy; Sickmann, Albert; Gevaert, Kris; Martens, Lennart
2011-08-05
The Thermo Proteome Discoverer program integrates both peptide identification and quantification into a single workflow for peptide-centric proteomics. Furthermore, its close integration with Thermo mass spectrometers has made it increasingly popular in the field. Here, we present a Java library to parse the msf files that constitute the output of Proteome Discoverer. The parser is also implemented as a graphical user interface allowing convenient access to the information found in the msf files, and in Rover, a program to analyze and validate quantitative proteomics information. All code, binaries, and documentation is freely available at http://thermo-msf-parser.googlecode.com.
The November 1, 2017 issue of Cancer Research is dedicated to a collection of computational resource papers in genomics, proteomics, animal models, imaging, and clinical subjects for non-bioinformaticists looking to incorporate computing tools into their work. Scientists at Pacific Northwest National Laboratory have developed P-MartCancer, an open, web-based interactive software tool that enables statistical analyses of peptide or protein data generated from mass-spectrometry (MS)-based global proteomics experiments.
Qi, Da; Zhang, Huaizhong; Fan, Jun; Perkins, Simon; Pisconti, Addolorata; Simpson, Deborah M; Bessant, Conrad; Hubbard, Simon; Jones, Andrew R
2015-09-01
The mzQuantML standard has been developed by the Proteomics Standards Initiative for capturing, archiving and exchanging quantitative proteomic data, derived from mass spectrometry. It is a rich XML-based format, capable of representing data about two-dimensional features from LC-MS data, and peptides, proteins or groups of proteins that have been quantified from multiple samples. In this article we report the development of an open source Java-based library of routines for mzQuantML, called the mzqLibrary, and associated software for visualising data called the mzqViewer. The mzqLibrary contains routines for mapping (peptide) identifications on quantified features, inference of protein (group)-level quantification values from peptide-level values, normalisation and basic statistics for differential expression. These routines can be accessed via the command line, via a Java programming interface access or a basic graphical user interface. The mzqLibrary also contains several file format converters, including import converters (to mzQuantML) from OpenMS, Progenesis LC-MS and MaxQuant, and exporters (from mzQuantML) to other standards or useful formats (mzTab, HTML, csv). The mzqViewer contains in-built routines for viewing the tables of data (about features, peptides or proteins), and connects to the R statistical library for more advanced plotting options. The mzqLibrary and mzqViewer packages are available from https://code.google.com/p/mzq-lib/. © 2015 The Authors. PROTEOMICS Published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Introducing the PRIDE Archive RESTful web services.
Reisinger, Florian; del-Toro, Noemi; Ternent, Tobias; Hermjakob, Henning; Vizcaíno, Juan Antonio
2015-07-01
The PRIDE (PRoteomics IDEntifications) database is one of the world-leading public repositories of mass spectrometry (MS)-based proteomics data and it is a founding member of the ProteomeXchange Consortium of proteomics resources. In the original PRIDE database system, users could access data programmatically by accessing the web services provided by the PRIDE BioMart interface. New REST (REpresentational State Transfer) web services have been developed to serve the most popular functionality provided by BioMart (now discontinued due to data scalability issues) and address the data access requirements of the newly developed PRIDE Archive. Using the API (Application Programming Interface) it is now possible to programmatically query for and retrieve peptide and protein identifications, project and assay metadata and the originally submitted files. Searching and filtering is also possible by metadata information, such as sample details (e.g. species and tissues), instrumentation (mass spectrometer), keywords and other provided annotations. The PRIDE Archive web services were first made available in April 2014. The API has already been adopted by a few applications and standalone tools such as PeptideShaker, PRIDE Inspector, the Unipept web application and the Python-based BioServices package. This application is free and open to all users with no login requirement and can be accessed at http://www.ebi.ac.uk/pride/ws/archive/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
2017-05-19
LightCycler® 96 desktop software. Positive and negative samples were identified using the “ Qualitative Detection” analysis function using the default...Institute of Infectious Diseases, Fort Detrick, MD 21702, United States A R T I C L E I N F O Keywords: West Nile virus Virus inactivation Sample buffer... samples using a commercially available SDS- PAGE sample buffer for proteomic studies. Using this method, we demonstrate its utility by identification
The HUPO proteomics standards initiative--overcoming the fragmentation of proteomics data.
Hermjakob, Henning
2006-09-01
Proteomics is a key field of modern biomolecular research, with many small and large scale efforts producing a wealth of proteomics data. However, the vast majority of this data is never exploited to its full potential. Even in publicly funded projects, often the raw data generated in a specific context is analysed, conclusions are drawn and published, but little attention is paid to systematic documentation, archiving, and public access to the data supporting the scientific results. It is often difficult to validate the results stated in a particular publication, and even simple global questions like "In which cellular contexts has my protein of interest been observed?" can currently not be answered with realistic effort, due to a lack of standardised reporting and collection of proteomics data. The Proteomics Standards Initiative (PSI), a work group of the Human Proteome Organisation (HUPO), defines community standards for data representation in proteomics to facilitate systematic data capture, comparison, exchange and verification. In this article we provide an overview of PSI organisational structure, activities, and current results, as well as ways to get involved in the broad-based, open PSI process.
Elguoshy, Amr; Hirao, Yoshitoshi; Xu, Bo; Saito, Suguru; Quadery, Ali F; Yamamoto, Keiko; Mitsui, Toshiaki; Yamamoto, Tadashi
2017-12-01
In an attempt to complete human proteome project (HPP), Chromosome-Centric Human Proteome Project (C-HPP) launched the journey of missing protein (MP) investigation in 2012. However, 2579 and 572 protein entries in the neXtProt (2017-1) are still considered as missing and uncertain proteins, respectively. Thus, in this study, we proposed a pipeline to analyze, identify, and validate human missing and uncertain proteins in open-access transcriptomics and proteomics databases. Analysis of RNA expression pattern for missing proteins in Human protein Atlas showed that 28% of them, such as Olfactory receptor 1I1 ( O60431 ), had no RNA expression, suggesting the necessity to consider uncommon tissues for transcriptomic and proteomic studies. Interestingly, 21% had elevated expression level in a particular tissue (tissue-enriched proteins), indicating the importance of targeting such proteins in their elevated tissues. Additionally, the analysis of RNA expression level for missing proteins showed that 95% had no or low expression level (0-10 transcripts per million), indicating that low abundance is one of the major obstacles facing the detection of missing proteins. Moreover, missing proteins are predicted to generate fewer predicted unique tryptic peptides than the identified proteins. Searching for these predicted unique tryptic peptides that correspond to missing and uncertain proteins in the experimental peptide list of open-access MS-based databases (PA, GPM) resulted in the detection of 402 missing and 19 uncertain proteins with at least two unique peptides (≥9 aa) at <(5 × 10 -4 )% FDR. Finally, matching the native spectra for the experimentally detected peptides with their SRMAtlas synthetic counterparts at three transition sources (QQQ, QTOF, QTRAP) gave us an opportunity to validate 41 missing proteins by ≥2 proteotypic peptides.
Zhang, Huaizhong; Fan, Jun; Perkins, Simon; Pisconti, Addolorata; Simpson, Deborah M.; Bessant, Conrad; Hubbard, Simon; Jones, Andrew R.
2015-01-01
The mzQuantML standard has been developed by the Proteomics Standards Initiative for capturing, archiving and exchanging quantitative proteomic data, derived from mass spectrometry. It is a rich XML‐based format, capable of representing data about two‐dimensional features from LC‐MS data, and peptides, proteins or groups of proteins that have been quantified from multiple samples. In this article we report the development of an open source Java‐based library of routines for mzQuantML, called the mzqLibrary, and associated software for visualising data called the mzqViewer. The mzqLibrary contains routines for mapping (peptide) identifications on quantified features, inference of protein (group)‐level quantification values from peptide‐level values, normalisation and basic statistics for differential expression. These routines can be accessed via the command line, via a Java programming interface access or a basic graphical user interface. The mzqLibrary also contains several file format converters, including import converters (to mzQuantML) from OpenMS, Progenesis LC‐MS and MaxQuant, and exporters (from mzQuantML) to other standards or useful formats (mzTab, HTML, csv). The mzqViewer contains in‐built routines for viewing the tables of data (about features, peptides or proteins), and connects to the R statistical library for more advanced plotting options. The mzqLibrary and mzqViewer packages are available from https://code.google.com/p/mzq‐lib/. PMID:26037908
DOE Office of Scientific and Technical Information (OSTI.GOV)
Whiteaker, Jeffrey R.; Halusa, Goran; Hoofnagle, Andrew N.
2016-02-12
The Clinical Proteomic Tumor Analysis Consortium (CPTAC) of the National Cancer Institute (NCI) has launched an Assay Portal (http://assays.cancer.gov) to serve as an open-source repository of well-characterized targeted proteomic assays. The portal is designed to curate and disseminate highly characterized, targeted mass spectrometry (MS)-based assays by providing detailed assay performance characterization data, standard operating procedures, and access to reagents. Assay content is accessed via the portal through queries to find assays targeting proteins associated with specific cellular pathways, protein complexes, or specific chromosomal regions. The position of the peptide analytes for which there are available assays are mapped relative tomore » other features of interest in the protein, such as sequence domains, isoforms, single nucleotide polymorphisms, and post-translational modifications. The overarching goals are to enable robust quantification of all human proteins and to standardize the quantification of targeted MS-based assays to ultimately enable harmonization of results over time and across laboratories.« less
Whiteaker, Jeffrey R; Halusa, Goran N; Hoofnagle, Andrew N; Sharma, Vagisha; MacLean, Brendan; Yan, Ping; Wrobel, John A; Kennedy, Jacob; Mani, D R; Zimmerman, Lisa J; Meyer, Matthew R; Mesri, Mehdi; Boja, Emily; Carr, Steven A; Chan, Daniel W; Chen, Xian; Chen, Jing; Davies, Sherri R; Ellis, Matthew J C; Fenyö, David; Hiltke, Tara; Ketchum, Karen A; Kinsinger, Chris; Kuhn, Eric; Liebler, Daniel C; Liu, Tao; Loss, Michael; MacCoss, Michael J; Qian, Wei-Jun; Rivers, Robert; Rodland, Karin D; Ruggles, Kelly V; Scott, Mitchell G; Smith, Richard D; Thomas, Stefani; Townsend, R Reid; Whiteley, Gordon; Wu, Chaochao; Zhang, Hui; Zhang, Zhen; Rodriguez, Henry; Paulovich, Amanda G
2016-01-01
The Clinical Proteomic Tumor Analysis Consortium (CPTAC) of the National Cancer Institute (NCI) has launched an Assay Portal (http://assays.cancer.gov) to serve as an open-source repository of well-characterized targeted proteomic assays. The portal is designed to curate and disseminate highly characterized, targeted mass spectrometry (MS)-based assays by providing detailed assay performance characterization data, standard operating procedures, and access to reagents. Assay content is accessed via the portal through queries to find assays targeting proteins associated with specific cellular pathways, protein complexes, or specific chromosomal regions. The position of the peptide analytes for which there are available assays are mapped relative to other features of interest in the protein, such as sequence domains, isoforms, single nucleotide polymorphisms, and posttranslational modifications. The overarching goals are to enable robust quantification of all human proteins and to standardize the quantification of targeted MS-based assays to ultimately enable harmonization of results over time and across laboratories.
PIQMIe: a web server for semi-quantitative proteomics data management and analysis
Kuzniar, Arnold; Kanaar, Roland
2014-01-01
We present the Proteomics Identifications and Quantitations Data Management and Integration Service or PIQMIe that aids in reliable and scalable data management, analysis and visualization of semi-quantitative mass spectrometry based proteomics experiments. PIQMIe readily integrates peptide and (non-redundant) protein identifications and quantitations from multiple experiments with additional biological information on the protein entries, and makes the linked data available in the form of a light-weight relational database, which enables dedicated data analyses (e.g. in R) and user-driven queries. Using the web interface, users are presented with a concise summary of their proteomics experiments in numerical and graphical forms, as well as with a searchable protein grid and interactive visualization tools to aid in the rapid assessment of the experiments and in the identification of proteins of interest. The web server not only provides data access through a web interface but also supports programmatic access through RESTful web service. The web server is available at http://piqmie.semiqprot-emc.cloudlet.sara.nl or http://www.bioinformatics.nl/piqmie. This website is free and open to all users and there is no login requirement. PMID:24861615
PIQMIe: a web server for semi-quantitative proteomics data management and analysis.
Kuzniar, Arnold; Kanaar, Roland
2014-07-01
We present the Proteomics Identifications and Quantitations Data Management and Integration Service or PIQMIe that aids in reliable and scalable data management, analysis and visualization of semi-quantitative mass spectrometry based proteomics experiments. PIQMIe readily integrates peptide and (non-redundant) protein identifications and quantitations from multiple experiments with additional biological information on the protein entries, and makes the linked data available in the form of a light-weight relational database, which enables dedicated data analyses (e.g. in R) and user-driven queries. Using the web interface, users are presented with a concise summary of their proteomics experiments in numerical and graphical forms, as well as with a searchable protein grid and interactive visualization tools to aid in the rapid assessment of the experiments and in the identification of proteins of interest. The web server not only provides data access through a web interface but also supports programmatic access through RESTful web service. The web server is available at http://piqmie.semiqprot-emc.cloudlet.sara.nl or http://www.bioinformatics.nl/piqmie. This website is free and open to all users and there is no login requirement. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
pyOpenMS: a Python-based interface to the OpenMS mass-spectrometry algorithm library.
Röst, Hannes L; Schmitt, Uwe; Aebersold, Ruedi; Malmström, Lars
2014-01-01
pyOpenMS is an open-source, Python-based interface to the C++ OpenMS library, providing facile access to a feature-rich, open-source algorithm library for MS-based proteomics analysis. It contains Python bindings that allow raw access to the data structures and algorithms implemented in OpenMS, specifically those for file access (mzXML, mzML, TraML, mzIdentML among others), basic signal processing (smoothing, filtering, de-isotoping, and peak-picking) and complex data analysis (including label-free, SILAC, iTRAQ, and SWATH analysis tools). pyOpenMS thus allows fast prototyping and efficient workflow development in a fully interactive manner (using the interactive Python interpreter) and is also ideally suited for researchers not proficient in C++. In addition, our code to wrap a complex C++ library is completely open-source, allowing other projects to create similar bindings with ease. The pyOpenMS framework is freely available at https://pypi.python.org/pypi/pyopenms while the autowrap tool to create Cython code automatically is available at https://pypi.python.org/pypi/autowrap (both released under the 3-clause BSD licence). © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Perez-Riverol, Yasset; Xu, Qing-Wei; Wang, Rui; Uszkoreit, Julian; Griss, Johannes; Sanchez, Aniel; Reisinger, Florian; Csordas, Attila; Ternent, Tobias; Del-Toro, Noemi; Dianes, Jose A; Eisenacher, Martin; Hermjakob, Henning; Vizcaíno, Juan Antonio
2016-01-01
The original PRIDE Inspector tool was developed as an open source standalone tool to enable the visualization and validation of mass-spectrometry (MS)-based proteomics data before data submission or already publicly available in the Proteomics Identifications (PRIDE) database. The initial implementation of the tool focused on visualizing PRIDE data by supporting the PRIDE XML format and a direct access to private (password protected) and public experiments in PRIDE.The ProteomeXchange (PX) Consortium has been set up to enable a better integration of existing public proteomics repositories, maximizing its benefit to the scientific community through the implementation of standard submission and dissemination pipelines. Within the Consortium, PRIDE is focused on supporting submissions of tandem MS data. The increasing use and popularity of the new Proteomics Standards Initiative (PSI) data standards such as mzIdentML and mzTab, and the diversity of workflows supported by the PX resources, prompted us to design and implement a new suite of algorithms and libraries that would build upon the success of the original PRIDE Inspector and would enable users to visualize and validate PX "complete" submissions. The PRIDE Inspector Toolsuite supports the handling and visualization of different experimental output files, ranging from spectra (mzML, mzXML, and the most popular peak lists formats) and peptide and protein identification results (mzIdentML, PRIDE XML, mzTab) to quantification data (mzTab, PRIDE XML), using a modular and extensible set of open-source, cross-platform libraries. We believe that the PRIDE Inspector Toolsuite represents a milestone in the visualization and quality assessment of proteomics data. It is freely available at http://github.com/PRIDE-Toolsuite/. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Perez-Riverol, Yasset; Xu, Qing-Wei; Wang, Rui; Uszkoreit, Julian; Griss, Johannes; Sanchez, Aniel; Reisinger, Florian; Csordas, Attila; Ternent, Tobias; del-Toro, Noemi; Dianes, Jose A.; Eisenacher, Martin; Hermjakob, Henning; Vizcaíno, Juan Antonio
2016-01-01
The original PRIDE Inspector tool was developed as an open source standalone tool to enable the visualization and validation of mass-spectrometry (MS)-based proteomics data before data submission or already publicly available in the Proteomics Identifications (PRIDE) database. The initial implementation of the tool focused on visualizing PRIDE data by supporting the PRIDE XML format and a direct access to private (password protected) and public experiments in PRIDE. The ProteomeXchange (PX) Consortium has been set up to enable a better integration of existing public proteomics repositories, maximizing its benefit to the scientific community through the implementation of standard submission and dissemination pipelines. Within the Consortium, PRIDE is focused on supporting submissions of tandem MS data. The increasing use and popularity of the new Proteomics Standards Initiative (PSI) data standards such as mzIdentML and mzTab, and the diversity of workflows supported by the PX resources, prompted us to design and implement a new suite of algorithms and libraries that would build upon the success of the original PRIDE Inspector and would enable users to visualize and validate PX “complete” submissions. The PRIDE Inspector Toolsuite supports the handling and visualization of different experimental output files, ranging from spectra (mzML, mzXML, and the most popular peak lists formats) and peptide and protein identification results (mzIdentML, PRIDE XML, mzTab) to quantification data (mzTab, PRIDE XML), using a modular and extensible set of open-source, cross-platform libraries. We believe that the PRIDE Inspector Toolsuite represents a milestone in the visualization and quality assessment of proteomics data. It is freely available at http://github.com/PRIDE-Toolsuite/. PMID:26545397
Bensaddek, Dalila; Narayan, Vikram; Nicolas, Armel; Murillo, Alejandro Brenes; Gartner, Anton; Kenyon, Cynthia J; Lamond, Angus I
2016-02-01
Proteomics studies typically analyze proteins at a population level, using extracts prepared from tens of thousands to millions of cells. The resulting measurements correspond to average values across the cell population and can mask considerable variation in protein expression and function between individual cells or organisms. Here, we report the development of micro-proteomics for the analysis of Caenorhabditis elegans, a eukaryote composed of 959 somatic cells and ∼1500 germ cells, measuring the worm proteome at a single organism level to a depth of ∼3000 proteins. This includes detection of proteins across a wide dynamic range of expression levels (>6 orders of magnitude), including many chromatin-associated factors involved in chromosome structure and gene regulation. We apply the micro-proteomics workflow to measure the global proteome response to heat-shock in individual nematodes. This shows variation between individual animals in the magnitude of proteome response following heat-shock, including variable induction of heat-shock proteins. The micro-proteomics pipeline thus facilitates the investigation of stochastic variation in protein expression between individuals within an isogenic population of C. elegans. All data described in this study are available online via the Encyclopedia of Proteome Dynamics (http://www.peptracker.com/epd), an open access, searchable database resource. © 2015 The Authors. PROTEOMICS Published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
ProteoWizard: open source software for rapid proteomics tools development.
Kessner, Darren; Chambers, Matt; Burke, Robert; Agus, David; Mallick, Parag
2008-11-01
The ProteoWizard software project provides a modular and extensible set of open-source, cross-platform tools and libraries. The tools perform proteomics data analyses; the libraries enable rapid tool creation by providing a robust, pluggable development framework that simplifies and unifies data file access, and performs standard proteomics and LCMS dataset computations. The library contains readers and writers of the mzML data format, which has been written using modern C++ techniques and design principles and supports a variety of platforms with native compilers. The software has been specifically released under the Apache v2 license to ensure it can be used in both academic and commercial projects. In addition to the library, we also introduce a rapidly growing set of companion tools whose implementation helps to illustrate the simplicity of developing applications on top of the ProteoWizard library. Cross-platform software that compiles using native compilers (i.e. GCC on Linux, MSVC on Windows and XCode on OSX) is available for download free of charge, at http://proteowizard.sourceforge.net. This website also provides code examples, and documentation. It is our hope the ProteoWizard project will become a standard platform for proteomics development; consequently, code use, contribution and further development are strongly encouraged.
Skyline: an open source document editor for creating and analyzing targeted proteomics experiments.
MacLean, Brendan; Tomazela, Daniela M; Shulman, Nicholas; Chambers, Matthew; Finney, Gregory L; Frewen, Barbara; Kern, Randall; Tabb, David L; Liebler, Daniel C; MacCoss, Michael J
2010-04-01
Skyline is a Windows client application for targeted proteomics method creation and quantitative data analysis. It is open source and freely available for academic and commercial use. The Skyline user interface simplifies the development of mass spectrometer methods and the analysis of data from targeted proteomics experiments performed using selected reaction monitoring (SRM). Skyline supports using and creating MS/MS spectral libraries from a wide variety of sources to choose SRM filters and verify results based on previously observed ion trap data. Skyline exports transition lists to and imports the native output files from Agilent, Applied Biosystems, Thermo Fisher Scientific and Waters triple quadrupole instruments, seamlessly connecting mass spectrometer output back to the experimental design document. The fast and compact Skyline file format is easily shared, even for experiments requiring many sample injections. A rich array of graphs displays results and provides powerful tools for inspecting data integrity as data are acquired, helping instrument operators to identify problems early. The Skyline dynamic report designer exports tabular data from the Skyline document model for in-depth analysis with common statistical tools. Single-click, self-updating web installation is available at http://proteome.gs.washington.edu/software/skyline. This web site also provides access to instructional videos, a support board, an issues list and a link to the source code project.
GeneLab: NASA's Open Access, Collaborative Platform for Systems Biology and Space Medicine
NASA Technical Reports Server (NTRS)
Berrios, Daniel C.; Thompson, Terri G.; Fogle, Homer W.; Rask, Jon C.; Coughlan, Joseph C.
2015-01-01
NASA is investing in GeneLab1 (http:genelab.nasa.gov), a multi-year effort to maximize utilization of the limited resources to conduct biological and medical research in space, principally aboard the International Space Station (ISS). High-throughput genomic, transcriptomic, proteomic or other omics analyses from experiments conducted on the ISS will be stored in the GeneLab Data Systems (GLDS), an open-science information system that will also include a biocomputation platform with collaborative science capabilities, to enable the discovery and validation of molecular networks.
Processing Shotgun Proteomics Data on the Amazon Cloud with the Trans-Proteomic Pipeline*
Slagel, Joseph; Mendoza, Luis; Shteynberg, David; Deutsch, Eric W.; Moritz, Robert L.
2015-01-01
Cloud computing, where scalable, on-demand compute cycles and storage are available as a service, has the potential to accelerate mass spectrometry-based proteomics research by providing simple, expandable, and affordable large-scale computing to all laboratories regardless of location or information technology expertise. We present new cloud computing functionality for the Trans-Proteomic Pipeline, a free and open-source suite of tools for the processing and analysis of tandem mass spectrometry datasets. Enabled with Amazon Web Services cloud computing, the Trans-Proteomic Pipeline now accesses large scale computing resources, limited only by the available Amazon Web Services infrastructure, for all users. The Trans-Proteomic Pipeline runs in an environment fully hosted on Amazon Web Services, where all software and data reside on cloud resources to tackle large search studies. In addition, it can also be run on a local computer with computationally intensive tasks launched onto the Amazon Elastic Compute Cloud service to greatly decrease analysis times. We describe the new Trans-Proteomic Pipeline cloud service components, compare the relative performance and costs of various Elastic Compute Cloud service instance types, and present on-line tutorials that enable users to learn how to deploy cloud computing technology rapidly with the Trans-Proteomic Pipeline. We provide tools for estimating the necessary computing resources and costs given the scale of a job and demonstrate the use of cloud enabled Trans-Proteomic Pipeline by performing over 1100 tandem mass spectrometry files through four proteomic search engines in 9 h and at a very low cost. PMID:25418363
Rigbolt, Kristoffer T G; Vanselow, Jens T; Blagoev, Blagoy
2011-08-01
Recent technological advances have made it possible to identify and quantify thousands of proteins in a single proteomics experiment. As a result of these developments, the analysis of data has become the bottleneck of proteomics experiment. To provide the proteomics community with a user-friendly platform for comprehensive analysis, inspection and visualization of quantitative proteomics data we developed the Graphical Proteomics Data Explorer (GProX)(1). The program requires no special bioinformatics training, as all functions of GProX are accessible within its graphical user-friendly interface which will be intuitive to most users. Basic features facilitate the uncomplicated management and organization of large data sets and complex experimental setups as well as the inspection and graphical plotting of quantitative data. These are complemented by readily available high-level analysis options such as database querying, clustering based on abundance ratios, feature enrichment tests for e.g. GO terms and pathway analysis tools. A number of plotting options for visualization of quantitative proteomics data is available and most analysis functions in GProX create customizable high quality graphical displays in both vector and bitmap formats. The generic import requirements allow data originating from essentially all mass spectrometry platforms, quantitation strategies and software to be analyzed in the program. GProX represents a powerful approach to proteomics data analysis providing proteomics experimenters with a toolbox for bioinformatics analysis of quantitative proteomics data. The program is released as open-source and can be freely downloaded from the project webpage at http://gprox.sourceforge.net.
Rigbolt, Kristoffer T. G.; Vanselow, Jens T.; Blagoev, Blagoy
2011-01-01
Recent technological advances have made it possible to identify and quantify thousands of proteins in a single proteomics experiment. As a result of these developments, the analysis of data has become the bottleneck of proteomics experiment. To provide the proteomics community with a user-friendly platform for comprehensive analysis, inspection and visualization of quantitative proteomics data we developed the Graphical Proteomics Data Explorer (GProX)1. The program requires no special bioinformatics training, as all functions of GProX are accessible within its graphical user-friendly interface which will be intuitive to most users. Basic features facilitate the uncomplicated management and organization of large data sets and complex experimental setups as well as the inspection and graphical plotting of quantitative data. These are complemented by readily available high-level analysis options such as database querying, clustering based on abundance ratios, feature enrichment tests for e.g. GO terms and pathway analysis tools. A number of plotting options for visualization of quantitative proteomics data is available and most analysis functions in GProX create customizable high quality graphical displays in both vector and bitmap formats. The generic import requirements allow data originating from essentially all mass spectrometry platforms, quantitation strategies and software to be analyzed in the program. GProX represents a powerful approach to proteomics data analysis providing proteomics experimenters with a toolbox for bioinformatics analysis of quantitative proteomics data. The program is released as open-source and can be freely downloaded from the project webpage at http://gprox.sourceforge.net. PMID:21602510
Processing shotgun proteomics data on the Amazon cloud with the trans-proteomic pipeline.
Slagel, Joseph; Mendoza, Luis; Shteynberg, David; Deutsch, Eric W; Moritz, Robert L
2015-02-01
Cloud computing, where scalable, on-demand compute cycles and storage are available as a service, has the potential to accelerate mass spectrometry-based proteomics research by providing simple, expandable, and affordable large-scale computing to all laboratories regardless of location or information technology expertise. We present new cloud computing functionality for the Trans-Proteomic Pipeline, a free and open-source suite of tools for the processing and analysis of tandem mass spectrometry datasets. Enabled with Amazon Web Services cloud computing, the Trans-Proteomic Pipeline now accesses large scale computing resources, limited only by the available Amazon Web Services infrastructure, for all users. The Trans-Proteomic Pipeline runs in an environment fully hosted on Amazon Web Services, where all software and data reside on cloud resources to tackle large search studies. In addition, it can also be run on a local computer with computationally intensive tasks launched onto the Amazon Elastic Compute Cloud service to greatly decrease analysis times. We describe the new Trans-Proteomic Pipeline cloud service components, compare the relative performance and costs of various Elastic Compute Cloud service instance types, and present on-line tutorials that enable users to learn how to deploy cloud computing technology rapidly with the Trans-Proteomic Pipeline. We provide tools for estimating the necessary computing resources and costs given the scale of a job and demonstrate the use of cloud enabled Trans-Proteomic Pipeline by performing over 1100 tandem mass spectrometry files through four proteomic search engines in 9 h and at a very low cost. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Skyline: an open source document editor for creating and analyzing targeted proteomics experiments
MacLean, Brendan; Tomazela, Daniela M.; Shulman, Nicholas; Chambers, Matthew; Finney, Gregory L.; Frewen, Barbara; Kern, Randall; Tabb, David L.; Liebler, Daniel C.; MacCoss, Michael J.
2010-01-01
Summary: Skyline is a Windows client application for targeted proteomics method creation and quantitative data analysis. It is open source and freely available for academic and commercial use. The Skyline user interface simplifies the development of mass spectrometer methods and the analysis of data from targeted proteomics experiments performed using selected reaction monitoring (SRM). Skyline supports using and creating MS/MS spectral libraries from a wide variety of sources to choose SRM filters and verify results based on previously observed ion trap data. Skyline exports transition lists to and imports the native output files from Agilent, Applied Biosystems, Thermo Fisher Scientific and Waters triple quadrupole instruments, seamlessly connecting mass spectrometer output back to the experimental design document. The fast and compact Skyline file format is easily shared, even for experiments requiring many sample injections. A rich array of graphs displays results and provides powerful tools for inspecting data integrity as data are acquired, helping instrument operators to identify problems early. The Skyline dynamic report designer exports tabular data from the Skyline document model for in-depth analysis with common statistical tools. Availability: Single-click, self-updating web installation is available at http://proteome.gs.washington.edu/software/skyline. This web site also provides access to instructional videos, a support board, an issues list and a link to the source code project. Contact: brendanx@u.washington.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:20147306
A new reference implementation of the PSICQUIC web service.
del-Toro, Noemi; Dumousseau, Marine; Orchard, Sandra; Jimenez, Rafael C; Galeota, Eugenia; Launay, Guillaume; Goll, Johannes; Breuer, Karin; Ono, Keiichiro; Salwinski, Lukasz; Hermjakob, Henning
2013-07-01
The Proteomics Standard Initiative Common QUery InterfaCe (PSICQUIC) specification was created by the Human Proteome Organization Proteomics Standards Initiative (HUPO-PSI) to enable computational access to molecular-interaction data resources by means of a standard Web Service and query language. Currently providing >150 million binary interaction evidences from 28 servers globally, the PSICQUIC interface allows the concurrent search of multiple molecular-interaction information resources using a single query. Here, we present an extension of the PSICQUIC specification (version 1.3), which has been released to be compliant with the enhanced standards in molecular interactions. The new release also includes a new reference implementation of the PSICQUIC server available to the data providers. It offers augmented web service capabilities and improves the user experience. PSICQUIC has been running for almost 5 years, with a user base growing from only 4 data providers to 28 (April 2013) allowing access to 151 310 109 binary interactions. The power of this web service is shown in PSICQUIC View web application, an example of how to simultaneously query, browse and download results from the different PSICQUIC servers. This application is free and open to all users with no login requirement (http://www.ebi.ac.uk/Tools/webservices/psicquic/view/main.xhtml).
xiSPEC: web-based visualization, analysis and sharing of proteomics data.
Kolbowski, Lars; Combe, Colin; Rappsilber, Juri
2018-05-08
We present xiSPEC, a standard compliant, next-generation web-based spectrum viewer for visualizing, analyzing and sharing mass spectrometry data. Peptide-spectrum matches from standard proteomics and cross-linking experiments are supported. xiSPEC is to date the only browser-based tool supporting the standardized file formats mzML and mzIdentML defined by the proteomics standards initiative. Users can either upload data directly or select files from the PRIDE data repository as input. xiSPEC allows users to save and share their datasets publicly or password protected for providing access to collaborators or readers and reviewers of manuscripts. The identification table features advanced interaction controls and spectra are presented in three interconnected views: (i) annotated mass spectrum, (ii) peptide sequence fragmentation key and (iii) quality control error plots of matched fragments. Highlighting or selecting data points in any view is represented in all other views. Views are interactive scalable vector graphic elements, which can be exported, e.g. for use in publication. xiSPEC allows for re-annotation of spectra for easy hypothesis testing by modifying input data. xiSPEC is freely accessible at http://spectrumviewer.org and the source code is openly available on https://github.com/Rappsilber-Laboratory/xiSPEC.
CPTAC Proteomics Data on UCSC Genome Browser | Office of Cancer Clinical Proteomics Research
The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium scientists are working together with the University of California, Santa Cruz (UCSC) Genomics Institute to provide public access to cancer proteomics data via the UCSC Genome Browser. This effort extends accessibility of the CPTAC data to more researchers and provides an additional level of analysis to assist the cancer biology community.
M2Lite: An Open-source, Light-weight, Pluggable and Fast Proteome Discoverer MSF to mzIdentML Tool.
Aiyetan, Paul; Zhang, Bai; Chen, Lily; Zhang, Zhen; Zhang, Hui
2014-04-28
Proteome Discoverer is one of many tools used for protein database search and peptide to spectrum assignment in mass spectrometry-based proteomics. However, the inadequacy of conversion tools makes it challenging to compare and integrate its results to those of other analytical tools. Here we present M2Lite, an open-source, light-weight, easily pluggable and fast conversion tool. M2Lite converts proteome discoverer derived MSF files to the proteomics community defined standard - the mzIdentML file format. M2Lite's source code is available as open-source at https://bitbucket.org/paiyetan/m2lite/src and its compiled binaries and documentation can be freely downloaded at https://bitbucket.org/paiyetan/m2lite/downloads.
An object model and database for functional genomics.
Jones, Andrew; Hunt, Ela; Wastling, Jonathan M; Pizarro, Angel; Stoeckert, Christian J
2004-07-10
Large-scale functional genomics analysis is now feasible and presents significant challenges in data analysis, storage and querying. Data standards are required to enable the development of public data repositories and to improve data sharing. There is an established data format for microarrays (microarray gene expression markup language, MAGE-ML) and a draft standard for proteomics (PEDRo). We believe that all types of functional genomics experiments should be annotated in a consistent manner, and we hope to open up new ways of comparing multiple datasets used in functional genomics. We have created a functional genomics experiment object model (FGE-OM), developed from the microarray model, MAGE-OM and two models for proteomics, PEDRo and our own model (Gla-PSI-Glasgow Proposal for the Proteomics Standards Initiative). FGE-OM comprises three namespaces representing (i) the parts of the model common to all functional genomics experiments; (ii) microarray-specific components; and (iii) proteomics-specific components. We believe that FGE-OM should initiate discussion about the contents and structure of the next version of MAGE and the future of proteomics standards. A prototype database called RNA And Protein Abundance Database (RAPAD), based on FGE-OM, has been implemented and populated with data from microbial pathogenesis. FGE-OM and the RAPAD schema are available from http://www.gusdb.org/fge.html, along with a set of more detailed diagrams. RAPAD can be accessed by registration at the site.
IdentiPy: An Extensible Search Engine for Protein Identification in Shotgun Proteomics.
Levitsky, Lev I; Ivanov, Mark V; Lobas, Anna A; Bubis, Julia A; Tarasova, Irina A; Solovyeva, Elizaveta M; Pridatchenko, Marina L; Gorshkov, Mikhail V
2018-06-18
We present an open-source, extensible search engine for shotgun proteomics. Implemented in Python programming language, IdentiPy shows competitive processing speed and sensitivity compared with the state-of-the-art search engines. It is equipped with a user-friendly web interface, IdentiPy Server, enabling the use of a single server installation accessed from multiple workstations. Using a simplified version of X!Tandem scoring algorithm and its novel "autotune" feature, IdentiPy outperforms the popular alternatives on high-resolution data sets. Autotune adjusts the search parameters for the particular data set, resulting in improved search efficiency and simplifying the user experience. IdentiPy with the autotune feature shows higher sensitivity compared with the evaluated search engines. IdentiPy Server has built-in postprocessing and protein inference procedures and provides graphic visualization of the statistical properties of the data set and the search results. It is open-source and can be freely extended to use third-party scoring functions or processing algorithms and allows customization of the search workflow for specialized applications.
2013-01-01
Background Contemporary coral reef research has firmly established that a genomic approach is urgently needed to better understand the effects of anthropogenic environmental stress and global climate change on coral holobiont interactions. Here we present KEGG orthology-based annotation of the complete genome sequence of the scleractinian coral Acropora digitifera and provide the first comprehensive view of the genome of a reef-building coral by applying advanced bioinformatics. Description Sequences from the KEGG database of protein function were used to construct hidden Markov models. These models were used to search the predicted proteome of A. digitifera to establish complete genomic annotation. The annotated dataset is published in ZoophyteBase, an open access format with different options for searching the data. A particularly useful feature is the ability to use a Google-like search engine that links query words to protein attributes. We present features of the annotation that underpin the molecular structure of key processes of coral physiology that include (1) regulatory proteins of symbiosis, (2) planula and early developmental proteins, (3) neural messengers, receptors and sensory proteins, (4) calcification and Ca2+-signalling proteins, (5) plant-derived proteins, (6) proteins of nitrogen metabolism, (7) DNA repair proteins, (8) stress response proteins, (9) antioxidant and redox-protective proteins, (10) proteins of cellular apoptosis, (11) microbial symbioses and pathogenicity proteins, (12) proteins of viral pathogenicity, (13) toxins and venom, (14) proteins of the chemical defensome and (15) coral epigenetics. Conclusions We advocate that providing annotation in an open-access searchable database available to the public domain will give an unprecedented foundation to interrogate the fundamental molecular structure and interactions of coral symbiosis and allow critical questions to be addressed at the genomic level based on combined aspects of evolutionary, developmental, metabolic, and environmental perspectives. PMID:23889801
University of Victoria Genome British Columbia Proteomics Centre, a leader in proteomic technology development, has partnered with the U.S. National Cancer Institute (NCI) to make targeted proteomic assays accessible to the community through NCI’s CPTAC Assay Portal (https://assays.cancer.gov).
The National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium (CPTAC) is pleased to announce the opening of the leaderboard to its Proteogenomics Computational DREAM Challenge. The leadership board remains open for submissions during September 25, 2017 through October 8, 2017, with the Challenge expected to run until November 17, 2017.
compomics-utilities: an open-source Java library for computational proteomics.
Barsnes, Harald; Vaudel, Marc; Colaert, Niklaas; Helsens, Kenny; Sickmann, Albert; Berven, Frode S; Martens, Lennart
2011-03-08
The growing interest in the field of proteomics has increased the demand for software tools and applications that process and analyze the resulting data. And even though the purpose of these tools can vary significantly, they usually share a basic set of features, including the handling of protein and peptide sequences, the visualization of (and interaction with) spectra and chromatograms, and the parsing of results from various proteomics search engines. Developers typically spend considerable time and effort implementing these support structures, which detracts from working on the novel aspects of their tool. In order to simplify the development of proteomics tools, we have implemented an open-source support library for computational proteomics, called compomics-utilities. The library contains a broad set of features required for reading, parsing, and analyzing proteomics data. compomics-utilities is already used by a long list of existing software, ensuring library stability and continued support and development. As a user-friendly, well-documented and open-source library, compomics-utilities greatly simplifies the implementation of the basic features needed in most proteomics tools. Implemented in 100% Java, compomics-utilities is fully portable across platforms and architectures. Our library thus allows the developers to focus on the novel aspects of their tools, rather than on the basic functions, which can contribute substantially to faster development, and better tools for proteomics.
Halligan, Brian D.; Geiger, Joey F.; Vallejos, Andrew K.; Greene, Andrew S.; Twigger, Simon N.
2009-01-01
One of the major difficulties for many laboratories setting up proteomics programs has been obtaining and maintaining the computational infrastructure required for the analysis of the large flow of proteomics data. We describe a system that combines distributed cloud computing and open source software to allow laboratories to set up scalable virtual proteomics analysis clusters without the investment in computational hardware or software licensing fees. Additionally, the pricing structure of distributed computing providers, such as Amazon Web Services, allows laboratories or even individuals to have large-scale computational resources at their disposal at a very low cost per run. We provide detailed step by step instructions on how to implement the virtual proteomics analysis clusters as well as a list of current available preconfigured Amazon machine images containing the OMSSA and X!Tandem search algorithms and sequence databases on the Medical College of Wisconsin Proteomics Center website (http://proteomics.mcw.edu/vipdac). PMID:19358578
Halligan, Brian D; Geiger, Joey F; Vallejos, Andrew K; Greene, Andrew S; Twigger, Simon N
2009-06-01
One of the major difficulties for many laboratories setting up proteomics programs has been obtaining and maintaining the computational infrastructure required for the analysis of the large flow of proteomics data. We describe a system that combines distributed cloud computing and open source software to allow laboratories to set up scalable virtual proteomics analysis clusters without the investment in computational hardware or software licensing fees. Additionally, the pricing structure of distributed computing providers, such as Amazon Web Services, allows laboratories or even individuals to have large-scale computational resources at their disposal at a very low cost per run. We provide detailed step-by-step instructions on how to implement the virtual proteomics analysis clusters as well as a list of current available preconfigured Amazon machine images containing the OMSSA and X!Tandem search algorithms and sequence databases on the Medical College of Wisconsin Proteomics Center Web site ( http://proteomics.mcw.edu/vipdac ).
DAPD: A Knowledgebase for Diabetes Associated Proteins.
Gopinath, Krishnasamy; Jayakumararaj, Ramaraj; Karthikeyan, Muthusamy
2015-01-01
Recent advancements in genomics and proteomics provide a solid foundation for understanding the pathogenesis of diabetes. Proteomics of diabetes associated pathways help to identify the most potent target for the management of diabetes. The relevant datasets are scattered in various prominent sources which takes much time to select the therapeutic target for the clinical management of diabetes. However, additional information about target proteins is needed for validation. This lacuna may be resolved by linking diabetes associated genes, pathways and proteins and it will provide a strong base for the treatment and planning management strategies of diabetes. Thus, a web source "Diabetes Associated Proteins Database (DAPD)" has been developed to link the diabetes associated genes, pathways and proteins using PHP, MySQL. The current version of DAPD has been built with proteins associated with different types of diabetes. In addition, DAPD has been linked to external sources to gain the access to more participatory proteins and their pathway network. DAPD will reduce the time and it is expected to pave the way for the discovery of novel anti-diabetic leads using computational drug designing for diabetes management. DAPD is open accessed via following url www.mkarthikeyan.bioinfoau.org/dapd.
Deutsch, Eric W.; Mendoza, Luis; Shteynberg, David; Slagel, Joseph; Sun, Zhi; Moritz, Robert L.
2015-01-01
Democratization of genomics technologies has enabled the rapid determination of genotypes. More recently the democratization of comprehensive proteomics technologies is enabling the determination of the cellular phenotype and the molecular events that define its dynamic state. Core proteomic technologies include mass spectrometry to define protein sequence, protein:protein interactions, and protein post-translational modifications. Key enabling technologies for proteomics are bioinformatic pipelines to identify, quantitate, and summarize these events. The Trans-Proteomics Pipeline (TPP) is a robust open-source standardized data processing pipeline for large-scale reproducible quantitative mass spectrometry proteomics. It supports all major operating systems and instrument vendors via open data formats. Here we provide a review of the overall proteomics workflow supported by the TPP, its major tools, and how it can be used in its various modes from desktop to cloud computing. We describe new features for the TPP, including data visualization functionality. We conclude by describing some common perils that affect the analysis of tandem mass spectrometry datasets, as well as some major upcoming features. PMID:25631240
Deutsch, Eric W; Mendoza, Luis; Shteynberg, David; Slagel, Joseph; Sun, Zhi; Moritz, Robert L
2015-08-01
Democratization of genomics technologies has enabled the rapid determination of genotypes. More recently the democratization of comprehensive proteomics technologies is enabling the determination of the cellular phenotype and the molecular events that define its dynamic state. Core proteomic technologies include MS to define protein sequence, protein:protein interactions, and protein PTMs. Key enabling technologies for proteomics are bioinformatic pipelines to identify, quantitate, and summarize these events. The Trans-Proteomics Pipeline (TPP) is a robust open-source standardized data processing pipeline for large-scale reproducible quantitative MS proteomics. It supports all major operating systems and instrument vendors via open data formats. Here, we provide a review of the overall proteomics workflow supported by the TPP, its major tools, and how it can be used in its various modes from desktop to cloud computing. We describe new features for the TPP, including data visualization functionality. We conclude by describing some common perils that affect the analysis of MS/MS datasets, as well as some major upcoming features. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
jmzML, an open-source Java API for mzML, the PSI standard for MS data.
Côté, Richard G; Reisinger, Florian; Martens, Lennart
2010-04-01
We here present jmzML, a Java API for the Proteomics Standards Initiative mzML data standard. Based on the Java Architecture for XML Binding and XPath-based XML indexer random-access XML parser, jmzML can handle arbitrarily large files in minimal memory, allowing easy and efficient processing of mzML files using the Java programming language. jmzML also automatically resolves internal XML references on-the-fly. The library (which includes a viewer) can be downloaded from http://jmzml.googlecode.com.
IPeak: An open source tool to combine results from multiple MS/MS search engines.
Wen, Bo; Du, Chaoqin; Li, Guilin; Ghali, Fawaz; Jones, Andrew R; Käll, Lukas; Xu, Shaohang; Zhou, Ruo; Ren, Zhe; Feng, Qiang; Xu, Xun; Wang, Jun
2015-09-01
Liquid chromatography coupled tandem mass spectrometry (LC-MS/MS) is an important technique for detecting peptides in proteomics studies. Here, we present an open source software tool, termed IPeak, a peptide identification pipeline that is designed to combine the Percolator post-processing algorithm and multi-search strategy to enhance the sensitivity of peptide identifications without compromising accuracy. IPeak provides a graphical user interface (GUI) as well as a command-line interface, which is implemented in JAVA and can work on all three major operating system platforms: Windows, Linux/Unix and OS X. IPeak has been designed to work with the mzIdentML standard from the Proteomics Standards Initiative (PSI) as an input and output, and also been fully integrated into the associated mzidLibrary project, providing access to the overall pipeline, as well as modules for calling Percolator on individual search engine result files. The integration thus enables IPeak (and Percolator) to be used in conjunction with any software packages implementing the mzIdentML data standard. IPeak is freely available and can be downloaded under an Apache 2.0 license at https://code.google.com/p/mzidentml-lib/. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Helsens, Kenny; Colaert, Niklaas; Barsnes, Harald; Muth, Thilo; Flikka, Kristian; Staes, An; Timmerman, Evy; Wortelkamp, Steffi; Sickmann, Albert; Vandekerckhove, Joël; Gevaert, Kris; Martens, Lennart
2010-03-01
MS-based proteomics produces large amounts of mass spectra that require processing, identification and possibly quantification before interpretation can be undertaken. High-throughput studies require automation of these various steps, and management of the data in association with the results obtained. We here present ms_lims (http://genesis.UGent.be/ms_lims), a freely available, open-source system based on a central database to automate data management and processing in MS-driven proteomics analyses.
A proteomic approach to obesity and type 2 diabetes
López-Villar, Elena; Martos-Moreno, Gabriel Á; Chowen, Julie A; Okada, Shigeru; Kopchick, John J; Argente, Jesús
2015-01-01
The incidence of obesity and type diabetes 2 has increased dramatically resulting in an increased interest in its biomedical relevance. However, the mechanisms that trigger the development of diabetes type 2 in obese patients remain largely unknown. Scientific, clinical and pharmaceutical communities are dedicating vast resources to unravel this issue by applying different omics tools. During the last decade, the advances in proteomic approaches and the Human Proteome Organization have opened and are opening a new door that may be helpful in the identification of patients at risk and to improve current therapies. Here, we briefly review some of the advances in our understanding of type 2 diabetes that have occurred through the application of proteomics. We also review, in detail, the current improvements in proteomic methodologies and new strategies that could be employed to further advance our understanding of this pathology. By applying these new proteomic advances, novel therapeutic and/or diagnostic protein targets will be discovered in the obesity/Type 2 diabetes area. PMID:25960181
Fernandes, Pedro; Gevaert, Kris; Rothacker, Julie; Saiyed, Taslimarif; Detwiler, Michelle
2012-01-01
This roundtable will feature four international speakers who will discuss national and international collaborative initiatives and outreach efforts in which they participate. They will share how these efforts have facilitated access to cutting-edge technology, fostered new generations of scientists, and ultimately advanced the progression of global scientific research. Open discussion will follow the presentations! Centre for Cellular and Molecular Platforms, National Centre for Biological Sciences, India: experiences in implementing a national high-end core facility organization with the goals of improving regional technology access and enhancing the quality of research for scientists in academia, biotechnology companies, and the biopharmaceutical industry.Monash University Technology Platforms and Broader Victorian and Australian Networks: Australian initiatives to build global research capabilities and identify means to internationally benchmark regional capabilities to ensure delivery of world class infrastructure. Within the context of the current Australian strategic framework, funding considerations will be discussed, along with expectations for partner facilities to collaborate and be fully accessible to academia and industry.Instituto Gulbenkian de Ciencia, Portugal and beyond: Multiple roles of networking in science and extending outreach while consolidating community integration. Discussion will include achievement of community building and integration using concepts of sharing, training, resource availability, and the value and empowerment gained using acquired skills. The role of networking and institutional visibility will also be discussed.PRIME-XS: This EU-funded consortium provides an infrastructure of proteomics technologies to the European research community. The core is formed by six access facilities through which the consortium provides access to their technologies. Twelve partners work together to develop new resources to aid the community including the development of bioinformatic tools to analyze large-scale proteomics data and novel technologies to analyze protein interaction networks, post-translational modifications and more sensitive ways to detect protein and peptide biomarkers in complex samples.
Top-Down Proteomics and Farm Animal and Aquatic Sciences.
Campos, Alexandre M O; de Almeida, André M
2016-12-21
Proteomics is a field of growing importance in animal and aquatic sciences. Similar to other proteomic approaches, top-down proteomics is slowly making its way within the vast array of proteomic approaches that researchers have access to. This opinion and mini-review article is dedicated to top-down proteomics and how its use can be of importance to animal and aquatic sciences. Herein, we include an overview of the principles of top-down proteomics and how it differs regarding other more commonly used proteomic methods, especially bottom-up proteomics. In addition, we provide relevant sections on how the approach was or can be used as a research tool and conclude with our opinions of future use in animal and aquatic sciences.
NASA's GeneLab Phase II: Federated Search and Data Discovery
NASA Technical Reports Server (NTRS)
Berrios, Daniel C.; Costes, Sylvain V.; Tran, Peter B.
2017-01-01
GeneLab is currently being developed by NASA to accelerate 'open science' biomedical research in support of the human exploration of space and the improvement of life on earth. Phase I of the four-phase GeneLab Data Systems (GLDS) project emphasized capabilities for submission, curation, search, and retrieval of genomics, transcriptomics and proteomics ('omics') data from biomedical research of space environments. The focus of development of the GLDS for Phase II has been federated data search for and retrieval of these kinds of data across other open-access systems, so that users are able to conduct biological meta-investigations using data from a variety of sources. Such meta-investigations are key to corroborating findings from many kinds of assays and translating them into systems biology knowledge and, eventually, therapeutics.
NASAs GeneLab Phase II: Federated Search and Data Discovery
NASA Technical Reports Server (NTRS)
Berrios, Daniel C.; Costes, Sylvain; Tran, Peter
2017-01-01
GeneLab is currently being developed by NASA to accelerate open science biomedical research in support of the human exploration of space and the improvement of life on earth. Phase I of the four-phase GeneLab Data Systems (GLDS) project emphasized capabilities for submission, curation, search, and retrieval of genomics, transcriptomics and proteomics (omics) data from biomedical research of space environments. The focus of development of the GLDS for Phase II has been federated data search for and retrieval of these kinds of data across other open-access systems, so that users are able to conduct biological meta-investigations using data from a variety of sources. Such meta-investigations are key to corroborating findings from many kinds of assays and translating them into systems biology knowledge and, eventually, therapeutics.
Proteogenomics, integration of proteomics, genomics, and transcriptomics, is an emerging approach that promises to advance basic, translational and clinical research. By combining genomic and proteomic information, leading scientists are gaining new insights due to a more complete and unified understanding of complex biological processes.
Perez-Riverol, Yasset; Wang, Rui; Hermjakob, Henning; Müller, Markus; Vesada, Vladimir; Vizcaíno, Juan Antonio
2014-01-01
Data processing, management and visualization are central and critical components of a state of the art high-throughput mass spectrometry (MS)-based proteomics experiment, and are often some of the most time-consuming steps, especially for labs without much bioinformatics support. The growing interest in the field of proteomics has triggered an increase in the development of new software libraries, including freely available and open-source software. From database search analysis to post-processing of the identification results, even though the objectives of these libraries and packages can vary significantly, they usually share a number of features. Common use cases include the handling of protein and peptide sequences, the parsing of results from various proteomics search engines output files, and the visualization of MS-related information (including mass spectra and chromatograms). In this review, we provide an overview of the existing software libraries, open-source frameworks and also, we give information on some of the freely available applications which make use of them. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. PMID:23467006
Perez-Riverol, Yasset; Wang, Rui; Hermjakob, Henning; Müller, Markus; Vesada, Vladimir; Vizcaíno, Juan Antonio
2014-01-01
Data processing, management and visualization are central and critical components of a state of the art high-throughput mass spectrometry (MS)-based proteomics experiment, and are often some of the most time-consuming steps, especially for labs without much bioinformatics support. The growing interest in the field of proteomics has triggered an increase in the development of new software libraries, including freely available and open-source software. From database search analysis to post-processing of the identification results, even though the objectives of these libraries and packages can vary significantly, they usually share a number of features. Common use cases include the handling of protein and peptide sequences, the parsing of results from various proteomics search engines output files, and the visualization of MS-related information (including mass spectra and chromatograms). In this review, we provide an overview of the existing software libraries, open-source frameworks and also, we give information on some of the freely available applications which make use of them. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. Copyright © 2013 Elsevier B.V. All rights reserved.
Keller, Andrew; Bader, Samuel L.; Shteynberg, David; Hood, Leroy; Moritz, Robert L.
2015-01-01
Proteomics by mass spectrometry technology is widely used for identifying and quantifying peptides and proteins. The breadth and sensitivity of peptide detection have been advanced by the advent of data-independent acquisition mass spectrometry. Analysis of such data, however, is challenging due to the complexity of fragment ion spectra that have contributions from multiple co-eluting precursor ions. We present SWATHProphet software that identifies and quantifies peptide fragment ion traces in data-independent acquisition data, provides accurate probabilities to ensure results are correct, and automatically detects and removes contributions to quantitation originating from interfering precursor ions. Integration in the widely used open source Trans-Proteomic Pipeline facilitates subsequent analyses such as combining results of multiple data sets together for improved discrimination using iProphet and inferring sample proteins using ProteinProphet. This novel development should greatly help make data-independent acquisition mass spectrometry accessible to large numbers of users. PMID:25713123
Syn, Genevieve; Blackwell, Jenefer M; Jamieson, Sarra E; Francis, Richard W
2018-01-01
Toxoplasma gondii uses epigenetic mechanisms to regulate both endogenous and host cell gene expression. To identify genes with putative epigenetic functions, we developed an in silico pipeline to interrogate the T. gondii proteome of 8313 proteins. Step 1 employs PredictNLS and NucPred to identify genes predicted to target eukaryotic nuclei. Step 2 uses GOLink to identify proteins of epigenetic function based on Gene Ontology terms. This resulted in 611 putative nuclear localised proteins with predicted epigenetic functions. Step 3 filtered for secretory proteins using SignalP, SecretomeP, and experimental data. This identified 57 of the 611 putative epigenetic proteins as likely to be secreted. The pipeline is freely available online, uses open access tools and software with user-friendly Perl scripts to automate and manage the results, and is readily adaptable to undertake any such in silico search for genes contributing to particular functions.
Comet: an open-source MS/MS sequence database search tool.
Eng, Jimmy K; Jahan, Tahmina A; Hoopmann, Michael R
2013-01-01
Proteomics research routinely involves identifying peptides and proteins via MS/MS sequence database search. Thus the database search engine is an integral tool in many proteomics research groups. Here, we introduce the Comet search engine to the existing landscape of commercial and open-source database search tools. Comet is open source, freely available, and based on one of the original sequence database search tools that has been widely used for many years. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
QCloud: A cloud-based quality control system for mass spectrometry-based proteomics laboratories
Chiva, Cristina; Olivella, Roger; Borràs, Eva; Espadas, Guadalupe; Pastor, Olga; Solé, Amanda
2018-01-01
The increasing number of biomedical and translational applications in mass spectrometry-based proteomics poses new analytical challenges and raises the need for automated quality control systems. Despite previous efforts to set standard file formats, data processing workflows and key evaluation parameters for quality control, automated quality control systems are not yet widespread among proteomics laboratories, which limits the acquisition of high-quality results, inter-laboratory comparisons and the assessment of variability of instrumental platforms. Here we present QCloud, a cloud-based system to support proteomics laboratories in daily quality assessment using a user-friendly interface, easy setup, automated data processing and archiving, and unbiased instrument evaluation. QCloud supports the most common targeted and untargeted proteomics workflows, it accepts data formats from different vendors and it enables the annotation of acquired data and reporting incidences. A complete version of the QCloud system has successfully been developed and it is now open to the proteomics community (http://qcloud.crg.eu). QCloud system is an open source project, publicly available under a Creative Commons License Attribution-ShareAlike 4.0. PMID:29324744
A proteomic approach to obesity and type 2 diabetes.
López-Villar, Elena; Martos-Moreno, Gabriel Á; Chowen, Julie A; Okada, Shigeru; Kopchick, John J; Argente, Jesús
2015-07-01
The incidence of obesity and type diabetes 2 has increased dramatically resulting in an increased interest in its biomedical relevance. However, the mechanisms that trigger the development of diabetes type 2 in obese patients remain largely unknown. Scientific, clinical and pharmaceutical communities are dedicating vast resources to unravel this issue by applying different omics tools. During the last decade, the advances in proteomic approaches and the Human Proteome Organization have opened and are opening a new door that may be helpful in the identification of patients at risk and to improve current therapies. Here, we briefly review some of the advances in our understanding of type 2 diabetes that have occurred through the application of proteomics. We also review, in detail, the current improvements in proteomic methodologies and new strategies that could be employed to further advance our understanding of this pathology. By applying these new proteomic advances, novel therapeutic and/or diagnostic protein targets will be discovered in the obesity/Type 2 diabetes area. © 2015 The Authors. Journal of Cellular and Molecular Medicine published by John Wiley & Sons Ltd and Foundation for Cellular and Molecular Medicine.
CPTAC Contributes to Healthdata.gov | Office of Cancer Clinical Proteomics Research
Recently, proteomic data generated by the Clinical Proteomic Tumor Analysis Consortium (CPTAC) funded by National Cancer Institute (NCI) was highlighted to the wider research community at Healthdata.gov. Healthdata.gov aims to make health data more accessible to entrepreneurs, researchers, and policy makers in the hopes of improving health outcomes f
Proteome of Caulobacter crescentus cell cycle publicly accessible on SWICZ server.
Vohradsky, Jiri; Janda, Ivan; Grünenfelder, Björn; Berndt, Peter; Röder, Daniel; Langen, Hanno; Weiser, Jaroslav; Jenal, Urs
2003-10-01
Here we present the Swiss-Czech Proteomics Server (SWICZ), which hosts the proteomic database summarizing information about the cell cycle of the aquatic bacterium Caulobacter crescentus. The database provides a searchable tool for easy access of global protein synthesis and protein stability data as examined during the C. crescentus cell cycle. Protein synthesis data collected from five different cell cycle stages were determined for each protein spot as a relative value of the total amount of [(35)S]methionine incorporation. Protein stability of pulse-labeled extracts were measured during a chase period equivalent to one cell cycle unit. Quantitative information for individual proteins together with descriptive data such as protein identities, apparent molecular masses and isoelectric points, were combined with information on protein function, genomic context, and the cell cycle stage, and were then assembled in a relational database with a world wide web interface (http://proteom.biomed.cas.cz), which allows the database records to be searched and displays the recovered information. A total of 1250 protein spots were reproducibly detected on two-dimensional gel electropherograms, 295 of which were identified by mass spectroscopy. The database is accessible either through clickable two-dimensional gel electrophoretic maps or by means of a set of dedicated search engines. Basic characterization of the experimental procedures, data processing, and a comprehensive description of the web site are presented. In its current state, the SWICZ proteome database provides a platform for the incorporation of new data emerging from extended functional studies on the C. crescentus proteome.
Efficient visualization of high-throughput targeted proteomics experiments: TAPIR.
Röst, Hannes L; Rosenberger, George; Aebersold, Ruedi; Malmström, Lars
2015-07-15
Targeted mass spectrometry comprises a set of powerful methods to obtain accurate and consistent protein quantification in complex samples. To fully exploit these techniques, a cross-platform and open-source software stack based on standardized data exchange formats is required. We present TAPIR, a fast and efficient Python visualization software for chromatograms and peaks identified in targeted proteomics experiments. The input formats are open, community-driven standardized data formats (mzML for raw data storage and TraML encoding the hierarchical relationships between transitions, peptides and proteins). TAPIR is scalable to proteome-wide targeted proteomics studies (as enabled by SWATH-MS), allowing researchers to visualize high-throughput datasets. The framework integrates well with existing automated analysis pipelines and can be extended beyond targeted proteomics to other types of analyses. TAPIR is available for all computing platforms under the 3-clause BSD license at https://github.com/msproteomicstools/msproteomicstools. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Proteogenomics Dashboard for the Human Proteome Project.
Tabas-Madrid, Daniel; Alves-Cruzeiro, Joao; Segura, Victor; Guruceaga, Elizabeth; Vialas, Vital; Prieto, Gorka; García, Carlos; Corrales, Fernando J; Albar, Juan Pablo; Pascual-Montano, Alberto
2015-09-04
dasHPPboard is a novel proteomics-based dashboard that collects and reports the experiments produced by the Spanish Human Proteome Project consortium (SpHPP) and aims to help HPP to map the entire human proteome. We have followed the strategy of analog genomics projects like the Encyclopedia of DNA Elements (ENCODE), which provides a vast amount of data on human cell lines experiments. The dashboard includes results of shotgun and selected reaction monitoring proteomics experiments, post-translational modifications information, as well as proteogenomics studies. We have also processed the transcriptomics data from the ENCODE and Human Body Map (HBM) projects for the identification of specific gene expression patterns in different cell lines and tissues, taking special interest in those genes having little proteomic evidence available (missing proteins). Peptide databases have been built using single nucleotide variants and novel junctions derived from RNA-Seq data that can be used in search engines for sample-specific protein identifications on the same cell lines or tissues. The dasHPPboard has been designed as a tool that can be used to share and visualize a combination of proteomic and transcriptomic data, providing at the same time easy access to resources for proteogenomics analyses. The dasHPPboard can be freely accessed at: http://sphppdashboard.cnb.csic.es.
The complete genome sequence and proteomics of Yersinia pestis phage Yep-phi.
Zhao, Xiangna; Wu, Weili; Qi, Zhizhen; Cui, Yujun; Yan, Yanfeng; Guo, Zhaobiao; Wang, Zuyun; Wang, Hu; Deng, Haijun; Xue, Yan; Chen, Weijun; Wang, Xiaoyi; Yang, Ruifu
2011-01-01
Yep-phi, a lytic phage of Yersinia pestis, was isolated in China and is routinely used as a diagnostic phage for the identification of the plague pathogen. Yep-phi has an isometric hexagonal head containing dsDNA and a short non-contractile conical tail. In this study, we sequenced the Yep-phi genome (GenBank accession no. HQ333270) and performed proteomics analysis. The genome consists of 38 ,616 bp of DNA, including direct terminal repeats of 222 bp, and is predicted to contain 45 ORFs. Most structural proteins were identified by proteomics analysis. Compared with the three available genome sequences of lytic phages for Y. pestis, the phages could be divided into two subgroups. Yep-phi displays marked homology to the bacteriophages Berlin (GenBank accession no. AM183667) and Yepe2 (GenBank accession no. EU734170), and these comprise one subgroup. The other subgroup is represented by bacteriophage ΦA1122 (GenBank accession no. AY247822). Potential recombination was detected among the Yep-phi subgroup.
Vanderperre, Benoît; Lucier, Jean-François; Bissonnette, Cyntia; Motard, Julie; Tremblay, Guillaume; Vanderperre, Solène; Wisztorski, Maxence; Salzet, Michel; Boisvert, François-Michel; Roucou, Xavier
2013-01-01
A fully mature mRNA is usually associated to a reference open reading frame encoding a single protein. Yet, mature mRNAs contain unconventional alternative open reading frames (AltORFs) located in untranslated regions (UTRs) or overlapping the reference ORFs (RefORFs) in non-canonical +2 and +3 reading frames. Although recent ribosome profiling and footprinting approaches have suggested the significant use of unconventional translation initiation sites in mammals, direct evidence of large-scale alternative protein expression at the proteome level is still lacking. To determine the contribution of alternative proteins to the human proteome, we generated a database of predicted human AltORFs revealing a new proteome mainly composed of small proteins with a median length of 57 amino acids, compared to 344 amino acids for the reference proteome. We experimentally detected a total of 1,259 alternative proteins by mass spectrometry analyses of human cell lines, tissues and fluids. In plasma and serum, alternative proteins represent up to 55% of the proteome and may be a potential unsuspected new source for biomarkers. We observed constitutive co-expression of RefORFs and AltORFs from endogenous genes and from transfected cDNAs, including tumor suppressor p53, and provide evidence that out-of-frame clones representing AltORFs are mistakenly rejected as false positive in cDNAs screening assays. Functional importance of alternative proteins is strongly supported by significant evolutionary conservation in vertebrates, invertebrates, and yeast. Our results imply that coding of multiple proteins in a single gene by the use of AltORFs may be a common feature in eukaryotes, and confirm that translation of unconventional ORFs generates an as yet unexplored proteome. PMID:23950983
Chalivendra, Subbaiah C.; Lopez-Casado, Gloria; Bedinger, Patricia A.
2013-01-01
Although self-incompatibility (SI) in plants has been studied extensively, far less is known about interspecific reproductive barriers. One interspecific barrier, known as unilateral incongruity or incompatibility (UI), occurs when species display unidirectional compatibility in interspecific crosses. In the wild tomato species Solanum pennellii, both SI and self-compatible (SC) populations express UI when crossed with domesticated tomato, offering a useful model system to dissect the molecular mechanisms involved in reproductive barriers. In this study, the timing of reproductive barrier establishment during pistil development was determined in SI and SC accessions of S. pennellii using a semi-in vivo system to track pollen-tube growth in developing styles. Both SI and UI barriers were absent in styles 5 days prior to flower opening, but were established by 2 days before flower opening, with partial barriers detected during a transition period 3–4 days before flower opening. The developmental expression dynamics of known SI factors, S-RNases and HT proteins, was also examined. The accumulation of HT-A protein coincided temporally and spatially with UI barriers in developing pistils. Proteomic analysis of stigma/styles from key developmental stages showed a switch in protein profiles from cell-division-associated proteins in immature stigma/styles to a set of proteins in mature stigma/styles that included S-RNases, HT-A protein and proteins associated with cell-wall loosening and defense responses, which could be involved in pollen–pistil interactions. Other prominent proteins in mature stigma/styles were those involved in lipid metabolism, consistent with the accumulation of lipid-rich material during pistil maturation. PMID:23166371
Using the Proteomics Identifications Database (PRIDE).
Martens, Lennart; Jones, Phil; Côté, Richard
2008-03-01
The Proteomics Identifications Database (PRIDE) is a public data repository designed to store, disseminate, and analyze mass spectrometry based proteomics datasets. The PRIDE database can accommodate any level of detailed metadata about the submitted results, which can be queried, explored, viewed, or downloaded via the PRIDE Web interface. The PRIDE database also provides a simple, yet powerful, access control mechanism that fully supports confidential peer-reviewing of data related to a manuscript, ensuring that these results remain invisible to the general public while allowing referees and journal editors anonymized access to the data. This unit describes in detail the functionality that PRIDE provides with regards to searching, viewing, and comparing the available data, as well as different options for submitting data to PRIDE.
BatMass: a Java Software Platform for LC-MS Data Visualization in Proteomics and Metabolomics.
Avtonomov, Dmitry M; Raskind, Alexander; Nesvizhskii, Alexey I
2016-08-05
Mass spectrometry (MS) coupled to liquid chromatography (LC) is a commonly used technique in metabolomic and proteomic research. As the size and complexity of LC-MS-based experiments grow, it becomes increasingly more difficult to perform quality control of both raw data and processing results. In a practical setting, quality control steps for raw LC-MS data are often overlooked, and assessment of an experiment's success is based on some derived metrics such as "the number of identified compounds". The human brain interprets visual data much better than plain text, hence the saying "a picture is worth a thousand words". Here, we present the BatMass software package, which allows for performing quick quality control of raw LC-MS data through its fast visualization capabilities. It also serves as a testbed for developers of LC-MS data processing algorithms by providing a data access library for open mass spectrometry file formats and a means of visually mapping processing results back to the original data. We illustrate the utility of BatMass with several use cases of quality control and data exploration.
BatMass: a Java software platform for LC/MS data visualization in proteomics and metabolomics
Avtonomov, Dmitry; Raskind, Alexander; Nesvizhskii, Alexey I.
2017-01-01
Mass spectrometry (MS) coupled to liquid chromatography (LC) is a commonly used technique in metabolomic and proteomic research. As the size and complexity of LC/MS based experiments grow, it becomes increasingly more difficult to perform quality control of both raw data and processing results. In a practical setting, quality control steps for raw LC/MS data are often overlooked and assessment of an experiment's success is based on some derived metrics such as “the number of identified compounds”. Human brain interprets visual data much better than plain text, hence the saying “a picture is worth a thousand words”. Here we present BatMass software package which allows to perform quick quality control of raw LC/MS data through its fast visualization capabilities. It also serves as a testbed for developers of LC/MS data processing algorithms by providing a data access library for open mass spectrometry file formats and a means of visually mapping processing results back to the original data. We illustrate the utility of BatMass with several use cases of quality control and data exploration. PMID:27306858
MannDB: A microbial annotation database for protein characterization
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, C; Lam, M; Smith, J
2006-05-19
MannDB was created to meet a need for rapid, comprehensive automated protein sequence analyses to support selection of proteins suitable as targets for driving the development of reagents for pathogen or protein toxin detection. Because a large number of open-source tools were needed, it was necessary to produce a software system to scale the computations for whole-proteome analysis. Thus, we built a fully automated system for executing software tools and for storage, integration, and display of automated protein sequence analysis and annotation data. MannDB is a relational database that organizes data resulting from fully automated, high-throughput protein-sequence analyses using open-sourcemore » tools. Types of analyses provided include predictions of cleavage, chemical properties, classification, features, functional assignment, post-translational modifications, motifs, antigenicity, and secondary structure. Proteomes (lists of hypothetical and known proteins) are downloaded and parsed from Genbank and then inserted into MannDB, and annotations from SwissProt are downloaded when identifiers are found in the Genbank entry or when identical sequences are identified. Currently 36 open-source tools are run against MannDB protein sequences either on local systems or by means of batch submission to external servers. In addition, BLAST against protein entries in MvirDB, our database of microbial virulence factors, is performed. A web client browser enables viewing of computational results and downloaded annotations, and a query tool enables structured and free-text search capabilities. When available, links to external databases, including MvirDB, are provided. MannDB contains whole-proteome analyses for at least one representative organism from each category of biological threat organism listed by APHIS, CDC, HHS, NIAID, USDA, USFDA, and WHO. MannDB comprises a large number of genomes and comprehensive protein sequence analyses representing organisms listed as high-priority agents on the websites of several governmental organizations concerned with bio-terrorism. MannDB provides the user with a BLAST interface for comparison of native and non-native sequences and a query tool for conveniently selecting proteins of interest. In addition, the user has access to a web-based browser that compiles comprehensive and extensive reports.« less
A 2-D guinea pig lung proteome map
USDA-ARS?s Scientific Manuscript database
Guinea pigs represent an important model for a number of infectious and non-infectious pulmonary diseases. The guinea pig genome has recently been sequenced to full coverage, opening up new research avenues using genomics, transcriptomics and proteomics techniques in this species. In order to furth...
MAPU: Max-Planck Unified database of organellar, cellular, tissue and body fluid proteomes
Zhang, Yanling; Zhang, Yong; Adachi, Jun; Olsen, Jesper V.; Shi, Rong; de Souza, Gustavo; Pasini, Erica; Foster, Leonard J.; Macek, Boris; Zougman, Alexandre; Kumar, Chanchal; Wiśniewski, Jacek R.; Jun, Wang; Mann, Matthias
2007-01-01
Mass spectrometry (MS)-based proteomics has become a powerful technology to map the protein composition of organelles, cell types and tissues. In our department, a large-scale effort to map these proteomes is complemented by the Max-Planck Unified (MAPU) proteome database. MAPU contains several body fluid proteomes; including plasma, urine, and cerebrospinal fluid. Cell lines have been mapped to a depth of several thousand proteins and the red blood cell proteome has also been analyzed in depth. The liver proteome is represented with 3200 proteins. By employing high resolution MS and stringent validation criteria, false positive identification rates in MAPU are lower than 1:1000. Thus MAPU datasets can serve as reference proteomes in biomarker discovery. MAPU contains the peptides identifying each protein, measured masses, scores and intensities and is freely available at using a clickable interface of cell or body parts. Proteome data can be queried across proteomes by protein name, accession number, sequence similarity, peptide sequence and annotation information. More than 4500 mouse and 2500 human proteins have already been identified in at least one proteome. Basic annotation information and links to other public databases are provided in MAPU and we plan to add further analysis tools. PMID:17090601
Proteomics technique opens new frontiers in mobilome research.
Davidson, Andrew D; Matthews, David A; Maringer, Kevin
2017-01-01
A large proportion of the genome of most eukaryotic organisms consists of highly repetitive mobile genetic elements. The sum of these elements is called the "mobilome," which in eukaryotes is made up mostly of transposons. Transposable elements contribute to disease, evolution, and normal physiology by mediating genetic rearrangement, and through the "domestication" of transposon proteins for cellular functions. Although 'omics studies of mobilome genomes and transcriptomes are common, technical challenges have hampered high-throughput global proteomics analyses of transposons. In a recent paper, we overcame these technical hurdles using a technique called "proteomics informed by transcriptomics" (PIT), and thus published the first unbiased global mobilome-derived proteome for any organism (using cell lines derived from the mosquito Aedes aegypti ). In this commentary, we describe our methods in more detail, and summarise our major findings. We also use new genome sequencing data to show that, in many cases, the specific genomic element expressing a given protein can be identified using PIT. This proteomic technique therefore represents an important technological advance that will open new avenues of research into the role that proteins derived from transposons and other repetitive and sequence diverse genetic elements, such as endogenous retroviruses, play in health and disease.
Computer applications making rapid advances in high throughput microbial proteomics (HTMP).
Anandkumar, Balakrishna; Haga, Steve W; Wu, Hui-Fen
2014-02-01
The last few decades have seen the rise of widely-available proteomics tools. From new data acquisition devices, such as MALDI-MS and 2DE to new database searching softwares, these new products have paved the way for high throughput microbial proteomics (HTMP). These tools are enabling researchers to gain new insights into microbial metabolism, and are opening up new areas of study, such as protein-protein interactions (interactomics) discovery. Computer software is a key part of these emerging fields. This current review considers: 1) software tools for identifying the proteome, such as MASCOT or PDQuest, 2) online databases of proteomes, such as SWISS-PROT, Proteome Web, or the Proteomics Facility of the Pathogen Functional Genomics Resource Center, and 3) software tools for applying proteomic data, such as PSI-BLAST or VESPA. These tools allow for research in network biology, protein identification, functional annotation, target identification/validation, protein expression, protein structural analysis, metabolic pathway engineering and drug discovery.
Sys-BodyFluid: a systematical database for human body fluid proteome research
Li, Su-Jun; Peng, Mao; Li, Hong; Liu, Bo-Shu; Wang, Chuan; Wu, Jia-Rui; Li, Yi-Xue; Zeng, Rong
2009-01-01
Recently, body fluids have widely become an important target for proteomic research and proteomic study has produced more and more body fluid related protein data. A database is needed to collect and analyze these proteome data. Thus, we developed this web-based body fluid proteome database Sys-BodyFluid. It contains eleven kinds of body fluid proteomes, including plasma/serum, urine, cerebrospinal fluid, saliva, bronchoalveolar lavage fluid, synovial fluid, nipple aspirate fluid, tear fluid, seminal fluid, human milk and amniotic fluid. Over 10 000 proteins are presented in the Sys-BodyFluid. Sys-BodyFluid provides the detailed protein annotations, including protein description, Gene Ontology, domain information, protein sequence and involved pathways. These proteome data can be retrieved by using protein name, protein accession number and sequence similarity. In addition, users can query between these different body fluids to get the different proteins identification information. Sys-BodyFluid database can facilitate the body fluid proteomics and disease proteomics research as a reference database. It is available at http://www.biosino.org/bodyfluid/. PMID:18978022
Sys-BodyFluid: a systematical database for human body fluid proteome research.
Li, Su-Jun; Peng, Mao; Li, Hong; Liu, Bo-Shu; Wang, Chuan; Wu, Jia-Rui; Li, Yi-Xue; Zeng, Rong
2009-01-01
Recently, body fluids have widely become an important target for proteomic research and proteomic study has produced more and more body fluid related protein data. A database is needed to collect and analyze these proteome data. Thus, we developed this web-based body fluid proteome database Sys-BodyFluid. It contains eleven kinds of body fluid proteomes, including plasma/serum, urine, cerebrospinal fluid, saliva, bronchoalveolar lavage fluid, synovial fluid, nipple aspirate fluid, tear fluid, seminal fluid, human milk and amniotic fluid. Over 10,000 proteins are presented in the Sys-BodyFluid. Sys-BodyFluid provides the detailed protein annotations, including protein description, Gene Ontology, domain information, protein sequence and involved pathways. These proteome data can be retrieved by using protein name, protein accession number and sequence similarity. In addition, users can query between these different body fluids to get the different proteins identification information. Sys-BodyFluid database can facilitate the body fluid proteomics and disease proteomics research as a reference database. It is available at http://www.biosino.org/bodyfluid/.
Riffle, Michael; Eng, Jimmy K.
2010-01-01
The field of proteomics, particularly the application of mass spectrometry analysis to protein samples, is well-established and growing rapidly. Proteomics studies generate large volumes of raw experimental data and inferred biological results. To facilitate the dissemination of these data, centralized data repositories have been developed that make the data and results accessible to proteomics researchers and biologists alike. This review of proteomics data repositories focuses exclusively on freely-available, centralized data resources that disseminate or store experimental mass spectrometry data and results. The resources chosen reflect a current “snapshot” of the state of resources available with an emphasis placed on resources that may be of particular interest to yeast researchers. Resources are described in terms of their intended purpose and the features and functionality provided to users. PMID:19795424
Palmblad, Magnus; van der Burgt, Yuri E M; Dalebout, Hans; Derks, Rico J E; Schoenmaker, Bart; Deelder, André M
2009-05-02
Accurate mass determination enhances peptide identification in mass spectrometry based proteomics. We here describe the combination of two previously published open source software tools to improve mass measurement accuracy in Fourier transform ion cyclotron resonance mass spectrometry (FTICRMS). The first program, msalign, aligns one MS/MS dataset with one FTICRMS dataset. The second software, recal2, uses peptides identified from the MS/MS data for automated internal calibration of the FTICR spectra, resulting in sub-ppm mass measurement errors.
MAPU: Max-Planck Unified database of organellar, cellular, tissue and body fluid proteomes.
Zhang, Yanling; Zhang, Yong; Adachi, Jun; Olsen, Jesper V; Shi, Rong; de Souza, Gustavo; Pasini, Erica; Foster, Leonard J; Macek, Boris; Zougman, Alexandre; Kumar, Chanchal; Wisniewski, Jacek R; Jun, Wang; Mann, Matthias
2007-01-01
Mass spectrometry (MS)-based proteomics has become a powerful technology to map the protein composition of organelles, cell types and tissues. In our department, a large-scale effort to map these proteomes is complemented by the Max-Planck Unified (MAPU) proteome database. MAPU contains several body fluid proteomes; including plasma, urine, and cerebrospinal fluid. Cell lines have been mapped to a depth of several thousand proteins and the red blood cell proteome has also been analyzed in depth. The liver proteome is represented with 3200 proteins. By employing high resolution MS and stringent validation criteria, false positive identification rates in MAPU are lower than 1:1000. Thus MAPU datasets can serve as reference proteomes in biomarker discovery. MAPU contains the peptides identifying each protein, measured masses, scores and intensities and is freely available at http://www.mapuproteome.com using a clickable interface of cell or body parts. Proteome data can be queried across proteomes by protein name, accession number, sequence similarity, peptide sequence and annotation information. More than 4500 mouse and 2500 human proteins have already been identified in at least one proteome. Basic annotation information and links to other public databases are provided in MAPU and we plan to add further analysis tools.
Making open data work for plant scientists.
Leonelli, Sabina; Smirnoff, Nicholas; Moore, Jonathan; Cook, Charis; Bastow, Ruth
2013-11-01
Despite the clear demand for open data sharing, its implementation within plant science is still limited. This is, at least in part, because open data-sharing raises several unanswered questions and challenges to current research practices. In this commentary, some of the challenges encountered by plant researchers at the bench when generating, interpreting, and attempting to disseminate their data have been highlighted. The difficulties involved in sharing sequencing, transcriptomics, proteomics, and metabolomics data are reviewed. The benefits and drawbacks of three data-sharing venues currently available to plant scientists are identified and assessed: (i) journal publication; (ii) university repositories; and (iii) community and project-specific databases. It is concluded that community and project-specific databases are the most useful to researchers interested in effective data sharing, since these databases are explicitly created to meet the researchers' needs, support extensive curation, and embody a heightened awareness of what it takes to make data reuseable by others. Such bottom-up and community-driven approaches need to be valued by the research community, supported by publishers, and provided with long-term sustainable support by funding bodies and government. At the same time, these databases need to be linked to generic databases where possible, in order to be discoverable to the majority of researchers and thus promote effective and efficient data sharing. As we look forward to a future that embraces open access to data and publications, it is essential that data policies, data curation, data integration, data infrastructure, and data funding are linked together so as to foster data access and research productivity.
SearchGUI: A Highly Adaptable Common Interface for Proteomics Search and de Novo Engines.
Barsnes, Harald; Vaudel, Marc
2018-05-25
Mass-spectrometry-based proteomics has become the standard approach for identifying and quantifying proteins. A vital step consists of analyzing experimentally generated mass spectra to identify the underlying peptide sequences for later mapping to the originating proteins. We here present the latest developments in SearchGUI, a common open-source interface for the most frequently used freely available proteomics search and de novo engines that has evolved into a central component in numerous bioinformatics workflows.
Yu, Xiaobo; Bian, Xiaofang; Throop, Andrea; Song, Lusheng; Moral, Lerys Del; Park, Jin; Seiler, Catherine; Fiacco, Michael; Steel, Jason; Hunter, Preston; Saul, Justin; Wang, Jie; Qiu, Ji; Pipas, James M.; LaBaer, Joshua
2014-01-01
Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic treatment of a viral infection, and mechanisms of host biology. With more than 2,000 viral genomes sequenced, only a small percent of them are well investigated. The access of these viral open reading frames (ORFs) in a flexible cloning format would greatly facilitate both in vitro and in vivo virus-host interaction studies. However, the overall progress of viral ORF cloning has been slow. To facilitate viral studies, we are releasing the initiation of our panviral proteome collection of 2,035 ORF clones from 830 viral genes in the Gateway® recombinational cloning system. Here, we demonstrate several uses of our viral collection including highly efficient production of viral proteins using human cell-free expression system in vitro, global identification of host targets for rubella virus using Nucleic Acid Programmable Protein Arrays (NAPPA) containing 10,000 unique human proteins, and detection of host serological responses using micro-fluidic multiplexed immunoassays. The studies presented here begin to elucidate host-viral protein interactions with our systemic utilization of viral ORFs, high-throughput cloning, and proteomic technologies. These valuable plasmid resources will be available to the research community to enable continued viral functional studies. PMID:24955142
Yu, Xiaobo; Bian, Xiaofang; Throop, Andrea; Song, Lusheng; Moral, Lerys Del; Park, Jin; Seiler, Catherine; Fiacco, Michael; Steel, Jason; Hunter, Preston; Saul, Justin; Wang, Jie; Qiu, Ji; Pipas, James M; LaBaer, Joshua
2014-01-01
Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic treatment of a viral infection, and mechanisms of host biology. With more than 2,000 viral genomes sequenced, only a small percent of them are well investigated. The access of these viral open reading frames (ORFs) in a flexible cloning format would greatly facilitate both in vitro and in vivo virus-host interaction studies. However, the overall progress of viral ORF cloning has been slow. To facilitate viral studies, we are releasing the initiation of our panviral proteome collection of 2,035 ORF clones from 830 viral genes in the Gateway® recombinational cloning system. Here, we demonstrate several uses of our viral collection including highly efficient production of viral proteins using human cell-free expression system in vitro, global identification of host targets for rubella virus using Nucleic Acid Programmable Protein Arrays (NAPPA) containing 10,000 unique human proteins, and detection of host serological responses using micro-fluidic multiplexed immunoassays. The studies presented here begin to elucidate host-viral protein interactions with our systemic utilization of viral ORFs, high-throughput cloning, and proteomic technologies. These valuable plasmid resources will be available to the research community to enable continued viral functional studies.
Public Access for Teaching Genomics, Proteomics, and Bioinformatics
ERIC Educational Resources Information Center
Campbell, A. Malcolm
2003-01-01
When the human genome project was conceived, its leaders wanted all researchers to have equal access to the data and associated research tools. Their vision of equal access provides an unprecedented teaching opportunity. Teachers and students have free access to the same databases that researchers are using. Furthermore, the recent movement to…
Proteomics data exchange and storage: the need for common standards and public repositories.
Jiménez, Rafael C; Vizcaíno, Juan Antonio
2013-01-01
Both the existence of data standards and public databases or repositories have been key factors behind the development of the existing "omics" approaches. In this book chapter we first review the main existing mass spectrometry (MS)-based proteomics resources: PRIDE, PeptideAtlas, GPMDB, and Tranche. Second, we report on the current status of the different proteomics data standards developed by the Proteomics Standards Initiative (PSI): the formats mzML, mzIdentML, mzQuantML, TraML, and PSI-MI XML are then reviewed. Finally, we present an easy way to query and access MS proteomics data in the PRIDE database, as a representative of the existing repositories, using the workflow management system (WMS) tool Taverna. Two different publicly available workflows are explained and described.
Alexander, William M; Ficarro, Scott B; Adelmant, Guillaume; Marto, Jarrod A
2017-08-01
The continued evolution of modern mass spectrometry instrumentation and associated methods represents a critical component in efforts to decipher the molecular mechanisms which underlie normal physiology and understand how dysregulation of biological pathways contributes to human disease. The increasing scale of these experiments combined with the technological diversity of mass spectrometers presents several challenges for community-wide data access, analysis, and distribution. Here we detail a redesigned version of multiplierz, our Python software library which leverages our common application programming interface (mzAPI) for analysis and distribution of proteomic data. New features include support for a wider range of native mass spectrometry file types, interfaces to additional database search engines, compatibility with new reporting formats, and high-level tools to perform post-search proteomic analyses. A GUI desktop environment, mzDesktop, provides access to multiplierz functionality through a user friendly interface. multiplierz is available for download from: https://github.com/BlaisProteomics/multiplierz; and mzDesktop is available for download from: https://sourceforge.net/projects/multiplierz/. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Jolivet, Pascale; Aymé, Laure; Giuliani, Alexandre; Wien, Frank; Chardot, Thierry; Gohon, Yann
2017-10-03
Lipid droplets are the major stock of lipids in oleaginous plant seeds. Despite their economic importance for oil production and biotechnological issues (biofuels, lubricants and plasticizers), numerous questions about their formation, structure and regulation are still unresolved. To determine water accessible domains of protein coating at lipid droplets surface, a structural proteomic approach has been performed. This technique relies on the millisecond timescale production of hydroxyl radicals by the radiolysis of water using Synchrotron X-ray white beam. Thanks to the evolution of mass spectrometry analysis techniques this approach allows the creation of a map of the solvent accessibility for proteins difficult to study by other means. Using these results, a S3 oleosin water accessibility map is proposed. This is the first time that such a map on an oleosin co-purified with plant lipid droplets and other associated protein is presented. Lipid droplet associated proteins function is linked to stability, structure and probably formation and lipid mobilization of droplets. Structure of these proteins in their native environment, at the interface between bulk water and the lipidic core of these organelles is only based on hydrophobicity plot. Using hydroxyl radical footprinting and proteomics approaches we studied water accessibility of one major protein in these droplets: S3 oleosin of Arabidopsis thaliana seeds. Copyright © 2017 Elsevier B.V. All rights reserved.
Proteomic profiling of human plasma for cancer biomarker discovery.
Huang, Zhao; Ma, Linguang; Huang, Canhua; Li, Qifu; Nice, Edouard C
2017-03-01
Over the past decades, substantial advances have been made in both the early diagnosis and accurate prognosis of many cancers because of the impressive development of novel proteomic strategies. However, it remains difficult to standardize proteomic approaches. In addition, the heterogeneity of proteins in distinct tissues results in incomplete population of the whole proteome, which inevitably limits its clinical practice. As one of the most complex proteomes in the human body, the plasma proteome contains secreted proteins originating from multiple organs and tissues, making it a favorable matrix for comprehensive biomarker discovery. Here, we will discuss the roles of plasma proteome profiling in cancer biomarker discovery and validation, and highlight both the inherent advantages and disadvantages. Although several hurdles lay ahead, further advances in this technology will greatly increase our understanding of cancer biology, reveal new biomarkers and biomarker panels, and open a new avenue for more efficient early diagnosis and surveillance of cancer, leading toward personalized medicine. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Proteomics technique opens new frontiers in mobilome research
Davidson, Andrew D.; Matthews, David A.
2017-01-01
ABSTRACT A large proportion of the genome of most eukaryotic organisms consists of highly repetitive mobile genetic elements. The sum of these elements is called the “mobilome,” which in eukaryotes is made up mostly of transposons. Transposable elements contribute to disease, evolution, and normal physiology by mediating genetic rearrangement, and through the “domestication” of transposon proteins for cellular functions. Although ‘omics studies of mobilome genomes and transcriptomes are common, technical challenges have hampered high-throughput global proteomics analyses of transposons. In a recent paper, we overcame these technical hurdles using a technique called “proteomics informed by transcriptomics” (PIT), and thus published the first unbiased global mobilome-derived proteome for any organism (using cell lines derived from the mosquito Aedes aegypti). In this commentary, we describe our methods in more detail, and summarise our major findings. We also use new genome sequencing data to show that, in many cases, the specific genomic element expressing a given protein can be identified using PIT. This proteomic technique therefore represents an important technological advance that will open new avenues of research into the role that proteins derived from transposons and other repetitive and sequence diverse genetic elements, such as endogenous retroviruses, play in health and disease. PMID:28932623
Perez-Riverol, Yasset; Alpi, Emanuele; Wang, Rui; Hermjakob, Henning; Vizcaíno, Juan Antonio
2015-01-01
Compared to other data-intensive disciplines such as genomics, public deposition and storage of MS-based proteomics, data are still less developed due to, among other reasons, the inherent complexity of the data and the variety of data types and experimental workflows. In order to address this need, several public repositories for MS proteomics experiments have been developed, each with different purposes in mind. The most established resources are the Global Proteome Machine Database (GPMDB), PeptideAtlas, and the PRIDE database. Additionally, there are other useful (in many cases recently developed) resources such as ProteomicsDB, Mass Spectrometry Interactive Virtual Environment (MassIVE), Chorus, MaxQB, PeptideAtlas SRM Experiment Library (PASSEL), Model Organism Protein Expression Database (MOPED), and the Human Proteinpedia. In addition, the ProteomeXchange consortium has been recently developed to enable better integration of public repositories and the coordinated sharing of proteomics information, maximizing its benefit to the scientific community. Here, we will review each of the major proteomics resources independently and some tools that enable the integration, mining and reuse of the data. We will also discuss some of the major challenges and current pitfalls in the integration and sharing of the data. PMID:25158685
mz5: space- and time-efficient storage of mass spectrometry data sets.
Wilhelm, Mathias; Kirchner, Marc; Steen, Judith A J; Steen, Hanno
2012-01-01
Across a host of MS-driven-omics fields, researchers witness the acquisition of ever increasing amounts of high throughput MS data and face the need for their compact yet efficiently accessible storage. Addressing the need for an open data exchange format, the Proteomics Standards Initiative and the Seattle Proteome Center at the Institute for Systems Biology independently developed the mzData and mzXML formats, respectively. In a subsequent joint effort, they defined an ontology and associated controlled vocabulary that specifies the contents of MS data files, implemented as the newer mzML format. All three formats are based on XML and are thus not particularly efficient in either storage space requirements or read/write speed. This contribution introduces mz5, a complete reimplementation of the mzML ontology that is based on the efficient, industrial strength storage backend HDF5. Compared with the current mzML standard, this strategy yields an average file size reduction to ∼54% and increases linear read and write speeds ∼3-4-fold. The format is implemented as part of the ProteoWizard project and is available under a permissive Apache license. Additional information and download links are available from http://software.steenlab.org/mz5.
Open reading frames associated with cancer in the dark matter of the human genome.
Delgado, Ana Paula; Brandao, Pamela; Chapado, Maria Julia; Hamid, Sheilin; Narayanan, Ramaswamy
2014-01-01
The uncharacterized proteins (open reading frames, ORFs) in the human genome offer an opportunity to discover novel targets for cancer. A systematic analysis of the dark matter of the human proteome for druggability and biomarker discovery is crucial to mining the genome. Numerous data mining tools are available to mine these ORFs to develop a comprehensive knowledge base for future target discovery and validation. Using the Genetic Association Database, the ORFs of the human dark matter proteome were screened for evidence of association with neoplasms. The Phenome-Genome Integrator tool was used to establish phenotypic association with disease traits including cancer. Batch analysis of the tools for protein expression analysis, gene ontology and motifs and domains was used to characterize the ORFs. Sixty-two ORFs were identified for neoplasm association. The expression Quantitative Trait Loci (eQTL) analysis identified thirteen ORFs related to cancer traits. Protein expression, motifs and domain analysis and genome-wide association studies verified the relevance of these OncoORFs in diverse tumors. The OncoORFs are also associated with a wide variety of human diseases and disorders. Our results link the OncoORFs to diverse diseases and disorders. This suggests a complex landscape of the uncharacterized proteome in human diseases. These results open the dark matter of the proteome to novel cancer target research. Copyright© 2014, International Institute of Anticancer Research (Dr. John G. Delinasios), All rights reserved.
BioMart Central Portal: an open database network for the biological community
Guberman, Jonathan M.; Ai, J.; Arnaiz, O.; Baran, Joachim; Blake, Andrew; Baldock, Richard; Chelala, Claude; Croft, David; Cros, Anthony; Cutts, Rosalind J.; Di Génova, A.; Forbes, Simon; Fujisawa, T.; Gadaleta, E.; Goodstein, D. M.; Gundem, Gunes; Haggarty, Bernard; Haider, Syed; Hall, Matthew; Harris, Todd; Haw, Robin; Hu, S.; Hubbard, Simon; Hsu, Jack; Iyer, Vivek; Jones, Philip; Katayama, Toshiaki; Kinsella, R.; Kong, Lei; Lawson, Daniel; Liang, Yong; Lopez-Bigas, Nuria; Luo, J.; Lush, Michael; Mason, Jeremy; Moreews, Francois; Ndegwa, Nelson; Oakley, Darren; Perez-Llamas, Christian; Primig, Michael; Rivkin, Elena; Rosanoff, S.; Shepherd, Rebecca; Simon, Reinhard; Skarnes, B.; Smedley, Damian; Sperling, Linda; Spooner, William; Stevenson, Peter; Stone, Kevin; Teague, J.; Wang, Jun; Wang, Jianxin; Whitty, Brett; Wong, D. T.; Wong-Erasmus, Marie; Yao, L.; Youens-Clark, Ken; Yung, Christina; Zhang, Junjun; Kasprzyk, Arek
2011-01-01
BioMart Central Portal is a first of its kind, community-driven effort to provide unified access to dozens of biological databases spanning genomics, proteomics, model organisms, cancer data, ontology information and more. Anybody can contribute an independently maintained resource to the Central Portal, allowing it to be exposed to and shared with the research community, and linking it with the other resources in the portal. Users can take advantage of the common interface to quickly utilize different sources without learning a new system for each. The system also simplifies cross-database searches that might otherwise require several complicated steps. Several integrated tools streamline common tasks, such as converting between ID formats and retrieving sequences. The combination of a wide variety of databases, an easy-to-use interface, robust programmatic access and the array of tools make Central Portal a one-stop shop for biological data querying. Here, we describe the structure of Central Portal and show example queries to demonstrate its capabilities. Database URL: http://central.biomart.org. PMID:21930507
NCBI GEO: mining millions of expression profiles--database and tools.
Barrett, Tanya; Suzek, Tugba O; Troup, Dennis B; Wilhite, Stephen E; Ngau, Wing-Chi; Ledoux, Pierre; Rudnev, Dmitry; Lash, Alex E; Fujibuchi, Wataru; Edgar, Ron
2005-01-01
The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) is the largest fully public repository for high-throughput molecular abundance data, primarily gene expression data. The database has a flexible and open design that allows the submission, storage and retrieval of many data types. These data include microarray-based experiments measuring the abundance of mRNA, genomic DNA and protein molecules, as well as non-array-based technologies such as serial analysis of gene expression (SAGE) and mass spectrometry proteomic technology. GEO currently holds over 30,000 submissions representing approximately half a billion individual molecular abundance measurements, for over 100 organisms. Here, we describe recent database developments that facilitate effective mining and visualization of these data. Features are provided to examine data from both experiment- and gene-centric perspectives using user-friendly Web-based interfaces accessible to those without computational or microarray-related analytical expertise. The GEO database is publicly accessible through the World Wide Web at http://www.ncbi.nlm.nih.gov/geo.
BioMart Central Portal: an open database network for the biological community.
Guberman, Jonathan M; Ai, J; Arnaiz, O; Baran, Joachim; Blake, Andrew; Baldock, Richard; Chelala, Claude; Croft, David; Cros, Anthony; Cutts, Rosalind J; Di Génova, A; Forbes, Simon; Fujisawa, T; Gadaleta, E; Goodstein, D M; Gundem, Gunes; Haggarty, Bernard; Haider, Syed; Hall, Matthew; Harris, Todd; Haw, Robin; Hu, S; Hubbard, Simon; Hsu, Jack; Iyer, Vivek; Jones, Philip; Katayama, Toshiaki; Kinsella, R; Kong, Lei; Lawson, Daniel; Liang, Yong; Lopez-Bigas, Nuria; Luo, J; Lush, Michael; Mason, Jeremy; Moreews, Francois; Ndegwa, Nelson; Oakley, Darren; Perez-Llamas, Christian; Primig, Michael; Rivkin, Elena; Rosanoff, S; Shepherd, Rebecca; Simon, Reinhard; Skarnes, B; Smedley, Damian; Sperling, Linda; Spooner, William; Stevenson, Peter; Stone, Kevin; Teague, J; Wang, Jun; Wang, Jianxin; Whitty, Brett; Wong, D T; Wong-Erasmus, Marie; Yao, L; Youens-Clark, Ken; Yung, Christina; Zhang, Junjun; Kasprzyk, Arek
2011-01-01
BioMart Central Portal is a first of its kind, community-driven effort to provide unified access to dozens of biological databases spanning genomics, proteomics, model organisms, cancer data, ontology information and more. Anybody can contribute an independently maintained resource to the Central Portal, allowing it to be exposed to and shared with the research community, and linking it with the other resources in the portal. Users can take advantage of the common interface to quickly utilize different sources without learning a new system for each. The system also simplifies cross-database searches that might otherwise require several complicated steps. Several integrated tools streamline common tasks, such as converting between ID formats and retrieving sequences. The combination of a wide variety of databases, an easy-to-use interface, robust programmatic access and the array of tools make Central Portal a one-stop shop for biological data querying. Here, we describe the structure of Central Portal and show example queries to demonstrate its capabilities.
Perez-Riverol, Yasset; Alpi, Emanuele; Wang, Rui; Hermjakob, Henning; Vizcaíno, Juan Antonio
2015-03-01
Compared to other data-intensive disciplines such as genomics, public deposition and storage of MS-based proteomics, data are still less developed due to, among other reasons, the inherent complexity of the data and the variety of data types and experimental workflows. In order to address this need, several public repositories for MS proteomics experiments have been developed, each with different purposes in mind. The most established resources are the Global Proteome Machine Database (GPMDB), PeptideAtlas, and the PRIDE database. Additionally, there are other useful (in many cases recently developed) resources such as ProteomicsDB, Mass Spectrometry Interactive Virtual Environment (MassIVE), Chorus, MaxQB, PeptideAtlas SRM Experiment Library (PASSEL), Model Organism Protein Expression Database (MOPED), and the Human Proteinpedia. In addition, the ProteomeXchange consortium has been recently developed to enable better integration of public repositories and the coordinated sharing of proteomics information, maximizing its benefit to the scientific community. Here, we will review each of the major proteomics resources independently and some tools that enable the integration, mining and reuse of the data. We will also discuss some of the major challenges and current pitfalls in the integration and sharing of the data. © 2014 The Authors. PROTEOMICS published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Proteomics in the genome engineering era.
Vandemoortele, Giel; Gevaert, Kris; Eyckerman, Sven
2016-01-01
Genome engineering experiments used to be lengthy, inefficient, and often expensive, preventing a widespread adoption of such experiments for the full assessment of endogenous protein functions. With the revolutionary clustered regularly interspaced short palindromic repeats/CRISPR-associated protein 9 technology, genome engineering became accessible to the broad life sciences community and is now implemented in several research areas. One particular field that can benefit significantly from this evolution is proteomics where a substantial impact on experimental design and general proteome biology can be expected. In this review, we describe the main applications of genome engineering in proteomics, including the use of engineered disease models and endogenous epitope tagging. In addition, we provide an overview on current literature and highlight important considerations when launching genome engineering technologies in proteomics workflows. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
The full proteomics analysis of a small tumor sample (similar in mass to a few grains of rice) produces well over 500 megabytes of unprocessed "raw" data when analyzed on a mass spectrometer (MS). Thus, for every proteomics experiment there is a vast amount of raw data that must be analyzed and interrogated in order to extract biological information. Moreover, the raw data output from different MS vendors are generally in different formats inhibiting the ability of labs to productively work together.
Tandem Mass Spectrum Sequencing: An Alternative to Database Search Engines in Shotgun Proteomics.
Muth, Thilo; Rapp, Erdmann; Berven, Frode S; Barsnes, Harald; Vaudel, Marc
2016-01-01
Protein identification via database searches has become the gold standard in mass spectrometry based shotgun proteomics. However, as the quality of tandem mass spectra improves, direct mass spectrum sequencing gains interest as a database-independent alternative. In this chapter, the general principle of this so-called de novo sequencing is introduced along with pitfalls and challenges of the technique. The main tools available are presented with a focus on user friendly open source software which can be directly applied in everyday proteomic workflows.
Wimmer, Helge; Gundacker, Nina C; Griss, Johannes; Haudek, Verena J; Stättner, Stefan; Mohr, Thomas; Zwickl, Hannes; Paulitschke, Verena; Baron, David M; Trittner, Wolfgang; Kubicek, Markus; Bayer, Editha; Slany, Astrid; Gerner, Christopher
2009-06-01
Interpretation of proteome data with a focus on biomarker discovery largely relies on comparative proteome analyses. Here, we introduce a database-assisted interpretation strategy based on proteome profiles of primary cells. Both 2-D-PAGE and shotgun proteomics are applied. We obtain high data concordance with these two different techniques. When applying mass analysis of tryptic spot digests from 2-D gels of cytoplasmic fractions, we typically identify several hundred proteins. Using the same protein fractions, we usually identify more than thousand proteins by shotgun proteomics. The data consistency obtained when comparing these independent data sets exceeds 99% of the proteins identified in the 2-D gels. Many characteristic differences in protein expression of different cells can thus be independently confirmed. Our self-designed SQL database (CPL/MUW - database of the Clinical Proteomics Laboratories at the Medical University of Vienna accessible via www.meduniwien.ac.at/proteomics/database) facilitates (i) quality management of protein identification data, which are based on MS, (ii) the detection of cell type-specific proteins and (iii) of molecular signatures of specific functional cell states. Here, we demonstrate, how the interpretation of proteome profiles obtained from human liver tissue and hepatocellular carcinoma tissue is assisted by the Clinical Proteomics Laboratories at the Medical University of Vienna-database. Therefore, we suggest that the use of reference experiments supported by a tailored database may substantially facilitate data interpretation of proteome profiling experiments.
Parasites, proteomes and systems: has Descartes' clock run out of time?
Wastling, J M; Armstrong, S D; Krishna, R; Xia, D
2012-08-01
Systems biology aims to integrate multiple biological data types such as genomics, transcriptomics and proteomics across different levels of structure and scale; it represents an emerging paradigm in the scientific process which challenges the reductionism that has dominated biomedical research for hundreds of years. Systems biology will nevertheless only be successful if the technologies on which it is based are able to deliver the required type and quality of data. In this review we discuss how well positioned is proteomics to deliver the data necessary to support meaningful systems modelling in parasite biology. We summarise the current state of identification proteomics in parasites, but argue that a new generation of quantitative proteomics data is now needed to underpin effective systems modelling. We discuss the challenges faced to acquire more complete knowledge of protein post-translational modifications, protein turnover and protein-protein interactions in parasites. Finally we highlight the central role of proteome-informatics in ensuring that proteomics data is readily accessible to the user-community and can be translated and integrated with other relevant data types.
Parasites, proteomes and systems: has Descartes’ clock run out of time?
WASTLING, J. M.; ARMSTRONG, S. D.; KRISHNA, R.; XIA, D.
2012-01-01
SUMMARY Systems biology aims to integrate multiple biological data types such as genomics, transcriptomics and proteomics across different levels of structure and scale; it represents an emerging paradigm in the scientific process which challenges the reductionism that has dominated biomedical research for hundreds of years. Systems biology will nevertheless only be successful if the technologies on which it is based are able to deliver the required type and quality of data. In this review we discuss how well positioned is proteomics to deliver the data necessary to support meaningful systems modelling in parasite biology. We summarise the current state of identification proteomics in parasites, but argue that a new generation of quantitative proteomics data is now needed to underpin effective systems modelling. We discuss the challenges faced to acquire more complete knowledge of protein post-translational modifications, protein turnover and protein-protein interactions in parasites. Finally we highlight the central role of proteome-informatics in ensuring that proteomics data is readily accessible to the user-community and can be translated and integrated with other relevant data types. PMID:22828391
Proteome reference map and regulation network of neonatal rat cardiomyocyte
Li, Zi-jian; Liu, Ning; Han, Qi-de; Zhang, You-yi
2011-01-01
Aim: To study and establish a proteome reference map and regulation network of neonatal rat cardiomyocyte. Methods: Cultured cardiomyocytes of neonatal rats were used. All proteins expressed in the cardiomyocytes were separated and identified by two-dimensional polyacrylamide gel electrophoresis (2-DE) and matrix-assisted laser desorption/ionization-time of flight mass spectrometry (MALDI-TOF MS). Biological networks and pathways of the neonatal rat cardiomyocytes were analyzed using the Ingenuity Pathway Analysis (IPA) program (www.ingenuity.com). A 2-DE database was made accessible on-line by Make2ddb package on a web server. Results: More than 1000 proteins were separated on 2D gels, and 148 proteins were identified. The identified proteins were used for the construction of an extensible markup language-based database. Biological networks and pathways were constructed to analyze the functions associate with cardiomyocyte proteins in the database. The 2-DE database of rat cardiomyocyte proteins can be accessed at http://2d.bjmu.edu.cn. Conclusion: A proteome reference map and regulation network of the neonatal rat cardiomyocytes have been established, which may serve as an international platform for storage, analysis and visualization of cardiomyocyte proteomic data. PMID:21841810
Principles of proteome allocation are revealed using proteomic data and genome-scale models
Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.; Ebrahim, Ali; Saunders, Michael A.; Palsson, Bernhard O.
2016-01-01
Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thus represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. This flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models. PMID:27857205
Principles of proteome allocation are revealed using proteomic data and genome-scale models
Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.; ...
2016-11-18
Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thusmore » represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. Furthermore, this flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models.« less
GeneLab Phase 2: Integrated Search Data Federation of Space Biology Experimental Data
NASA Technical Reports Server (NTRS)
Tran, P. B.; Berrios, D. C.; Gurram, M. M.; Hashim, J. C. M.; Raghunandan, S.; Lin, S. Y.; Le, T. Q.; Heher, D. M.; Thai, H. T.; Welch, J. D.;
2016-01-01
The GeneLab project is a science initiative to maximize the scientific return of omics data collected from spaceflight and from ground simulations of microgravity and radiation experiments, supported by a data system for a public bioinformatics repository and collaborative analysis tools for these data. The mission of GeneLab is to maximize the utilization of the valuable biological research resources aboard the ISS by collecting genomic, transcriptomic, proteomic and metabolomic (so-called omics) data to enable the exploration of the molecular network responses of terrestrial biology to space environments using a systems biology approach. All GeneLab data are made available to a worldwide network of researchers through its open-access data system. GeneLab is currently being developed by NASA to support Open Science biomedical research in order to enable the human exploration of space and improve life on earth. Open access to Phase 1 of the GeneLab Data Systems (GLDS) was implemented in April 2015. Download volumes have grown steadily, mirroring the growth in curated space biology research data sets (61 as of June 2016), now exceeding 10 TB/month, with over 10,000 file downloads since the start of Phase 1. For the period April 2015 to May 2016, most frequently downloaded were data from studies of Mus musculus (39) followed closely by Arabidopsis thaliana (30), with the remaining downloads roughly equally split across 12 other organisms (each 10 of total downloads). GLDS Phase 2 is focusing on interoperability, supporting data federation, including integrated search capabilities, of GLDS-housed data sets with external data sources, such as gene expression data from NIHNCBIs Gene Expression Omnibus (GEO), proteomic data from EBIs PRIDE system, and metagenomic data from Argonne National Laboratory's MG-RAST. GEO and MG-RAST employ specifications for investigation metadata that are different from those used by the GLDS and PRIDE (e.g., ISA-Tab). The GLDS Phase 2 system will implement a Google-like, full-text search engine using a Service-Oriented Architecture by utilizing publicly available RESTful web services Application Programming Interfaces (e.g., GEO Entrez Programming Utilities) and a Common Metadata Model (CMM) in order to accommodate the different metadata formats between the heterogeneous bioinformatics databases. GLDS Phase 2 completion with fully implemented capabilities will be made available to the general public in September 2017.
Cassidy, Liam; Prasse, Daniela; Linke, Dennis; Schmitz, Ruth A; Tholey, Andreas
2016-10-07
The recent discovery of an increasing number of small open reading frames (sORF) creates the need for suitable analytical technologies for the comprehensive identification of the corresponding gene products. For biological and functional studies the knowledge of the entire set of proteins and sORF gene products is essential. Consequently in the present study we evaluated analytical approaches that will allow for simultaneous analysis of widest parts of the proteome together with the predicted sORF. We performed a full proteome analysis of the methane producing archaeon Methanosarcina mazei strain Gö1 cytosolic proteome using a high/low pH reversed phase LC-MS bottom-up approach. The second analytical approach was based on semi-top-down strategy, encompassing a separation at intact protein level using a GelFree system, followed by digestion and LC-MS analysis. A high overlap in identified proteins was found for both approaches yielding the most comprehensive coverage of the cytosolic proteome of this organism achieved so far. The application of the second approach in combination with an adjustment of the search criteria for database searches further led to a significant increase of sORF peptide identifications, finally allowing to detect and identify 28 sORF gene products.
Veit, Johannes; Sachsenberg, Timo; Chernev, Aleksandar; Aicheler, Fabian; Urlaub, Henning; Kohlbacher, Oliver
2016-09-02
Modern mass spectrometry setups used in today's proteomics studies generate vast amounts of raw data, calling for highly efficient data processing and analysis tools. Software for analyzing these data is either monolithic (easy to use, but sometimes too rigid) or workflow-driven (easy to customize, but sometimes complex). Thermo Proteome Discoverer (PD) is a powerful software for workflow-driven data analysis in proteomics which, in our eyes, achieves a good trade-off between flexibility and usability. Here, we present two open-source plugins for PD providing additional functionality: LFQProfiler for label-free quantification of peptides and proteins, and RNP(xl) for UV-induced peptide-RNA cross-linking data analysis. LFQProfiler interacts with existing PD nodes for peptide identification and validation and takes care of the entire quantitative part of the workflow. We show that it performs at least on par with other state-of-the-art software solutions for label-free quantification in a recently published benchmark ( Ramus, C.; J. Proteomics 2016 , 132 , 51 - 62 ). The second workflow, RNP(xl), represents the first software solution to date for identification of peptide-RNA cross-links including automatic localization of the cross-links at amino acid resolution and localization scoring. It comes with a customized integrated cross-link fragment spectrum viewer for convenient manual inspection and validation of the results.
The Gel Electrophoresis Markup Language (GelML) from the Proteomics Standards Initiative
Gibson, Frank; Hoogland, Christine; Martinez-Bartolomé, Salvador; Medina-Aunon, J. Alberto; Albar, Juan Pablo; Babnigg, Gyorgy; Wipat, Anil; Hermjakob, Henning; Almeida, Jonas S; Stanislaus, Romesh; Paton, Norman W; Jones, Andrew R
2011-01-01
The Human Proteome Organisation’s Proteomics Standards Initiative (HUPO-PSI) has developed the GelML data exchange format for representing gel electrophoresis experiments performed in proteomics investigations. The format closely follows the reporting guidelines for gel electrophoresis, which are part of the Minimum Information About a Proteomics Experiment (MIAPE) set of modules. GelML supports the capture of metadata (such as experimental protocols) and data (such as gel images) resulting from gel electrophoresis so that laboratories can be compliant with the MIAPE Gel Electrophoresis guidelines, while allowing such data sets to be exchanged or downloaded from public repositories. The format is sufficiently flexible to capture data from a broad range of experimental processes, and complements other PSI formats for mass spectrometry data and the results of protein and peptide identifications to capture entire gel-based proteome workflows. GelML has resulted from the open standardisation process of PSI consisting of both public consultation and anonymous review of the specifications. PMID:20677327
The gel electrophoresis markup language (GelML) from the Proteomics Standards Initiative.
Gibson, Frank; Hoogland, Christine; Martinez-Bartolomé, Salvador; Medina-Aunon, J Alberto; Albar, Juan Pablo; Babnigg, Gyorgy; Wipat, Anil; Hermjakob, Henning; Almeida, Jonas S; Stanislaus, Romesh; Paton, Norman W; Jones, Andrew R
2010-09-01
The Human Proteome Organisation's Proteomics Standards Initiative has developed the GelML (gel electrophoresis markup language) data exchange format for representing gel electrophoresis experiments performed in proteomics investigations. The format closely follows the reporting guidelines for gel electrophoresis, which are part of the Minimum Information About a Proteomics Experiment (MIAPE) set of modules. GelML supports the capture of metadata (such as experimental protocols) and data (such as gel images) resulting from gel electrophoresis so that laboratories can be compliant with the MIAPE Gel Electrophoresis guidelines, while allowing such data sets to be exchanged or downloaded from public repositories. The format is sufficiently flexible to capture data from a broad range of experimental processes, and complements other PSI formats for MS data and the results of protein and peptide identifications to capture entire gel-based proteome workflows. GelML has resulted from the open standardisation process of PSI consisting of both public consultation and anonymous review of the specifications.
Virtual Labs in proteomics: new E-learning tools.
Ray, Sandipan; Koshy, Nicole Rachel; Reddy, Panga Jaipal; Srivastava, Sanjeeva
2012-05-17
Web-based educational resources have gained enormous popularity recently and are increasingly becoming a part of modern educational systems. Virtual Labs are E-learning platforms where learners can gain the experience of practical experimentation without any direct physical involvement on real bench work. They use computerized simulations, models, videos, animations and other instructional technologies to create interactive content. Proteomics being one of the most rapidly growing fields of the biological sciences is now an important part of college and university curriculums. Consequently, many E-learning programs have started incorporating the theoretical and practical aspects of different proteomic techniques as an element of their course work in the form of Video Lectures and Virtual Labs. To this end, recently we have developed a Virtual Proteomics Lab at the Indian Institute of Technology Bombay, which demonstrates different proteomics techniques, including basic and advanced gel and MS-based protein separation and identification techniques, bioinformatics tools and molecular docking methods, and their applications in different biological samples. This Tutorial will discuss the prominent Virtual Labs featuring proteomics content, including the Virtual Proteomics Lab of IIT-Bombay, and E-resources available for proteomics study that are striving to make proteomic techniques and concepts available and accessible to the student and research community. This Tutorial is part of the International Proteomics Tutorial Programme (IPTP 14). Details can be found at: http://www.proteomicstutorials.org/. Copyright © 2012 Elsevier B.V. All rights reserved.
COMPASS: a suite of pre- and post-search proteomics software tools for OMSSA
Wenger, Craig D.; Phanstiel, Douglas H.; Lee, M. Violet; Bailey, Derek J.; Coon, Joshua J.
2011-01-01
Here we present the Coon OMSSA Proteomic Analysis Software Suite (COMPASS): a free and open-source software pipeline for high-throughput analysis of proteomics data, designed around the Open Mass Spectrometry Search Algorithm. We detail a synergistic set of tools for protein database generation, spectral reduction, peptide false discovery rate analysis, peptide quantitation via isobaric labeling, protein parsimony and protein false discovery rate analysis, and protein quantitation. We strive for maximum ease of use, utilizing graphical user interfaces and working with data files in the original instrument vendor format. Results are stored in plain text comma-separated values files, which are easy to view and manipulate with a text editor or spreadsheet program. We illustrate the operation and efficacy of COMPASS through the use of two LC–MS/MS datasets. The first is a dataset of a highly annotated mixture of standard proteins and manually validated contaminants that exhibits the identification workflow. The second is a dataset of yeast peptides, labeled with isobaric stable isotope tags and mixed in known ratios, to demonstrate the quantitative workflow. For these two datasets, COMPASS performs equivalently or better than the current de facto standard, the Trans-Proteomic Pipeline. PMID:21298793
Informed-Proteomics: open-source software package for top-down proteomics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Park, Jungkap; Piehowski, Paul D.; Wilkins, Christopher
Top-down proteomics involves the analysis of intact proteins. This approach is very attractive as it allows for analyzing proteins in their endogenous form without proteolysis, preserving valuable information about post-translation modifications, isoforms, proteolytic processing or their combinations collectively called proteoforms. Moreover, the quality of the top-down LC-MS/MS datasets is rapidly increasing due to advances in the liquid chromatography and mass spectrometry instrumentation and sample processing protocols. However, the top-down mass spectra are substantially more complex compare to the more conventional bottom-up data. To take full advantage of the increasing quality of the top-down LC-MS/MS datasets there is an urgent needmore » to develop algorithms and software tools for confident proteoform identification and quantification. In this study we present a new open source software suite for top-down proteomics analysis consisting of an LC-MS feature finding algorithm, a database search algorithm, and an interactive results viewer. The presented tool along with several other popular tools were evaluated using human-in-mouse xenograft luminal and basal breast tumor samples that are known to have significant differences in protein abundance based on bottom-up analysis.« less
Making a protein extract from plant pathogenic fungi for gel- and LC-based proteomics.
Fernández, Raquel González; Redondo, Inmaculada; Jorrin-Novo, Jesus V
2014-01-01
Proteomic technologies have become a successful tool to provide relevant information on fungal biology. In the case of plant pathogenic fungi, this approach would allow a deeper knowledge of the interaction and the biological cycle of the pathogen, as well as the identification of pathogenicity and virulence factors. These two elements open up new possibilities for crop disease diagnosis and environment-friendly crop protection. Phytopathogenic fungi, due to its particular cellular characteristics, can be considered as a recalcitrant biological material, which makes it difficult to obtain quality protein samples for proteomic analysis. This chapter focuses on protein extraction for gel- and LC-based proteomics with specific protocols of our current research with Botrytis cinerea.
Proteomics of Plant Pathogenic Fungi
González-Fernández, Raquel; Prats, Elena; Jorrín-Novo, Jesús V.
2010-01-01
Plant pathogenic fungi cause important yield losses in crops. In order to develop efficient and environmental friendly crop protection strategies, molecular studies of the fungal biological cycle, virulence factors, and interaction with its host are necessary. For that reason, several approaches have been performed using both classical genetic, cell biology, and biochemistry and the modern, holistic, and high-throughput, omic techniques. This work briefly overviews the tools available for studying Plant Pathogenic Fungi and is amply focused on MS-based Proteomics analysis, based on original papers published up to December 2009. At a methodological level, different steps in a proteomic workflow experiment are discussed. Separate sections are devoted to fungal descriptive (intracellular, subcellular, extracellular) and differential expression proteomics and interactomics. From the work published we can conclude that Proteomics, in combination with other techniques, constitutes a powerful tool for providing important information about pathogenicity and virulence factors, thus opening up new possibilities for crop disease diagnosis and crop protection. PMID:20589070
Proteomics of plant pathogenic fungi.
González-Fernández, Raquel; Prats, Elena; Jorrín-Novo, Jesús V
2010-01-01
Plant pathogenic fungi cause important yield losses in crops. In order to develop efficient and environmental friendly crop protection strategies, molecular studies of the fungal biological cycle, virulence factors, and interaction with its host are necessary. For that reason, several approaches have been performed using both classical genetic, cell biology, and biochemistry and the modern, holistic, and high-throughput, omic techniques. This work briefly overviews the tools available for studying Plant Pathogenic Fungi and is amply focused on MS-based Proteomics analysis, based on original papers published up to December 2009. At a methodological level, different steps in a proteomic workflow experiment are discussed. Separate sections are devoted to fungal descriptive (intracellular, subcellular, extracellular) and differential expression proteomics and interactomics. From the work published we can conclude that Proteomics, in combination with other techniques, constitutes a powerful tool for providing important information about pathogenicity and virulence factors, thus opening up new possibilities for crop disease diagnosis and crop protection.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.
Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thusmore » represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. Furthermore, this flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models.« less
Bhatia, Vivek N.; Perlman, David H.; Costello, Catherine E.; McComb, Mark E.
2009-01-01
In order that biological meaning may be derived and testable hypotheses may be built from proteomics experiments, assignments of proteins identified by mass spectrometry or other techniques must be supplemented with additional notation, such as information on known protein functions, protein-protein interactions, or biological pathway associations. Collecting, organizing, and interpreting this data often requires the input of experts in the biological field of study, in addition to the time-consuming search for and compilation of information from online protein databases. Furthermore, visualizing this bulk of information can be challenging due to the limited availability of easy-to-use and freely available tools for this process. In response to these constraints, we have undertaken the design of software to automate annotation and visualization of proteomics data in order to accelerate the pace of research. Here we present the Software Tool for Researching Annotations of Proteins (STRAP) – a user-friendly, open-source C# application. STRAP automatically obtains gene ontology (GO) terms associated with proteins in a proteomics results ID list using the freely accessible UniProtKB and EBI GOA databases. Summarized in an easy-to-navigate tabular format, STRAP includes meta-information on the protein in addition to complimentary GO terminology. Additionally, this information can be edited by the user so that in-house expertise on particular proteins may be integrated into the larger dataset. STRAP provides a sortable tabular view for all terms, as well as graphical representations of GO-term association data in pie (biological process, cellular component and molecular function) and bar charts (cross comparison of sample sets) to aid in the interpretation of large datasets and differential analyses experiments. Furthermore, proteins of interest may be exported as a unique FASTA-formatted file to allow for customizable re-searching of mass spectrometry data, and gene names corresponding to the proteins in the lists may be encoded in the Gaggle microformat for further characterization, including pathway analysis. STRAP, a tutorial, and the C# source code are freely available from http://cpctools.sourceforge.net. PMID:19839595
CrossCheck: an open-source web tool for high-throughput screen data analysis.
Najafov, Jamil; Najafov, Ayaz
2017-07-19
Modern high-throughput screening methods allow researchers to generate large datasets that potentially contain important biological information. However, oftentimes, picking relevant hits from such screens and generating testable hypotheses requires training in bioinformatics and the skills to efficiently perform database mining. There are currently no tools available to general public that allow users to cross-reference their screen datasets with published screen datasets. To this end, we developed CrossCheck, an online platform for high-throughput screen data analysis. CrossCheck is a centralized database that allows effortless comparison of the user-entered list of gene symbols with 16,231 published datasets. These datasets include published data from genome-wide RNAi and CRISPR screens, interactome proteomics and phosphoproteomics screens, cancer mutation databases, low-throughput studies of major cell signaling mediators, such as kinases, E3 ubiquitin ligases and phosphatases, and gene ontological information. Moreover, CrossCheck includes a novel database of predicted protein kinase substrates, which was developed using proteome-wide consensus motif searches. CrossCheck dramatically simplifies high-throughput screen data analysis and enables researchers to dig deep into the published literature and streamline data-driven hypothesis generation. CrossCheck is freely accessible as a web-based application at http://proteinguru.com/crosscheck.
Shrestha, Prashanta; Smith, Mark Thomas; Bundy, Bradley Charles
2014-01-25
Site-specific incorporation of unnatural amino acids (uAAs) during protein synthesis expands the proteomic code through the addition of unique residue chemistry. This field provides a unique tool to improve pharmacokinetics, cancer treatments, vaccine development, proteomics and protein engineering. The limited ability to predict the characteristics of proteins with uAA-incorporation creates a need for a low-cost system with the potential for rapid screening. Escherichia coli-based cell-free protein synthesis is a compelling platform for uAA incorporation due to the open and accessible nature of the reaction environment. However, typical cell-free systems can be expensive due to the high cost of energizing reagents. By employing alternative energy sources, we reduce the cost of uAA-incorporation in CFPS by 55%. While alternative energy systems reduce cost, the time investment to develop gene libraries can remain cumbersome. Cell-free systems allow the direct use of PCR products known as linear expression templates, thus alleviating tedious plasmid library preparations steps. We report the specific costs of CFPS with uAA incorporation, demonstrate that LETs are suitable expression templates with uAA-incorporation, and consider the substantial reduction in labor intensity using LET-based expression for CFPS uAA incorporation. Copyright © 2013 Elsevier B.V. All rights reserved.
mz5: Space- and Time-efficient Storage of Mass Spectrometry Data Sets*
Wilhelm, Mathias; Kirchner, Marc; Steen, Judith A. J.; Steen, Hanno
2012-01-01
Across a host of MS-driven-omics fields, researchers witness the acquisition of ever increasing amounts of high throughput MS data and face the need for their compact yet efficiently accessible storage. Addressing the need for an open data exchange format, the Proteomics Standards Initiative and the Seattle Proteome Center at the Institute for Systems Biology independently developed the mzData and mzXML formats, respectively. In a subsequent joint effort, they defined an ontology and associated controlled vocabulary that specifies the contents of MS data files, implemented as the newer mzML format. All three formats are based on XML and are thus not particularly efficient in either storage space requirements or read/write speed. This contribution introduces mz5, a complete reimplementation of the mzML ontology that is based on the efficient, industrial strength storage backend HDF5. Compared with the current mzML standard, this strategy yields an average file size reduction to ∼54% and increases linear read and write speeds ∼3–4-fold. The format is implemented as part of the ProteoWizard project and is available under a permissive Apache license. Additional information and download links are available from http://software.steenlab.org/mz5. PMID:21960719
2013-01-01
Background The goal of many proteomics experiments is to determine the abundance of proteins in biological samples, and the variation thereof in various physiological conditions. High-throughput quantitative proteomics, specifically label-free LC-MS/MS, allows rapid measurement of thousands of proteins, enabling large-scale studies of various biological systems. Prior to analyzing these information-rich datasets, raw data must undergo several computational processing steps. We present a method to address one of the essential steps in proteomics data processing - the matching of peptide measurements across samples. Results We describe a novel method for label-free proteomics data alignment with the ability to incorporate previously unused aspects of the data, particularly ion mobility drift times and product ion information. We compare the results of our alignment method to PEPPeR and OpenMS, and compare alignment accuracy achieved by different versions of our method utilizing various data characteristics. Our method results in increased match recall rates and similar or improved mismatch rates compared to PEPPeR and OpenMS feature-based alignment. We also show that the inclusion of drift time and product ion information results in higher recall rates and more confident matches, without increases in error rates. Conclusions Based on the results presented here, we argue that the incorporation of ion mobility drift time and product ion information are worthy pursuits. Alignment methods should be flexible enough to utilize all available data, particularly with recent advancements in experimental separation methods. PMID:24341404
Benjamin, Ashlee M; Thompson, J Will; Soderblom, Erik J; Geromanos, Scott J; Henao, Ricardo; Kraus, Virginia B; Moseley, M Arthur; Lucas, Joseph E
2013-12-16
The goal of many proteomics experiments is to determine the abundance of proteins in biological samples, and the variation thereof in various physiological conditions. High-throughput quantitative proteomics, specifically label-free LC-MS/MS, allows rapid measurement of thousands of proteins, enabling large-scale studies of various biological systems. Prior to analyzing these information-rich datasets, raw data must undergo several computational processing steps. We present a method to address one of the essential steps in proteomics data processing--the matching of peptide measurements across samples. We describe a novel method for label-free proteomics data alignment with the ability to incorporate previously unused aspects of the data, particularly ion mobility drift times and product ion information. We compare the results of our alignment method to PEPPeR and OpenMS, and compare alignment accuracy achieved by different versions of our method utilizing various data characteristics. Our method results in increased match recall rates and similar or improved mismatch rates compared to PEPPeR and OpenMS feature-based alignment. We also show that the inclusion of drift time and product ion information results in higher recall rates and more confident matches, without increases in error rates. Based on the results presented here, we argue that the incorporation of ion mobility drift time and product ion information are worthy pursuits. Alignment methods should be flexible enough to utilize all available data, particularly with recent advancements in experimental separation methods.
Highlights of the Biology and Disease-driven Human Proteome Project, 2015-2016.
Van Eyk, Jennifer E; Corrales, Fernando J; Aebersold, Ruedi; Cerciello, Ferdinando; Deutsch, Eric W; Roncada, Paola; Sanchez, Jean-Charles; Yamamoto, Tadashi; Yang, Pengyuan; Zhang, Hui; Omenn, Gilbert S
2016-11-04
The Biology and Disease-driven Human Proteome Project (B/D-HPP) is aimed at supporting and enhancing the broad use of state-of-the-art proteomic methods to characterize and quantify proteins for in-depth understanding of the molecular mechanisms of biological processes and human disease. Based on a foundation of the pre-existing HUPO initiatives begun in 2002, the B/D-HPP is designed to provide standardized methods and resources for mass spectrometry and specific protein affinity reagents and facilitate accessibility of these resources to the broader life sciences research and clinical communities. Currently there are 22 B/D-HPP initiatives and 3 closely related HPP resource pillars. The B/D-HPP groups are working to define sets of protein targets that are highly relevant to each particular field to deliver relevant assays for the measurement of these selected targets and to disseminate and make publicly accessible the information and tools generated. Major developments are the 2016 publications of the Human SRM Atlas and of "popular protein sets" for six organ systems. Here we present the current activities and plans of the BD-HPP initiatives as highlighted in numerous B/D-HPP workshops at the 14th annual HUPO 2015 World Congress of Proteomics in Vancouver, Canada.
DeNovoGUI: An Open Source Graphical User Interface for de Novo Sequencing of Tandem Mass Spectra
2013-01-01
De novo sequencing is a popular technique in proteomics for identifying peptides from tandem mass spectra without having to rely on a protein sequence database. Despite the strong potential of de novo sequencing algorithms, their adoption threshold remains quite high. We here present a user-friendly and lightweight graphical user interface called DeNovoGUI for running parallelized versions of the freely available de novo sequencing software PepNovo+, greatly simplifying the use of de novo sequencing in proteomics. Our platform-independent software is freely available under the permissible Apache2 open source license. Source code, binaries, and additional documentation are available at http://denovogui.googlecode.com. PMID:24295440
DeNovoGUI: an open source graphical user interface for de novo sequencing of tandem mass spectra.
Muth, Thilo; Weilnböck, Lisa; Rapp, Erdmann; Huber, Christian G; Martens, Lennart; Vaudel, Marc; Barsnes, Harald
2014-02-07
De novo sequencing is a popular technique in proteomics for identifying peptides from tandem mass spectra without having to rely on a protein sequence database. Despite the strong potential of de novo sequencing algorithms, their adoption threshold remains quite high. We here present a user-friendly and lightweight graphical user interface called DeNovoGUI for running parallelized versions of the freely available de novo sequencing software PepNovo+, greatly simplifying the use of de novo sequencing in proteomics. Our platform-independent software is freely available under the permissible Apache2 open source license. Source code, binaries, and additional documentation are available at http://denovogui.googlecode.com .
Microfluidic-Mass Spectrometry Interfaces for Translational Proteomics.
Pedde, R Daniel; Li, Huiyan; Borchers, Christoph H; Akbari, Mohsen
2017-10-01
Interfacing mass spectrometry (MS) with microfluidic chips (μchip-MS) holds considerable potential to transform a clinician's toolbox, providing translatable methods for the early detection, diagnosis, monitoring, and treatment of noncommunicable diseases by streamlining and integrating laborious sample preparation workflows on high-throughput, user-friendly platforms. Overcoming the limitations of competitive immunoassays - currently the gold standard in clinical proteomics - μchip-MS can provide unprecedented access to complex proteomic assays having high sensitivity and specificity, but without the labor, costs, and complexities associated with conventional MS sample processing. This review surveys recent μchip-MS systems for clinical applications and examines their emerging role in streamlining the development and translation of MS-based proteomic assays by alleviating many of the challenges that currently inhibit widespread clinical adoption. Crown Copyright © 2017. Published by Elsevier Ltd. All rights reserved.
Vascular Sap Proteomics: Providing Insight into Long-Distance Signaling during Stress
Carella, Philip; Wilson, Daniel C.; Kempthorne, Christine J.; Cameron, Robin K.
2016-01-01
The plant vascular system, composed of the xylem and phloem, is important for the transport of water, mineral nutrients, and photosynthate throughout the plant body. The vasculature is also the primary means by which developmental and stress signals move from one organ to another. Due to practical and technological limitations, proteomics analysis of xylem and phloem sap has been understudied in comparison to accessible sample types such as leaves and roots. However, recent advances in sample collection techniques and mass spectrometry technology are making it possible to comprehensively analyze vascular sap proteomes. In this mini-review, we discuss the emerging field of vascular sap proteomics, with a focus on recent comparative studies to identify vascular proteins that may play roles in long-distance signaling and other processes during stress responses in plants. PMID:27242852
GeneLab Analysis Working Group Kick-Off Meeting
NASA Technical Reports Server (NTRS)
Costes, Sylvain V.
2018-01-01
Goals to achieve for GeneLab AWG - GL vision - Review of GeneLab AWG charter Timeline and milestones for 2018 Logistics - Monthly Meeting - Workshop - Internship - ASGSR Introduction of team leads and goals of each group Introduction of all members Q/A Three-tier Client Strategy to Democratize Data Physiological changes, pathway enrichment, differential expression, normalization, processing metadata, reproducibility, Data federation/integration with heterogeneous bioinformatics external databases The GLDS currently serves over 100 omics investigations to the biomedical community via open access. In order to expand the scope of metadata record searches via the GLDS, we designed a metadata warehouse that collects and updates metadata records from external systems housing similar data. To demonstrate the capabilities of federated search and retrieval of these data, we imported metadata records from three open-access data systems into the GLDS metadata warehouse: NCBI's Gene Expression Omnibus (GEO), EBI's PRoteomics IDEntifications (PRIDE) repository, and the Metagenomics Analysis server (MG-RAST). Each of these systems defines metadata for omics data sets differently. One solution to bridge such differences is to employ a common object model (COM) to which each systems' representation of metadata can be mapped. Warehoused metadata records are then transformed at ETL to this single, common representation. Queries generated via the GLDS are then executed against the warehouse, and matching records are shown in the COM representation (Fig. 1). While this approach is relatively straightforward to implement, the volume of the data in the omics domain presents challenges in dealing with latency and currency of records. Furthermore, the lack of a coordinated has been federated data search for and retrieval of these kinds of data across other open-access systems, so that users are able to conduct biological meta-investigations using data from a variety of sources. Such meta-investigations are key to corroborating findings from many kinds of assays and translating them into systems biology knowledge and, eventually, therapeutics.
Expanding proteome coverage with orthogonal-specificity α-Lytic proteases
DOE Office of Scientific and Technical Information (OSTI.GOV)
Meyer, Jesse G.; Kim, Sangtae; Maltby, David A.
2014-03-01
Bottom-up proteomics studies traditionally involve proteome digestion with a single protease, trypsin. However, trypsin alone does not generate peptides that encompass the entire proteome. Alternative proteases have been explored, but most have specificity for charged amino acid side chains. Therefore, additional proteases that improve proteome coverage by cleavage at sequences complimentary to trypsin may increase proteome coverage. We demonstrate the novel application of two proteases for bottom-up proteomics: wild type alpha-lytic protease (WaLP), and an active site mutant of WaLP, M190A alpha-lytic protease (MaLP). We assess several relevant factors including MS/MS fragmentation, peptide length, peptide yield, and protease specificity. Bymore » combining data from separate digestions with trypsin, LysC, WaLP, and MaLP, proteome coverage was increased 101% compared to trypsin digestion alone. To demonstrate how the gained sequence coverage can access additional PTM information, we show identification of a number of novel phosphorylation sites in the S. pombe proteome and include an illustrative example from the protein MPD2, wherein two novel sites are identified, one in a tryptic peptide too short to identify and the other in a sequence devoid of tryptic sites. The specificity of WaLP and MaLP for aliphatic amino acid side chains was particularly valuable for coverage of membrane protein sequences, which increased 350% when the data from trypsin, LysC, WaLP, and MaLP were combined.« less
ProteoSign: an end-user online differential proteomics statistical analysis platform.
Efstathiou, Georgios; Antonakis, Andreas N; Pavlopoulos, Georgios A; Theodosiou, Theodosios; Divanach, Peter; Trudgian, David C; Thomas, Benjamin; Papanikolaou, Nikolas; Aivaliotis, Michalis; Acuto, Oreste; Iliopoulos, Ioannis
2017-07-03
Profiling of proteome dynamics is crucial for understanding cellular behavior in response to intrinsic and extrinsic stimuli and maintenance of homeostasis. Over the last 20 years, mass spectrometry (MS) has emerged as the most powerful tool for large-scale identification and characterization of proteins. Bottom-up proteomics, the most common MS-based proteomics approach, has always been challenging in terms of data management, processing, analysis and visualization, with modern instruments capable of producing several gigabytes of data out of a single experiment. Here, we present ProteoSign, a freely available web application, dedicated in allowing users to perform proteomics differential expression/abundance analysis in a user-friendly and self-explanatory way. Although several non-commercial standalone tools have been developed for post-quantification statistical analysis of proteomics data, most of them are not end-user appealing as they often require very stringent installation of programming environments, third-party software packages and sometimes further scripting or computer programming. To avoid this bottleneck, we have developed a user-friendly software platform accessible via a web interface in order to enable proteomics laboratories and core facilities to statistically analyse quantitative proteomics data sets in a resource-efficient manner. ProteoSign is available at http://bioinformatics.med.uoc.gr/ProteoSign and the source code at https://github.com/yorgodillo/ProteoSign. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Deutsch, Eric W.; Csordas, Attila; Sun, Zhi; Jarnuczak, Andrew; Perez-Riverol, Yasset; Ternent, Tobias; Campbell, David S.; Bernal-Llinares, Manuel; Okuda, Shujiro; Kawano, Shin; Moritz, Robert L.; Carver, Jeremy J.; Wang, Mingxun; Ishihama, Yasushi; Bandeira, Nuno; Hermjakob, Henning; Vizcaíno, Juan Antonio
2017-01-01
The ProteomeXchange (PX) Consortium of proteomics resources (http://www.proteomexchange.org) was formally started in 2011 to standardize data submission and dissemination of mass spectrometry proteomics data worldwide. We give an overview of the current consortium activities and describe the advances of the past few years. Augmenting the PX founding members (PRIDE and PeptideAtlas, including the PASSEL resource), two new members have joined the consortium: MassIVE and jPOST. ProteomeCentral remains as the common data access portal, providing the ability to search for data sets in all participating PX resources, now with enhanced data visualization components. We describe the updated submission guidelines, now expanded to include four members instead of two. As demonstrated by data submission statistics, PX is supporting a change in culture of the proteomics field: public data sharing is now an accepted standard, supported by requirements for journal submissions resulting in public data release becoming the norm. More than 4500 data sets have been submitted to the various PX resources since 2012. Human is the most represented species with approximately half of the data sets, followed by some of the main model organisms and a growing list of more than 900 diverse species. Data reprocessing activities are becoming more prominent, with both MassIVE and PeptideAtlas releasing the results of reprocessed data sets. Finally, we outline the upcoming advances for ProteomeXchange. PMID:27924013
A Computational Tool to Detect and Avoid Redundancy in Selected Reaction Monitoring
Röst, Hannes; Malmström, Lars; Aebersold, Ruedi
2012-01-01
Selected reaction monitoring (SRM), also called multiple reaction monitoring, has become an invaluable tool for targeted quantitative proteomic analyses, but its application can be compromised by nonoptimal selection of transitions. In particular, complex backgrounds may cause ambiguities in SRM measurement results because peptides with interfering transitions similar to those of the target peptide may be present in the sample. Here, we developed a computer program, the SRMCollider, that calculates nonredundant theoretical SRM assays, also known as unique ion signatures (UIS), for a given proteomic background. We show theoretically that UIS of three transitions suffice to conclusively identify 90% of all yeast peptides and 85% of all human peptides. Using predicted retention times, the SRMCollider also simulates time-scheduled SRM acquisition, which reduces the number of interferences to consider and leads to fewer transitions necessary to construct an assay. By integrating experimental fragment ion intensities from large scale proteome synthesis efforts (SRMAtlas) with the information content-based UIS, we combine two orthogonal approaches to create high quality SRM assays ready to be deployed. We provide a user friendly, open source implementation of an algorithm to calculate UIS of any order that can be accessed online at http://www.srmcollider.org to find interfering transitions. Finally, our tool can also simulate the specificity of novel data-independent MS acquisition methods in Q1–Q3 space. This allows us to predict parameters for these methods that deliver a specificity comparable with that of SRM. Using SRM interference information in addition to other sources of information can increase the confidence in an SRM measurement. We expect that the consideration of information content will become a standard step in SRM assay design and analysis, facilitated by the SRMCollider. PMID:22535207
Xu, Ruilian; Tang, Jun; Deng, Quantong; He, Wan; Sun, Xiujie; Xia, Ligang; Cheng, Zhiqiang; He, Lisheng; You, Shuyuan; Hu, Jintao; Fu, Yuxiang; Zhu, Jian; Chen, Yixin; Gao, Weina; He, An; Guo, Zhengyu; Lin, Lin; Li, Hua; Hu, Chaofeng; Tian, Ruijun
2018-05-01
Increasing attention has been focused on cell type proteome profiling for understanding the heterogeneous multicellular microenvironment in tissue samples. However, current cell type proteome profiling methods need large amounts of starting materials which preclude their application to clinical tumor specimens with limited access. Here, by seamlessly combining laser capture microdissection and integrated proteomics sample preparation technology SISPROT, specific cell types in tumor samples could be precisely dissected with single cell resolution and processed for high-sensitivity proteome profiling. Sample loss and contamination due to the multiple transfer steps are significantly reduced by the full integration and noncontact design. H&E staining dyes which are necessary for cell type investigation could be selectively removed by the unique two-stage design of the spintip device. This easy-to-use proteome profiling technology achieved high sensitivity with the identification of more than 500 proteins from only 0.1 mm 2 and 10 μm thickness colon cancer tissue section. The first cell type proteome profiling of four cell types from one colon tumor and surrounding normal tissue, including cancer cells, enterocytes, lymphocytes, and smooth muscle cells, was obtained. 5271, 4691, 4876, and 2140 protein groups were identified, respectively, from tissue section of only 5 mm 2 and 10 μm thickness. Furthermore, spatially resolved proteome distribution profiles of enterocytes, lymphocytes, and smooth muscle cells on the same tissue slices and across four consecutive sections with micrometer distance were successfully achieved. This fully integrated proteomics technology, termed LCM-SISPROT, is therefore promising for spatial-resolution cell type proteome profiling of tumor microenvironment with a minute amount of clinical starting materials.
Analyzing large-scale proteomics projects with latent semantic indexing.
Klie, Sebastian; Martens, Lennart; Vizcaíno, Juan Antonio; Côté, Richard; Jones, Phil; Apweiler, Rolf; Hinneburg, Alexander; Hermjakob, Henning
2008-01-01
Since the advent of public data repositories for proteomics data, readily accessible results from high-throughput experiments have been accumulating steadily. Several large-scale projects in particular have contributed substantially to the amount of identifications available to the community. Despite the considerable body of information amassed, very few successful analyses have been performed and published on this data, leveling off the ultimate value of these projects far below their potential. A prominent reason published proteomics data is seldom reanalyzed lies in the heterogeneous nature of the original sample collection and the subsequent data recording and processing. To illustrate that at least part of this heterogeneity can be compensated for, we here apply a latent semantic analysis to the data contributed by the Human Proteome Organization's Plasma Proteome Project (HUPO PPP). Interestingly, despite the broad spectrum of instruments and methodologies applied in the HUPO PPP, our analysis reveals several obvious patterns that can be used to formulate concrete recommendations for optimizing proteomics project planning as well as the choice of technologies used in future experiments. It is clear from these results that the analysis of large bodies of publicly available proteomics data by noise-tolerant algorithms such as the latent semantic analysis holds great promise and is currently underexploited.
Bio Warfare and Terrorism: Toxins and Other Mid-Spectrum Agents
2005-01-01
biotechnology, toxicogenomics, toxin, tetrodotoxin, and others. Once an agent has and proteomics may also help to open the door to the 276 Bio Warfare...also interferon gamma, interleukin-6, and tumor alsointrfern gmma intrlekin6, ad tmor by the mold Aspergillus flavus and commonly conta- necrosis factor...as bullets. No the new sciences of genomics and proteomics to alter toxoid or antitoxin is available, genetic code and to affect the expression of
Proteomics Standards Initiative: Fifteen Years of Progress and Future Work
2017-01-01
The Proteomics Standards Initiative (PSI) of the Human Proteome Organization (HUPO) has now been developing and promoting open community standards and software tools in the field of proteomics for 15 years. Under the guidance of the chair, cochairs, and other leadership positions, the PSI working groups are tasked with the development and maintenance of community standards via special workshops and ongoing work. Among the existing ratified standards, the PSI working groups continue to update PSI-MI XML, MITAB, mzML, mzIdentML, mzQuantML, mzTab, and the MIAPE (Minimum Information About a Proteomics Experiment) guidelines with the advance of new technologies and techniques. Furthermore, new standards are currently either in the final stages of completion (proBed and proBAM for proteogenomics results as well as PEFF) or in early stages of design (a spectral library standard format, a universal spectrum identifier, the qcML quality control format, and the Protein Expression Interface (PROXI) web services Application Programming Interface). In this work we review the current status of all of these aspects of the PSI, describe synergies with other efforts such as the ProteomeXchange Consortium, the Human Proteome Project, and the metabolomics community, and provide a look at future directions of the PSI. PMID:28849660
Proteomics Standards Initiative: Fifteen Years of Progress and Future Work.
Deutsch, Eric W; Orchard, Sandra; Binz, Pierre-Alain; Bittremieux, Wout; Eisenacher, Martin; Hermjakob, Henning; Kawano, Shin; Lam, Henry; Mayer, Gerhard; Menschaert, Gerben; Perez-Riverol, Yasset; Salek, Reza M; Tabb, David L; Tenzer, Stefan; Vizcaíno, Juan Antonio; Walzer, Mathias; Jones, Andrew R
2017-12-01
The Proteomics Standards Initiative (PSI) of the Human Proteome Organization (HUPO) has now been developing and promoting open community standards and software tools in the field of proteomics for 15 years. Under the guidance of the chair, cochairs, and other leadership positions, the PSI working groups are tasked with the development and maintenance of community standards via special workshops and ongoing work. Among the existing ratified standards, the PSI working groups continue to update PSI-MI XML, MITAB, mzML, mzIdentML, mzQuantML, mzTab, and the MIAPE (Minimum Information About a Proteomics Experiment) guidelines with the advance of new technologies and techniques. Furthermore, new standards are currently either in the final stages of completion (proBed and proBAM for proteogenomics results as well as PEFF) or in early stages of design (a spectral library standard format, a universal spectrum identifier, the qcML quality control format, and the Protein Expression Interface (PROXI) web services Application Programming Interface). In this work we review the current status of all of these aspects of the PSI, describe synergies with other efforts such as the ProteomeXchange Consortium, the Human Proteome Project, and the metabolomics community, and provide a look at future directions of the PSI.
MitoMiner: a data warehouse for mitochondrial proteomics data
Smith, Anthony C.; Blackshaw, James A.; Robinson, Alan J.
2012-01-01
MitoMiner (http://mitominer.mrc-mbu.cam.ac.uk/) is a data warehouse for the storage and analysis of mitochondrial proteomics data gathered from publications of mass spectrometry and green fluorescent protein tagging studies. In MitoMiner, these data are integrated with data from UniProt, Gene Ontology, Online Mendelian Inheritance in Man, HomoloGene, Kyoto Encyclopaedia of Genes and Genomes and PubMed. The latest release of MitoMiner stores proteomics data sets from 46 studies covering 11 different species from eumetazoa, viridiplantae, fungi and protista. MitoMiner is implemented by using the open source InterMine data warehouse system, which provides a user interface allowing users to upload data for analysis, personal accounts to store queries and results and enables queries of any data in the data model. MitoMiner also provides lists of proteins for use in analyses, including the new MitoMiner mitochondrial proteome reference sets that specify proteins with substantial experimental evidence for mitochondrial localization. As further mitochondrial proteomics data sets from normal and diseased tissue are published, MitoMiner can be used to characterize the variability of the mitochondrial proteome between tissues and investigate how changes in the proteome may contribute to mitochondrial dysfunction and mitochondrial-associated diseases such as cancer, neurodegenerative diseases, obesity, diabetes, heart failure and the ageing process. PMID:22121219
Proteomic Analysis of the Human Skin Proteome after In Vivo Treatment with Sodium Dodecyl Sulphate
Parkinson, Erika; Skipp, Paul; Aleksic, Maja; Garrow, Andrew; Dadd, Tony; Hughes, Michael; Clough, Geraldine; O′Connor, C. David
2014-01-01
Background Skin has a variety of functions that are incompletely understood at the molecular level. As the most accessible tissue in the body it often reveals the first signs of inflammation or infection and also represents a potentially valuable source of biomarkers for several diseases. In this study we surveyed the skin proteome qualitatively using gel electrophoresis, liquid chromatography tandem mass spectrometry (GeLC-MS/MS) and quantitatively using an isobaric tagging strategy (iTRAQ) to characterise the response of human skin following exposure to sodium dodecyl sulphate (SDS). Results A total of 653 skin proteins were assigned, 159 of which were identified using GeLC-MS/MS and 616 using iTRAQ, representing the most comprehensive proteomic study in human skin tissue. Statistical analysis of the available iTRAQ data did not reveal any significant differences in the measured skin proteome after 4 hours exposure to the model irritant SDS. Conclusions This study represents the first step in defining the critical response to an irritant at the level of the proteome and provides a valuable resource for further studies at the later stages of irritant exposure. PMID:24849295
Yeast proteome map (last update).
Perrot, Michel; Moes, Suzette; Massoni, Aurélie; Jenoe, Paul; Boucherie, Hélian
2009-10-01
The identification of proteins separated on 2-D gels is essential to exploit the full potential of 2-D gel electrophoresis for proteomic investigations. For this purpose we have undertaken the systematic identification of Saccharomyces cerevisiae proteins separated on 2-D gels. We report here the identification by mass spectrometry of 100 novel yeast protein spots that have so far not been tackled due to their scarcity on our standard 2-D gels. These identifications extend the number of protein spots identified on our yeast 2-D proteome map to 716. They correspond to 485 unique proteins. Among these, 154 were resolved into several isoforms. The present data set can now be expanded to report for the first time a map of 363 protein isoforms that significantly deepens our knowledge of the yeast proteome. The reference map and a list of all identified proteins can be accessed on the Yeast Protein Map server (www.ibgc.u-bordeaux2.fr/YPM).
Progress in Top-Down Proteomics and the Analysis of Proteoforms
Toby, Timothy K.; Fornelli, Luca; Kelleher, Neil L.
2017-01-01
From a molecular perspective, enactors of function in biology are intact proteins that can be variably modified at the genetic, transcriptional, or post-translational level. Over the past 30 years, mass spectrometry (MS) has become a powerful method for the analysis of proteomes. Prevailing bottom-up proteomics operates at the level of the peptide, leading to issues with protein inference, connectivity, and incomplete sequence/modification information. Top-down proteomics (TDP), alternatively, applies MS at the proteoform level to analyze intact proteins with diverse sources of intramolecular complexity preserved during analysis. Fortunately, advances in prefractionation workflows, MS instrumentation, and dissociation methods for whole-protein ions have helped TDP emerge as an accessible and potentially disruptive modality with increasingly translational value. In this review, we discuss technical and conceptual advances in TDP, along with the growing power of proteoform-resolved measurements in clinical and translational research. PMID:27306313
P-MartCancer-Interactive Online Software to Enable Analysis of Shotgun Cancer Proteomic Datasets.
Webb-Robertson, Bobbie-Jo M; Bramer, Lisa M; Jensen, Jeffrey L; Kobold, Markus A; Stratton, Kelly G; White, Amanda M; Rodland, Karin D
2017-11-01
P-MartCancer is an interactive web-based software environment that enables statistical analyses of peptide or protein data, quantitated from mass spectrometry-based global proteomics experiments, without requiring in-depth knowledge of statistical programming. P-MartCancer offers a series of statistical modules associated with quality assessment, peptide and protein statistics, protein quantification, and exploratory data analyses driven by the user via customized workflows and interactive visualization. Currently, P-MartCancer offers access and the capability to analyze multiple cancer proteomic datasets generated through the Clinical Proteomics Tumor Analysis Consortium at the peptide, gene, and protein levels. P-MartCancer is deployed as a web service (https://pmart.labworks.org/cptac.html), alternatively available via Docker Hub (https://hub.docker.com/r/pnnl/pmart-web/). Cancer Res; 77(21); e47-50. ©2017 AACR . ©2017 American Association for Cancer Research.
2006-05-01
410. 3. Baek, J-M, and C. M. Kenerley. 1998. Detection and enumeration of a genetically modified fungus in soil environments by quantitative...deduced proteome of D. ethenogenes using the complete sequence of the alcohol dehydrogenase with the most similar N-terminus (accession number ZP_00128696...chlorophenol respiration (3 1). Four orfs similru· to VcrC were also found in the deduced proteome of D. ethenogenes (38 - 47 % identity). Two of those are
Genomics, transcriptomics and proteomics to elucidate the pathogenesis of rheumatoid arthritis.
Song, Xinqiang; Lin, Qingsong
2017-08-01
Rheumatoid arthritis is an autoimmune disease that affects several organs and tissues, predominantly the synovial joints. The pathogenesis of this disease is not completely understood, which maybe involved in the genomic variations, gene expression, protein translation and post-translational modifications. These system variations in genomics, transcriptomics and proteomics are dynamic in nature and their crosstalk is overwhelmingly complex, thus analyzing them separately may not be very informative. However, various '-omics' techniques developed in recent years have opened up new possibilities for clarifying disease pathways and thereby facilitating early diagnosis and specific therapies. This review examines how recent advances in the fields of genomics, transcriptomics and proteomics have contributed to our understanding of rheumatoid arthritis.
Differential proteome analysis of diabetes mellitus type 2 and its pathophysiological complications.
Sohail, Waleed; Majeed, Fatimah; Afroz, Amber
2018-06-11
The prevalence of Diabetes Mellitus Type 2 (DM 2) is increasing every passing year due to some global changes in lifestyles of people. The exact underlying mechanisms of the progression of this disease are not yet known. However recent advances in the combined omics more particularly in proteomics and genomics have opened a gateway towards the understanding of predetermined genetic factors, progression, complications and treatment of this disease. Here we shall review the recent advances in proteomics that have led to an early and better diagnostic approaches in controlling DM 2 more importantly the comparison of structural and functional protein biomarkers that are modified in the diseased state. By applying these advanced and promising proteomic strategies with bioinformatics applications and bio-statistical tools the prevalence of DM 2 and its associated disorders i-e nephropathy and retinopathy are expected to be controlled. Copyright © 2018 Diabetes India. Published by Elsevier Ltd. All rights reserved.
Modulating the protein content of complex proteomes using acetonitrile.
Prates, João; Martins, Gonçalo; López-Fernández, Hugo; Lodeiro, Carlos; Capelo, J L; Santos, Hugo M
2018-05-15
In this work we present acetonitrile as a tool to modulate the dynamic range of the proteome of complex samples. Different concentrations of acetonitrile ranging from 15% v/v to 65% v/v were used to modulate the protein content of serum samples from healthy people and patients with lymphoma and myeloma. We show that the proteome above 70 kDa is pelleted as a function of the concentration of acetonitrile and that profiling with PCA or Clustering is only possible using the supernatants obtained for concentrations of acetonitrile higher than 45% v/v or the pellets for concentrations of acetonitrile of 35% and 45%. The differentiation and classification of the three groups of sera samples (healthy, lymphoma and myeloma) were possible using acetonitrile at 55% v/v concentration. This work opens new avenues for the application of acetonitrile as a cost-effective tool in proteomics applications. Copyright © 2018 Elsevier B.V. All rights reserved.
Schwenk, Jochen M; Omenn, Gilbert S; Sun, Zhi; Campbell, David S; Baker, Mark S; Overall, Christopher M; Aebersold, Ruedi; Moritz, Robert L; Deutsch, Eric W
2017-12-01
Human blood plasma provides a highly accessible window to the proteome of any individual in health and disease. Since its inception in 2002, the Human Proteome Organization's Human Plasma Proteome Project (HPPP) has been promoting advances in the study and understanding of the full protein complement of human plasma and on determining the abundance and modifications of its components. In 2017, we review the history of the HPPP and the advances of human plasma proteomics in general, including several recent achievements. We then present the latest 2017-04 build of Human Plasma PeptideAtlas, which yields ∼43 million peptide-spectrum matches and 122,730 distinct peptide sequences from 178 individual experiments at a 1% protein-level FDR globally across all experiments. Applying the latest Human Proteome Project Data Interpretation Guidelines, we catalog 3509 proteins that have at least two non-nested uniquely mapping peptides of nine amino acids or more and >1300 additional proteins with ambiguous evidence. We apply the same two-peptide guideline to historical PeptideAtlas builds going back to 2006 and examine the progress made in the past ten years in plasma proteome coverage. We also compare the distribution of proteins in historical PeptideAtlas builds in various RNA abundance and cellular localization categories. We then discuss advances in plasma proteomics based on targeted mass spectrometry as well as affinity assays, which during early 2017 target ∼2000 proteins. Finally, we describe considerations about sample handling and study design, concluding with an outlook for future advances in deciphering the human plasma proteome.
Thiele, Herbert; Glandorf, Jörg; Hufnagel, Peter
2010-05-27
With the large variety of Proteomics workflows, as well as the large variety of instruments and data-analysis software available, researchers today face major challenges validating and comparing their Proteomics data. Here we present a new generation of the ProteinScape bioinformatics platform, now enabling researchers to manage Proteomics data from the generation and data warehousing to a central data repository with a strong focus on the improved accuracy, reproducibility and comparability demanded by many researchers in the field. It addresses scientists; current needs in proteomics identification, quantification and validation. But producing large protein lists is not the end point in Proteomics, where one ultimately aims to answer specific questions about the biological condition or disease model of the analyzed sample. In this context, a new tool has been developed at the Spanish Centro Nacional de Biotecnologia Proteomics Facility termed PIKE (Protein information and Knowledge Extractor) that allows researchers to control, filter and access specific information from genomics and proteomic databases, to understand the role and relationships of the proteins identified in the experiments. Additionally, an EU funded project, ProDac, has coordinated systematic data collection in public standards-compliant repositories like PRIDE. This will cover all aspects from generating MS data in the laboratory, assembling the whole annotation information and storing it together with identifications in a standardised format.
mzStudio: A Dynamic Digital Canvas for User-Driven Interrogation of Mass Spectrometry Data.
Ficarro, Scott B; Alexander, William M; Marto, Jarrod A
2017-08-01
Although not yet truly 'comprehensive', modern mass spectrometry-based experiments can generate quantitative data for a meaningful fraction of the human proteome. Importantly for large-scale protein expression analysis, robust data pipelines are in place for identification of un-modified peptide sequences and aggregation of these data to protein-level quantification. However, interoperable software tools that enable scientists to computationally explore and document novel hypotheses for peptide sequence, modification status, or fragmentation behavior are not well-developed. Here, we introduce mzStudio, an open-source Python module built on our multiplierz project. This desktop application provides a highly-interactive graphical user interface (GUI) through which scientists can examine and annotate spectral features, re-search existing PSMs to test different modifications or new spectral matching algorithms, share results with colleagues, integrate other domain-specific software tools, and finally create publication-quality graphics. mzStudio leverages our common application programming interface (mzAPI) for access to native data files from multiple instrument platforms, including ion trap, quadrupole time-of-flight, Orbitrap, matrix-assisted laser desorption ionization, and triple quadrupole mass spectrometers and is compatible with several popular search engines including Mascot, Proteome Discoverer, X!Tandem, and Comet. The mzStudio toolkit enables researchers to create a digital provenance of data analytics and other evidence that support specific peptide sequence assignments.
López-Pedrouso, María; Bernal, Javier; Franco, Daniel; Zapata, Carlos
2014-07-23
High-resolution two-dimensional electrophoresis (2-DE) profiles of the protein phaseolin, the major seed storage protein of common bean, display great number of spots with differentially glycosylated and phosphorylated α- and β-type polypeptides. This work aims to test whether these complex profiles can be useful markers of genetic differentiation and seed protein quality in bean populations. The 2-DE phaseolin profile and the amino acid composition were examined in bean seeds from 18 domesticated and wild accessions belonging to the Mesoamerican and Andean gene pools. We found that proteomic distances based on 2-DE profiles were successful in identifying the accessions belonging to each gene pool and outliers distantly related. In addition, accessions identified as outliers from proteomic distances showed the highest levels of methionine content, an essential amino acid deficient in bean seeds. These findings suggest that 2-DE phaseolin profiles provide valuable information with potential of being used in common bean genetic improvement.
Saliva Proteomics Analysis Offers Insights on Type 1 Diabetes Pathology in a Pediatric Population
Pappa, Eftychia; Vastardis, Heleni; Mermelekas, George; Gerasimidi-Vazeou, Andriani; Zoidakis, Jerome; Vougas, Konstantinos
2018-01-01
The composition of the salivary proteome is affected by pathological conditions. We analyzed by high resolution mass spectrometry approaches saliva samples collected from children and adolescents with type 1 diabetes and healthy controls. The list of more than 2000 high confidence protein identifications constitutes a comprehensive characterization of the salivary proteome. Patients with good glycemic regulation and healthy individuals have comparable proteomic profiles. In contrast, a significant number of differentially expressed proteins were identified in the saliva of patients with poor glycemic regulation compared to patients with good glycemic control and healthy children. These proteins are involved in biological processes relevant to diabetic pathology such as endothelial damage and inflammation. Moreover, a putative preventive therapeutic approach was identified based on bioinformatic analysis of the deregulated salivary proteins. Thus, thorough characterization of saliva proteins in diabetic pediatric patients established a connection between molecular changes and disease pathology. This proteomic and bioinformatic approach highlights the potential of salivary diagnostics in diabetes pathology and opens the way for preventive treatment of the disease. PMID:29755368
Public data and open source tools for multi-assay genomic investigation of disease.
Kannan, Lavanya; Ramos, Marcel; Re, Angela; El-Hachem, Nehme; Safikhani, Zhaleh; Gendoo, Deena M A; Davis, Sean; Gomez-Cabrero, David; Castelo, Robert; Hansen, Kasper D; Carey, Vincent J; Morgan, Martin; Culhane, Aedín C; Haibe-Kains, Benjamin; Waldron, Levi
2016-07-01
Molecular interrogation of a biological sample through DNA sequencing, RNA and microRNA profiling, proteomics and other assays, has the potential to provide a systems level approach to predicting treatment response and disease progression, and to developing precision therapies. Large publicly funded projects have generated extensive and freely available multi-assay data resources; however, bioinformatic and statistical methods for the analysis of such experiments are still nascent. We review multi-assay genomic data resources in the areas of clinical oncology, pharmacogenomics and other perturbation experiments, population genomics and regulatory genomics and other areas, and tools for data acquisition. Finally, we review bioinformatic tools that are explicitly geared toward integrative genomic data visualization and analysis. This review provides starting points for accessing publicly available data and tools to support development of needed integrative methods. © The Author 2015. Published by Oxford University Press.
Questioning the efficacy of 'gold' open access to published articles.
Fredericks, Suzanne
2015-07-01
To question the efficacy of 'gold' open access to published articles. Open access is unrestricted access to academic, theoretical and research literature that is scholarly and peer-reviewed. Two models of open access exist: 'gold' and 'green'. Gold open access provides everyone with access to articles during all stages of publication, with processing charges paid by the author(s). Green open access involves placing an already published article into a repository to provide unrestricted access, with processing charges incurred by the publisher. This is a discussion paper. An exploration of the relative benefits and drawbacks of the 'gold' and 'green' open access systems. Green open access is a more economic and efficient means of granting open access to scholarly literature but a large number of researchers select gold open access journals as their first choices for manuscript submissions. This paper questions the efficacy of gold open access models and presents an examination of green open access models to encourage nurse researchers to consider this approach. In the current academic environment, with increased pressures to publish and low funding success rates, it is difficult to understand why gold open access still exists. Green open access enhances the visibility of an academic's work, as increased downloads of articles tend to lead to increased citations. Green open access is the cheaper option, as well as the most beneficial choice, for universities that want to provide unrestricted access to all literature at minimal risk.
Komatsu, Setsuko; Wang, Xin; Yin, Xiaojian; Nanjo, Yohei; Ohyanagi, Hajime; Sakata, Katsumi
2017-06-23
The Soybean Proteome Database (SPD) stores data on soybean proteins obtained with gel-based and gel-free proteomic techniques. The database was constructed to provide information on proteins for functional analyses. The majority of the data is focused on soybean (Glycine max 'Enrei'). The growth and yield of soybean are strongly affected by environmental stresses such as flooding. The database was originally constructed using data on soybean proteins separated by two-dimensional polyacrylamide gel electrophoresis, which is a gel-based proteomic technique. Since 2015, the database has been expanded to incorporate data obtained by label-free mass spectrometry-based quantitative proteomics, which is a gel-free proteomic technique. Here, the portions of the database consisting of gel-free proteomic data are described. The gel-free proteomic database contains 39,212 proteins identified in 63 sample sets, such as temporal and organ-specific samples of soybean plants grown under flooding stress or non-stressed conditions. In addition, data on organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored. Furthermore, the database integrates multiple omics data such as genomics, transcriptomics, metabolomics, and proteomics. The SPD database is accessible at http://proteome.dc.affrc.go.jp/Soybean/. The Soybean Proteome Database stores data obtained from both gel-based and gel-free proteomic techniques. The gel-free proteomic database comprises 39,212 proteins identified in 63 sample sets, such as different organs of soybean plants grown under flooding stress or non-stressed conditions in a time-dependent manner. In addition, organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored in the gel-free proteomics database. A total of 44,704 proteins, including 5490 proteins identified using a gel-based proteomic technique, are stored in the SPD. It accounts for approximately 80% of all predicted proteins from genome sequences, though there are over lapped proteins. Based on the demonstrated application of data stored in the database for functional analyses, it is suggested that these data will be useful for analyses of biological mechanisms in soybean. Furthermore, coupled with recent advances in information and communication technology, the usefulness of this database would increase in the analyses of biological mechanisms. Copyright © 2017 Elsevier B.V. All rights reserved.
Development of a proteomic approach to monitor protein synthesis in mycotoxin producing moulds.
Milles, J; Krämer, J; Prange, A
2007-12-01
In general, proteome studies compare different states of metabolism to investigate external or internal influences on protein expression. In the context of mycotoxin production the method could open another view on this complex and could be helpful to gain knowledge about proteins which are involved in metabolism (enzymes, transporters). In this short technical report, we describe a new protocol suitable for protein preparation for whole proteome analysis ofFusarium graminearum. Cell lysis was performed by grinding the mycelium with liquid nitrogen. Proteins were extracted with TCA/acetone and then cleaned; the isolated proteins were separated in a 2D-gel electrophoresis system (BioRad) using different pH gradients. The protocol established seems also generally applicable for other mycotoxin producing fungi.
Aasebø, Elise; Forthun, Rakel B.; Berven, Frode; Selheim, Frode; Hernandez-Valladares, Maria
2016-01-01
The identification of protein biomarkers for acute myeloid leukemia (AML) that could find applications in AML diagnosis and prognosis, treatment and the selection for bone marrow transplant requires substantial comparative analyses of the proteomes from AML patients. In the past years, several studies have suggested some biomarkers for AML diagnosis or AML classification using methods for sample preparation with low proteome coverage and low resolution mass spectrometers. However, most of the studies did not follow up, confirm or validate their candidates with more patient samples. Current proteomics methods, new high resolution and fast mass spectrometers allow the identification and quantification of several thousands of proteins obtained from few tens of μg of AML cell lysate. Enrichment methods for posttranslational modifications (PTM), such as phosphorylation, can isolate several thousands of site-specific phosphorylated peptides from AML patient samples, which subsequently can be quantified with high confidence in new mass spectrometers. While recent reports aiming to propose proteomic or phosphoproteomic biomarkers on the studied AML patient samples have taken advantage of the technological progress, the access to large cohorts of AML patients to sample from and the availability of appropriate control samples still remain challenging. PMID:26306748
Schilmiller, Anthony L; Miner, Dennis P; Larson, Matthew; McDowell, Eric; Gang, David R; Wilkerson, Curtis; Last, Robert L
2010-07-01
Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces beta-caryophyllene and alpha-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells.
Schilmiller, Anthony L.; Miner, Dennis P.; Larson, Matthew; McDowell, Eric; Gang, David R.; Wilkerson, Curtis; Last, Robert L.
2010-01-01
Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces β-caryophyllene and α-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells. PMID:20431087
Natural variation in floral nectar proteins of two Nicotiana attenuata accessions.
Seo, Pil Joon; Wielsch, Natalie; Kessler, Danny; Svatos, Ales; Park, Chung-Mo; Baldwin, Ian T; Kim, Sang-Gyu
2013-07-13
Floral nectar (FN) contains not only energy-rich compounds to attract pollinators, but also defense chemicals and several proteins. However, proteomic analysis of FN has been hampered by the lack of publically available sequence information from nectar-producing plants. Here we used next-generation sequencing and advanced proteomics to profile FN proteins in the opportunistic outcrossing wild tobacco, Nicotiana attenuata. We constructed a transcriptome database of N. attenuata and characterized its nectar proteome using LC-MS/MS. The FN proteins of N. attenuata included nectarins, sugar-cleaving enzymes (glucosidase, galactosidase, and xylosidase), RNases, pathogen-related proteins, and lipid transfer proteins. Natural variation in FN proteins of eleven N. attenuata accessions revealed a negative relationship between the accumulation of two abundant proteins, nectarin1b and nectarin5. In addition, microarray analysis of nectary tissues revealed that protein accumulation in FN is not simply correlated with the accumulation of transcripts encoding FN proteins and identified a group of genes that were specifically expressed in the nectary. Natural variation of identified FN proteins in the ecological model plant N. attenuata suggests that nectar chemistry may have a complex function in plant-pollinator-microbe interactions.
Natural variation in floral nectar proteins of two Nicotiana attenuata accessions
2013-01-01
Background Floral nectar (FN) contains not only energy-rich compounds to attract pollinators, but also defense chemicals and several proteins. However, proteomic analysis of FN has been hampered by the lack of publically available sequence information from nectar-producing plants. Here we used next-generation sequencing and advanced proteomics to profile FN proteins in the opportunistic outcrossing wild tobacco, Nicotiana attenuata. Results We constructed a transcriptome database of N. attenuata and characterized its nectar proteome using LC-MS/MS. The FN proteins of N. attenuata included nectarins, sugar-cleaving enzymes (glucosidase, galactosidase, and xylosidase), RNases, pathogen-related proteins, and lipid transfer proteins. Natural variation in FN proteins of eleven N. attenuata accessions revealed a negative relationship between the accumulation of two abundant proteins, nectarin1b and nectarin5. In addition, microarray analysis of nectary tissues revealed that protein accumulation in FN is not simply correlated with the accumulation of transcripts encoding FN proteins and identified a group of genes that were specifically expressed in the nectary. Conclusions Natural variation of identified FN proteins in the ecological model plant N. attenuata suggests that nectar chemistry may have a complex function in plant-pollinator-microbe interactions. PMID:23848992
How well can morphology assess cell death modality? A proteomics study
Chernobrovkin, Alexey L; Zubarev, Roman A
2016-01-01
While the focus of attempts to classify cell death programs has finally shifted in 2010s from microscopy-based morphological characteristics to biochemical assays, more recent discoveries have put the underlying assumptions of many such assays under severe stress, mostly because of the limited specificity of the assays. On the other hand, proteomics can quantitatively measure the abundances of thousands of proteins in a single experiment. Thus proteomics could develop a modern alternative to both semiquantitative morphology assessment as well as single-molecule biochemical assays. Here we tested this hypothesis by analyzing the proteomes of cells dying after been treated with various chemical agents. The most striking finding is that, for a multivariate model based on the proteome changes in three cells lines, the regulation patterns of the 200–500 most abundant proteins typically attributed to household type more accurately reflect that of the proteins directly interacting with the drug than any other protein subset grouped by common function or biological process, including cell death. This is in broad agreement with the 'rigid cell death mechanics' model where drug action mechanism and morphological changes caused by it are bijectively linked. This finding, if confirmed, will open way for a broad use of proteomics in death modality assessment. PMID:27752363
Maillard Proteomics: Opening New Pages
Soboleva, Alena; Schmidt, Rico; Vikhnina, Maria; Grishina, Tatiana; Frolov, Andrej
2017-01-01
Protein glycation is a ubiquitous non-enzymatic post-translational modification, formed by reaction of protein amino and guanidino groups with carbonyl compounds, presumably reducing sugars and α-dicarbonyls. Resulting advanced glycation end products (AGEs) represent a highly heterogeneous group of compounds, deleterious in mammals due to their pro-inflammatory effect, and impact in pathogenesis of diabetes mellitus, Alzheimer’s disease and ageing. The body of information on the mechanisms and pathways of AGE formation, acquired during the last decades, clearly indicates a certain site-specificity of glycation. It makes characterization of individual glycation sites a critical pre-requisite for understanding in vivo mechanisms of AGE formation and developing adequate nutritional and therapeutic approaches to reduce it in humans. In this context, proteomics is the methodology of choice to address site-specific molecular changes related to protein glycation. Therefore, here we summarize the methods of Maillard proteomics, specifically focusing on the techniques providing comprehensive structural and quantitative characterization of glycated proteome. Further, we address the novel break-through areas, recently established in the field of Maillard research, i.e., in vitro models based on synthetic peptides, site-based diagnostics of metabolism-related diseases (e.g., diabetes mellitus), proteomics of anti-glycative defense, and dynamics of plant glycated proteome during ageing and response to environmental stress. PMID:29231845
Razban, Rostam M; Gilson, Amy I; Durfee, Niamh; Strobelt, Hendrik; Dinkla, Kasper; Choi, Jeong-Mo; Pfister, Hanspeter; Shakhnovich, Eugene I
2018-05-08
Protein evolution spans time scales and its effects span the length of an organism. A web app named ProteomeVis is developed to provide a comprehensive view of protein evolution in the S. cerevisiae and E. coli proteomes. ProteomeVis interactively creates protein chain graphs, where edges between nodes represent structure and sequence similarities within user-defined ranges, to study the long time scale effects of protein structure evolution. The short time scale effects of protein sequence evolution are studied by sequence evolutionary rate (ER) correlation analyses with protein properties that span from the molecular to the organismal level. We demonstrate the utility and versatility of ProteomeVis by investigating the distribution of edges per node in organismal protein chain universe graphs (oPCUGs) and putative ER determinants. S. cerevisiae and E. coli oPCUGs are scale-free with scaling constants of 1.79 and 1.56, respectively. Both scaling constants can be explained by a previously reported theoretical model describing protein structure evolution (Dokholyan et al., 2002). Protein abundance most strongly correlates with ER among properties in ProteomeVis, with Spearman correlations of -0.49 (p-value<10-10) and -0.46 (p-value<10-10) for S. cerevisiae and E. coli, respectively. This result is consistent with previous reports that found protein expression to be the most important ER determinant (Zhang and Yang, 2015). ProteomeVis is freely accessible at http://proteomevis.chem.harvard.edu. Supplementary data are available at Bioinformatics. shakhnovich@chemistry.harvard.edu.
Goeminne, Ludger J E; Gevaert, Kris; Clement, Lieven
2018-01-16
Label-free shotgun proteomics is routinely used to assess proteomes. However, extracting relevant information from the massive amounts of generated data remains difficult. This tutorial provides a strong foundation on analysis of quantitative proteomics data. We provide key statistical concepts that help researchers to design proteomics experiments and we showcase how to analyze quantitative proteomics data using our recent free and open-source R package MSqRob, which was developed to implement the peptide-level robust ridge regression method for relative protein quantification described by Goeminne et al. MSqRob can handle virtually any experimental proteomics design and outputs proteins ordered by statistical significance. Moreover, its graphical user interface and interactive diagnostic plots provide easy inspection and also detection of anomalies in the data and flaws in the data analysis, allowing deeper assessment of the validity of results and a critical review of the experimental design. Our tutorial discusses interactive preprocessing, data analysis and visualization of label-free MS-based quantitative proteomics experiments with simple and more complex designs. We provide well-documented scripts to run analyses in bash mode on GitHub, enabling the integration of MSqRob in automated pipelines on cluster environments (https://github.com/statOmics/MSqRob). The concepts outlined in this tutorial aid in designing better experiments and analyzing the resulting data more appropriately. The two case studies using the MSqRob graphical user interface will contribute to a wider adaptation of advanced peptide-based models, resulting in higher quality data analysis workflows and more reproducible results in the proteomics community. We also provide well-documented scripts for experienced users that aim at automating MSqRob on cluster environments. Copyright © 2017 Elsevier B.V. All rights reserved.
The Human Genome Project: big science transforms biology and medicine.
Hood, Leroy; Rowen, Lee
2013-01-01
The Human Genome Project has transformed biology through its integrated big science approach to deciphering a reference human genome sequence along with the complete sequences of key model organisms. The project exemplifies the power, necessity and success of large, integrated, cross-disciplinary efforts - so-called 'big science' - directed towards complex major objectives. In this article, we discuss the ways in which this ambitious endeavor led to the development of novel technologies and analytical tools, and how it brought the expertise of engineers, computer scientists and mathematicians together with biologists. It established an open approach to data sharing and open-source software, thereby making the data resulting from the project accessible to all. The genome sequences of microbes, plants and animals have revolutionized many fields of science, including microbiology, virology, infectious disease and plant biology. Moreover, deeper knowledge of human sequence variation has begun to alter the practice of medicine. The Human Genome Project has inspired subsequent large-scale data acquisition initiatives such as the International HapMap Project, 1000 Genomes, and The Cancer Genome Atlas, as well as the recently announced Human Brain Project and the emerging Human Proteome Project.
The Human Genome Project: big science transforms biology and medicine
2013-01-01
The Human Genome Project has transformed biology through its integrated big science approach to deciphering a reference human genome sequence along with the complete sequences of key model organisms. The project exemplifies the power, necessity and success of large, integrated, cross-disciplinary efforts - so-called ‘big science’ - directed towards complex major objectives. In this article, we discuss the ways in which this ambitious endeavor led to the development of novel technologies and analytical tools, and how it brought the expertise of engineers, computer scientists and mathematicians together with biologists. It established an open approach to data sharing and open-source software, thereby making the data resulting from the project accessible to all. The genome sequences of microbes, plants and animals have revolutionized many fields of science, including microbiology, virology, infectious disease and plant biology. Moreover, deeper knowledge of human sequence variation has begun to alter the practice of medicine. The Human Genome Project has inspired subsequent large-scale data acquisition initiatives such as the International HapMap Project, 1000 Genomes, and The Cancer Genome Atlas, as well as the recently announced Human Brain Project and the emerging Human Proteome Project. PMID:24040834
Eckhard, Ulrich; Huesgen, Pitter F; Schilling, Oliver; Bellac, Caroline L; Butler, Georgina S; Cox, Jennifer H; Dufour, Antoine; Goebeler, Verena; Kappelhoff, Reinhild; Auf dem Keller, Ulrich; Klein, Theo; Lange, Philipp F; Marino, Giada; Morrison, Charlotte J; Prudova, Anna; Rodriguez, David; Starr, Amanda E; Wang, Yili; Overall, Christopher M
2016-06-01
The data described provide a comprehensive resource for the family-wide active site specificity portrayal of the human matrix metalloproteinase family. We used the high-throughput proteomic technique PICS (Proteomic Identification of protease Cleavage Sites) to comprehensively assay 9 different MMPs. We identified more than 4300 peptide cleavage sites, spanning both the prime and non-prime sides of the scissile peptide bond allowing detailed subsite cooperativity analysis. The proteomic cleavage data were expanded by kinetic analysis using a set of 6 quenched-fluorescent peptide substrates designed using these results. These datasets represent one of the largest specificity profiling efforts with subsequent structural follow up for any protease family and put the spotlight on the specificity similarities and differences of the MMP family. A detailed analysis of this data may be found in Eckhard et al. (2015) [1]. The raw mass spectrometry data and the corresponding metadata have been deposited in PRIDE/ProteomeXchange with the accession number PXD002265.
P-MartCancer–Interactive Online Software to Enable Analysis of Shotgun Cancer Proteomic Datasets
DOE Office of Scientific and Technical Information (OSTI.GOV)
Webb-Robertson, Bobbie-Jo M.; Bramer, Lisa M.; Jensen, Jeffrey L.
P-MartCancer is a new interactive web-based software environment that enables biomedical and biological scientists to perform in-depth analyses of global proteomics data without requiring direct interaction with the data or with statistical software. P-MartCancer offers a series of statistical modules associated with quality assessment, peptide and protein statistics, protein quantification and exploratory data analyses driven by the user via customized workflows and interactive visualization. Currently, P-MartCancer offers access to multiple cancer proteomic datasets generated through the Clinical Proteomics Tumor Analysis Consortium (CPTAC) at the peptide, gene and protein levels. P-MartCancer is deployed using Azure technologies (http://pmart.labworks.org/cptac.html), the web-service is alternativelymore » available via Docker Hub (https://hub.docker.com/r/pnnl/pmart-web/) and many statistical functions can be utilized directly from an R package available on GitHub (https://github.com/pmartR).« less
Global Proteomics Analysis of the Response to Starvation in C. elegans*
Larance, Mark; Pourkarimi, Ehsan; Wang, Bin; Brenes Murillo, Alejandro; Kent, Robert; Lamond, Angus I.; Gartner, Anton
2015-01-01
Periodic starvation of animals induces large shifts in metabolism but may also influence many other cellular systems and can lead to adaption to prolonged starvation conditions. To date, there is limited understanding of how starvation affects gene expression, particularly at the protein level. Here, we have used mass-spectrometry-based quantitative proteomics to identify global changes in the Caenorhabditis elegans proteome due to acute starvation of young adult animals. Measuring changes in the abundance of over 5,000 proteins, we show that acute starvation rapidly alters the levels of hundreds of proteins, many involved in central metabolic pathways, highlighting key regulatory responses. Surprisingly, we also detect changes in the abundance of chromatin-associated proteins, including specific linker histones, histone variants, and histone posttranslational modifications associated with the epigenetic control of gene expression. To maximize community access to these data, they are presented in an online searchable database, the Encyclopedia of Proteome Dynamics (http://www.peptracker.com/epd/). PMID:25963834
Progress and challenges for abiotic stress proteomics of crop plants.
Barkla, Bronwyn J; Vera-Estrella, Rosario; Pantoja, Omar
2013-06-01
Plants are continually challenged to recognize and respond to adverse changes in their environment to avoid detrimental effects on growth and development. Understanding the mechanisms that crop plants employ to resist and tolerate abiotic stress is of considerable interest for designing agriculture breeding strategies to ensure sustainable productivity. The application of proteomics technologies to advance our knowledge in crop plant abiotic stress tolerance has increased dramatically in the past few years as evidenced by the large amount of publications in this area. This is attributed to advances in various technology platforms associated with MS-based techniques as well as the accessibility of proteomics units to a wider plant research community. This review summarizes the work which has been reported for major crop plants and evaluates the findings in context of the approaches that are widely employed with the aim to encourage broadening the strategies used to increase coverage of the proteome. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Kusebauch, Ulrike; Deutsch, Eric W.; Campbell, David S.; Sun, Zhi; Farrah, Terry; Moritz, Robert L.
2014-01-01
PeptideAtlas, SRMAtlas and PASSEL are web-accessible resources to support discovery and targeted proteomics research. PeptideAtlas is a multi-species compendium of shotgun proteomic data provided by the scientific community, SRMAtlas is a resource of high-quality, complete proteome SRM assays generated in a consistent manner for the targeted identification and quantification of proteins, and PASSEL is a repository that compiles and represents selected reaction monitoring data, all in an easy to use interface. The databases are generated from native mass spectrometry data files that are analyzed in a standardized manner including statistical validation of the results. Each resource offers search functionalities and can be queried by user defined constraints; the query results are provided in tables or are graphically displayed. PeptideAtlas, SRMAtlas and PASSEL are publicly available freely via the website http://www.peptideatlas.org. In this protocol, we describe the use of these resources, we highlight how to submit, search, collate and download data. PMID:24939129
Steiner, Carine; Ducret, Axel; Tille, Jean-Christophe; Thomas, Marlene; McKee, Thomas A; Rubbia-Brandt, Laura A; Scherl, Alexander; Lescuyer, Pierre; Cutler, Paul
2014-01-01
Proteomic analysis of tissues has advanced in recent years as instruments and methodologies have evolved. The ability to retrieve peptides from formalin-fixed paraffin-embedded tissues followed by shotgun or targeted proteomic analysis is offering new opportunities in biomedical research. In particular, access to large collections of clinically annotated samples should enable the detailed analysis of pathologically relevant tissues in a manner previously considered unfeasible. In this paper, we review the current status of proteomic analysis of formalin-fixed paraffin-embedded tissues with a particular focus on targeted approaches and the potential for this technique to be used in clinical research and clinical diagnosis. We also discuss the limitations and perspectives of the technique, particularly with regard to application in clinical diagnosis and drug discovery. PMID:24339433
OpenKnowledge for peer-to-peer experimentation in protein identification by MS/MS
2011-01-01
Background Traditional scientific workflow platforms usually run individual experiments with little evaluation and analysis of performance as required by automated experimentation in which scientists are being allowed to access numerous applicable workflows rather than being committed to a single one. Experimental protocols and data under a peer-to-peer environment could potentially be shared freely without any single point of authority to dictate how experiments should be run. In such environment it is necessary to have mechanisms by which each individual scientist (peer) can assess, locally, how he or she wants to be involved with others in experiments. This study aims to implement and demonstrate simple peer ranking under the OpenKnowledge peer-to-peer infrastructure by both simulated and real-world bioinformatics experiments involving multi-agent interactions. Methods A simulated experiment environment with a peer ranking capability was specified by the Lightweight Coordination Calculus (LCC) and automatically executed under the OpenKnowledge infrastructure. The peers such as MS/MS protein identification services (including web-enabled and independent programs) were made accessible as OpenKnowledge Components (OKCs) for automated execution as peers in the experiments. The performance of the peers in these automated experiments was monitored and evaluated by simple peer ranking algorithms. Results Peer ranking experiments with simulated peers exhibited characteristic behaviours, e.g., power law effect (a few dominant peers dominate), similar to that observed in the traditional Web. Real-world experiments were run using an interaction model in LCC involving two different types of MS/MS protein identification peers, viz., peptide fragment fingerprinting (PFF) and de novo sequencing with another peer ranking algorithm simply based on counting the successful and failed runs. This study demonstrated a novel integration and useful evaluation of specific proteomic peers and found MASCOT to be a dominant peer as judged by peer ranking. Conclusion The simulated and real-world experiments in the present study demonstrated that the OpenKnowledge infrastructure with peer ranking capability can serve as an evaluative environment for automated experimentation. PMID:22192521
Hiller, Karsten; Grote, Andreas; Maneck, Matthias; Münch, Richard; Jahn, Dieter
2006-10-01
After the publication of JVirGel 1.0 in 2003 we got many requests and suggestions from the proteomics community to further improve the performance of the software and to add additional useful new features. The integration of the PrediSi algorithm for the prediction of signal peptides for the Sec-dependent protein export into JVirGel 2.0 allows the exclusion of most exported preproteins from calculated proteomic maps and provides the basis for the calculation of Sec-based secretomes. A tool for the identification of transmembrane helices carrying proteins (JCaMelix) and the prediction of the corresponding membrane proteome was added. Finally, in order to directly compare experimental and calculated proteome data, a function to overlay and evaluate predicted and experimental two-dimensional gels was included. JVirGel 2.0 is freely available as precompiled package for the installation on Windows or Linux operating systems. Furthermore, there is a completely platform-independent Java version available for download. Additionally, we provide a Java Server Pages based version of JVirGel 2.0 which can be operated in nearly all web browsers. All versions are accessible at http://www.jvirgel.de
High-Throughput Cloning and Expression Library Creation for Functional Proteomics
Festa, Fernanda; Steel, Jason; Bian, Xiaofang; Labaer, Joshua
2013-01-01
The study of protein function usually requires the use of a cloned version of the gene for protein expression and functional assays. This strategy is particular important when the information available regarding function is limited. The functional characterization of the thousands of newly identified proteins revealed by genomics requires faster methods than traditional single gene experiments, creating the need for fast, flexible and reliable cloning systems. These collections of open reading frame (ORF) clones can be coupled with high-throughput proteomics platforms, such as protein microarrays and cell-based assays, to answer biological questions. In this tutorial we provide the background for DNA cloning, discuss the major high-throughput cloning systems (Gateway® Technology, Flexi® Vector Systems, and Creator™ DNA Cloning System) and compare them side-by-side. We also report an example of high-throughput cloning study and its application in functional proteomics. This Tutorial is part of the International Proteomics Tutorial Programme (IPTP12). Details can be found at http://www.proteomicstutorials.org. PMID:23457047
A draft map of the mouse pluripotent stem cell spatial proteome
Christoforou, Andy; Mulvey, Claire M.; Breckels, Lisa M.; Geladaki, Aikaterini; Hurrell, Tracey; Hayward, Penelope C.; Naake, Thomas; Gatto, Laurent; Viner, Rosa; Arias, Alfonso Martinez; Lilley, Kathryn S.
2016-01-01
Knowledge of the subcellular distribution of proteins is vital for understanding cellular mechanisms. Capturing the subcellular proteome in a single experiment has proven challenging, with studies focusing on specific compartments or assigning proteins to subcellular niches with low resolution and/or accuracy. Here we introduce hyperLOPIT, a method that couples extensive fractionation, quantitative high-resolution accurate mass spectrometry with multivariate data analysis. We apply hyperLOPIT to a pluripotent stem cell population whose subcellular proteome has not been extensively studied. We provide localization data on over 5,000 proteins with unprecedented spatial resolution to reveal the organization of organelles, sub-organellar compartments, protein complexes, functional networks and steady-state dynamics of proteins and unexpected subcellular locations. The method paves the way for characterizing the impact of post-transcriptional and post-translational modification on protein location and studies involving proteome-level locational changes on cellular perturbation. An interactive open-source resource is presented that enables exploration of these data. PMID:26754106
Arbuscular mycorrhizal symbiosis affects the grain proteome of Zea mays: a field study
Bona, Elisa; Scarafoni, Alessio; Marsano, Francesco; Boatti, Lara; Copetta, Andrea; Massa, Nadia; Gamalero, Elisa; D’Agostino, Giovanni; Cesaro, Patrizia; Cavaletto, Maria; Berta, Graziella
2016-01-01
Maize is one of the most important crops worldwide and is strongly dependent on arbuscular mycorrhiza (AM) fungi, organisms that form a mutualistic association with land plants. In maize, AM symbiosis enhances spike dry weight, spike length, spike circumference, and the dry weight and dimensions of the grain. Notwithstanding its ubiquitous nature, the detailed relationship between AM fungal colonization and plant development is not completely understood. To facilitate a better understanding of the effects of AM fungi on plants, the work reported here assessed the effects of a consortium of AM fungi on the kernel proteome of maize, cultivated in open-field conditions. To our knowledge, this is the first report of the modulation of a plant seed proteome following AM fungal inoculation in the field. Here, it was found that AM fungi modify the maize seed proteome by up-regulating enzymes involved in energetic metabolism, embryo development, nucleotide metabolism, seed storage and stress responses. PMID:27216714
Mspire-Simulator: LC-MS shotgun proteomic simulator for creating realistic gold standard data.
Noyce, Andrew B; Smith, Rob; Dalgleish, James; Taylor, Ryan M; Erb, K C; Okuda, Nozomu; Prince, John T
2013-12-06
The most important step in any quantitative proteomic pipeline is feature detection (aka peak picking). However, generating quality hand-annotated data sets to validate the algorithms, especially for lower abundance peaks, is nearly impossible. An alternative for creating gold standard data is to simulate it with features closely mimicking real data. We present Mspire-Simulator, a free, open-source shotgun proteomic simulator that goes beyond previous simulation attempts by generating LC-MS features with realistic m/z and intensity variance along with other noise components. It also includes machine-learned models for retention time and peak intensity prediction and a genetic algorithm to custom fit model parameters for experimental data sets. We show that these methods are applicable to data from three different mass spectrometers, including two fundamentally different types, and show visually and analytically that simulated peaks are nearly indistinguishable from actual data. Researchers can use simulated data to rigorously test quantitation software, and proteomic researchers may benefit from overlaying simulated data on actual data sets.
Interaction Analysis through Proteomic Phage Display
2014-01-01
Phage display is a powerful technique for profiling specificities of peptide binding domains. The method is suited for the identification of high-affinity ligands with inhibitor potential when using highly diverse combinatorial peptide phage libraries. Such experiments further provide consensus motifs for genome-wide scanning of ligands of potential biological relevance. A complementary but considerably less explored approach is to display expression products of genomic DNA, cDNA, open reading frames (ORFs), or oligonucleotide libraries designed to encode defined regions of a target proteome on phage particles. One of the main applications of such proteomic libraries has been the elucidation of antibody epitopes. This review is focused on the use of proteomic phage display to uncover protein-protein interactions of potential relevance for cellular function. The method is particularly suited for the discovery of interactions between peptide binding domains and their targets. We discuss the largely unexplored potential of this method in the discovery of domain-motif interactions of potential biological relevance. PMID:25295249
A multi-center study benchmarks software tools for label-free proteome quantification
Gillet, Ludovic C; Bernhardt, Oliver M.; MacLean, Brendan; Röst, Hannes L.; Tate, Stephen A.; Tsou, Chih-Chiang; Reiter, Lukas; Distler, Ute; Rosenberger, George; Perez-Riverol, Yasset; Nesvizhskii, Alexey I.; Aebersold, Ruedi; Tenzer, Stefan
2016-01-01
The consistent and accurate quantification of proteins by mass spectrometry (MS)-based proteomics depends on the performance of instruments, acquisition methods and data analysis software. In collaboration with the software developers, we evaluated OpenSWATH, SWATH2.0, Skyline, Spectronaut and DIA-Umpire, five of the most widely used software methods for processing data from SWATH-MS (sequential window acquisition of all theoretical fragment ion spectra), a method that uses data-independent acquisition (DIA) for label-free protein quantification. We analyzed high-complexity test datasets from hybrid proteome samples of defined quantitative composition acquired on two different MS instruments using different SWATH isolation windows setups. For consistent evaluation we developed LFQbench, an R-package to calculate metrics of precision and accuracy in label-free quantitative MS, and report the identification performance, robustness and specificity of each software tool. Our reference datasets enabled developers to improve their software tools. After optimization, all tools provided highly convergent identification and reliable quantification performance, underscoring their robustness for label-free quantitative proteomics. PMID:27701404
A multicenter study benchmarks software tools for label-free proteome quantification.
Navarro, Pedro; Kuharev, Jörg; Gillet, Ludovic C; Bernhardt, Oliver M; MacLean, Brendan; Röst, Hannes L; Tate, Stephen A; Tsou, Chih-Chiang; Reiter, Lukas; Distler, Ute; Rosenberger, George; Perez-Riverol, Yasset; Nesvizhskii, Alexey I; Aebersold, Ruedi; Tenzer, Stefan
2016-11-01
Consistent and accurate quantification of proteins by mass spectrometry (MS)-based proteomics depends on the performance of instruments, acquisition methods and data analysis software. In collaboration with the software developers, we evaluated OpenSWATH, SWATH 2.0, Skyline, Spectronaut and DIA-Umpire, five of the most widely used software methods for processing data from sequential window acquisition of all theoretical fragment-ion spectra (SWATH)-MS, which uses data-independent acquisition (DIA) for label-free protein quantification. We analyzed high-complexity test data sets from hybrid proteome samples of defined quantitative composition acquired on two different MS instruments using different SWATH isolation-window setups. For consistent evaluation, we developed LFQbench, an R package, to calculate metrics of precision and accuracy in label-free quantitative MS and report the identification performance, robustness and specificity of each software tool. Our reference data sets enabled developers to improve their software tools. After optimization, all tools provided highly convergent identification and reliable quantification performance, underscoring their robustness for label-free quantitative proteomics.
Arbuscular mycorrhizal symbiosis affects the grain proteome of Zea mays: a field study.
Bona, Elisa; Scarafoni, Alessio; Marsano, Francesco; Boatti, Lara; Copetta, Andrea; Massa, Nadia; Gamalero, Elisa; D'Agostino, Giovanni; Cesaro, Patrizia; Cavaletto, Maria; Berta, Graziella
2016-05-24
Maize is one of the most important crops worldwide and is strongly dependent on arbuscular mycorrhiza (AM) fungi, organisms that form a mutualistic association with land plants. In maize, AM symbiosis enhances spike dry weight, spike length, spike circumference, and the dry weight and dimensions of the grain. Notwithstanding its ubiquitous nature, the detailed relationship between AM fungal colonization and plant development is not completely understood. To facilitate a better understanding of the effects of AM fungi on plants, the work reported here assessed the effects of a consortium of AM fungi on the kernel proteome of maize, cultivated in open-field conditions. To our knowledge, this is the first report of the modulation of a plant seed proteome following AM fungal inoculation in the field. Here, it was found that AM fungi modify the maize seed proteome by up-regulating enzymes involved in energetic metabolism, embryo development, nucleotide metabolism, seed storage and stress responses.
Analysis of high accuracy, quantitative proteomics data in the MaxQB database.
Schaab, Christoph; Geiger, Tamar; Stoehr, Gabriele; Cox, Juergen; Mann, Matthias
2012-03-01
MS-based proteomics generates rapidly increasing amounts of precise and quantitative information. Analysis of individual proteomic experiments has made great strides, but the crucial ability to compare and store information across different proteome measurements still presents many challenges. For example, it has been difficult to avoid contamination of databases with low quality peptide identifications, to control for the inflation in false positive identifications when combining data sets, and to integrate quantitative data. Although, for example, the contamination with low quality identifications has been addressed by joint analysis of deposited raw data in some public repositories, we reasoned that there should be a role for a database specifically designed for high resolution and quantitative data. Here we describe a novel database termed MaxQB that stores and displays collections of large proteomics projects and allows joint analysis and comparison. We demonstrate the analysis tools of MaxQB using proteome data of 11 different human cell lines and 28 mouse tissues. The database-wide false discovery rate is controlled by adjusting the project specific cutoff scores for the combined data sets. The 11 cell line proteomes together identify proteins expressed from more than half of all human genes. For each protein of interest, expression levels estimated by label-free quantification can be visualized across the cell lines. Similarly, the expression rank order and estimated amount of each protein within each proteome are plotted. We used MaxQB to calculate the signal reproducibility of the detected peptides for the same proteins across different proteomes. Spearman rank correlation between peptide intensity and detection probability of identified proteins was greater than 0.8 for 64% of the proteome, whereas a minority of proteins have negative correlation. This information can be used to pinpoint false protein identifications, independently of peptide database scores. The information contained in MaxQB, including high resolution fragment spectra, is accessible to the community via a user-friendly web interface at http://www.biochem.mpg.de/maxqb.
50 CFR 660.332 - Open access daily trip limit (DTL) fishery for sablefish.
Code of Federal Regulations, 2010 CFR
2010-10-01
... 50 Wildlife and Fisheries 9 2010-10-01 2010-10-01 false Open access daily trip limit (DTL) fishery... COAST STATES West Coast Groundfish-Open Access Fisheries § 660.332 Open access daily trip limit (DTL) fishery for sablefish. (a) Open access DTL fisheries both north and south of 36° N. lat. Open access...
Sethi, Manveen K; Thaysen-Andersen, Morten; Kim, Hoguen; Park, Cheol Keun; Baker, Mark S; Packer, Nicolle H; Paik, Young-Ki; Hancock, William S; Fanayan, Susan
2015-08-03
Modern proteomics has proven instrumental in our understanding of the molecular deregulations associated with the development and progression of cancer. Herein, we profile membrane-enriched proteome of tumor and adjacent normal tissues from eight CRC patients using label-free nanoLC-MS/MS-based quantitative proteomics and advanced pathway analysis. Of the 948 identified proteins, 184 proteins were differentially expressed (P<0.05, fold change>1.5) between the tumor and non-tumor tissue (69 up-regulated and 115 down-regulated in tumor tissues). The CRC tumor and non-tumor tissues clustered tightly in separate groups using hierarchical cluster analysis of the differentially expressed proteins, indicating a strong CRC-association of this proteome subset. Specifically, cancer associated proteins such as FN1, TNC, DEFA1, ITGB2, MLEC, CDH17, EZR and pathways including actin cytoskeleton and RhoGDI signaling were deregulated. Stage-specific proteome signatures were identified including up-regulated ribosomal proteins and down-regulated annexin proteins in early stage CRC. Finally, EGFR(+) CRC tissues showed an EGFR-dependent down-regulation of cell adhesion molecules, relative to EGFR(-) tissues. Taken together, this study provides a detailed map of the altered proteome and associated protein pathways in CRC, which enhances our mechanistic understanding of CRC biology and opens avenues for a knowledge-driven search for candidate CRC protein markers. Copyright © 2015 Elsevier B.V. All rights reserved.
Blattmann, Peter; Heusel, Moritz; Aebersold, Ruedi
2016-01-01
SWATH-MS is an acquisition and analysis technique of targeted proteomics that enables measuring several thousand proteins with high reproducibility and accuracy across many samples. OpenSWATH is popular open-source software for peptide identification and quantification from SWATH-MS data. For downstream statistical and quantitative analysis there exist different tools such as MSstats, mapDIA and aLFQ. However, the transfer of data from OpenSWATH to the downstream statistical tools is currently technically challenging. Here we introduce the R/Bioconductor package SWATH2stats, which allows convenient processing of the data into a format directly readable by the downstream analysis tools. In addition, SWATH2stats allows annotation, analyzing the variation and the reproducibility of the measurements, FDR estimation, and advanced filtering before submitting the processed data to downstream tools. These functionalities are important to quickly analyze the quality of the SWATH-MS data. Hence, SWATH2stats is a new open-source tool that summarizes several practical functionalities for analyzing, processing, and converting SWATH-MS data and thus facilitates the efficient analysis of large-scale SWATH/DIA datasets.
Challenges for proteomics core facilities.
Lilley, Kathryn S; Deery, Michael J; Gatto, Laurent
2011-03-01
Many analytical techniques have been executed by core facilities established within academic, pharmaceutical and other industrial institutions. The centralization of such facilities ensures a level of expertise and hardware which often cannot be supported by individual laboratories. The establishment of a core facility thus makes the technology available for multiple researchers in the same institution. Often, the services within the core facility are also opened out to researchers from other institutions, frequently with a fee being levied for the service provided. In the 1990s, with the onset of the age of genomics, there was an abundance of DNA analysis facilities, many of which have since disappeared from institutions and are now available through commercial sources. Ten years on, as proteomics was beginning to be utilized by many researchers, this technology found itself an ideal candidate for being placed within a core facility. We discuss what in our view are the daily challenges of proteomics core facilities. We also examine the potential unmet needs of the proteomics core facility that may also be applicable to proteomics laboratories which do not function as core facilities. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Shteynberg, David; Deutsch, Eric W.; Lam, Henry; Eng, Jimmy K.; Sun, Zhi; Tasman, Natalie; Mendoza, Luis; Moritz, Robert L.; Aebersold, Ruedi; Nesvizhskii, Alexey I.
2011-01-01
The combination of tandem mass spectrometry and sequence database searching is the method of choice for the identification of peptides and the mapping of proteomes. Over the last several years, the volume of data generated in proteomic studies has increased dramatically, which challenges the computational approaches previously developed for these data. Furthermore, a multitude of search engines have been developed that identify different, overlapping subsets of the sample peptides from a particular set of tandem mass spectrometry spectra. We present iProphet, the new addition to the widely used open-source suite of proteomic data analysis tools Trans-Proteomics Pipeline. Applied in tandem with PeptideProphet, it provides more accurate representation of the multilevel nature of shotgun proteomic data. iProphet combines the evidence from multiple identifications of the same peptide sequences across different spectra, experiments, precursor ion charge states, and modified states. It also allows accurate and effective integration of the results from multiple database search engines applied to the same data. The use of iProphet in the Trans-Proteomics Pipeline increases the number of correctly identified peptides at a constant false discovery rate as compared with both PeptideProphet and another state-of-the-art tool Percolator. As the main outcome, iProphet permits the calculation of accurate posterior probabilities and false discovery rate estimates at the level of sequence identical peptide identifications, which in turn leads to more accurate probability estimates at the protein level. Fully integrated with the Trans-Proteomics Pipeline, it supports all commonly used MS instruments, search engines, and computer platforms. The performance of iProphet is demonstrated on two publicly available data sets: data from a human whole cell lysate proteome profiling experiment representative of typical proteomic data sets, and from a set of Streptococcus pyogenes experiments more representative of organism-specific composite data sets. PMID:21876204
Yang, Yongxin; Bu, Dengpan; Zhao, Xiaowei; Sun, Peng; Wang, Jiaqi; Zhou, Lingyun
2013-04-05
To aid in unraveling diverse genetic and biological unknowns, a proteomic approach was used to analyze the whey proteome in cow, yak, buffalo, goat, and camel milk based on the isobaric tag for relative and absolute quantification (iTRAQ) techniques. This analysis is the first to produce proteomic data for the milk from the above-mentioned animal species: 211 proteins have been identified and 113 proteins have been categorized according to molecular function, cellular components, and biological processes based on gene ontology annotation. The results of principal component analysis showed significant differences in proteomic patterns among goat, camel, cow, buffalo, and yak milk. Furthermore, 177 differentially expressed proteins were submitted to advanced hierarchical clustering. The resulting clustering pattern included three major sample clusters: (1) cow, buffalo, and yak milk; (2) goat, cow, buffalo, and yak milk; and (3) camel milk. Certain proteins were chosen as characterization traits for a given species: whey acidic protein and quinone oxidoreductase for camel milk, biglycan for goat milk, uncharacterized protein (Accession Number: F1MK50 ) for yak milk, clusterin for buffalo milk, and primary amine oxidase for cow milk. These results help reveal the quantitative milk whey proteome pattern for analyzed species. This provides information for evaluating adulteration of specific specie milk and may provide potential directions for application of specific milk protein production based on physiological differences among animal species.
Recent advances and opportunities in proteomic analyses of tumour heterogeneity.
Bateman, Nicholas W; Conrads, Thomas P
2018-04-01
Solid tumour malignancies comprise a highly variable admixture of tumour and non-tumour cellular populations, forming a complex cellular ecosystem and tumour microenvironment. This tumour heterogeneity is not incidental, and is known to correlate with poor patient prognosis for many cancer types. Indeed, non-malignant cell populations, such as vascular endothelial and immune cells, are known to play key roles supporting and, in some cases, driving aggressive tumour biology, and represent targets of emerging therapeutics, such as antiangiogenesis and immune checkpoint inhibitors. The biochemical interplay between these cellular populations and how they contribute to molecular tumour heterogeneity remains enigmatic, particularly from the perspective of the tumour proteome. This review focuses on recent advances in proteomic methods, namely imaging mass spectrometry, single-cell proteomic techniques, and preanalytical sample processing, that are uniquely positioned to enable detailed analysis of discrete cellular populations within tumours to improve our understanding of tumour proteomic heterogeneity. This review further emphasizes the opportunity afforded by the application of these techniques to the analysis of tumour heterogeneity in formalin-fixed paraffin-embedded archival tumour tissues, as these represent an invaluable resource for retrospective analyses that is now routinely accessible, owing to recent technological and methodological advances in tumour tissue proteomics. Copyright © 2018 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd. Copyright © 2018 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd.
Kitata, Reta Birhanu; Dimayacyac-Esleta, Baby Rorielyn T.; Choong, Wai-Kok; Tsai, Chia-Feng; Lin, Tai-Du; Tsou, Chih-Chiang; Weng, Shao-Hsing; Chen, Yi-Ju; Yang, Pan-Chyr; Arco, Susan D.; Nesvizhskii, Alexey I.; Sung, Ting-Yi; Chen, Yu-Ju
2016-01-01
Despite significant efforts in the past decade towards complete mapping of the human proteome, 3564 proteins (neXtProt, 09-2014) are still “missing proteins”. Over one-third of these missing proteins are annotated as membrane proteins, owing to their relatively challenging accessibility with standard shotgun proteomics. Using non-small cell lung cancer (NSCLC) as a model study, we aim to mine missing proteins from disease-associated membrane proteome, which may be still largely under-represented. To increase identification coverage, we employed Hp-RP StageTip pre-fractionation of membrane-enriched samples from 11 NSCLC cell lines. Analysis of membrane samples from 20 pairs of tumor and adjacent normal lung tissue were incorporated to include physiologically expressed membrane proteins. Using multiple search engines (X!Tandem, Comet and Mascot) and stringent evaluation of FDR (MAYU and PeptideShaker), we identified 7702 proteins (66% membrane proteins) and 178 missing proteins (74 membrane proteins) with PSM-, peptide-, and protein-level FDR of 1%. Through multiple reaction monitoring (MRM) using synthetic peptides, we provided additional evidences for 8 missing proteins including 7 with transmembrane helix domains (TMH). This study demonstrates that mining missing proteins focused on cancer membrane sub-proteome can greatly contribute to map the whole human proteome. All data were deposited into ProteomeXchange with the identifier PXD002224. PMID:26202522
Simats, Alba; García-Berrocoso, Teresa; Ramiro, Laura; Giralt, Dolors; Gill, Natalia; Penalba, Anna; Bustamante, Alejandro; Rosell, Anna; Montaner, Joan
2018-05-21
The limited accessibility to the brain has turned the cerebrospinal fluid (CSF) into a valuable source that may contribute to the complete understanding of the stroke pathophysiology. Here we have described the CSF proteome in the hyper-acute phase of cerebral ischemia by performing an aptamer-based proteomic assay (SOMAscan) in CSF samples collected before and 30 min after male Wistar rats had undergone a 90 min Middle Cerebral Artery Occlusion (MCAO) or sham-surgery. Proteomic results indicated that cerebral ischemia acutely increased the CSF levels of 716 proteins, mostly overrepresented in leukocyte chemotaxis and neuronal death processes. Seven promising candidates were further evaluated in rat plasma and brain (CKB, CaMK2A, CaMK2B, CaMK2D, PDXP, AREG, CMPK). The 3 CaMK2 family-members and CMPK early decreased in the infarcted brain area and, together with AREG, co-localized with neurons. Conversely, CKB levels remained consistent after the insult and specifically matched with astrocytes. Further exploration of these candidates in human plasma revealed the potential of CKB and CMPK to diagnose stroke, while CaMK2B and CMPK resulted feasible biomarkers of functional stroke outcome. Our findings provided insights into the CSF proteome following cerebral ischemia and identified new outstanding proteins that might be further considered as potential biomarkers of stroke.
Lereim, Ragnhild Reehorst; Oveland, Eystein; Xiao, Yichuan; Torkildsen, Øivind; Wergeland, Stig; Myhr, Kjell-Morten; Sun, Shao-Cong; Berven, Frode S
2016-09-01
The ubiquitin ligase Peli1 has previously been suggested as a potential treatment target in multiple sclerosis. In the multiple sclerosis disease model, experimental autoimmune encephalomyelitis, Peli1 knock-out led to less activated microglia and less inflammation in the central nervous system. Despite being important in microglia, Peli1 expression has also been detected in glial and neuronal cells. In the present study the overall brain proteomes of Peli1 knock-out mice and wild-type mice were compared prior to experimental autoimmune encephalomyelitis induction, at onset of the disease and at disease peak. Brain samples from the frontal hemisphere, peripheral from the extensive inflammatory foci, were analyzed using TMT-labeling of sample pools, and the discovered proteins were verified in individual mice using label-free proteomics. The greatest proteomic differences between Peli1 knock-out and wild-type mice were observed at the disease peak. In Peli1 knock-out a higher degree of antigen presentation, increased activity of adaptive and innate immune cells and alterations to proteins involved in iron metabolism were observed during experimental autoimmune encephalomyelitis. These results unravel global effects to the brain proteome when abrogating Peli1 expression, underlining the importance of Peli1 as a regulator of the immune response also peripheral to inflammatory foci during experimental autoimmune encephalomyelitis. The proteomics data is available in PRIDE with accession PXD003710.
Opportunities and Challenges for Nutritional Proteomics in Cancer Prevention12
Romagnolo, Donato F.; Milner, John A.
2012-01-01
Knowledge gaps persist about the efficacy of cancer prevention strategies based on dietary food components. Adaptations to nutrient supply are executed through tuning of multiple protein networks that include transcription factors, histones, modifying enzymes, translation factors, membrane and nuclear receptors, and secreted proteins. However, the simultaneous quantitative and qualitative measurement of all proteins that regulate cancer processes is not practical using traditional protein methodologies. Proteomics offers an attractive opportunity to fill this knowledge gap and unravel the effects of dietary components on protein networks that impinge on cancer. The articles presented in this supplement are from talks proffered in the “Nutrition Proteomics and Cancer Prevention” session at the American Institute for Cancer Research Annual Research Conference on Food, Nutrition, Physical Activity and Cancer held in Washington, DC on October 21 and 22, 2010. Recent advances in MS technologies suggest that studies in nutrition and cancer prevention may benefit from the adoption of proteomic tools to elucidate the impact on biological processes that govern the transition from normal to malignant phenotype; to identify protein changes that determine both positive and negative responses to food components; to assess how protein networks mediate dose-, time-, and tissue-dependent responses to food components; and, finally, for predicting responders and nonresponders. However, both the limited accessibility to proteomic technologies and research funding appear to be hampering the routine adoption of proteomic tools in nutrition and cancer prevention research. PMID:22649262
2011-01-01
Background Since its inception, proteomics has essentially operated in a discovery mode with the goal of identifying and quantifying the maximal number of proteins in a sample. Increasingly, proteomic measurements are also supporting hypothesis-driven studies, in which a predetermined set of proteins is consistently detected and quantified in multiple samples. Selected reaction monitoring (SRM) is a targeted mass spectrometric technique that supports the detection and quantification of specific proteins in complex samples at high sensitivity and reproducibility. Here, we describe ATAQS, an integrated software platform that supports all stages of targeted, SRM-based proteomics experiments including target selection, transition optimization and post acquisition data analysis. This software will significantly facilitate the use of targeted proteomic techniques and contribute to the generation of highly sensitive, reproducible and complete datasets that are particularly critical for the discovery and validation of targets in hypothesis-driven studies in systems biology. Result We introduce a new open source software pipeline, ATAQS (Automated and Targeted Analysis with Quantitative SRM), which consists of a number of modules that collectively support the SRM assay development workflow for targeted proteomic experiments (project management and generation of protein, peptide and transitions and the validation of peptide detection by SRM). ATAQS provides a flexible pipeline for end-users by allowing the workflow to start or end at any point of the pipeline, and for computational biologists, by enabling the easy extension of java algorithm classes for their own algorithm plug-in or connection via an external web site. This integrated system supports all steps in a SRM-based experiment and provides a user-friendly GUI that can be run by any operating system that allows the installation of the Mozilla Firefox web browser. Conclusions Targeted proteomics via SRM is a powerful new technique that enables the reproducible and accurate identification and quantification of sets of proteins of interest. ATAQS is the first open-source software that supports all steps of the targeted proteomics workflow. ATAQS also provides software API (Application Program Interface) documentation that enables the addition of new algorithms to each of the workflow steps. The software, installation guide and sample dataset can be found in http://tools.proteomecenter.org/ATAQS/ATAQS.html PMID:21414234
Reisinger, Florian; Krishna, Ritesh; Ghali, Fawaz; Ríos, Daniel; Hermjakob, Henning; Vizcaíno, Juan Antonio; Jones, Andrew R
2012-03-01
We present a Java application programming interface (API), jmzIdentML, for the Human Proteome Organisation (HUPO) Proteomics Standards Initiative (PSI) mzIdentML standard for peptide and protein identification data. The API combines the power of Java Architecture of XML Binding (JAXB) and an XPath-based random-access indexer to allow a fast and efficient mapping of extensible markup language (XML) elements to Java objects. The internal references in the mzIdentML files are resolved in an on-demand manner, where the whole file is accessed as a random-access swap file, and only the relevant piece of XMLis selected for mapping to its corresponding Java object. The APIis highly efficient in its memory usage and can handle files of arbitrary sizes. The APIfollows the official release of the mzIdentML (version 1.1) specifications and is available in the public domain under a permissive licence at http://www.code.google.com/p/jmzidentml/. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
50 CFR 660.313 - Open access fishery-recordkeeping and reporting.
Code of Federal Regulations, 2011 CFR
2011-10-01
... 50 Wildlife and Fisheries 11 2011-10-01 2011-10-01 false Open access fishery-recordkeeping and... West Coast Groundfish-Open Access Fisheries § 660.313 Open access fishery—recordkeeping and reporting... to open access fisheries. (b) Declaration reports for vessels using nontrawl gear. Declaration...
50 CFR 660.313 - Open access fishery-recordkeeping and reporting.
Code of Federal Regulations, 2010 CFR
2010-10-01
... 50 Wildlife and Fisheries 9 2010-10-01 2010-10-01 false Open access fishery-recordkeeping and... West Coast Groundfish-Open Access Fisheries § 660.313 Open access fishery—recordkeeping and reporting... to open access fisheries. (b) Declaration reports for vessels using nontrawl gear. Declaration...
50 CFR 660.313 - Open access fishery-recordkeeping and reporting.
Code of Federal Regulations, 2014 CFR
2014-10-01
... 50 Wildlife and Fisheries 13 2014-10-01 2014-10-01 false Open access fishery-recordkeeping and... West Coast Groundfish-Open Access Fisheries § 660.313 Open access fishery—recordkeeping and reporting... to open access fisheries. (b) Declaration reports for vessels using nontrawl gear. Declaration...
50 CFR 660.313 - Open access fishery-recordkeeping and reporting.
Code of Federal Regulations, 2012 CFR
2012-10-01
... 50 Wildlife and Fisheries 13 2012-10-01 2012-10-01 false Open access fishery-recordkeeping and... West Coast Groundfish-Open Access Fisheries § 660.313 Open access fishery—recordkeeping and reporting... to open access fisheries. (b) Declaration reports for vessels using nontrawl gear. Declaration...
50 CFR 660.313 - Open access fishery-recordkeeping and reporting.
Code of Federal Regulations, 2013 CFR
2013-10-01
... 50 Wildlife and Fisheries 13 2013-10-01 2013-10-01 false Open access fishery-recordkeeping and... West Coast Groundfish-Open Access Fisheries § 660.313 Open access fishery—recordkeeping and reporting... to open access fisheries. (b) Declaration reports for vessels using nontrawl gear. Declaration...
50 CFR 660.330 - Open access fishery-management measures.
Code of Federal Regulations, 2014 CFR
2014-10-01
... 50 Wildlife and Fisheries 13 2014-10-01 2014-10-01 false Open access fishery-management measures... West Coast Groundfish-Open Access Fisheries § 660.330 Open access fishery—management measures. (a) General. Groundfish species taken in open access fisheries will be managed with cumulative trip limits...
50 CFR 660.330 - Open access fishery-management measures.
Code of Federal Regulations, 2010 CFR
2010-10-01
... 50 Wildlife and Fisheries 9 2010-10-01 2010-10-01 false Open access fishery-management measures... West Coast Groundfish-Open Access Fisheries § 660.330 Open access fishery—management measures. (a) General. Groundfish species taken in open access fisheries will be managed with cumulative trip limits...
50 CFR 660.330 - Open access fishery-management measures.
Code of Federal Regulations, 2013 CFR
2013-10-01
... 50 Wildlife and Fisheries 13 2013-10-01 2013-10-01 false Open access fishery-management measures... West Coast Groundfish-Open Access Fisheries § 660.330 Open access fishery—management measures. (a) General. Groundfish species taken in open access fisheries will be managed with cumulative trip limits...
50 CFR 660.320 - Open access fishery-crossover provisions.
Code of Federal Regulations, 2010 CFR
2010-10-01
... 50 Wildlife and Fisheries 9 2010-10-01 2010-10-01 false Open access fishery-crossover provisions... West Coast Groundfish-Open Access Fisheries § 660.320 Open access fishery—crossover provisions. (a) Operating in both limited entry and open access fisheries. See provisions at § 660.60, subpart C. (b...
50 CFR 660.320 - Open access fishery-crossover provisions.
Code of Federal Regulations, 2011 CFR
2011-10-01
... 50 Wildlife and Fisheries 11 2011-10-01 2011-10-01 false Open access fishery-crossover provisions... West Coast Groundfish-Open Access Fisheries § 660.320 Open access fishery—crossover provisions. (a) Operating in both limited entry and open access fisheries. See provisions at § 660.60, subpart C. (b...
50 CFR 660.312 - Open access fishery-prohibitions.
Code of Federal Regulations, 2012 CFR
2012-10-01
... 50 Wildlife and Fisheries 13 2012-10-01 2012-10-01 false Open access fishery-prohibitions. 660.312... Groundfish-Open Access Fisheries § 660.312 Open access fishery—prohibitions. General groundfish prohibitions..., possess, or land groundfish in excess of the landing limit for the open access fishery without having a...
50 CFR 660.330 - Open access fishery-management measures.
Code of Federal Regulations, 2012 CFR
2012-10-01
... 50 Wildlife and Fisheries 13 2012-10-01 2012-10-01 false Open access fishery-management measures... West Coast Groundfish-Open Access Fisheries § 660.330 Open access fishery—management measures. (a) General. Groundfish species taken in open access fisheries will be managed with cumulative trip limits...
50 CFR 660.312 - Open access fishery-prohibitions.
Code of Federal Regulations, 2014 CFR
2014-10-01
... 50 Wildlife and Fisheries 13 2014-10-01 2014-10-01 false Open access fishery-prohibitions. 660.312... Groundfish-Open Access Fisheries § 660.312 Open access fishery—prohibitions. General groundfish prohibitions..., possess, or land groundfish in excess of the landing limit for the open access fishery without having a...
50 CFR 660.312 - Open access fishery-prohibitions.
Code of Federal Regulations, 2010 CFR
2010-10-01
... 50 Wildlife and Fisheries 9 2010-10-01 2010-10-01 false Open access fishery-prohibitions. 660.312... Groundfish-Open Access Fisheries § 660.312 Open access fishery—prohibitions. General groundfish prohibitions..., possess, or land groundfish in excess of the landing limit for the open access fishery without having a...
50 CFR 660.312 - Open access fishery-prohibitions.
Code of Federal Regulations, 2013 CFR
2013-10-01
... 50 Wildlife and Fisheries 13 2013-10-01 2013-10-01 false Open access fishery-prohibitions. 660.312... Groundfish-Open Access Fisheries § 660.312 Open access fishery—prohibitions. General groundfish prohibitions..., possess, or land groundfish in excess of the landing limit for the open access fishery without having a...
50 CFR 660.312 - Open access fishery-prohibitions.
Code of Federal Regulations, 2011 CFR
2011-10-01
... 50 Wildlife and Fisheries 11 2011-10-01 2011-10-01 false Open access fishery-prohibitions. 660.312... Groundfish-Open Access Fisheries § 660.312 Open access fishery—prohibitions. General groundfish prohibitions..., possess, or land groundfish in excess of the landing limit for the open access fishery without having a...
50 CFR 660.330 - Open access fishery-management measures.
Code of Federal Regulations, 2011 CFR
2011-10-01
... 50 Wildlife and Fisheries 11 2011-10-01 2011-10-01 false Open access fishery-management measures... West Coast Groundfish-Open Access Fisheries § 660.330 Open access fishery—management measures. (a) General. Groundfish species taken in open access fisheries will be managed with cumulative trip limits...
Medical education and information literacy in the era of open access.
Brower, Stewart M
2010-01-01
The Open Access movement in scholarly communications poses new issues and concerns for medical education in general and information literacy education specifically. For medical educators, Open Access can affect the availability of new information, instructional materials, and scholarship in medical education. For students, Open Access materials continue to be available to them post-graduation, regardless of affiliation. Libraries and information literacy librarians are challenged in their responses to the Open Access publishing movement in how best to support Open Access endeavors within their own institutions, and how best to educate their user base about Open Access in general.
Chakraborty, Sandeep; Phu, My; de Morais, Tâmara Prado; Nascimento, Rafael; Goulart, Luiz Ricardo; Rao, Basuthkar J; Asgeirsson, Bjarni; Dandekar, Abhaya M
2014-01-01
The therapeutic potential of α-helical anti-microbial peptides (AH-AMP) to combat pathogens is fast gaining prominence. Based on recently published open access software for characterizing α-helical peptides (PAGAL), we elucidate a search methodology (SCALPEL) that leverages the massive structural data pre-existing in the PDB database to obtain AH-AMPs belonging to the host proteome. We provide in vitro validation of SCALPEL on plant pathogens ( Xylella fastidiosa, Xanthomonas arboricola and Liberibacter crescens) by identifying AH-AMPs that mirror the function and properties of cecropin B, a well-studied AH-AMP. The identified peptides include a linear AH-AMP present within the existing structure of phosphoenolpyruvate carboxylase (PPC20), and an AH-AMP mimicing the properties of the two α-helices of cecropin B from chitinase (CHITI25). The minimum inhibitory concentration of these peptides are comparable to that of cecropin B, while anionic peptides used as control failed to show any inhibitory effect on these pathogens. Substitute therapies in place of conventional chemotherapies using membrane permeabilizing peptides like these might also prove effective to target cancer cells. The use of native structures from the same organism could possibly ensure that administration of such peptides will be better tolerated and not elicit an adverse immune response. We suggest a similar approach to target Ebola epitopes, enumerated using PAGAL recently, by selecting suitable peptides from the human proteome, especially in wake of recent reports of cationic amphiphiles inhibiting virus entry and infection.
Metabolomics and Proteomics of Brassica napus Guard Cells in Response to Low CO2
Geng, Sisi; Yu, Bing; Zhu, Ning; Dufresne, Craig; Chen, Sixue
2017-01-01
Stomatal guard cell response to various stimuli is an important process that balances plant carbon dioxide (CO2) uptake and water transpiration. Elevated CO2 induces stomatal closure, while low CO2 promotes stomatal opening. The signaling process of elevated CO2 induced stomatal closure has been extensively studied in recent years. However, the mechanism of low CO2 induced stomatal opening is not fully understood. Here we report metabolomic and proteomic responses of Brassica napus guard cells to low CO2 using hyphenated mass spectrometry technologies. A total of 411 metabolites and 1397 proteins were quantified in a time-course study of low CO2 effects. Metabolites and proteins that exhibited significant changes are overrepresented in fatty acid metabolism, starch and sucrose metabolism, glycolysis and redox regulation. Concomitantly, multiple hormones that promote stomatal opening increased in response to low CO2. Interestingly, jasmonic acid precursors were diverted to a branch pathway of traumatic acid biosynthesis. These results indicate that the low CO2 response is mediated by a complex crosstalk between different phytohormones. PMID:28791296
Metabolomics and Proteomics of Brassica napus Guard Cells in Response to Low CO2.
Geng, Sisi; Yu, Bing; Zhu, Ning; Dufresne, Craig; Chen, Sixue
2017-01-01
Stomatal guard cell response to various stimuli is an important process that balances plant carbon dioxide (CO 2 ) uptake and water transpiration. Elevated CO 2 induces stomatal closure, while low CO 2 promotes stomatal opening. The signaling process of elevated CO 2 induced stomatal closure has been extensively studied in recent years. However, the mechanism of low CO 2 induced stomatal opening is not fully understood. Here we report metabolomic and proteomic responses of Brassica napus guard cells to low CO 2 using hyphenated mass spectrometry technologies. A total of 411 metabolites and 1397 proteins were quantified in a time-course study of low CO 2 effects. Metabolites and proteins that exhibited significant changes are overrepresented in fatty acid metabolism, starch and sucrose metabolism, glycolysis and redox regulation. Concomitantly, multiple hormones that promote stomatal opening increased in response to low CO 2 . Interestingly, jasmonic acid precursors were diverted to a branch pathway of traumatic acid biosynthesis. These results indicate that the low CO 2 response is mediated by a complex crosstalk between different phytohormones.
Kremer, Lukas P M; Leufken, Johannes; Oyunchimeg, Purevdulam; Schulze, Stefan; Fufezan, Christian
2016-03-04
Proteomics data integration has become a broad field with a variety of programs offering innovative algorithms to analyze increasing amounts of data. Unfortunately, this software diversity leads to many problems as soon as the data is analyzed using more than one algorithm for the same task. Although it was shown that the combination of multiple peptide identification algorithms yields more robust results, it is only recently that unified approaches are emerging; however, workflows that, for example, aim to optimize search parameters or that employ cascaded style searches can only be made accessible if data analysis becomes not only unified but also and most importantly scriptable. Here we introduce Ursgal, a Python interface to many commonly used bottom-up proteomics tools and to additional auxiliary programs. Complex workflows can thus be composed using the Python scripting language using a few lines of code. Ursgal is easily extensible, and we have made several database search engines (X!Tandem, OMSSA, MS-GF+, Myrimatch, MS Amanda), statistical postprocessing algorithms (qvality, Percolator), and one algorithm that combines statistically postprocessed outputs from multiple search engines ("combined FDR") accessible as an interface in Python. Furthermore, we have implemented a new algorithm ("combined PEP") that combines multiple search engines employing elements of "combined FDR", PeptideShaker, and Bayes' theorem.
Publishing in open access era: focus on respiratory journals
Xu, Dingyao; Zhong, Xiyao; Li, Li; Ling, Qibo; Bu, Zhaode
2014-01-01
We have entered an open access publishing era. The impact and significance of open access is still under debate after two decades of evolution. Open access journals benefit researchers and the general public by promoting visibility, sharing and communicating. Non-mainstream journals should turn the challenge of open access into opportunity of presenting best research articles to the global readership. Open access journals need to optimize their business models to promote the healthy and continuous development. PMID:24822120
Publishing in open access era: focus on respiratory journals.
Dai, Ni; Xu, Dingyao; Zhong, Xiyao; Li, Li; Ling, Qibo; Bu, Zhaode
2014-05-01
We have entered an open access publishing era. The impact and significance of open access is still under debate after two decades of evolution. Open access journals benefit researchers and the general public by promoting visibility, sharing and communicating. Non-mainstream journals should turn the challenge of open access into opportunity of presenting best research articles to the global readership. Open access journals need to optimize their business models to promote the healthy and continuous development.
Publishing in Open Access Education Journals: The Authors' Perspectives
ERIC Educational Resources Information Center
Coonin, Bryna; Younce, Leigh M.
2010-01-01
Open access publishing is now an accepted method of scholarly communication. However, the greatest traction for open access publishing thus far has been in the sciences. Penetration of open access publishing has been much slower among the social sciences. This study surveys 309 authors from recent issues of open access journals in education to…
50 CFR 660.311 - Open access fishery-definitions.
Code of Federal Regulations, 2012 CFR
2012-10-01
... 50 Wildlife and Fisheries 13 2012-10-01 2012-10-01 false Open access fishery-definitions. 660.311... Groundfish-Open Access Fisheries § 660.311 Open access fishery—definitions. General definitions for the... specific to the open access fishery covered in this subpart and are in addition to those specified at § 660...
50 CFR 660.311 - Open access fishery-definitions.
Code of Federal Regulations, 2011 CFR
2011-10-01
... 50 Wildlife and Fisheries 11 2011-10-01 2011-10-01 false Open access fishery-definitions. 660.311... Groundfish-Open Access Fisheries § 660.311 Open access fishery—definitions. General definitions for the... specific to the open access fishery covered in this subpart and are in addition to those specified at § 660...
50 CFR 660.320 - Open access fishery-crossover provisions.
Code of Federal Regulations, 2014 CFR
2014-10-01
... 50 Wildlife and Fisheries 13 2014-10-01 2014-10-01 false Open access fishery-crossover provisions... West Coast Groundfish-Open Access Fisheries § 660.320 Open access fishery—crossover provisions. The crossover provisions listed at § 660.60(h)(7), apply to vessels fishing in the open access fishery. [76 FR...
50 CFR 660.320 - Open access fishery-crossover provisions.
Code of Federal Regulations, 2013 CFR
2013-10-01
... 50 Wildlife and Fisheries 13 2013-10-01 2013-10-01 false Open access fishery-crossover provisions... West Coast Groundfish-Open Access Fisheries § 660.320 Open access fishery—crossover provisions. The crossover provisions listed at § 660.60(h)(7), apply to vessels fishing in the open access fishery. [76 FR...
50 CFR 660.319 - Open access fishery gear identification and marking.
Code of Federal Regulations, 2013 CFR
2013-10-01
... 50 Wildlife and Fisheries 13 2013-10-01 2013-10-01 false Open access fishery gear identification... COAST STATES West Coast Groundfish-Open Access Fisheries § 660.319 Open access fishery gear identification and marking. (a) Gear identification. (1) Open access fixed gear (longline, trap or pot, set net...
50 CFR 660.311 - Open access fishery-definitions.
Code of Federal Regulations, 2010 CFR
2010-10-01
... 50 Wildlife and Fisheries 9 2010-10-01 2010-10-01 false Open access fishery-definitions. 660.311... Groundfish-Open Access Fisheries § 660.311 Open access fishery—definitions. General definitions for the... specific to the open access fishery covered in this subpart and are in addition to those specified at § 660...
50 CFR 660.320 - Open access fishery-crossover provisions.
Code of Federal Regulations, 2012 CFR
2012-10-01
... 50 Wildlife and Fisheries 13 2012-10-01 2012-10-01 false Open access fishery-crossover provisions... West Coast Groundfish-Open Access Fisheries § 660.320 Open access fishery—crossover provisions. The crossover provisions listed at § 660.60(h)(7), apply to vessels fishing in the open access fishery. [76 FR...
50 CFR 660.311 - Open access fishery-definitions.
Code of Federal Regulations, 2013 CFR
2013-10-01
... 50 Wildlife and Fisheries 13 2013-10-01 2013-10-01 false Open access fishery-definitions. 660.311... Groundfish-Open Access Fisheries § 660.311 Open access fishery—definitions. General definitions for the... specific to the open access fishery covered in this subpart and are in addition to those specified at § 660...
50 CFR 660.311 - Open access fishery-definitions.
Code of Federal Regulations, 2014 CFR
2014-10-01
... 50 Wildlife and Fisheries 13 2014-10-01 2014-10-01 false Open access fishery-definitions. 660.311... Groundfish-Open Access Fisheries § 660.311 Open access fishery—definitions. General definitions for the... specific to the open access fishery covered in this subpart and are in addition to those specified at § 660...
Education Scholars' Perceptions and Practices toward Open Access Publishing
ERIC Educational Resources Information Center
Ellingford, Lori Michelle
2012-01-01
Although open access publishing has been available since 1998, we know little regarding scholars' perceptions and practices toward publishing in open access outlets, especially in the social science community. Open access publishing has been slow to penetrate the field of education, yet the potential impact of open access could make this…
pyQms enables universal and accurate quantification of mass spectrometry data.
Leufken, Johannes; Niehues, Anna; Sarin, L Peter; Wessel, Florian; Hippler, Michael; Leidel, Sebastian A; Fufezan, Christian
2017-10-01
Quantitative mass spectrometry (MS) is a key technique in many research areas (1), including proteomics, metabolomics, glycomics, and lipidomics. Because all of the corresponding molecules can be described by chemical formulas, universal quantification tools are highly desirable. Here, we present pyQms, an open-source software for accurate quantification of all types of molecules measurable by MS. pyQms uses isotope pattern matching that offers an accurate quality assessment of all quantifications and the ability to directly incorporate mass spectrometer accuracy. pyQms is, due to its universal design, applicable to every research field, labeling strategy, and acquisition technique. This opens ultimate flexibility for researchers to design experiments employing innovative and hitherto unexplored labeling strategies. Importantly, pyQms performs very well to accurately quantify partially labeled proteomes in large scale and high throughput, the most challenging task for a quantification algorithm. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Quantitative proteomics in Giardia duodenalis-Achievements and challenges.
Emery, Samantha J; Lacey, Ernest; Haynes, Paul A
2016-08-01
Giardia duodenalis (syn. G. lamblia and G. intestinalis) is a protozoan parasite of vertebrates and a major contributor to the global burden of diarrheal diseases and gastroenteritis. The publication of multiple genome sequences in the G. duodenalis species complex has provided important insights into parasite biology, and made post-genomic technologies, including proteomics, significantly more accessible. The aims of proteomics are to identify and quantify proteins present in a cell, and assign functions to them within the context of dynamic biological systems. In Giardia, proteomics in the post-genomic era has transitioned from reliance on gel-based systems to utilisation of a diverse array of techniques based on bottom-up LC-MS/MS technologies. Together, these have generated crucial foundations for subcellular proteomes, elucidated intra- and inter-assemblage isolate variation, and identified pathways and markers in differentiation, host-parasite interactions and drug resistance. However, in Giardia, proteomics remains an emerging field, with considerable shortcomings evident from the published research. These include a bias towards assemblage A, a lack of emphasis on quantitative analytical techniques, and limited information on post-translational protein modifications. Additionally, there are multiple areas of research for which proteomic data is not available to add value to published transcriptomic data. The challenge of amalgamating data in the systems biology paradigm necessitates the further generation of large, high-quality quantitative datasets to accurately model parasite biology. This review surveys the current proteomic research available for Giardia and evaluates their technical and quantitative approaches, while contextualising their biological insights into parasite pathology, isolate variation and eukaryotic evolution. Finally, we propose areas of priority for the generation of future proteomic data to explore fundamental questions in Giardia, including the analysis of post-translational modifications, and the design of MS-based assays for validation of differentially expressed proteins in large datasets. Copyright © 2016 Elsevier B.V. All rights reserved.
ERIC Educational Resources Information Center
Krishnamurthy, M.
2008-01-01
Purpose: The purpose of this paper is to describe the open access and open source movement in the digital library world. Design/methodology/approach: A review of key developments in the open access and open source movement is provided. Findings: Open source software and open access to research findings are of great use to scholars in developing…
DOE Office of Scientific and Technical Information (OSTI.GOV)
Weckwerth, Wolfram; Baginsky, Sacha; Van Wijk, Klass
2009-12-01
In the past 10 years, we have witnessed remarkable advances in the field of plant molecular biology. The rapid development of proteomic technologies and the speed with which these techniques have been applied to the field have altered our perception of how we can analyze proteins in complex systems. At nearly the same time, the availability of the complete genome for the model plant Arabidopsis thaliana was released; this effort provides an unsurpassed resource for the identification of proteins when researchers use MS to analyze plant samples. Recognizing the growth in this area, the Multinational Arabidopsis Steering Committee (MASC) establishedmore » a subcommittee for A. thaliana proteomics in 2006 with the objective of consolidating databases, technique standards, and experimentally validated candidate genes and functions. Since the establishment of the Multinational Arabidopsis Steering Subcommittee for Proteomics (MASCP), many new approaches and resources have become available. Recently, the subcommittee established a webpage to consolidate this information (www.masc-proteomics.org). It includes links to plant proteomic databases, general information about proteomic techniques, meeting information, a summary of proteomic standards, and other relevant resources. Altogether, this website provides a useful resource for the Arabidopsis proteomics community. In the future, the website will host discussions and investigate the cross-linking of databases. The subcommittee members have extensive experience in arabidopsis proteomics and collectively have produced some of the most extensive proteomics data sets for this model plant (Table S1 in the Supporting Information has a list of resources). The largest collection of proteomics data from a single study in A. thaliana was assembled into an accessible database (AtProteome; http://fgcz-atproteome.unizh.ch/index.php) and was recently published by the Baginsky lab.1 The database provides links to major Arabidopsis online resources, and raw data have been deposited in PRIDE and PRIDE BioMart. Included in this database is an Arabidopsis proteome map that provides evidence for the expression of {approx}50% of all predicted gene models, including several alternative gene models that are not represented in The Arabidopsis Information Resource (TAIR) protein database. A set of organ-specific biomarkers is provided, as well as organ-specific proteotypic peptides for 4105 proteins that can be used to facilitate targeted quantitative proteomic surveys. In the future, the AtProteome database will be linked to additional existing resources developed by MASCP members, such as PPDB, ProMEX, and SUBA. The most comprehensive study on the Arabidopsis chloroplast proteome, which includes information on chloroplast sorting signals, posttranslational modifications (PTMs), and protein abundances (analyzed by high-accuracy MS [Orbitrap]), was recently published by the van Wijk lab.2 These and previous data are available via the plant proteome database (PPDB; http://ppdb.tc.cornell.edu) for A. thaliana and maize. PPDB provides genome-wide experimental and functional characterization of the A. thaliana and maize proteomes, including PTMs and subcellular localization information, with an emphasis on leaf and plastid proteins. Maize and Arabidopsis proteome entries are directly linked via internal BLAST alignments within PPDB. Direct links for each protein to TAIR, SUBA, ProMEX, and other resources are also provided.« less
Hegedűs, Tamás; Chaubey, Pururawa Mayank; Várady, György; Szabó, Edit; Sarankó, Hajnalka; Hofstetter, Lia; Roschitzki, Bernd; Sarkadi, Balázs
2015-01-01
Based on recent results, the determination of the easily accessible red blood cell (RBC) membrane proteins may provide new diagnostic possibilities for assessing mutations, polymorphisms or regulatory alterations in diseases. However, the analysis of the current mass spectrometry-based proteomics datasets and other major databases indicates inconsistencies—the results show large scattering and only a limited overlap for the identified RBC membrane proteins. Here, we applied membrane-specific proteomics studies in human RBC, compared these results with the data in the literature, and generated a comprehensive and expandable database using all available data sources. The integrated web database now refers to proteomic, genetic and medical databases as well, and contains an unexpected large number of validated membrane proteins previously thought to be specific for other tissues and/or related to major human diseases. Since the determination of protein expression in RBC provides a method to indicate pathological alterations, our database should facilitate the development of RBC membrane biomarker platforms and provide a unique resource to aid related further research and diagnostics. Database URL: http://rbcc.hegelab.org PMID:26078478
The jmzQuantML programming interface and validator for the mzQuantML data standard.
Qi, Da; Krishna, Ritesh; Jones, Andrew R
2014-03-01
The mzQuantML standard from the HUPO Proteomics Standards Initiative has recently been released, capturing quantitative data about peptides and proteins, following analysis of MS data. We present a Java application programming interface (API) for mzQuantML called jmzQuantML. The API provides robust bridges between Java classes and elements in mzQuantML files and allows random access to any part of the file. The API provides read and write capabilities, and is designed to be embedded in other software packages, enabling mzQuantML support to be added to proteomics software tools (http://code.google.com/p/jmzquantml/). The mzQuantML standard is designed around a multilevel validation system to ensure that files are structurally and semantically correct for different proteomics quantitative techniques. In this article, we also describe a Java software tool (http://code.google.com/p/mzquantml-validator/) for validating mzQuantML files, which is a formal part of the data standard. © 2014 The Authors. Proteomics published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
pyGeno: A Python package for precision medicine and proteogenomics.
Daouda, Tariq; Perreault, Claude; Lemieux, Sébastien
2016-01-01
pyGeno is a Python package mainly intended for precision medicine applications that revolve around genomics and proteomics. It integrates reference sequences and annotations from Ensembl, genomic polymorphisms from the dbSNP database and data from next-gen sequencing into an easy to use, memory-efficient and fast framework, therefore allowing the user to easily explore subject-specific genomes and proteomes. Compared to a standalone program, pyGeno gives the user access to the complete expressivity of Python, a general programming language. Its range of application therefore encompasses both short scripts and large scale genome-wide studies.
pyGeno: A Python package for precision medicine and proteogenomics
Daouda, Tariq; Perreault, Claude; Lemieux, Sébastien
2016-01-01
pyGeno is a Python package mainly intended for precision medicine applications that revolve around genomics and proteomics. It integrates reference sequences and annotations from Ensembl, genomic polymorphisms from the dbSNP database and data from next-gen sequencing into an easy to use, memory-efficient and fast framework, therefore allowing the user to easily explore subject-specific genomes and proteomes. Compared to a standalone program, pyGeno gives the user access to the complete expressivity of Python, a general programming language. Its range of application therefore encompasses both short scripts and large scale genome-wide studies. PMID:27785359
David, Matthieu; Fertin, Guillaume; Rogniaux, Hélène; Tessier, Dominique
2017-08-04
The analysis of discovery proteomics experiments relies on algorithms that identify peptides from their tandem mass spectra. The almost exhaustive interpretation of these spectra remains an unresolved issue. At present, an important number of missing interpretations is probably due to peptides displaying post-translational modifications and variants that yield spectra that are particularly difficult to interpret. However, the emergence of a new generation of mass spectrometers that provide high fragment ion accuracy has paved the way for more efficient algorithms. We present a new software, SpecOMS, that can handle the computational complexity of pairwise comparisons of spectra in the context of large volumes. SpecOMS can compare a whole set of experimental spectra generated by a discovery proteomics experiment to a whole set of theoretical spectra deduced from a protein database in a few minutes on a standard workstation. SpecOMS can ingeniously exploit those capabilities to improve the peptide identification process, allowing strong competition between all possible peptides for spectrum interpretation. Remarkably, this software resolves the drawbacks (i.e., efficiency problems and decreased sensitivity) that usually accompany open modification searches. We highlight this promising approach using results obtained from the analysis of a public human data set downloaded from the PRIDE (PRoteomics IDEntification) database.
Proteomic Analysis of the Cell Cycle of Procylic Form Trypanosoma brucei.
Crozier, Thomas W M; Tinti, Michele; Wheeler, Richard J; Ly, Tony; Ferguson, Michael A J; Lamond, Angus I
2018-06-01
We describe a single-step centrifugal elutriation method to produce synchronous Gap1 (G1)-phase procyclic trypanosomes at a scale amenable for proteomic analysis of the cell cycle. Using ten-plex tandem mass tag (TMT) labeling and mass spectrometry (MS)-based proteomics technology, the expression levels of 5325 proteins were quantified across the cell cycle in this parasite. Of these, 384 proteins were classified as cell-cycle regulated and subdivided into nine clusters with distinct temporal regulation. These groups included many known cell cycle regulators in trypanosomes, which validates the approach. In addition, we identify 40 novel cell cycle regulated proteins that are essential for trypanosome survival and thus represent potential future drug targets for the prevention of trypanosomiasis. Through cross-comparison to the TrypTag endogenous tagging microscopy database, we were able to validate the cell-cycle regulated patterns of expression for many of the proteins of unknown function detected in our proteomic analysis. A convenient interface to access and interrogate these data is also presented, providing a useful resource for the scientific community. Data are available via ProteomeXchange with identifier PXD008741 (https://www.ebi.ac.uk/pride/archive/). © 2018 by The American Society for Biochemistry and Molecular Biology, Inc.
Kitata, Reta Birhanu; Dimayacyac-Esleta, Baby Rorielyn T; Choong, Wai-Kok; Tsai, Chia-Feng; Lin, Tai-Du; Tsou, Chih-Chiang; Weng, Shao-Hsing; Chen, Yi-Ju; Yang, Pan-Chyr; Arco, Susan D; Nesvizhskii, Alexey I; Sung, Ting-Yi; Chen, Yu-Ju
2015-09-04
Despite significant efforts in the past decade toward complete mapping of the human proteome, 3564 proteins (neXtProt, 09-2014) are still "missing proteins". Over one-third of these missing proteins are annotated as membrane proteins, owing to their relatively challenging accessibility with standard shotgun proteomics. Using nonsmall cell lung cancer (NSCLC) as a model study, we aim to mine missing proteins from disease-associated membrane proteome, which may be still largely under-represented. To increase identification coverage, we employed Hp-RP StageTip prefractionation of membrane-enriched samples from 11 NSCLC cell lines. Analysis of membrane samples from 20 pairs of tumor and adjacent normal lung tissue was incorporated to include physiologically expressed membrane proteins. Using multiple search engines (X!Tandem, Comet, and Mascot) and stringent evaluation of FDR (MAYU and PeptideShaker), we identified 7702 proteins (66% membrane proteins) and 178 missing proteins (74 membrane proteins) with PSM-, peptide-, and protein-level FDR of 1%. Through multiple reaction monitoring using synthetic peptides, we provided additional evidence of eight missing proteins including seven with transmembrane helix domains. This study demonstrates that mining missing proteins focused on cancer membrane subproteome can greatly contribute to map the whole human proteome. All data were deposited into ProteomeXchange with the identifier PXD002224.
Sanchez-Niño, Maria Dolores; Sanz, Ana B; Ramos, Adrian M; Fernandez-Fernandez, Beatriz; Ortiz, Alberto
2017-04-01
Exponential technologies double in power or processing speed every year, whereas their cost halves. Deception and disruption are two key stages in the development of exponential technologies. Deception occurs when, after initial introduction, technologies are dismissed as irrelevant, while they continue to progress, perhaps not as fast or with so many immediate practical applications as initially thought. Twenty years after the first publications, clinical proteomics is still not available in most hospitals and some clinicians have felt deception at unfulfilled promises. However, there are indications that clinical proteomics may be entering the disruptive phase, where, once refined, technologies disrupt established industries or procedures. In this regard, recent manuscripts in CKJ illustrate how proteomics is entering the clinical realm, with applications ranging from the identification of amyloid proteins in the pathology lab, to a new generation of urinary biomarkers for chronic kidney disease (CKD) assessment and outcome prediction. Indeed, one such panel of urinary peptidomics biomarkers, CKD273, recently received a Food and Drug Administration letter of support, the first ever in the CKD field. In addition, a must-read resource providing information on kidney disease-related proteomics and systems biology databases and how to access and use them in clinical decision-making was also recently published in CKJ .
The “Dark Side” of Food Stuff Proteomics: The CPLL-Marshals Investigate
Righetti, Pier Giorgio; Fasoli, Elisa; D’Amato, Alfonsina; Boschetti, Egisto
2014-01-01
The present review deals with analysis of the proteome of animal and plant-derived food stuff, as well as of non-alcoholic and alcoholic beverages. The survey is limited to those systems investigated with the help of combinatorial peptide ligand libraries, a most powerful technique allowing access to low- to very-low-abundance proteins, i.e., to those proteins that might characterize univocally a given biological system and, in the case of commercial food preparations, attest their genuineness or adulteration. Among animal foods the analysis of cow’s and donkey’s milk is reported, together with the proteomic composition of egg white and yolk, as well as of honey, considered as a hybrid between floral and animal origin. In terms of plant and fruits, a survey is offered of spinach, artichoke, banana, avocado, mango and lemon proteomics, considered as recalcitrant tissues in that small amounts of proteins are dispersed into a large body of plant polymers and metabolites. As examples of non-alcoholic beverages, ginger ale, coconut milk, a cola drink, almond milk and orgeat syrup are analyzed. Finally, the trace proteome of white and red wines, beer and aperitifs is reported, with the aim of tracing the industrial manipulations and herbal usage prior to their commercialization. PMID:28234315
Open access for operational research publications from low- and middle-income countries: who pays?
Kumar, A. M. V.; Reid, A. J.; Van den Bergh, R.; Isaakidis, P.; Draguez, B.; Delaunois, P.; Nagaraja, S. B.; Ramsay, A.; Reeder, J. C.; Denisiuk, O.; Ali, E.; Khogali, M.; Hinderaker, S. G.; Kosgei, R. J.; van Griensven, J.; Quaglio, G. L.; Maher, D.; Billo, N. E.; Terry, R. F.; Harries, A. D.
2014-01-01
Open-access journal publications aim to ensure that new knowledge is widely disseminated and made freely accessible in a timely manner so that it can be used to improve people's health, particularly those in low- and middle-income countries. In this paper, we briefly explain the differences between closed- and open-access journals, including the evolving idea of the ‘open-access spectrum’. We highlight the potential benefits of supporting open access for operational research, and discuss the conundrum and ways forward as regards who pays for open access. PMID:26400799
The continued movement for open access to peer-reviewed literature.
Liesegang, Thomas J
2013-09-01
To provide a current overview of the movement for open access to the peer review literature. Perspective. Literature review of recent advances in the open access movement with a personal viewpoint of the nuances of the movement. The open access movement is complex, with many different constituents. The idealists for the open access movement are seeking open access to the literature but also to the data that constitute the research within the manuscript. The business model of the traditional subscription journal is being scrutinized in relation to the surge in the number of open access journals. Within this environment authors should beware predatory practices. More government and funding agencies are mandating open access to their funded research. This open access movement will continue to be disruptive until a business model ensures continuity of the scientific record. A flood of open access articles that might enrich, but also might pollute or confuse, the medical literature has altered the filtering mechanism provided by the traditional peer review system. At some point there may be a shake-out, with some literature being lost in cyberspace. The open access movement is maturing and must be embraced in some format. The challenge is to establish a sustainable financial business model that will permit the use of digital technology but yet not endanger the decades-old traditional publication model and peer review system. Authors seem to be slower in adopting open access than the idealists in the movement. Copyright © 2013 Elsevier Inc. All rights reserved.
Open Access Journal Policies: A Systematic Analysis of Radiology Journals.
Narayan, Anand; Lobner, Katie; Fritz, Jan
2018-02-01
The open access movement has pushed for greater access to scientific knowledge by expanding access to scientific journal articles. There is limited information about the extent to which open access policies have been adopted by radiology journals. We performed a systematic analysis to ascertain the proportion of radiology journals with open access options. A search was performed with the assistance of a clinical informationist. Full and mixed English-language diagnostic and interventional radiology Web of Science journals (impact factors > 1.0) were included. Nuclear medicine, radiation oncology, physics, and solicitation-only journals were excluded. Primary outcome was open access option (yes or no) with additional outcomes including presence or absence of embargo, complete or partial copyright transfer, publication fees, and self-archiving policies. Secondary outcomes included journal citations, journal impact factors, immediacy, Eigenfactor, and article influence scores. Independent double readings were performed with differences resolved by consensus, supplemented by contacting editorial staff at each journal. In all, 125 journals were identified; review yielded 49 journals (39%, mean impact factor of 2.61). Thirty-six of the journals had open access options (73.4%), and four journals were exclusively open access (8.2%). Twelve-month embargoes were most commonly cited (90.6%) with 28.6% of journals stating that they did not require a complete transfer of copyright. Prices for open access options ranged from $750 to $4,000 (median $3,000). No statistically significant differences were found in journal impact measures comparing journals with open access options to journals without open access options. Diagnostic and interventional radiology journals have widely adopted open access options with a few radiology journals being exclusively open access. Copyright © 2017 American College of Radiology. Published by Elsevier Inc. All rights reserved.
Salisbury, Joseph P; Sîrbulescu, Ruxandra F; Moran, Benjamin M; Auclair, Jared R; Zupanc, Günther K H; Agar, Jeffrey N
2015-03-11
The brown ghost knifefish (Apteronotus leptorhynchus) is a weakly electric teleost fish of particular interest as a versatile model system for a variety of research areas in neuroscience and biology. The comprehensive information available on the neurophysiology and neuroanatomy of this organism has enabled significant advances in such areas as the study of the neural basis of behavior, the development of adult-born neurons in the central nervous system and their involvement in the regeneration of nervous tissue, as well as brain aging and senescence. Despite substantial scientific interest in this species, no genomic resources are currently available. Here, we report the de novo assembly and annotation of the A. leptorhynchus transcriptome. After evaluating several trimming and transcript reconstruction strategies, de novo assembly using Trinity uncovered 42,459 unique contigs containing at least a partial protein-coding sequence based on alignment to a reference set of known Actinopterygii sequences. As many as 11,847 of these contigs contained full or near-full length protein sequences, providing broad coverage of the proteome. A variety of non-coding RNA sequences were also identified and annotated, including conserved long intergenic non-coding RNA and other long non-coding RNA observed previously to be expressed in adult zebrafish (Danio rerio) brain, as well as a variety of miRNA, snRNA, and snoRNA. Shotgun proteomics confirmed translation of open reading frames from over 2,000 transcripts, including alternative splice variants. Assignment of tandem mass spectra was greatly improved by use of the assembly compared to databases of sequences from closely related organisms. The assembly and raw reads have been deposited at DDBJ/EMBL/GenBank under the accession number GBKR00000000. Tandem mass spectrometry data is available via ProteomeXchange with identifier PXD001285. Presented here is the first release of an annotated de novo transcriptome assembly from Apteronotus leptorhynchus, providing a broad overview of RNA expressed in central nervous system tissue. The assembly, which includes substantial coverage of a wide variety of both protein coding and non-coding transcripts, will allow the development of better tools to understand the mechanisms underlying unique characteristics of the knifefish model system, such as their tremendous regenerative capacity and negligible brain senescence.
Quantitative proteome analysis using isobaric peptide termini labeling (IPTL).
Arntzen, Magnus O; Koehler, Christian J; Treumann, Achim; Thiede, Bernd
2011-01-01
The quantitative comparison of proteome level changes across biological samples has become an essential feature in proteomics that remains challenging. We have recently introduced isobaric peptide termini labeling (IPTL), a novel strategy for isobaric quantification based on the derivatization of peptide termini with complementary isotopically labeled reagents. Unlike non-isobaric quantification methods, sample complexity at the MS level is not increased, providing improved sensitivity and protein coverage. The distinguishing feature of IPTL when comparing it to more established isobaric labeling methods (iTRAQ and TMT) is the presence of quantification signatures in all sequence-determining ions in MS/MS spectra, not only in the low mass reporter ion region. This makes IPTL a quantification method that is accessible to mass spectrometers with limited capabilities in the low mass range. Also, the presence of several quantification points in each MS/MS spectrum increases the robustness of the quantification procedure.
CPTAC Assay Portal: a repository of targeted proteomic assays
DOE Office of Scientific and Technical Information (OSTI.GOV)
Whiteaker, Jeffrey R.; Halusa, Goran; Hoofnagle, Andrew N.
2014-06-27
To address these issues, the Clinical Proteomic Tumor Analysis Consortium (CPTAC) of the National Cancer Institute (NCI) has launched an Assay Portal (http://assays.cancer.gov) to serve as a public repository of well-characterized quantitative, MS-based, targeted proteomic assays. The purpose of the CPTAC Assay Portal is to facilitate widespread adoption of targeted MS assays by disseminating SOPs, reagents, and assay characterization data for highly characterized assays. A primary aim of the NCI-supported portal is to bring together clinicians or biologists and analytical chemists to answer hypothesis-driven questions using targeted, MS-based assays. Assay content is easily accessed through queries and filters, enabling investigatorsmore » to find assays to proteins relevant to their areas of interest. Detailed characterization data are available for each assay, enabling researchers to evaluate assay performance prior to launching the assay in their own laboratory.« less
Assessing the clinical utility of cancer genomic and proteomic data across tumor types.
Yuan, Yuan; Van Allen, Eliezer M; Omberg, Larsson; Wagle, Nikhil; Amin-Mansour, Ali; Sokolov, Artem; Byers, Lauren A; Xu, Yanxun; Hess, Kenneth R; Diao, Lixia; Han, Leng; Huang, Xuelin; Lawrence, Michael S; Weinstein, John N; Stuart, Josh M; Mills, Gordon B; Garraway, Levi A; Margolin, Adam A; Getz, Gad; Liang, Han
2014-07-01
Molecular profiling of tumors promises to advance the clinical management of cancer, but the benefits of integrating molecular data with traditional clinical variables have not been systematically studied. Here we retrospectively predict patient survival using diverse molecular data (somatic copy-number alteration, DNA methylation and mRNA, microRNA and protein expression) from 953 samples of four cancer types from The Cancer Genome Atlas project. We find that incorporating molecular data with clinical variables yields statistically significantly improved predictions (FDR < 0.05) for three cancers but those quantitative gains were limited (2.2-23.9%). Additional analyses revealed little predictive power across tumor types except for one case. In clinically relevant genes, we identified 10,281 somatic alterations across 12 cancer types in 2,928 of 3,277 patients (89.4%), many of which would not be revealed in single-tumor analyses. Our study provides a starting point and resources, including an open-access model evaluation platform, for building reliable prognostic and therapeutic strategies that incorporate molecular data.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Xusheng; Pandey, Ashutosh K.; Mulligan, Megan K.
Phenome-wide association is a novel reverse genetic strategy to analyze genome-to-phenome relations in human clinical cohorts. Here we test this approach using a large murine population segregating for ~5 million sequence variants, and we compare our results to those extracted from a matched analysis of gene variants in a large human cohort. For the mouse cohort, we amassed a deep and broad open-access phenome consisting of ~4,500 metabolic, physiological, pharmacological and behavioural traits, and more than 90 independent expression quantitative trait locus (QTL), transcriptome, proteome, metagenome and metabolome data sets-by far the largest coherent phenome for any experimental cohort (www.genenetwork.org).more » Here, we tested downstream effects of subsets of variants and discovered several novel associations, including a missense mutation in fumarate hydratase that controls variation in the mitochondrial unfolded protein response in both mouse and Caenorhabditis elegans, and missense mutations in Col6a5 that underlies variation in bone mineral density in both mouse and human.« less
Joint mouse–human phenome-wide association to test gene function and disease risk
Wang, Xusheng; Pandey, Ashutosh K.; Mulligan, Megan K.; ...
2016-02-02
Phenome-wide association is a novel reverse genetic strategy to analyze genome-to-phenome relations in human clinical cohorts. Here we test this approach using a large murine population segregating for ~5 million sequence variants, and we compare our results to those extracted from a matched analysis of gene variants in a large human cohort. For the mouse cohort, we amassed a deep and broad open-access phenome consisting of ~4,500 metabolic, physiological, pharmacological and behavioural traits, and more than 90 independent expression quantitative trait locus (QTL), transcriptome, proteome, metagenome and metabolome data sets-by far the largest coherent phenome for any experimental cohort (www.genenetwork.org).more » Here, we tested downstream effects of subsets of variants and discovered several novel associations, including a missense mutation in fumarate hydratase that controls variation in the mitochondrial unfolded protein response in both mouse and Caenorhabditis elegans, and missense mutations in Col6a5 that underlies variation in bone mineral density in both mouse and human.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhu, Ying; Zhao, Rui; Piehowski, Paul D.
One of the greatest challenges for mass spectrometry (MS)-based proteomics is the limited ability to analyze small samples. Here we investigate the relative contributions of liquid chromatography (LC), MS instrumentation and data analysis methods with the aim of improving proteome coverage for sample sizes ranging from 0.5 ng to 50 ng. We show that the LC separations utilizing 30-µm-i.d. columns increase signal intensity by >3-fold relative to those using 75-µm-i.d. columns, leading to 32% increase in peptide identifications. The Orbitrap Fusion Lumos mass spectrometer significantly boosted both sensitivity and sequencing speed relative to earlier generation Orbitraps (e.g., LTQ-Orbitrap), leading tomore » a ~3× increase in peptide identifications and 1.7× increase in identified protein groups for 2 ng tryptic digests of bacterial lysate. The Match Between Runs algorithm of open-source MaxQuant software further increased proteome coverage by ~ 95% for 0.5 ng samples and by ~42% for 2 ng samples. The present platform is capable of identifying >3000 protein groups from tryptic digestion of cell lysates equivalent to 50 HeLa cells and 100 THP-1 cells (~10 ng total proteins), respectively, and >950 proteins from subnanogram bacterial and archaeal cell lysates. The present ultrasensitive LC-MS platform is expected to enable deep proteome coverage for subnanogram samples, including single mammalian cells.« less
The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update.
Afgan, Enis; Baker, Dannon; Batut, Bérénice; van den Beek, Marius; Bouvier, Dave; Cech, Martin; Chilton, John; Clements, Dave; Coraor, Nate; Grüning, Björn A; Guerler, Aysam; Hillman-Jackson, Jennifer; Hiltemann, Saskia; Jalili, Vahid; Rasche, Helena; Soranzo, Nicola; Goecks, Jeremy; Taylor, James; Nekrutenko, Anton; Blankenberg, Daniel
2018-05-22
Galaxy (homepage: https://galaxyproject.org, main public server: https://usegalaxy.org) is a web-based scientific analysis platform used by tens of thousands of scientists across the world to analyze large biomedical datasets such as those found in genomics, proteomics, metabolomics and imaging. Started in 2005, Galaxy continues to focus on three key challenges of data-driven biomedical science: making analyses accessible to all researchers, ensuring analyses are completely reproducible, and making it simple to communicate analyses so that they can be reused and extended. During the last two years, the Galaxy team and the open-source community around Galaxy have made substantial improvements to Galaxy's core framework, user interface, tools, and training materials. Framework and user interface improvements now enable Galaxy to be used for analyzing tens of thousands of datasets, and >5500 tools are now available from the Galaxy ToolShed. The Galaxy community has led an effort to create numerous high-quality tutorials focused on common types of genomic analyses. The Galaxy developer and user communities continue to grow and be integral to Galaxy's development. The number of Galaxy public servers, developers contributing to the Galaxy framework and its tools, and users of the main Galaxy server have all increased substantially.
ERIC Educational Resources Information Center
Armbruster, Chris
2008-01-01
Online, open access is the superior model for scholarly communication. A variety of scientific communities in physics, the life sciences and economics have gone furthest in innovating their scholarly communication through open access, enhancing accessibility for scientists, students and the interested public. Open access enjoys a comparative…
Bond, A R; Iacobazzi, D; Abdul-Ghani, S; Ghorbel, M T; Heesom, K J; George, S J; Caputo, M; Suleiman, M-S; Tulloh, R M
2018-03-20
Right ventricle (RV) remodelling occurs in neonatal patients born with ventricular septal defect (VSD). The presence of a defect between the two ventricles allows for shunting of blood from the left to right side. The resulting RV hypertrophy leads to molecular remodelling which has thus far been largely investigated using right atrial (RA) tissue. In this study we used proteomic and phosphoproteomic analysis in order to determine any difference between the proteomes for RA and RV. Samples were therefore taken from the RA and RV of five infants (0.34 ± 0.05 years, mean ± SEM) with VSD who were undergoing cardiac surgery to repair the defect. Significant differences in protein expression between RV and RA were seen. 150 protein accession numbers were identified which were significantly lower in the atria, whereas none were significantly higher in the atria compared to the ventricle. 19 phosphorylation sites (representing 19 phosphoproteins) were also lower in RA. This work has identified differences in the proteome between RA and RV which reflect differences in contractile activity and metabolism. As such, caution should be used when drawing conclusions based on analysis of the RA and extrapolating to the hypertrophied RV. RV hypertrophy occurs in neonatal patients born with VSD. Very little is known about how the atria responds to RV hypertrophy, especially at the protein level. Access to tissue from age-matched groups of patients is very rare, and we are in the unique position of being able to get tissue from both the atria and ventricle during reparative surgery of these infants. Our findings will be beneficial to future research into heart chamber malformations in congenital heart defects. Copyright © 2018. Published by Elsevier B.V.
Griss, Johannes; Jones, Andrew R; Sachsenberg, Timo; Walzer, Mathias; Gatto, Laurent; Hartler, Jürgen; Thallinger, Gerhard G; Salek, Reza M; Steinbeck, Christoph; Neuhauser, Nadin; Cox, Jürgen; Neumann, Steffen; Fan, Jun; Reisinger, Florian; Xu, Qing-Wei; Del Toro, Noemi; Pérez-Riverol, Yasset; Ghali, Fawaz; Bandeira, Nuno; Xenarios, Ioannis; Kohlbacher, Oliver; Vizcaíno, Juan Antonio; Hermjakob, Henning
2014-10-01
The HUPO Proteomics Standards Initiative has developed several standardized data formats to facilitate data sharing in mass spectrometry (MS)-based proteomics. These allow researchers to report their complete results in a unified way. However, at present, there is no format to describe the final qualitative and quantitative results for proteomics and metabolomics experiments in a simple tabular format. Many downstream analysis use cases are only concerned with the final results of an experiment and require an easily accessible format, compatible with tools such as Microsoft Excel or R. We developed the mzTab file format for MS-based proteomics and metabolomics results to meet this need. mzTab is intended as a lightweight supplement to the existing standard XML-based file formats (mzML, mzIdentML, mzQuantML), providing a comprehensive summary, similar in concept to the supplemental material of a scientific publication. mzTab files can contain protein, peptide, and small molecule identifications together with experimental metadata and basic quantitative information. The format is not intended to store the complete experimental evidence but provides mechanisms to report results at different levels of detail. These range from a simple summary of the final results to a representation of the results including the experimental design. This format is ideally suited to make MS-based proteomics and metabolomics results available to a wider biological community outside the field of MS. Several software tools for proteomics and metabolomics have already adapted the format as an output format. The comprehensive mzTab specification document and extensive additional documentation can be found online. © 2014 by The American Society for Biochemistry and Molecular Biology, Inc.
Griss, Johannes; Jones, Andrew R.; Sachsenberg, Timo; Walzer, Mathias; Gatto, Laurent; Hartler, Jürgen; Thallinger, Gerhard G.; Salek, Reza M.; Steinbeck, Christoph; Neuhauser, Nadin; Cox, Jürgen; Neumann, Steffen; Fan, Jun; Reisinger, Florian; Xu, Qing-Wei; del Toro, Noemi; Pérez-Riverol, Yasset; Ghali, Fawaz; Bandeira, Nuno; Xenarios, Ioannis; Kohlbacher, Oliver; Vizcaíno, Juan Antonio; Hermjakob, Henning
2014-01-01
The HUPO Proteomics Standards Initiative has developed several standardized data formats to facilitate data sharing in mass spectrometry (MS)-based proteomics. These allow researchers to report their complete results in a unified way. However, at present, there is no format to describe the final qualitative and quantitative results for proteomics and metabolomics experiments in a simple tabular format. Many downstream analysis use cases are only concerned with the final results of an experiment and require an easily accessible format, compatible with tools such as Microsoft Excel or R. We developed the mzTab file format for MS-based proteomics and metabolomics results to meet this need. mzTab is intended as a lightweight supplement to the existing standard XML-based file formats (mzML, mzIdentML, mzQuantML), providing a comprehensive summary, similar in concept to the supplemental material of a scientific publication. mzTab files can contain protein, peptide, and small molecule identifications together with experimental metadata and basic quantitative information. The format is not intended to store the complete experimental evidence but provides mechanisms to report results at different levels of detail. These range from a simple summary of the final results to a representation of the results including the experimental design. This format is ideally suited to make MS-based proteomics and metabolomics results available to a wider biological community outside the field of MS. Several software tools for proteomics and metabolomics have already adapted the format as an output format. The comprehensive mzTab specification document and extensive additional documentation can be found online. PMID:24980485
30 CFR 291.113 - What actions may MMS take to remedy denial of open and nondiscriminatory access?
Code of Federal Regulations, 2010 CFR
2010-07-01
... open and nondiscriminatory access? 291.113 Section 291.113 Mineral Resources MINERALS MANAGEMENT SERVICE, DEPARTMENT OF THE INTERIOR APPEALS OPEN AND NONDISCRIMINATORY ACCESS TO OIL AND GAS PIPELINES... grantee or transporter has not provided open access or nondiscriminatory access, then the decision will...
Giglia, E
2010-09-01
This contribution is aimed at presenting a sort of "state of the art" of Open Access on the occasion of the 2010 international Open Access Week, to be held from October 18 to October 24. We shall see facts and figures about open archives and the mandates to deposit; about Open Access journals; about impact and citation advantages for the researchers, and about economic sustainability.
Open access: changing global science publishing.
Gasparyan, Armen Yuri; Ayvazyan, Lilit; Kitas, George D
2013-08-01
The article reflects on open access as a strategy of changing the quality of science communication globally. Successful examples of open-access journals are presented to highlight implications of archiving in open digital repositories for the quality and citability of research output. Advantages and downsides of gold, green, and hybrid models of open access operating in diverse scientific environments are described. It is assumed that open access is a global trend which influences the workflow in scholarly journals, changing their quality, credibility, and indexability.
Candidate-based proteomics in the search for biomarkers of cardiovascular disease
Anderson, Leigh
2005-01-01
The key concept of proteomics (looking at many proteins at once) opens new avenues in the search for clinically useful biomarkers of disease, treatment response and ageing. As the number of proteins that can be detected in plasma or serum (the primary clinical diagnostic samples) increases towards 1000, a paradoxical decline has occurred in the number of new protein markers approved for diagnostic use in clinical laboratories. This review explores the limitations of current proteomics protein discovery platforms, and proposes an alternative approach, applicable to a range of biological/physiological problems, in which quantitative mass spectrometric methods developed for analytical chemistry are employed to measure limited sets of candidate markers in large sets of clinical samples. A set of 177 candidate biomarker proteins with reported associations to cardiovascular disease and stroke are presented as a starting point for such a ‘directed proteomics’ approach. PMID:15611012
A statistical framework for protein quantitation in bottom-up MS-based proteomics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Karpievitch, Yuliya; Stanley, Jeffrey R.; Taverner, Thomas
2009-08-15
ABSTRACT Motivation: Quantitative mass spectrometry-based proteomics requires protein-level estimates and confidence measures. Challenges include the presence of low-quality or incorrectly identified peptides and widespread, informative, missing data. Furthermore, models are required for rolling peptide-level information up to the protein level. Results: We present a statistical model for protein abundance in terms of peptide peak intensities, applicable to both label-based and label-free quantitation experiments. The model allows for both random and censoring missingness mechanisms and provides naturally for protein-level estimates and confidence measures. The model is also used to derive automated filtering and imputation routines. Three LC-MS datasets are used tomore » illustrate the methods. Availability: The software has been made available in the open-source proteomics platform DAnTE (Polpitiya et al. (2008)) (http://omics.pnl.gov/software/). Contact: adabney@stat.tamu.edu« less
Automated image alignment for 2D gel electrophoresis in a high-throughput proteomics pipeline.
Dowsey, Andrew W; Dunn, Michael J; Yang, Guang-Zhong
2008-04-01
The quest for high-throughput proteomics has revealed a number of challenges in recent years. Whilst substantial improvements in automated protein separation with liquid chromatography and mass spectrometry (LC/MS), aka 'shotgun' proteomics, have been achieved, large-scale open initiatives such as the Human Proteome Organization (HUPO) Brain Proteome Project have shown that maximal proteome coverage is only possible when LC/MS is complemented by 2D gel electrophoresis (2-DE) studies. Moreover, both separation methods require automated alignment and differential analysis to relieve the bioinformatics bottleneck and so make high-throughput protein biomarker discovery a reality. The purpose of this article is to describe a fully automatic image alignment framework for the integration of 2-DE into a high-throughput differential expression proteomics pipeline. The proposed method is based on robust automated image normalization (RAIN) to circumvent the drawbacks of traditional approaches. These use symbolic representation at the very early stages of the analysis, which introduces persistent errors due to inaccuracies in modelling and alignment. In RAIN, a third-order volume-invariant B-spline model is incorporated into a multi-resolution schema to correct for geometric and expression inhomogeneity at multiple scales. The normalized images can then be compared directly in the image domain for quantitative differential analysis. Through evaluation against an existing state-of-the-art method on real and synthetically warped 2D gels, the proposed analysis framework demonstrates substantial improvements in matching accuracy and differential sensitivity. High-throughput analysis is established through an accelerated GPGPU (general purpose computation on graphics cards) implementation. Supplementary material, software and images used in the validation are available at http://www.proteomegrid.org/rain/.
NASA Astrophysics Data System (ADS)
Schuback, N.; Hippmann, A.; Maldonado, M. T.; Allen, A. E.; McCrow, J.; Foster, L. J.; Green, B. R.; Alami, M.
2016-02-01
Iron plays a significant role in controlling marine primary productivity. Despite that extremely low dissolved iron (Fe) concentrations are found in Fe-limited regions, some phytoplankton are able to survive and thrive. Two strains of the model oceanic diatom Thalassiosira oceanica, TO 1003 and TO 1005, have both been used in previous studies to characterize adaptations to iron limitation. These studies have shown that T. oceanica has lowered its Fe requirements and increased its Fe acquisition efficiency compared to coastal counterparts. Both strategies may impose a higher cellular copper (Cu) demand. However, the underlying biochemical adaptations in these oceanic diatoms remain unknown. Recently, the genome, as well as the first proteomic and transcriptomic analyses of T. oceanica 1005 grown under different Fe levels, were published. To further our understanding of the interplay between Fe- and Cu- physiology in open ocean diatoms, we examined an array of physiological responses to varying degrees of Fe-, Cu- and Fe/Cu co-limitation in both strains. We also determined the differential expression of proteins using stable isotope labeling and LC-MS/MS proteomic analysis. The two strains, TO 1003 and TO 1005, need markedly different metal concentrations in the media. TO1003 requires 30% less Cu to sustain its optimal growth and less than 1/10th of the minimum Cu that is needed by TO 1005 to survive. In contrast, TO 1005 is able to grow with less Fe available in the media. The physiological and proteomic responses of these two strains when acclimated to low Fe and/or Cu concentrations will be presented. The evolutionary implications will be discussed.
Open access publishing, article downloads, and citations: randomised controlled trial
Lewenstein, Bruce V; Simon, Daniel H; Booth, James G; Connolly, Mathew J L
2008-01-01
Objective To measure the effect of free access to the scientific literature on article downloads and citations. Design Randomised controlled trial. Setting 11 journals published by the American Physiological Society. Participants 1619 research articles and reviews. Main outcome measures Article readership (measured as downloads of full text, PDFs, and abstracts) and number of unique visitors (internet protocol addresses). Citations to articles were gathered from the Institute for Scientific Information after one year. Interventions Random assignment on online publication of articles published in 11 scientific journals to open access (treatment) or subscription access (control). Results Articles assigned to open access were associated with 89% more full text downloads (95% confidence interval 76% to 103%), 42% more PDF downloads (32% to 52%), and 23% more unique visitors (16% to 30%), but 24% fewer abstract downloads (−29% to −19%) than subscription access articles in the first six months after publication. Open access articles were no more likely to be cited than subscription access articles in the first year after publication. Fifty nine per cent of open access articles (146 of 247) were cited nine to 12 months after publication compared with 63% (859 of 1372) of subscription access articles. Logistic and negative binomial regression analysis of article citation counts confirmed no citation advantage for open access articles. Conclusions Open access publishing may reach more readers than subscription access publishing. No evidence was found of a citation advantage for open access articles in the first year after publication. The citation advantage from open access reported widely in the literature may be an artefact of other causes. PMID:18669565
Riera, M; Aibar, E
2013-05-01
Some studies suggest that open access articles are more often cited than non-open access articles. However, the relationship between open access and citations count in a discipline such as intensive care medicine has not been studied to date. The present article analyzes the effect of open access publishing of scientific articles in intensive care medicine journals in terms of citations count. We evaluated a total of 161 articles (76% being non-open access articles) published in Intensive Care Medicine in the year 2008. Citation data were compared between the two groups up until April 30, 2011. Potentially confounding variables for citation counts were adjusted for in a linear multiple regression model. The median number (interquartile range) of citations of non-open access articles was 8 (4-12) versus 9 (6-18) in the case of open access articles (p=0.084). In the highest citation range (>8), the citation count was 13 (10-16) and 18 (13-21) (p=0.008), respectively. The mean follow-up was 37.5 ± 3 months in both groups. In the 30-35 months after publication, the average number (mean ± standard deviation) of citations per article per month of non-open access articles was 0.28 ± 0.6 versus 0.38 ± 0.7 in the case of open access articles (p=0.043). Independent factors for citation advantage were the Hirsch index of the first signing author (β=0.207; p=0.015) and open access status (β=3.618; p=0.006). Open access publishing and the Hirsch index of the first signing author increase the impact of scientific articles. The open access advantage is greater for the more highly cited articles, and appears in the 30-35 months after publication. Copyright © 2012 Elsevier España, S.L. and SEMICYUC. All rights reserved.
The academic, economic and societal impacts of Open Access: an evidence-based review.
Tennant, Jonathan P; Waldner, François; Jacques, Damien C; Masuzzo, Paola; Collister, Lauren B; Hartgerink, Chris H J
2016-01-01
Ongoing debates surrounding Open Access to the scholarly literature are multifaceted and complicated by disparate and often polarised viewpoints from engaged stakeholders. At the current stage, Open Access has become such a global issue that it is critical for all involved in scholarly publishing, including policymakers, publishers, research funders, governments, learned societies, librarians, and academic communities, to be well-informed on the history, benefits, and pitfalls of Open Access. In spite of this, there is a general lack of consensus regarding the potential pros and cons of Open Access at multiple levels. This review aims to be a resource for current knowledge on the impacts of Open Access by synthesizing important research in three major areas: academic, economic and societal. While there is clearly much scope for additional research, several key trends are identified, including a broad citation advantage for researchers who publish openly, as well as additional benefits to the non-academic dissemination of their work. The economic impact of Open Access is less well-understood, although it is clear that access to the research literature is key for innovative enterprises, and a range of governmental and non-governmental services. Furthermore, Open Access has the potential to save both publishers and research funders considerable amounts of financial resources, and can provide some economic benefits to traditionally subscription-based journals. The societal impact of Open Access is strong, in particular for advancing citizen science initiatives, and leveling the playing field for researchers in developing countries. Open Access supersedes all potential alternative modes of access to the scholarly literature through enabling unrestricted re-use, and long-term stability independent of financial constraints of traditional publishers that impede knowledge sharing. However, Open Access has the potential to become unsustainable for research communities if high-cost options are allowed to continue to prevail in a widely unregulated scholarly publishing market. Open Access remains only one of the multiple challenges that the scholarly publishing system is currently facing. Yet, it provides one foundation for increasing engagement with researchers regarding ethical standards of publishing and the broader implications of 'Open Research'.
The academic, economic and societal impacts of Open Access: an evidence-based review
Tennant, Jonathan P.; Waldner, François; Jacques, Damien C.; Masuzzo, Paola; Collister, Lauren B.; Hartgerink, Chris. H. J.
2016-01-01
Ongoing debates surrounding Open Access to the scholarly literature are multifaceted and complicated by disparate and often polarised viewpoints from engaged stakeholders. At the current stage, Open Access has become such a global issue that it is critical for all involved in scholarly publishing, including policymakers, publishers, research funders, governments, learned societies, librarians, and academic communities, to be well-informed on the history, benefits, and pitfalls of Open Access. In spite of this, there is a general lack of consensus regarding the potential pros and cons of Open Access at multiple levels. This review aims to be a resource for current knowledge on the impacts of Open Access by synthesizing important research in three major areas: academic, economic and societal. While there is clearly much scope for additional research, several key trends are identified, including a broad citation advantage for researchers who publish openly, as well as additional benefits to the non-academic dissemination of their work. The economic impact of Open Access is less well-understood, although it is clear that access to the research literature is key for innovative enterprises, and a range of governmental and non-governmental services. Furthermore, Open Access has the potential to save both publishers and research funders considerable amounts of financial resources, and can provide some economic benefits to traditionally subscription-based journals. The societal impact of Open Access is strong, in particular for advancing citizen science initiatives, and leveling the playing field for researchers in developing countries. Open Access supersedes all potential alternative modes of access to the scholarly literature through enabling unrestricted re-use, and long-term stability independent of financial constraints of traditional publishers that impede knowledge sharing. However, Open Access has the potential to become unsustainable for research communities if high-cost options are allowed to continue to prevail in a widely unregulated scholarly publishing market. Open Access remains only one of the multiple challenges that the scholarly publishing system is currently facing. Yet, it provides one foundation for increasing engagement with researchers regarding ethical standards of publishing and the broader implications of 'Open Research'. PMID:27158456
Reasons to temper enthusiasm about open access nursing journals.
de Jong, Gideon
2017-04-01
Open access is a relatively new phenomenon within nursing science. Several papers from various nursing journals have been published recently on the disadvantages of the traditional model of purchasing proprietary fee-based databases to access scholarly information. Just few nursing scholars are less optimistic about the possible benefits of open access nursing journals. A critical reflection on the merits and pitfalls of open access journals along insights from the literature and personal opinion. Two arguments are discussed, providing justification for tempering enthusiasm about open access journals. First, only research groups with sufficient financial resources can publish in open access journals. Second, open access has conflicting incentives, where the aim is to expand production at the expense of publishing quality articles; a business model that fits well into a neoliberal discourse. There are valid reasons to criticise the traditional publishers for the excessive costs of a single article, therefore preventing the dissemination of scholarly nursing information. On the contrary, the business model of open access publishers is no less imbued with the neoliberal tendency of lining the pockets.
Squires, R. Burke; Noronha, Jyothi; Hunt, Victoria; García‐Sastre, Adolfo; Macken, Catherine; Baumgarth, Nicole; Suarez, David; Pickett, Brett E.; Zhang, Yun; Larsen, Christopher N.; Ramsey, Alvin; Zhou, Liwei; Zaremba, Sam; Kumar, Sanjeev; Deitrich, Jon; Klem, Edward; Scheuermann, Richard H.
2012-01-01
Please cite this paper as: Squires et al. (2012) Influenza research database: an integrated bioinformatics resource for influenza research and surveillance. Influenza and Other Respiratory Viruses 6(6), 404–416. Background The recent emergence of the 2009 pandemic influenza A/H1N1 virus has highlighted the value of free and open access to influenza virus genome sequence data integrated with information about other important virus characteristics. Design The Influenza Research Database (IRD, http://www.fludb.org) is a free, open, publicly‐accessible resource funded by the U.S. National Institute of Allergy and Infectious Diseases through the Bioinformatics Resource Centers program. IRD provides a comprehensive, integrated database and analysis resource for influenza sequence, surveillance, and research data, including user‐friendly interfaces for data retrieval, visualization and comparative genomics analysis, together with personal log in‐protected ‘workbench’ spaces for saving data sets and analysis results. IRD integrates genomic, proteomic, immune epitope, and surveillance data from a variety of sources, including public databases, computational algorithms, external research groups, and the scientific literature. Results To demonstrate the utility of the data and analysis tools available in IRD, two scientific use cases are presented. A comparison of hemagglutinin sequence conservation and epitope coverage information revealed highly conserved protein regions that can be recognized by the human adaptive immune system as possible targets for inducing cross‐protective immunity. Phylogenetic and geospatial analysis of sequences from wild bird surveillance samples revealed a possible evolutionary connection between influenza virus from Delaware Bay shorebirds and Alberta ducks. Conclusions The IRD provides a wealth of integrated data and information about influenza virus to support research of the genetic determinants dictating virus pathogenicity, host range restriction and transmission, and to facilitate development of vaccines, diagnostics, and therapeutics. PMID:22260278
Public Access and Open Access: Is There a Difference? | Poster
By Robin Meckley, Contributing Writer, and Tracie Frederick, Guest Writer Open access and public access—are they different concepts or are they the same? What do they mean for the researchers at NCI at Frederick? “Open-access (OA) literature is digital, online, free of charge, and free of most copyright and licensing restrictions. What makes it possible is the Internet and the consent of the author or copyright-holder,” according to an open access website maintained by Peter Suber, director, Harvard Open Access Project.
Identification of species- and tissue-specific proteins using proteomic strategy
NASA Astrophysics Data System (ADS)
Chernukha, I. M.; Vostrikova, N. L.; Kovalev, L. I.; Shishkin, S. S.; Kovaleva, M. A.; Manukhin, Y. S.
2017-09-01
Proteomic technologies have proven to be very effective for detecting biochemical changes in meat products, such as changes in tissue- and species-specific proteins. In the tissues of cattle, pig, horse and camel M. longissimus dorsi both tissue- and species specific proteins were detected using two dimensional electrophoresis. Species-specific isoforms of several muscle proteins were also identified. The identified and described proteins of cattle, pig, horse and camel skeletal muscles (including mass spectra of the tryptic peptides) were added to the national free access database “Muscle organ proteomics”. This research has enabled the development of new highly sensitive technologies for meat product quality control against food fraud.
DIGE compatible labelling of surface proteins on vital cells in vitro and in vivo.
Mayrhofer, Corina; Krieger, Sigurd; Allmaier, Günter; Kerjaschki, Dontscho
2006-01-01
Efficient methods for profiling of the cell surface proteome are desirable to get a deeper insight in basic biological processes, to localise proteins and to uncover proteins differentially expressed in diseases. Here we present a strategy to target cell surface exposed proteins via fluorescence labelling using CyDye DIGE fluors. This method has been applied to human cell lines in vitro as well as to a complex biological system in vivo. It allows detection of fluorophore-tagged cell surface proteins and visualisation of the accessible proteome within a single 2-D gel, simplifying subsequent UV MALDI-MS analysis.
Open Access Publishing - Strengths and Strategies
NASA Astrophysics Data System (ADS)
Rasmussen, Martin
2010-05-01
The journal crisis and the demand for free accessibility to the results of publicly funded research were the main drivers of the Open Access movement since the late 1990's. Besides many academic institutions that support the different ways of Open Access publishing, there is a growing number of publishing houses that are specialized on this new access and business model of scholarly literature. The lecture provides an overview of the different kinds of Open Access publishing, discusses the variety of underlying business models, names the advantages and potentials for researches and the public, and overcomes some objections against Open Access. Besides the increased visibility and information supply, the topic of copyrights and exploitation rights will be discussed. Furthermore, it is a central aim of the presentation to show that Open Access does not only support full peer-review, but also provides the potential for even enhanced quality assurance. The financing of business models based on open accessible literature is another important part to be outlined in the lecture.
50 CFR 660.316 - Open access fishery-observer requirements.
Code of Federal Regulations, 2011 CFR
2011-10-01
... 50 Wildlife and Fisheries 11 2011-10-01 2011-10-01 false Open access fishery-observer requirements. 660.316 Section 660.316 Wildlife and Fisheries FISHERY CONSERVATION AND MANAGEMENT, NATIONAL OCEANIC... West Coast Groundfish-Open Access Fisheries § 660.316 Open access fishery—observer requirements. (a...
50 CFR 660.316 - Open access fishery-observer requirements.
Code of Federal Regulations, 2010 CFR
2010-10-01
... 50 Wildlife and Fisheries 9 2010-10-01 2010-10-01 false Open access fishery-observer requirements. 660.316 Section 660.316 Wildlife and Fisheries FISHERY CONSERVATION AND MANAGEMENT, NATIONAL OCEANIC... West Coast Groundfish-Open Access Fisheries § 660.316 Open access fishery—observer requirements. (a...
50 CFR 660.316 - Open access fishery-observer requirements.
Code of Federal Regulations, 2013 CFR
2013-10-01
... 50 Wildlife and Fisheries 13 2013-10-01 2013-10-01 false Open access fishery-observer requirements. 660.316 Section 660.316 Wildlife and Fisheries FISHERY CONSERVATION AND MANAGEMENT, NATIONAL OCEANIC... West Coast Groundfish-Open Access Fisheries § 660.316 Open access fishery—observer requirements. (a...
50 CFR 660.316 - Open access fishery-observer requirements.
Code of Federal Regulations, 2012 CFR
2012-10-01
... 50 Wildlife and Fisheries 13 2012-10-01 2012-10-01 false Open access fishery-observer requirements. 660.316 Section 660.316 Wildlife and Fisheries FISHERY CONSERVATION AND MANAGEMENT, NATIONAL OCEANIC... West Coast Groundfish-Open Access Fisheries § 660.316 Open access fishery—observer requirements. (a...
50 CFR 660.316 - Open access fishery-observer requirements.
Code of Federal Regulations, 2014 CFR
2014-10-01
... 50 Wildlife and Fisheries 13 2014-10-01 2014-10-01 false Open access fishery-observer requirements. 660.316 Section 660.316 Wildlife and Fisheries FISHERY CONSERVATION AND MANAGEMENT, NATIONAL OCEANIC... West Coast Groundfish-Open Access Fisheries § 660.316 Open access fishery—observer requirements. (a...
An evaluation of emergency medicine investigators' views on open access to medical literature.
Rodriguez, R M; Wong, J; Hardy, J; Frankel, E
2006-12-01
Scientists and governmental agencies have called for free universal access to research publications via the internet--open access. To examine the current medical literature reading practices of emergency medicine investigators (EMIs) and their views towards open access. Surveys were mailed to the 212 corresponding authors of all original research articles published in years 2002 and 2003 in the Annals of Emergency Medicine, Academic Emergency Medicine and The Journal of Emergency Medicine. The most commonly read forms of medical literature reported by the 129 (61%) EMI respondents were hard-copy medical journals and online literature review services. 59% of EMIs were in favour of open access; 58% stated they would read a wider variety of medical literature; 21% believed open access would improve the quality of publications and 39% thought it would decrease the quality. When asked how a US 1500 dollars fee for open access would affect their ability to publish research, 69% said it would greatly impede and 19% said it would slightly impede their research. Despite concerns that open access may impede their ability to publish research and decrease the quality of publications, most EMIs surveyed favoured open access. They believed open access would increase and broaden their medical literature reading.
Microarray, proteomic, and metabonomic technologies are becoming increasingly accessible as tools for ecotoxicology research. Effective use of these technologies will depend, at least in part, on the ability to apply these techniques within a paradigm of hypothesis driven researc...
ExPASy: SIB bioinformatics resource portal.
Artimo, Panu; Jonnalagedda, Manohar; Arnold, Konstantin; Baratin, Delphine; Csardi, Gabor; de Castro, Edouard; Duvaud, Séverine; Flegel, Volker; Fortier, Arnaud; Gasteiger, Elisabeth; Grosdidier, Aurélien; Hernandez, Céline; Ioannidis, Vassilios; Kuznetsov, Dmitry; Liechti, Robin; Moretti, Sébastien; Mostaguir, Khaled; Redaschi, Nicole; Rossier, Grégoire; Xenarios, Ioannis; Stockinger, Heinz
2012-07-01
ExPASy (http://www.expasy.org) has worldwide reputation as one of the main bioinformatics resources for proteomics. It has now evolved, becoming an extensible and integrative portal accessing many scientific resources, databases and software tools in different areas of life sciences. Scientists can henceforth access seamlessly a wide range of resources in many different domains, such as proteomics, genomics, phylogeny/evolution, systems biology, population genetics, transcriptomics, etc. The individual resources (databases, web-based and downloadable software tools) are hosted in a 'decentralized' way by different groups of the SIB Swiss Institute of Bioinformatics and partner institutions. Specifically, a single web portal provides a common entry point to a wide range of resources developed and operated by different SIB groups and external institutions. The portal features a search function across 'selected' resources. Additionally, the availability and usage of resources are monitored. The portal is aimed for both expert users and people who are not familiar with a specific domain in life sciences. The new web interface provides, in particular, visual guidance for newcomers to ExPASy.
ERIC Educational Resources Information Center
Tenopir, Carol
2004-01-01
Open access publishing is a hot topic today. But open access publishing can have many different definitions, and pros and cons vary with the definitions. Open access publishing is especially attractive to companies and small colleges or universities that are likely to have many more readers than authors. A downside is that a membership fee sounds…
50 CFR 648.88 - Multispecies open access permit restrictions.
Code of Federal Regulations, 2011 CFR
2011-10-01
... 50 Wildlife and Fisheries 10 2011-10-01 2011-10-01 false Multispecies open access permit... Management Measures for the NE Multispecies and Monkfish Fisheries § 648.88 Multispecies open access permit restrictions. (a) Handgear permit. A vessel issued a valid open access NE multispecies Handgear permit is...
50 CFR 648.88 - Multispecies open access permit restrictions.
Code of Federal Regulations, 2010 CFR
2010-10-01
... 50 Wildlife and Fisheries 8 2010-10-01 2010-10-01 false Multispecies open access permit... Management Measures for the NE Multispecies and Monkfish Fisheries § 648.88 Multispecies open access permit restrictions. (a) Handgear permit. A vessel issued a valid open access NE multispecies Handgear permit is...
50 CFR 660.383 - Open access fishery management measures.
Code of Federal Regulations, 2010 CFR
2010-10-01
... 50 Wildlife and Fisheries 9 2010-10-01 2010-10-01 false Open access fishery management measures... West Coast Groundfish Fisheries § 660.383 Open access fishery management measures. (a) General. Groundfish species taken in open access fisheries will be managed with cumulative trip limits (see trip...
50 CFR 648.88 - Multispecies open access permit restrictions.
Code of Federal Regulations, 2013 CFR
2013-10-01
... 50 Wildlife and Fisheries 12 2013-10-01 2013-10-01 false Multispecies open access permit... Management Measures for the NE Multispecies and Monkfish Fisheries § 648.88 Multispecies open access permit restrictions. (a) Handgear permit. A vessel issued a valid open access NE multispecies Handgear permit is...
50 CFR 648.88 - Multispecies open access permit restrictions.
Code of Federal Regulations, 2014 CFR
2014-10-01
... 50 Wildlife and Fisheries 12 2014-10-01 2014-10-01 false Multispecies open access permit... Management Measures for the NE Multispecies and Monkfish Fisheries § 648.88 Multispecies open access permit restrictions. (a) Handgear permit. A vessel issued a valid open access NE multispecies Handgear permit is...
50 CFR 648.88 - Multispecies open access permit restrictions.
Code of Federal Regulations, 2012 CFR
2012-10-01
... 50 Wildlife and Fisheries 12 2012-10-01 2012-10-01 false Multispecies open access permit... Management Measures for the NE Multispecies and Monkfish Fisheries § 648.88 Multispecies open access permit restrictions. (a) Handgear permit. A vessel issued a valid open access NE multispecies Handgear permit is...
50 CFR 648.15 - Facilitation of enforcement.
Code of Federal Regulations, 2011 CFR
2011-10-01
... surfclam and ocean quahog vessel owners and operators. (1) Surfclam and ocean quahog open access permitted vessels. Vessel owners or operators issued an open access surfclam or ocean quahog open access permit for.../or an Open Access Herring Permit that fished with midwater trawl gear pursuant to § 648.80(d). Such...
The Human Proteome Project: Unlocking the Mysteries of Human Life and Unleashing Its Potential
2011-02-16
Australasian Genetics Resource Book. June 2007. Accessed September 27, 2010. www.genetics.com.au/pdf/factsheets/fs24.pdf. 2 White House, Office of...Project and Beyond." The Australasian Genetics Resource Book. June 2007. Accessed September 27, 2010. www.genetics.com.au/pdf/factsheets/fs24.pdf...9 Centre for Genetics Education. "The Human Genetic Code – The Human Genome Project and Beyond." The Australasian Genetics Resource Book. June
Open tubular lab-on-column/mass spectrometry for targeted proteomics of nanogram sample amounts.
Hustoft, Hanne Kolsrud; Vehus, Tore; Brandtzaeg, Ole Kristian; Krauss, Stefan; Greibrokk, Tyge; Wilson, Steven Ray; Lundanes, Elsa
2014-01-01
A novel open tubular nanoproteomic platform featuring accelerated on-line protein digestion and high-resolution nano liquid chromatography mass spectrometry (LC-MS) has been developed. The platform features very narrow open tubular columns, and is hence particularly suited for limited sample amounts. For enzymatic digestion of proteins, samples are passed through a 20 µm inner diameter (ID) trypsin + endoproteinase Lys-C immobilized open tubular enzyme reactor (OTER). Resulting peptides are subsequently trapped on a monolithic pre-column and transferred on-line to a 10 µm ID porous layer open tubular (PLOT) liquid chromatography LC separation column. Wnt/ß-catenein signaling pathway (Wnt-pathway) proteins of potentially diagnostic value were digested+detected in targeted-MS/MS mode in small cell samples and tumor tissues within 120 minutes. For example, a potential biomarker Axin1 was identifiable in just 10 ng of sample (protein extract of ∼1,000 HCT15 colon cancer cells). In comprehensive mode, the current OTER-PLOT set-up could be used to identify approximately 1500 proteins in HCT15 cells using a relatively short digestion+detection cycle (240 minutes), outperforming previously reported on-line digestion/separation systems. The platform is fully automated utilizing common commercial instrumentation and parts, while the reactor and columns are simple to produce and have low carry-over. These initial results point to automated solutions for fast and very sensitive MS based proteomics, especially for samples of limited size.
Imaging HIV-1 Tat Trafficking and Interactions by Engineered Green-Fluorescent-Protein Tagging
NASA Astrophysics Data System (ADS)
Beltram, Fabio
2002-03-01
The direct monitoring of protein function in live cells under physiologically relevant conditions is one of the most powerful and innovative methodologies for proteomics. Efficient florescent probes fully compatible with human-cell expression are the fundamental tools for these studies and their optimization opens the way to resolution at the single-protein level. Biological events involving protein pairs are also directly accessible thanks to tuning of protein-tag spectral properties and production of complementary pairs. Such pairs are characterized by overlapping absorption (for the acceptor tag) and emission (for the donor tag) spectra. By tagging the proteins of interest with acceptor and donor molecules, protein interaction can be directly visualized by FRET, fluorescent resonant energy transfer. In this talk we shall present the design by molecular dynamics calculations and the application of optimized green fluorescent proteins to the study of the human immunodeficiency virus HIV-1 proteomics. In particular trafficking and cellular interactions of HIV-1 transactivator protein Tat in live human cells will be presented. Tat localization and complex internalization pathways of exogenous molecules will be presented thanks to the peculiar optical properties of mutated GFPs. Cellular protein partners and subcellular interaction sites will be identified and directly visualized. The relevance of such results and of advanced spectroscopic and imaging techniques for a new level of understanding of biological processes and its significance for advancement in molecular biology will be underlined. A. Marcello et al., J. Biol. Chem. 276, 39220 (2001). R. Cinelli et al., Appl. Phys. Lett. 79, 3353 (2001).
Davidi, Lital; Levin, Yishai; Ben-Dor, Shifra; Pick, Uri
2015-01-01
The halotolerant green alga Dunaliella bardawil is unique in that it accumulates under stress two types of lipid droplets: cytoplasmatic lipid droplets (CLD) and β-carotene-rich (βC) plastoglobuli. Recently, we isolated and analyzed the lipid and pigment compositions of these lipid droplets. Here, we describe their proteome analysis. A contamination filter and an enrichment filter were utilized to define core proteins. A proteome database of Dunaliella salina/D. bardawil was constructed to aid the identification of lipid droplet proteins. A total of 124 and 42 core proteins were identified in βC-plastoglobuli and CLD, respectively, with only eight common proteins. Dunaliella spp. CLD resemble cytoplasmic droplets from Chlamydomonas reinhardtii and contain major lipid droplet-associated protein and enzymes involved in lipid and sterol metabolism. The βC-plastoglobuli proteome resembles the C. reinhardtii eyespot and Arabidopsis (Arabidopsis thaliana) plastoglobule proteomes and contains carotene-globule-associated protein, plastid-lipid-associated protein-fibrillins, SOUL heme-binding proteins, phytyl ester synthases, β-carotene biosynthesis enzymes, and proteins involved in membrane remodeling/lipid droplet biogenesis: VESICLE-INDUCING PLASTID PROTEIN1, synaptotagmin, and the eyespot assembly proteins EYE3 and SOUL3. Based on these and previous results, we propose models for the biogenesis of βC-plastoglobuli and the biosynthesis of β-carotene within βC-plastoglobuli and hypothesize that βC-plastoglobuli evolved from eyespot lipid droplets. PMID:25404729
The Hemolymph Proteome of Fed and Starved Drosophila Larvae
Goetze, Sandra; Ahrens, Christian H.; Omasits, Ulrich; Marty, Florian; Simigdala, Nikiana; Meyer, Imke; Wollscheid, Bernd; Brunner, Erich; Hafen, Ernst; Lehner, Christian F.
2013-01-01
The co-operation of specialized organ systems in complex multicellular organisms depends on effective chemical communication. Thus, body fluids (like blood, lymph or intraspinal fluid) contain myriads of signaling mediators apart from metabolites. Moreover, these fluids are also of crucial importance for immune and wound responses. Compositional analyses of human body fluids are therefore of paramount diagnostic importance. Further improving their comprehensiveness should increase our understanding of inter-organ communication. In arthropods, which have trachea for gas exchange and an open circulatory system, the single dominating interstitial fluid is the hemolymph. Accordingly, a detailed analysis of hemolymph composition should provide an especially comprehensive picture of chemical communication and defense in animals. Therefore we used an extensive protein fractionation workflow in combination with a discovery-driven proteomic approach to map out the detectable protein composition of hemolymph isolated from Drosophila larvae. Combined mass spectrometric analysis revealed more than 700 proteins extending far beyond the previously known Drosophila hemolymph proteome. Moreover, by comparing hemolymph isolated from either fed or starved larvae, we provide initial provisional insights concerning compositional changes in response to nutritional state. Storage proteins in particular were observed to be strongly reduced by starvation. Our hemolymph proteome catalog provides a rich basis for data mining, as exemplified by our identification of potential novel cytokines, as well as for future quantitative analyses by targeted proteomics. PMID:23840627
Learning from Heterogeneous Data Sources: An Application in Spatial Proteomics
Breckels, Lisa M.; Holden, Sean B.; Wojnar, David; Mulvey, Claire M.; Christoforou, Andy; Groen, Arnoud; Trotter, Matthew W. B.; Kohlbacher, Oliver; Lilley, Kathryn S.; Gatto, Laurent
2016-01-01
Sub-cellular localisation of proteins is an essential post-translational regulatory mechanism that can be assayed using high-throughput mass spectrometry (MS). These MS-based spatial proteomics experiments enable us to pinpoint the sub-cellular distribution of thousands of proteins in a specific system under controlled conditions. Recent advances in high-throughput MS methods have yielded a plethora of experimental spatial proteomics data for the cell biology community. Yet, there are many third-party data sources, such as immunofluorescence microscopy or protein annotations and sequences, which represent a rich and vast source of complementary information. We present a unique transfer learning classification framework that utilises a nearest-neighbour or support vector machine system, to integrate heterogeneous data sources to considerably improve on the quantity and quality of sub-cellular protein assignment. We demonstrate the utility of our algorithms through evaluation of five experimental datasets, from four different species in conjunction with four different auxiliary data sources to classify proteins to tens of sub-cellular compartments with high generalisation accuracy. We further apply the method to an experiment on pluripotent mouse embryonic stem cells to classify a set of previously unknown proteins, and validate our findings against a recent high resolution map of the mouse stem cell proteome. The methodology is distributed as part of the open-source Bioconductor pRoloc suite for spatial proteomics data analysis. PMID:27175778
The hemolymph proteome of fed and starved Drosophila larvae.
Handke, Björn; Poernbacher, Ingrid; Goetze, Sandra; Ahrens, Christian H; Omasits, Ulrich; Marty, Florian; Simigdala, Nikiana; Meyer, Imke; Wollscheid, Bernd; Brunner, Erich; Hafen, Ernst; Lehner, Christian F
2013-01-01
The co-operation of specialized organ systems in complex multicellular organisms depends on effective chemical communication. Thus, body fluids (like blood, lymph or intraspinal fluid) contain myriads of signaling mediators apart from metabolites. Moreover, these fluids are also of crucial importance for immune and wound responses. Compositional analyses of human body fluids are therefore of paramount diagnostic importance. Further improving their comprehensiveness should increase our understanding of inter-organ communication. In arthropods, which have trachea for gas exchange and an open circulatory system, the single dominating interstitial fluid is the hemolymph. Accordingly, a detailed analysis of hemolymph composition should provide an especially comprehensive picture of chemical communication and defense in animals. Therefore we used an extensive protein fractionation workflow in combination with a discovery-driven proteomic approach to map out the detectable protein composition of hemolymph isolated from Drosophila larvae. Combined mass spectrometric analysis revealed more than 700 proteins extending far beyond the previously known Drosophila hemolymph proteome. Moreover, by comparing hemolymph isolated from either fed or starved larvae, we provide initial provisional insights concerning compositional changes in response to nutritional state. Storage proteins in particular were observed to be strongly reduced by starvation. Our hemolymph proteome catalog provides a rich basis for data mining, as exemplified by our identification of potential novel cytokines, as well as for future quantitative analyses by targeted proteomics.
Lee, Hye Min; Gupta, Ravi; Kim, Sun Hyung; Wang, Yiming; Rakwal, Randeep; Agrawal, Ganesh Kumar; Kim, Sun Tae
2015-05-01
High-abundance proteins (HAPs) hamper in-depth proteome study necessitating development of a HAPs depletion method. Here, we report a novel ethanol precipitation method (EPM) for HAPs depletion from total tuber proteins. Ethanol showed a dose-dependent effect on depletion of sporamin from sweet potato and patatin from potato tubers, respectively. The 50% ethanol was an optimal concentration. 2DE analysis of EPM-prepared sweet potato proteins also revealed enrichment of storage proteins (SPs) in ethanol supernatant (ES) resulting in detection of new low-abundance proteins in ethanol pellet (EP), compared to total fraction. The ES fraction showed even higher trypsin inhibitor activity than total proteins, further showing the efficacy of EPM in enrichment of sporamin in ES fraction. Application of this method was demonstrated for comparative proteomics of two sweet potato cultivars (Hwang-geum and Ho-bac) and purification of SP (sporamin) in its native form, as examples. Comparative proteomics identified many cultivar specific protein spots and selected spots were confidently assigned for their protein identity using MALDI-TOF-TOF analysis. Overall, the EPM is simple, reproducible, and economical for depletion of SPs and is suitable for downstream proteomics study. This study opens a door for its potential application to other tuber crops or fruits rich in carbohydrates. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Zhu, Ying; Zhao, Rui; Piehowski, Paul D.; ...
2017-09-01
One of the greatest challenges for mass spectrometry (MS)-based proteomics is the limited ability to analyze small samples. Here in this study, we investigate the relative contributions of liquid chromatography (LC), MS instrumentation and data analysis methods with the aim of improving proteome coverage for sample sizes ranging from 0.5 ng to 50 ng. We show that the LC separations utilizing 30-μm-i.d. columns increase signal intensity by >3-fold relative to those using 75-μm-i.d. columns, leading to 32% increase in peptide identifications. The Orbitrap Fusion Lumos MS significantly boosted both sensitivity and sequencing speed relative to earlier generation Orbitraps (e.g., LTQ-Orbitrap),more » leading to a ~3-fold increase in peptide identifications and 1.7-fold increase in identified protein groups for 2 ng tryptic digests of the bacterium S. oneidensis. The Match Between Runs algorithm of open-source MaxQuant software further increased proteome coverage by ~95% for 0.5 ng samples and by ~42% for 2 ng samples. Using the best combination of the above variables, we were able to identify >3,000 proteins from 10 ng tryptic digests from both HeLa and THP-1 mammalian cell lines. We also identified >950 proteins from subnanogram archaeal/bacterial cocultures. Finally, the present ultrasensitive LC-MS platform achieves a level of proteome coverage not previously realized for ultra-small sample loadings, and is expected to facilitate the analysis of subnanogram samples, including single mammalian cells.« less
Perilous terra incognita--open-access journals.
Balon, Richard
2014-04-01
The author focuses on a new rapidly spreading practice of publication in open-access journals. The pros and cons of open-access journals are discussed. Publishing in these journals may be cost prohibitive for educators and junior faculty members. Some authors may be lured by the ease of publishing in open-access journals (and their, at times, inflated self-description, e.g., "international", "scientific"), and their possibly valuable contributions will escape the attention of Academic Psychiatry readership in the vast sea of open-access journals. The readership may be flooded with a large number of low-quality articles (maybe not even properly peer-reviewed) from open-access journals. It may take some time to sort out what is and what is not relevant and useful. Open-access publishing represents a problematic and controversial practice and may be associated with a conflict of interest for the editors and publishers of these journals.
ERIC Educational Resources Information Center
Stanton, Kate Valentine; Liew, Chern Li
2011-01-01
Introduction: We examine doctoral students' awareness of and attitudes to open access forms of publication. Levels of awareness of open access and the concept of institutional repositories, publishing behaviour and perceptions of benefits and risks of open access publishing were explored. Method: Qualitative and quantitative data were collected…
Open Access Publishing: What Authors Want
ERIC Educational Resources Information Center
Nariani, Rajiv; Fernandez, Leila
2012-01-01
Campus-based open access author funds are being considered by many academic libraries as a way to support authors publishing in open access journals. Article processing fees for open access have been introduced recently by publishers and have not yet been widely accepted by authors. Few studies have surveyed authors on their reasons for publishing…
Almost Halfway There: An Analysis of the Open Access Behaviors of Academic Librarians
ERIC Educational Resources Information Center
Mercer, Holly
2011-01-01
Academic librarians are increasingly expected to advocate for scholarly communications reforms such as open access to scholarly publications, yet librarians do not always practice what they preach. Previous research examined librarian attitudes toward open access, whereas this article presents results of a study of open access publishing and…
30 CFR 291.113 - What actions may MMS take to remedy denial of open and nondiscriminatory access?
Code of Federal Regulations, 2011 CFR
2011-07-01
... open and nondiscriminatory access? 291.113 Section 291.113 Mineral Resources BUREAU OF OCEAN ENERGY MANAGEMENT, REGULATION, AND ENFORCEMENT, DEPARTMENT OF THE INTERIOR APPEALS OPEN AND NONDISCRIMINATORY ACCESS... take to remedy denial of open and nondiscriminatory access? If the MMS Director's decision under § 291...
A tutorial in displaying mass spectrometry-based proteomic data using heat maps.
Key, Melissa
2012-01-01
Data visualization plays a critical role in interpreting experimental results of proteomic experiments. Heat maps are particularly useful for this task, as they allow us to find quantitative patterns across proteins and biological samples simultaneously. The quality of a heat map can be vastly improved by understanding the options available to display and organize the data in the heat map. This tutorial illustrates how to optimize heat maps for proteomics data by incorporating known characteristics of the data into the image. First, the concepts used to guide the creating of heat maps are demonstrated. Then, these concepts are applied to two types of analysis: visualizing spectral features across biological samples, and presenting the results of tests of statistical significance. For all examples we provide details of computer code in the open-source statistical programming language R, which can be used for biologists and clinicians with little statistical background. Heat maps are a useful tool for presenting quantitative proteomic data organized in a matrix format. Understanding and optimizing the parameters used to create the heat map can vastly improve both the appearance and the interoperation of heat map data.
Development of a Targeted Urine Proteome Assay for kidney diseases.
Cantley, Lloyd G; Colangelo, Christopher M; Stone, Kathryn L; Chung, Lisa; Belcher, Justin; Abbott, Thomas; Cantley, Jennifer L; Williams, Kenneth R; Parikh, Chirag R
2016-01-01
Since human urine is the most readily available biofluid whose proteome changes in response to disease, it is a logical sample for identifying protein biomarkers for kidney diseases. Potential biomarkers were identified by using a multiproteomics workflow to compare urine proteomes of kidney transplant patients with immediate and delayed graft function. Differentially expressed proteins were identified, and corresponding stable isotope labeled internal peptide standards were synthesized for scheduled MRM. The Targeted Urine Proteome Assay (TUPA) was then developed by identifying those peptides for which there were at least two transitions for which interference in a urine matrix across 156 MRM runs was <30%. This resulted in an assay that monitors 224 peptides from 167 quantifiable proteins. TUPA opens the way for using a robust mass spectrometric technology, MRM, for quantifying and validating biomarkers from among 167 urinary proteins. This approach, while developed using differentially expressed urinary proteins from patients with delayed versus immediate graft function after kidney transplant, can be expanded to include differentially expressed urinary proteins in multiple kidney diseases. Thus, TUPA could provide a single assay to help diagnose, prognose, and manage many kidney diseases. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Contribution of proteomics to the study of plant pathogenic fungi.
Gonzalez-Fernandez, Raquel; Jorrin-Novo, Jesus V
2012-01-01
Phytopathogenic fungi are one of the most damaging plant parasitic organisms, and can cause serious diseases and important yield losses in crops. The study of the biology of these microorganisms and the interaction with their hosts has experienced great advances in recent years due to the development of moderm, holistic and high-throughput -omic techniques, together with the increasing number of genome sequencing projects and the development of mutants and reverse genetics tools. We highlight among these -omic techniques the importance of proteomics, which has become a relevant tool in plant-fungus pathosystem research. Proteomics intends to identify gene products with a key role in pathogenicity and virulence. These studies would help in the search of key protein targets and in the development of agrochemicals, which may open new ways for crop disease diagnosis and protection. In this review, we made an overview on the contribution of proteomics to the knowledge of life cycle, infection mechanisms, and virulence of the plant pathogenic fungi. Data from current, innovative literature, according to both methodological and experimental systems, were summarized and discussed. Specific sections were devoted to the most studied fungal phytopathogens: Botrytis cinerea, Sclerotinia sclerotiorum, and Fusarium graminearum.
Digital Scholarship and Open Access
ERIC Educational Resources Information Center
Losoff, Barbara; Pence, Harry E.
2010-01-01
Open access publications provide scholars with unrestricted access to the "conversation" that is the basis for the advancement of knowledge. The large number of open access journals, archives, and depositories already in existence demonstrates the technical and economic viability of providing unrestricted access to the literature that is the…
Open Access Publishing in the Field of Medical Informatics.
Kuballa, Stefanie
2017-05-01
The open access paradigm has become an important approach in today's information and communication society. Funders and governments in different countries stipulate open access publications of funded research results. Medical informatics as part of the science, technology and medicine disciplines benefits from many research funds, such as National Institutes of Health in the US, Wellcome Trust in UK, German Research Foundation in Germany and many more. In this study an overview of the current open access programs and conditions of major journals in the field of medical informatics is presented. It was investigated whether there are suitable options and how they are shaped. Therefore all journals in Thomson Reuters Web of Science that were listed in the subject category "Medical Informatics" in 2014 were examined. An Internet research was conducted by investigating the journals' websites. It was reviewed whether journals offer an open access option with a subsequent check of conditions as for example the type of open access, the fees and the licensing. As a result all journals in the field of medical informatics that had an impact factor in 2014 offer an open access option. A predominantly consistent pricing range was determined with an average fee of 2.248 € and a median fee of 2.207 €. The height of a journals' open access fee did not correlate with the height of its Impact Factor. Hence, medical informatics journals have recognized the trend of open access publishing, though the vast majority of them are working with the hybrid method. Hybrid open access may however lead to problems in questions of double dipping and the often stipulated gold open access.
Schroter, Sara; Tite, Leanne
2006-01-01
Objectives: We aimed to assess journal authors' current knowledge and perceptions of open access and author-pays publishing. Design: An electronic survey. Setting: Authors of research papers submitted to BMJ, Archives of Disease in Childhood, and Journal of Medical Genetics in 2004. Main outcome measures: Familiarity with and perceptions of open access and author-pays publishing. Results: 468/1113 (42%) responded. Prior to definitions being provided, 47% (222/468) and 38% (176/468) reported they were familiar with the terms `open access' and `author-pays' publishing, respectively. Some who did not at first recognize the terms, did claim to recognize them when they were defined. Only 10% (49/468) had submitted to an author-pays journal. Compared with non-open access subscription-based journals, 35% agreed that open access author-pays journals have a greater capacity to publish more content making it easier to get published, 27% thought they had lower impact factors, 31% thought they had faster and more timely publicaitons, and 46% agreed that people will think anyone can pay to get published. 55% (256/468) thought they would not continue to submit to their respective journal if it became open access and charged, largely because of the reputaiton of the journals. Half (54%, 255/468) said open access has `no impact' or was `low priority' in their submission decisions. Two-thirds (66%, 308/468) said they would prefer to submit to a non-open access subscription-based journal than an open access author-pays journal. Over half thought they would have to make a contribution or pay the full cost of an author charge (56%, 262/468). Conclusions: The survey yielded useful information about respondents' knowledge and perceptions of these publishing models. Authors have limited familiarity with the concept of open-access publishing and surrounding issues. Currently, open access policies have little impact on authors' decision of where to submit papers. PMID:16508053
Achieving open access to conservation science.
Fuller, Richard A; Lee, Jasmine R; Watson, James E M
2014-12-01
Conservation science is a crisis discipline in which the results of scientific enquiry must be made available quickly to those implementing management. We assessed the extent to which scientific research published since the year 2000 in 20 conservation science journals is publicly available. Of the 19,207 papers published, 1,667 (8.68%) are freely downloadable from an official repository. Moreover, only 938 papers (4.88%) meet the standard definition of open access in which material can be freely reused providing attribution to the authors is given. This compares poorly with a comparable set of 20 evolutionary biology journals, where 31.93% of papers are freely downloadable and 7.49% are open access. Seventeen of the 20 conservation journals offer an open access option, but fewer than 5% of the papers are available through open access. The cost of accessing the full body of conservation science runs into tens of thousands of dollars per year for institutional subscribers, and many conservation practitioners cannot access pay-per-view science through their workplace. However, important initiatives such as Research4Life are making science available to organizations in developing countries. We urge authors of conservation science to pay for open access on a per-article basis or to choose publication in open access journals, taking care to ensure the license allows reuse for any purpose providing attribution is given. Currently, it would cost $51 million to make all conservation science published since 2000 freely available by paying the open access fees currently levied to authors. Publishers of conservation journals might consider more cost effective models for open access and conservation-oriented organizations running journals could consider a broader range of options for open access to nonmembers such as sponsorship of open access via membership fees. © 2014 The Authors. Conservation Biology published by Wiley Periodicals, Inc., on behalf of the Society for Conservation Biology.
Assembling proteomics data as a prerequisite for the analysis of large scale experiments
Schmidt, Frank; Schmid, Monika; Thiede, Bernd; Pleißner, Klaus-Peter; Böhme, Martina; Jungblut, Peter R
2009-01-01
Background Despite the complete determination of the genome sequence of a huge number of bacteria, their proteomes remain relatively poorly defined. Beside new methods to increase the number of identified proteins new database applications are necessary to store and present results of large- scale proteomics experiments. Results In the present study, a database concept has been developed to address these issues and to offer complete information via a web interface. In our concept, the Oracle based data repository system SQL-LIMS plays the central role in the proteomics workflow and was applied to the proteomes of Mycobacterium tuberculosis, Helicobacter pylori, Salmonella typhimurium and protein complexes such as 20S proteasome. Technical operations of our proteomics labs were used as the standard for SQL-LIMS template creation. By means of a Java based data parser, post-processed data of different approaches, such as LC/ESI-MS, MALDI-MS and 2-D gel electrophoresis (2-DE), were stored in SQL-LIMS. A minimum set of the proteomics data were transferred in our public 2D-PAGE database using a Java based interface (Data Transfer Tool) with the requirements of the PEDRo standardization. Furthermore, the stored proteomics data were extractable out of SQL-LIMS via XML. Conclusion The Oracle based data repository system SQL-LIMS played the central role in the proteomics workflow concept. Technical operations of our proteomics labs were used as standards for SQL-LIMS templates. Using a Java based parser, post-processed data of different approaches such as LC/ESI-MS, MALDI-MS and 1-DE and 2-DE were stored in SQL-LIMS. Thus, unique data formats of different instruments were unified and stored in SQL-LIMS tables. Moreover, a unique submission identifier allowed fast access to all experimental data. This was the main advantage compared to multi software solutions, especially if personnel fluctuations are high. Moreover, large scale and high-throughput experiments must be managed in a comprehensive repository system such as SQL-LIMS, to query results in a systematic manner. On the other hand, these database systems are expensive and require at least one full time administrator and specialized lab manager. Moreover, the high technical dynamics in proteomics may cause problems to adjust new data formats. To summarize, SQL-LIMS met the requirements of proteomics data handling especially in skilled processes such as gel-electrophoresis or mass spectrometry and fulfilled the PSI standardization criteria. The data transfer into a public domain via DTT facilitated validation of proteomics data. Additionally, evaluation of mass spectra by post-processing using MS-Screener improved the reliability of mass analysis and prevented storage of data junk. PMID:19166578
ERIC Educational Resources Information Center
Armbruster, Chris
2008-01-01
Open source, open content and open access are set to fundamentally alter the conditions of knowledge production and distribution. Open source, open content and open access are also the most tangible result of the shift towards e-science and digital networking. Yet, widespread misperceptions exist about the impact of this shift on knowledge…
SearchGUI: An open-source graphical user interface for simultaneous OMSSA and X!Tandem searches.
Vaudel, Marc; Barsnes, Harald; Berven, Frode S; Sickmann, Albert; Martens, Lennart
2011-03-01
The identification of proteins by mass spectrometry is a standard technique in the field of proteomics, relying on search engines to perform the identifications of the acquired spectra. Here, we present a user-friendly, lightweight and open-source graphical user interface called SearchGUI (http://searchgui.googlecode.com), for configuring and running the freely available OMSSA (open mass spectrometry search algorithm) and X!Tandem search engines simultaneously. Freely available under the permissible Apache2 license, SearchGUI is supported on Windows, Linux and OSX. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A Matter of Discipline: Open Access, the Humanities, and Art History
ERIC Educational Resources Information Center
Tomlin, Patrick
2009-01-01
Recent events suggest that open access has gained new momentum in the humanities, but the slow and uneven development of open-access initiatives in humanist fields continues to hinder the consolidation of efforts across the university. Although various studies have traced the general origins of the humanities' reticence to embrace open access, few…
Antibody Portal | Office of Cancer Clinical Proteomics Research
Central to reproducibility in biomedical research is being able to use well-characterized and defined reagents. The CPTAC Antibody Portal serves as a National Cancer Institute (NCI) community resource that provides access to a large number of standardized renewable affinity reagents (to cancer-associated targets) and accompanying characterization data.
The SciELO Open Access: A Gold Way from the South
ERIC Educational Resources Information Center
Packer, Abel L.
2009-01-01
Open access has long emphasized access to scholarly materials. However, open access can also mean access to the means of producing visible and recognized journals. This issue is particularly important in developing and emergent countries. The SciELO (Scientific Electronic Library On-line) project, first started in Brazil and, shortly afterward, in…
Databases and Electronic Resources - Betty Petersen Memorial Library
of NOAA-Wide and Open Access Databases on the NOAA Central Library website. American Meteorological to a nonfederal website. Open Science Directory Open Science Directory contains collections of Open Access Journals (e.g. Directory of Open Access Journals) and journals in the special programs (Hinari
Oh, Phil; Borgström, Per; Witkiewicz, Halina; Li, Yan; Borgström, Bengt J; Chrastina, Adrian; Iwata, Koji; Zinn, Kurt R; Baldwin, Richard; Testa, Jacqueline E; Schnitzer, Jan E
2007-03-01
How effectively and quickly endothelial caveolae can transcytose in vivo is unknown, yet critical for understanding their function and potential clinical utility. Here we use quantitative proteomics to identify aminopeptidase P (APP) concentrated in caveolae of lung endothelium. Electron microscopy confirms this and shows that APP antibody targets nanoparticles to caveolae. Dynamic intravital fluorescence microscopy reveals that targeted caveolae operate effectively as pumps, moving antibody within seconds from blood across endothelium into lung tissue, even against a concentration gradient. This active transcytosis requires normal caveolin-1 expression. Whole body gamma-scintigraphic imaging shows rapid, specific delivery into lung well beyond that achieved by standard vascular targeting. This caveolar trafficking in vivo may underscore a key physiological mechanism for selective transvascular exchange and may provide an enhanced delivery system for imaging agents, drugs, gene-therapy vectors and nanomedicines. 'In vivo proteomic imaging' as described here integrates organellar proteomics with multiple imaging techniques to identify an accessible target space that includes the transvascular pumping space of the caveola.
ERIC Educational Resources Information Center
Grandbois, Jennifer; Beheshti, Jamshid
2014-01-01
Introduction: This study aims to gain a greater understanding of the development of open access practices amongst library and information science authors, since their role is integral to the success of the broader open access movement. Method: Data were collected from scholarly articles about open access by library and information science authors…
Generation of comprehensive thoracic oncology database--tool for translational research.
Surati, Mosmi; Robinson, Matthew; Nandi, Suvobroto; Faoro, Leonardo; Demchuk, Carley; Kanteti, Rajani; Ferguson, Benjamin; Gangadhar, Tara; Hensing, Thomas; Hasina, Rifat; Husain, Aliya; Ferguson, Mark; Karrison, Theodore; Salgia, Ravi
2011-01-22
The Thoracic Oncology Program Database Project was created to serve as a comprehensive, verified, and accessible repository for well-annotated cancer specimens and clinical data to be available to researchers within the Thoracic Oncology Research Program. This database also captures a large volume of genomic and proteomic data obtained from various tumor tissue studies. A team of clinical and basic science researchers, a biostatistician, and a bioinformatics expert was convened to design the database. Variables of interest were clearly defined and their descriptions were written within a standard operating manual to ensure consistency of data annotation. Using a protocol for prospective tissue banking and another protocol for retrospective banking, tumor and normal tissue samples from patients consented to these protocols were collected. Clinical information such as demographics, cancer characterization, and treatment plans for these patients were abstracted and entered into an Access database. Proteomic and genomic data have been included in the database and have been linked to clinical information for patients described within the database. The data from each table were linked using the relationships function in Microsoft Access to allow the database manager to connect clinical and laboratory information during a query. The queried data can then be exported for statistical analysis and hypothesis generation.
An emerging cyberinfrastructure for biodefense pathogen and pathogen-host data.
Zhang, C; Crasta, O; Cammer, S; Will, R; Kenyon, R; Sullivan, D; Yu, Q; Sun, W; Jha, R; Liu, D; Xue, T; Zhang, Y; Moore, M; McGarvey, P; Huang, H; Chen, Y; Zhang, J; Mazumder, R; Wu, C; Sobral, B
2008-01-01
The NIAID-funded Biodefense Proteomics Resource Center (RC) provides storage, dissemination, visualization and analysis capabilities for the experimental data deposited by seven Proteomics Research Centers (PRCs). The data and its publication is to support researchers working to discover candidates for the next generation of vaccines, therapeutics and diagnostics against NIAID's Category A, B and C priority pathogens. The data includes transcriptional profiles, protein profiles, protein structural data and host-pathogen protein interactions, in the context of the pathogen life cycle in vivo and in vitro. The database has stored and supported host or pathogen data derived from Bacillus, Brucella, Cryptosporidium, Salmonella, SARS, Toxoplasma, Vibrio and Yersinia, human tissue libraries, and mouse macrophages. These publicly available data cover diverse data types such as mass spectrometry, yeast two-hybrid (Y2H), gene expression profiles, X-ray and NMR determined protein structures and protein expression clones. The growing database covers over 23 000 unique genes/proteins from different experiments and organisms. All of the genes/proteins are annotated and integrated across experiments using UniProt Knowledgebase (UniProtKB) accession numbers. The web-interface for the database enables searching, querying and downloading at the level of experiment, group and individual gene(s)/protein(s) via UniProtKB accession numbers or protein function keywords. The system is accessible at http://www.proteomicsresource.org/.
The Reactome pathway knowledgebase
Croft, David; Mundo, Antonio Fabregat; Haw, Robin; Milacic, Marija; Weiser, Joel; Wu, Guanming; Caudy, Michael; Garapati, Phani; Gillespie, Marc; Kamdar, Maulik R.; Jassal, Bijay; Jupe, Steven; Matthews, Lisa; May, Bruce; Palatnik, Stanislav; Rothfels, Karen; Shamovsky, Veronica; Song, Heeyeon; Williams, Mark; Birney, Ewan; Hermjakob, Henning; Stein, Lincoln; D'Eustachio, Peter
2014-01-01
Reactome (http://www.reactome.org) is a manually curated open-source open-data resource of human pathways and reactions. The current version 46 describes 7088 human proteins (34% of the predicted human proteome), participating in 6744 reactions based on data extracted from 15 107 research publications with PubMed links. The Reactome Web site and analysis tool set have been completely redesigned to increase speed, flexibility and user friendliness. The data model has been extended to support annotation of disease processes due to infectious agents and to mutation. PMID:24243840
The Reactome pathway knowledgebase.
Croft, David; Mundo, Antonio Fabregat; Haw, Robin; Milacic, Marija; Weiser, Joel; Wu, Guanming; Caudy, Michael; Garapati, Phani; Gillespie, Marc; Kamdar, Maulik R; Jassal, Bijay; Jupe, Steven; Matthews, Lisa; May, Bruce; Palatnik, Stanislav; Rothfels, Karen; Shamovsky, Veronica; Song, Heeyeon; Williams, Mark; Birney, Ewan; Hermjakob, Henning; Stein, Lincoln; D'Eustachio, Peter
2014-01-01
Reactome (http://www.reactome.org) is a manually curated open-source open-data resource of human pathways and reactions. The current version 46 describes 7088 human proteins (34% of the predicted human proteome), participating in 6744 reactions based on data extracted from 15 107 research publications with PubMed links. The Reactome Web site and analysis tool set have been completely redesigned to increase speed, flexibility and user friendliness. The data model has been extended to support annotation of disease processes due to infectious agents and to mutation.
Grobei, Monica A.; Qeli, Ermir; Brunner, Erich; Rehrauer, Hubert; Zhang, Runxuan; Roschitzki, Bernd; Basler, Konrad; Ahrens, Christian H.; Grossniklaus, Ueli
2009-01-01
Pollen, the male gametophyte of flowering plants, represents an ideal biological system to study developmental processes, such as cell polarity, tip growth, and morphogenesis. Upon hydration, the metabolically quiescent pollen rapidly switches to an active state, exhibiting extremely fast growth. This rapid switch requires relevant proteins to be stored in the mature pollen, where they have to retain functionality in a desiccated environment. Using a shotgun proteomics approach, we unambiguously identified ∼3500 proteins in Arabidopsis pollen, including 537 proteins that were not identified in genetic or transcriptomic studies. To generate this comprehensive reference data set, which extends the previously reported pollen proteome by a factor of 13, we developed a novel deterministic peptide classification scheme for protein inference. This generally applicable approach considers the gene model–protein sequence–protein accession relationships. It allowed us to classify and eliminate ambiguities inherently associated with any shotgun proteomics data set, to report a conservative list of protein identifications, and to seamlessly integrate data from previous transcriptomics studies. Manual validation of proteins unambiguously identified by a single, information-rich peptide enabled us to significantly reduce the false discovery rate, while keeping valuable identifications of shorter and lower abundant proteins. Bioinformatic analyses revealed a higher stability of pollen proteins compared to those of other tissues and implied a protein family of previously unknown function in vesicle trafficking. Interestingly, the pollen proteome is most similar to that of seeds, indicating physiological similarities between these developmentally distinct tissues. PMID:19546170
Cuschieri, Sarah
2018-06-01
Academics have a duty towards peers and scholars alike to engage in research work and to publish their findings. This also assists in establishing personal academic success as well as the attainment of research grants. In the past, authors used to publish their research articles for free but access to these articles was restricted to subscription users only. Recently, open access publishing has gained momentum, whereby such articles are made freely accessible online. However open access publishing comes with a price tag for the author through article processing charges. Open access may also question a journal's credibility within the academic world if improperly implemented. This is particularly so following the unsolicited bombardment of researchers' email accounts with invitations for submissions to predatory open access journals. For these reasons, authors needs to rigorously weigh the pros and cons of whether to choose a subscription based or an open access journal for publication. Copyright © 2018 Elsevier B.V. All rights reserved.
Automated selected reaction monitoring software for accurate label-free protein quantification.
Teleman, Johan; Karlsson, Christofer; Waldemarson, Sofia; Hansson, Karin; James, Peter; Malmström, Johan; Levander, Fredrik
2012-07-06
Selected reaction monitoring (SRM) is a mass spectrometry method with documented ability to quantify proteins accurately and reproducibly using labeled reference peptides. However, the use of labeled reference peptides becomes impractical if large numbers of peptides are targeted and when high flexibility is desired when selecting peptides. We have developed a label-free quantitative SRM workflow that relies on a new automated algorithm, Anubis, for accurate peak detection. Anubis efficiently removes interfering signals from contaminating peptides to estimate the true signal of the targeted peptides. We evaluated the algorithm on a published multisite data set and achieved results in line with manual data analysis. In complex peptide mixtures from whole proteome digests of Streptococcus pyogenes we achieved a technical variability across the entire proteome abundance range of 6.5-19.2%, which was considerably below the total variation across biological samples. Our results show that the label-free SRM workflow with automated data analysis is feasible for large-scale biological studies, opening up new possibilities for quantitative proteomics and systems biology.
Building high-quality assay libraries for targeted analysis of SWATH MS data.
Schubert, Olga T; Gillet, Ludovic C; Collins, Ben C; Navarro, Pedro; Rosenberger, George; Wolski, Witold E; Lam, Henry; Amodei, Dario; Mallick, Parag; MacLean, Brendan; Aebersold, Ruedi
2015-03-01
Targeted proteomics by selected/multiple reaction monitoring (S/MRM) or, on a larger scale, by SWATH (sequential window acquisition of all theoretical spectra) MS (mass spectrometry) typically relies on spectral reference libraries for peptide identification. Quality and coverage of these libraries are therefore of crucial importance for the performance of the methods. Here we present a detailed protocol that has been successfully used to build high-quality, extensive reference libraries supporting targeted proteomics by SWATH MS. We describe each step of the process, including data acquisition by discovery proteomics, assertion of peptide-spectrum matches (PSMs), generation of consensus spectra and compilation of MS coordinates that uniquely define each targeted peptide. Crucial steps such as false discovery rate (FDR) control, retention time normalization and handling of post-translationally modified peptides are detailed. Finally, we show how to use the library to extract SWATH data with the open-source software Skyline. The protocol takes 2-3 d to complete, depending on the extent of the library and the computational resources available.
jmzTab: a java interface to the mzTab data standard.
Xu, Qing-Wei; Griss, Johannes; Wang, Rui; Jones, Andrew R; Hermjakob, Henning; Vizcaíno, Juan Antonio
2014-06-01
mzTab is the most recent standard format developed by the Proteomics Standards Initiative. mzTab is a flexible tab-delimited file that can capture identification and quantification results coming from MS-based proteomics and metabolomics approaches. We here present an open-source Java application programming interface for mzTab called jmzTab. The software allows the efficient processing of mzTab files, providing read and write capabilities, and is designed to be embedded in other software packages. The second key feature of the jmzTab model is that it provides a flexible framework to maintain the logical integrity between the metadata and the table-based sections in the mzTab files. In this article, as two example implementations, we also describe two stand-alone tools that can be used to validate mzTab files and to convert PRIDE XML files to mzTab. The library is freely available at http://mztab.googlecode.com. © 2014 The Authors PROTEOMICS Published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
A TRPV2 interactome-based signature for prognosis in glioblastoma patients.
Doñate-Macián, Pau; Gómez, Antonio; Dégano, Irene R; Perálvarez-Marín, Alex
2018-04-06
Proteomics aids to the discovery and expansion of protein-protein interaction networks, which are key to understand molecular mechanisms in physiology and physiopathology, but also to infer protein function in a guilt-by-association fashion. In this study we use a systematic protein-protein interaction membrane yeast two-hybrid method to expand the interactome of TRPV2, a cation channel related to nervous system development. After validation of the interactome in silico , we define a TRPV2-interactome signature combining proteomics with the available physio-pathological data in Disgenet to find interactome-disease associations, highlighting nervous system disorders and neoplasms. The TRPV2-interactome signature against available experimental data is capable of discriminating overall risk in glioblastoma multiforme prognosis, progression, recurrence, and chemotherapy resistance. Beyond the impact on glioblastoma physiopathology, this study shows that combining systematic proteomics with in silico methods and available experimental data is key to open new perspectives to define novel biomarkers for diagnosis, prognosis and therapeutics in disease.
A TRPV2 interactome-based signature for prognosis in glioblastoma patients
Dégano, Irene R.; Perálvarez-Marín, Alex
2018-01-01
Proteomics aids to the discovery and expansion of protein-protein interaction networks, which are key to understand molecular mechanisms in physiology and physiopathology, but also to infer protein function in a guilt-by-association fashion. In this study we use a systematic protein-protein interaction membrane yeast two-hybrid method to expand the interactome of TRPV2, a cation channel related to nervous system development. After validation of the interactome in silico, we define a TRPV2-interactome signature combining proteomics with the available physio-pathological data in Disgenet to find interactome-disease associations, highlighting nervous system disorders and neoplasms. The TRPV2-interactome signature against available experimental data is capable of discriminating overall risk in glioblastoma multiforme prognosis, progression, recurrence, and chemotherapy resistance. Beyond the impact on glioblastoma physiopathology, this study shows that combining systematic proteomics with in silico methods and available experimental data is key to open new perspectives to define novel biomarkers for diagnosis, prognosis and therapeutics in disease. PMID:29719613
Bilić, Petra; Guillemin, Nicolas; Kovačević, Alan; Beer Ljubić, Blanka; Jović, Ines; Galan, Asier; Eckersall, Peter David; Burchmore, Richard; Mrljak, Vladimir
2018-05-15
Idiopathic dilated cardiomyopathy (iDCM) is a primary myocardial disorder with an unknown aetiology, characterized by reduced contractility and ventricular dilation of the left or both ventricles. Naturally occurring canine iDCM was used herein to identify serum proteomic signature of the disease compared to the healthy state, providing an insight into underlying mechanisms and revealing proteins with biomarker potential. To achieve this, we used high-throughput label-based quantitative LC-MS/MS proteomics approach and bioinformatics analysis of the in silico inferred interactome protein network created from the initial list of differential proteins. To complement the proteomic analysis, serum biochemical parameters and levels of know biomarkers of cardiac function were measured. Several proteins with biomarker potential were identified, such as inter-alpha-trypsin inhibitor heavy chain H4, microfibril-associated glycoprotein 4 and apolipoprotein A-IV, which were validated using an independent method (Western blotting) and showed high specificity and sensitivity according to the receiver operating characteristic curve analysis. Bioinformatics analysis revealed involvement of different pathways in iDCM, such as complement cascade activation, lipoprotein particles dynamics, elastic fibre formation, GPCR signalling and respiratory electron transport chain. Idiopathic dilated cardiomyopathy is a severe primary myocardial disease of unknown cause, affecting both humans and dogs. This study is a contribution to the canine heart disease research by means of proteomic and bioinformatic state of the art analyses, following similar approach in human iDCM research. Importantly, we used serum as non-invasive and easily accessible biological source of information and contributed to the scarce data on biofluid proteome research on this topic. Bioinformatics analysis revealed biological pathways modulated in canine iDCM with potential of further targeted research. Also, several proteins with biomarker potential have been identified and successfully validated. Copyright © 2018 Elsevier B.V. All rights reserved.
Albar, Juan Pablo; Binz, Pierre-Alain; Eisenacher, Martin; Jones, Andrew R; Mayer, Gerhard; Omenn, Gilbert S; Orchard, Sandra; Vizcaíno, Juan Antonio; Hermjakob, Henning
2015-01-01
Objective To describe the goals of the Proteomics Standards Initiative (PSI) of the Human Proteome Organization, the methods that the PSI has employed to create data standards, the resulting output of the PSI, lessons learned from the PSI’s evolution, and future directions and synergies for the group. Materials and Methods The PSI has 5 categories of deliverables that have guided the group. These are minimum information guidelines, data formats, controlled vocabularies, resources and software tools, and dissemination activities. These deliverables are produced via the leadership and working group organization of the initiative, driven by frequent workshops and ongoing communication within the working groups. Official standards are subjected to a rigorous document process that includes several levels of peer review prior to release. Results We have produced and published minimum information guidelines describing what information should be provided when making data public, either via public repositories or other means. The PSI has produced a series of standard formats covering mass spectrometer input, mass spectrometer output, results of informatics analysis (both qualitative and quantitative analyses), reports of molecular interaction data, and gel electrophoresis analyses. We have produced controlled vocabularies that ensure that concepts are uniformly annotated in the formats and engaged in extensive software development and dissemination efforts so that the standards can efficiently be used by the community. Conclusion In its first dozen years of operation, the PSI has produced many standards that have accelerated the field of proteomics by facilitating data exchange and deposition to data repositories. We look to the future to continue developing standards for new proteomics technologies and workflows and mechanisms for integration with other omics data types. Our products facilitate the translation of genomics and proteomics findings to clinical and biological phenotypes. The PSI website can be accessed at http://www.psidev.info. PMID:25726569
Usage Trends of Open Access and Local Journals: A Korean Case Study.
Seo, Jeong-Wook; Chung, Hosik; Yun, Jungmin; Park, Jin Young; Park, Eunsun; Ahn, Yuri
2016-01-01
Articles from open access and local journals are important resources for research in Korea and the usage trends of these articles are important indicators for the assessment of the current research practice. We analyzed an institutional collection of published papers from 1998 to 2014 authored by researchers from Seoul National University, and their references from papers published between 1998 and 2011. The published papers were collected from Web of Science or Scopus and were analyzed according to the proportion of articles from open access journals. Their cited references from published papers in Web of Science were analyzed according to the proportion of local (South Korean) or open access journals. The proportion of open access papers was relatively stable until 2006 (2.5 ~ 5.2% in Web of Science and 2.7 ~ 4.2% in Scopus), but then increased to 15.9% (Web of Science) or 18.5% (Scopus) in 2014. We analyzed 2,750,485 cited references from 52,295 published papers. We found that the overall proportion of cited articles from local journals was 1.8% and that for open access journals was 3.0%. Citations of open access articles have increased since 2006 to 4.1% in 2011, although the increase in open access article citations was less than for open access publications. The proportion of citations from local journals was even lower. We think that the publishing / citing mismatch is a term to describe this difference, which is an issue at Seoul National University, where the number of published papers at open access or local journals is increasing but the number of citations is not. The cause of this discrepancy is multi-factorial but the governmental / institutional policies, social / cultural issues and authors' citing behaviors will explain the mismatch. However, additional measures are also necessary, such as the development of an institutional citation database and improved search capabilities with respect to local and open access documents.
Usage Trends of Open Access and Local Journals: A Korean Case Study
Chung, Hosik; Yun, Jungmin; Park, Jin Young; Park, Eunsun; Ahn, Yuri
2016-01-01
Articles from open access and local journals are important resources for research in Korea and the usage trends of these articles are important indicators for the assessment of the current research practice. We analyzed an institutional collection of published papers from 1998 to 2014 authored by researchers from Seoul National University, and their references from papers published between 1998 and 2011. The published papers were collected from Web of Science or Scopus and were analyzed according to the proportion of articles from open access journals. Their cited references from published papers in Web of Science were analyzed according to the proportion of local (South Korean) or open access journals. The proportion of open access papers was relatively stable until 2006 (2.5 ~ 5.2% in Web of Science and 2.7 ~ 4.2% in Scopus), but then increased to 15.9% (Web of Science) or 18.5% (Scopus) in 2014. We analyzed 2,750,485 cited references from 52,295 published papers. We found that the overall proportion of cited articles from local journals was 1.8% and that for open access journals was 3.0%. Citations of open access articles have increased since 2006 to 4.1% in 2011, although the increase in open access article citations was less than for open access publications. The proportion of citations from local journals was even lower. We think that the publishing / citing mismatch is a term to describe this difference, which is an issue at Seoul National University, where the number of published papers at open access or local journals is increasing but the number of citations is not. The cause of this discrepancy is multi-factorial but the governmental / institutional policies, social / cultural issues and authors' citing behaviors will explain the mismatch. However, additional measures are also necessary, such as the development of an institutional citation database and improved search capabilities with respect to local and open access documents. PMID:27195948
18 CFR 35.28 - Non-discriminatory open access transmission tariff.
Code of Federal Regulations, 2012 CFR
2012-04-01
... 18 Conservation of Power and Water Resources 1 2012-04-01 2012-04-01 false Non-discriminatory open... AND TARIFFS Other Filing Requirements § 35.28 Non-discriminatory open access transmission tariff. (a... concerns regarding unnecessary market inefficiencies. (c) Non-discriminatory open access transmission...
Wagner, Mary Ann; Eschenbrenner, Michel; Horn, Troy A; Kraycer, Jo Ann; Mujer, Cesar V; Hagius, Sue; Elzer, Philip; DelVecchio, Vito G
2002-08-01
Brucella melitensis is a facultative intracellular bacterial pathogen that causes brucellosis, a zoonotic disease primarily infecting sheep and goats, characterized by undulant fever, arthritic pain and other neurological disorders in humans. A comprehensive proteomic study of strain 16M was conducted to identify and characterize the proteins expressed in laboratory-grown culture. Using overlapping narrow range immobilized pH gradient strips for two-dimensional gel electrophoresis, 883 protein spots were detected between pH 3.5 and 11. The average isoelectric point and molecular weight values of the detected spots were 5.22 and 46.5 kDa, respectively. Of the 883 observed protein spots, 440 have been identified by matrix-assisted laser desorption/ionization-mass spectrometry. These proteins represent 187 discrete open reading frames (ORFs) or 6% of the predicted 3197 ORFs contained in the genome. The corresponding ORFs of the identified proteins are distributed evenly between each of the two circular B. melitensis chromosomes, indicating that both replicons are functionally active. The presented proteome map lists those protein spots identified to date in this study. This map may serve as a baseline reference for future proteomic studies aimed at the definition of biochemical pathways associated with stress responses, host specificity, pathogenicity and virulence. It will also assist in characterization of global proteomic effects in gene-knockout mutants. Ultimately, it may aid in our overall understanding of the cell biology of B. melitensis, an important bacterial pathogen.
A comprehensive and scalable database search system for metaproteomics.
Chatterjee, Sandip; Stupp, Gregory S; Park, Sung Kyu Robin; Ducom, Jean-Christophe; Yates, John R; Su, Andrew I; Wolan, Dennis W
2016-08-16
Mass spectrometry-based shotgun proteomics experiments rely on accurate matching of experimental spectra against a database of protein sequences. Existing computational analysis methods are limited in the size of their sequence databases, which severely restricts the proteomic sequencing depth and functional analysis of highly complex samples. The growing amount of public high-throughput sequencing data will only exacerbate this problem. We designed a broadly applicable metaproteomic analysis method (ComPIL) that addresses protein database size limitations. Our approach to overcome this significant limitation in metaproteomics was to design a scalable set of sequence databases assembled for optimal library querying speeds. ComPIL was integrated with a modified version of the search engine ProLuCID (termed "Blazmass") to permit rapid matching of experimental spectra. Proof-of-principle analysis of human HEK293 lysate with a ComPIL database derived from high-quality genomic libraries was able to detect nearly all of the same peptides as a search with a human database (~500x fewer peptides in the database), with a small reduction in sensitivity. We were also able to detect proteins from the adenovirus used to immortalize these cells. We applied our method to a set of healthy human gut microbiome proteomic samples and showed a substantial increase in the number of identified peptides and proteins compared to previous metaproteomic analyses, while retaining a high degree of protein identification accuracy and allowing for a more in-depth characterization of the functional landscape of the samples. The combination of ComPIL with Blazmass allows proteomic searches to be performed with database sizes much larger than previously possible. These large database searches can be applied to complex meta-samples with unknown composition or proteomic samples where unexpected proteins may be identified. The protein database, proteomic search engine, and the proteomic data files for the 5 microbiome samples characterized and discussed herein are open source and available for use and additional analysis.
Thiele, H.; Glandorf, J.; Koerting, G.; Reidegeld, K.; Blüggel, M.; Meyer, H.; Stephan, C.
2007-01-01
In today’s proteomics research, various techniques and instrumentation bioinformatics tools are necessary to manage the large amount of heterogeneous data with an automatic quality control to produce reliable and comparable results. Therefore a data-processing pipeline is mandatory for data validation and comparison in a data-warehousing system. The proteome bioinformatics platform ProteinScape has been proven to cover these needs. The reprocessing of HUPO BPP participants’ MS data was done within ProteinScape. The reprocessed information was transferred into the global data repository PRIDE. ProteinScape as a data-warehousing system covers two main aspects: archiving relevant data of the proteomics workflow and information extraction functionality (protein identification, quantification and generation of biological knowledge). As a strategy for automatic data validation, different protein search engines are integrated. Result analysis is performed using a decoy database search strategy, which allows the measurement of the false-positive identification rate. Peptide identifications across different workflows, different MS techniques, and different search engines are merged to obtain a quality-controlled protein list. The proteomics identifications database (PRIDE), as a public data repository, is an archiving system where data are finally stored and no longer changed by further processing steps. Data submission to PRIDE is open to proteomics laboratories generating protein and peptide identifications. An export tool has been developed for transferring all relevant HUPO BPP data from ProteinScape into PRIDE using the PRIDE.xml format. The EU-funded ProDac project will coordinate the development of software tools covering international standards for the representation of proteomics data. The implementation of data submission pipelines and systematic data collection in public standards–compliant repositories will cover all aspects, from the generation of MS data in each laboratory to the conversion of all the annotating information and identifications to a standardized format. Such datasets can be used in the course of publishing in scientific journals.
Jones, K.; Kim, K.; Patel, B.; Kelsen, S.; Braverman, A.; Swinton, D.; Gafken, P.; Jones, L.; Lane, W.; Neveu, J.; Leung, H.; Shaffer, S.; Leszyk, J.; Stanley, B.; Fox, T.; Stanley, A.; Yeung, Anthony
2013-01-01
Proteomic research can benefit from simultaneous access to multiple cutting-edge mass spectrometers. 18 core facilities responded to our investigators seeking service through the ABRF Discussion Forum. Five of the facilities selected completed four plasma proteomics experiments as routine fee-for-service. Each biological experiment entailed an iTRAQ 4-plex proteome comparison of immunodepleted plasma provided as 30 labeled-peptide fractions. Identical samples were analyzed by two AB SCIEX TripleTOF 5600 and three Thermo Orbitrap (Elite/Velos Pro/Q Exactive) instruments. 480 LC-MS/MS runs delivered >250 GB of data over two months. We compare herein routine service analyses of three peptide fractions of different peptide abundance. Data files from each instrument were studied to develop optimal analysis parameters to compare with default parameters in Mascot Distiller 2.4, ProteinPilot 4.5 beta, AB Sciex MS Data Converter 1.3 beta, and Proteome Discover 1.3. Peak-picking for TripleTOFs was best by ProteinPilot 4.5 beta while Mascot Distiller and Proteome Discoverer were comparable for the Orbitraps. We compared protein identification and quantitation in SwissProt 2012_07 database by Mascot Server 2.4.01 versus ProteinPilot. By all search methods, more proteins, up to two fold, were identified using the Q Exactive than others. Q Exactive excelled also at the number of unique significant peptide ion sequences. However, software-dependent impact on subsequent interpretation, due to peptide modifications, can be critical. These findings may have special implications for iTRAQ plasma proteomics. For the low abundance peptide ions, the slope of the dynamic range drop-off in the plasma proteome is uniquely sharp compared with cell lysates. Our study provides data for testable improvements in the operation of these mass spectrometers. More importantly, we have demonstrated a new affordable expedient workflow for investigators to perform proteomic experiments through the ABRF infrastructure. (We acknowledge John Cottrell for optimizing the peak-picking parameters for Mascot Distiller).
75 FR 32937 - Combined Notice of Filings #1
Federal Register 2010, 2011, 2012, 2013, 2014
2010-06-10
... proposed amendments to its Open Access Transmission, Energy and Operating Reserve Markets Tariff. Filed... Interconnection, LLC submits the revised Open Access Tariff. Filed Date: 05/27/2010. Accession Number: 20100527... proposed revisions to its FERC Open Access Transmission Tariff to be effective 6/1/10. Filed Date: 05/27...
50 CFR 660.310 - Purpose and scope.
Code of Federal Regulations, 2011 CFR
2011-10-01
... Access Fisheries § 660.310 Purpose and scope. This subpart covers the Pacific Coast Groundfish open access fishery. The open access fishery, as defined at § 660.11, Subpart C, is the fishery composed of commercial vessels using open access gear fished pursuant to the harvest guidelines, quotas, and other...
50 CFR 648.15 - Facilitation of enforcement.
Code of Federal Regulations, 2010 CFR
2010-10-01
... ocean quahog open access permitted vessels. Vessel owners or operators issued an open access surfclam or ocean quahog open access permit for fishing in the ITQ Program, as specified at § 648.70, are required... limited access permitted vessels. Beginning January 1, 2009, vessel owners or operators issued a limited...
50 CFR 660.24 - Limited entry and open access fisheries.
Code of Federal Regulations, 2013 CFR
2013-10-01
... 50 Wildlife and Fisheries 13 2013-10-01 2013-10-01 false Limited entry and open access fisheries... Groundfish Fisheries § 660.24 Limited entry and open access fisheries. (a) General. All commercial fishing for groundfish must be conducted in accordance with the regulations governing limited entry and open...
Boja, Emily S; Rodriguez, Henry
2012-04-01
Traditional shotgun proteomics used to detect a mixture of hundreds to thousands of proteins through mass spectrometric analysis, has been the standard approach in research to profile protein content in a biological sample which could lead to the discovery of new (and all) protein candidates with diagnostic, prognostic, and therapeutic values. In practice, this approach requires significant resources and time, and does not necessarily represent the goal of the researcher who would rather study a subset of such discovered proteins (including their variations or posttranslational modifications) under different biological conditions. In this context, targeted proteomics is playing an increasingly important role in the accurate measurement of protein targets in biological samples in the hope of elucidating the molecular mechanism of cellular function via the understanding of intricate protein networks and pathways. One such (targeted) approach, selected reaction monitoring (or multiple reaction monitoring) mass spectrometry (MRM-MS), offers the capability of measuring multiple proteins with higher sensitivity and throughput than shotgun proteomics. Developing and validating MRM-MS-based assays, however, is an extensive and iterative process, requiring a coordinated and collaborative effort by the scientific community through the sharing of publicly accessible data and datasets, bioinformatic tools, standard operating procedures, and well characterized reagents. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Proteomic profiling of developing cotton fibers from wild and domesticated Gossypium barbadense.
Hu, Guanjing; Koh, Jin; Yoo, Mi-Jeong; Grupp, Kara; Chen, Sixue; Wendel, Jonathan F
2013-10-01
Pima cotton (Gossypium barbadense) is widely cultivated because of its long, strong seed trichomes ('fibers') used for premium textiles. These agronomically advanced fibers were derived following domestication and thousands of years of human-mediated crop improvement. To gain an insight into fiber development and evolution, we conducted comparative proteomic and transcriptomic profiling of developing fiber from an elite cultivar and a wild accession. Analyses using isobaric tag for relative and absolute quantification (iTRAQ) LC-MS/MS technology identified 1317 proteins in fiber. Of these, 205 were differentially expressed across developmental stages, and 190 showed differential expression between wild and cultivated forms, 14.4% of the proteome sampled. Human selection may have shifted the timing of developmental modules, such that some occur earlier in domesticated than in wild cotton. A novel approach was used to detect possible biased expression of homoeologous copies of proteins. Results indicate a significant partitioning of duplicate gene expression at the protein level, but an approximately equal degree of bias for each of the two constituent genomes of allopolyploid cotton. Our results demonstrate the power of complementary transcriptomic and proteomic approaches for the study of the domestication process. They also provide a rich database for mining for functional analyses of cotton improvement or evolution. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.
Liu, Ming-Qi; Zeng, Wen-Feng; Fang, Pan; Cao, Wei-Qian; Liu, Chao; Yan, Guo-Quan; Zhang, Yang; Peng, Chao; Wu, Jian-Qiang; Zhang, Xiao-Jin; Tu, Hui-Jun; Chi, Hao; Sun, Rui-Xiang; Cao, Yong; Dong, Meng-Qiu; Jiang, Bi-Yun; Huang, Jiang-Ming; Shen, Hua-Li; Wong, Catherine C L; He, Si-Min; Yang, Peng-Yuan
2017-09-05
The precise and large-scale identification of intact glycopeptides is a critical step in glycoproteomics. Owing to the complexity of glycosylation, the current overall throughput, data quality and accessibility of intact glycopeptide identification lack behind those in routine proteomic analyses. Here, we propose a workflow for the precise high-throughput identification of intact N-glycopeptides at the proteome scale using stepped-energy fragmentation and a dedicated search engine. pGlyco 2.0 conducts comprehensive quality control including false discovery rate evaluation at all three levels of matches to glycans, peptides and glycopeptides, improving the current level of accuracy of intact glycopeptide identification. The N-glycoproteome of samples metabolically labeled with 15 N/ 13 C were analyzed quantitatively and utilized to validate the glycopeptide identification, which could be used as a novel benchmark pipeline to compare different search engines. Finally, we report a large-scale glycoproteome dataset consisting of 10,009 distinct site-specific N-glycans on 1988 glycosylation sites from 955 glycoproteins in five mouse tissues.Protein glycosylation is a heterogeneous post-translational modification that generates greater proteomic diversity that is difficult to analyze. Here the authors describe pGlyco 2.0, a workflow for the precise one step identification of intact N-glycopeptides at the proteome scale.
Due diligence in the open-access explosion era: choosing a reputable journal for publication.
Masten, Yondell; Ashcraft, Alyce
2017-11-15
Faculty are required to publish. Naïve and "in-a-hurry-to-publish" authors seek to publish in journals where manuscripts are rapidly accepted. Others may innocently submit to one of an increasing number of questionable/predatory journals, where predatory is defined as practices of publishing journals for exploitation of author-pays, open-access publication model by charging authors publication fees for publisher profit without provision of expected services (expert peer review, editing, archiving, and indexing published manuscripts) and promising almost instant publication. Authors may intentionally submit manuscripts to predatory journals for rapid publication without concern for journal quality. A brief summary of the open access "movement," suggestions for selecting reputable open access journals, and suggestion for avoiding predatory publishers/journals are described. The purpose is to alert junior and seasoned faculty about predatory publishers included among available open access journal listings. Brief review of open access publication, predatory/questionable journal characteristics, suggestions for selecting reputable open access journals and avoiding predatory publishers/journals are described. Time is required for intentionally performing due diligence in open access journal selection, based on publisher/journal quality, prior to manuscript submission or authors must be able to successfully withdraw manuscripts when submission to a questionable or predatory journal is discovered. © FEMS 2017.
Due diligence in the open-access explosion era: choosing a reputable journal for publication
Ashcraft, Alyce
2017-01-01
Abstract Faculty are required to publish. Naïve and “in-a-hurry-to-publish” authors seek to publish in journals where manuscripts are rapidly accepted. Others may innocently submit to one of an increasing number of questionable/predatory journals, where predatory is defined as practices of publishing journals for exploitation of author-pays, open-access publication model by charging authors publication fees for publisher profit without provision of expected services (expert peer review, editing, archiving, and indexing published manuscripts) and promising almost instant publication. Authors may intentionally submit manuscripts to predatory journals for rapid publication without concern for journal quality. A brief summary of the open access “movement,” suggestions for selecting reputable open access journals, and suggestion for avoiding predatory publishers/journals are described. The purpose is to alert junior and seasoned faculty about predatory publishers included among available open access journal listings. Brief review of open access publication, predatory/questionable journal characteristics, suggestions for selecting reputable open access journals and avoiding predatory publishers/journals are described. Time is required for intentionally performing due diligence in open access journal selection, based on publisher/journal quality, prior to manuscript submission or authors must be able to successfully withdraw manuscripts when submission to a questionable or predatory journal is discovered. PMID:29040536
Maringer, Kevin; Yousuf, Amjad; Heesom, Kate J; Fan, Jun; Lee, David; Fernandez-Sesma, Ana; Bessant, Conrad; Matthews, David A; Davidson, Andrew D
2017-01-19
Aedes aegypti is a vector for the (re-)emerging human pathogens dengue, chikungunya, yellow fever and Zika viruses. Almost half of the Ae. aegypti genome is comprised of transposable elements (TEs). Transposons have been linked to diverse cellular processes, including the establishment of viral persistence in insects, an essential step in the transmission of vector-borne viruses. However, up until now it has not been possible to study the overall proteome derived from an organism's mobile genetic elements, partly due to the highly divergent nature of TEs. Furthermore, as for many non-model organisms, incomplete genome annotation has hampered proteomic studies on Ae. aegypti. We analysed the Ae. aegypti proteome using our new proteomics informed by transcriptomics (PIT) technique, which bypasses the need for genome annotation by identifying proteins through matched transcriptomic (rather than genomic) data. Our data vastly increase the number of experimentally confirmed Ae. aegypti proteins. The PIT analysis also identified hotspots of incomplete genome annotation, and showed that poor sequence and assembly quality do not explain all annotation gaps. Finally, in a proof-of-principle study, we developed criteria for the characterisation of proteomically active TEs. Protein expression did not correlate with a TE's genomic abundance at different levels of classification. Most notably, long terminal repeat (LTR) retrotransposons were markedly enriched compared to other elements. PIT was superior to 'conventional' proteomic approaches in both our transposon and genome annotation analyses. We present the first proteomic characterisation of an organism's repertoire of mobile genetic elements, which will open new avenues of research into the function of transposon proteins in health and disease. Furthermore, our study provides a proof-of-concept that PIT can be used to evaluate a genome's annotation to guide annotation efforts which has the potential to improve the efficiency of annotation projects in non-model organisms. PIT therefore represents a valuable new tool to study the biology of the important vector species Ae. aegypti, including its role in transmitting emerging viruses of global public health concern.
Mühlhaus, Timo; Weiss, Julia; Hemme, Dorothea; Sommer, Frederik; Schroda, Michael
2011-01-01
Crop-plant-yield safety is jeopardized by temperature stress caused by the global climate change. To take countermeasures by breeding and/or transgenic approaches it is essential to understand the mechanisms underlying plant acclimation to heat stress. To this end proteomics approaches are most promising, as acclimation is largely mediated by proteins. Accordingly, several proteomics studies, mainly based on two-dimensional gel-tandem MS approaches, were conducted in the past. However, results often were inconsistent, presumably attributable to artifacts inherent to the display of complex proteomes via two-dimensional-gels. We describe here a new approach to monitor proteome dynamics in time course experiments. This approach involves full 15N metabolic labeling and mass spectrometry based quantitative shotgun proteomics using a uniform 15N standard over all time points. It comprises a software framework, IOMIQS, that features batch job mediated automated peptide identification by four parallelized search engines, peptide quantification and data assembly for the processing of large numbers of samples. We have applied this approach to monitor proteome dynamics in a heat stress time course using the unicellular green alga Chlamydomonas reinhardtii as model system. We were able to identify 3433 Chlamydomonas proteins, of which 1116 were quantified in at least three of five time points of the time course. Statistical analyses revealed that levels of 38 proteins significantly increased, whereas levels of 206 proteins significantly decreased during heat stress. The increasing proteins comprise 25 (co-)chaperones and 13 proteins involved in chromatin remodeling, signal transduction, apoptosis, photosynthetic light reactions, and yet unknown functions. Proteins decreasing during heat stress were significantly enriched in functional categories that mediate carbon flux from CO2 and external acetate into protein biosynthesis, which also correlated with a rapid, but fully reversible cell cycle arrest after onset of stress. Our approach opens up new perspectives for plant systems biology and provides novel insights into plant stress acclimation. PMID:21610104
Access control mechanism of wireless gateway based on open flow
NASA Astrophysics Data System (ADS)
Peng, Rong; Ding, Lei
2017-08-01
In order to realize the access control of wireless gateway and improve the access control of wireless gateway devices, an access control mechanism of SDN architecture which is based on Open vSwitch is proposed. The mechanism utilizes the features of the controller--centralized control and programmable. Controller send access control flow table based on the business logic. Open vSwitch helps achieve a specific access control strategy based on the flow table.
Software Tools | Office of Cancer Clinical Proteomics Research
The CPTAC program develops new approaches to elucidate aspects of the molecular complexity of cancer made from large-scale proteogenomic datasets, and advance them toward precision medicine. Part of the CPTAC mission is to make data and tools available and accessible to the greater research community to accelerate the discovery process.
USDA-ARS?s Scientific Manuscript database
The modern cultivated common bean (Phaseolus vulgaris) has evolved from wild common beans distributed in Central America, Mexico and the Andean region of South America. It has been reported that wild common bean accessions have higher levels of protein content than the domesticated dry bean cultiva...
A cool tool for hot and sour Archaea: proteomics of Sulfolobus solfataricus.
Kort, Julia Christin; Esser, Dominik; Pham, Trong Khoa; Noirel, Josselin; Wright, Phillip C; Siebers, Bettina
2013-10-01
In recent years, much progress has been made in proteomic studies to unravel metabolic pathways and basic cellular processes. This is especially interesting for members of the Archaea, the third domain of life. Archaea exhibit extraordinary features and many of their cultivable representatives are adaptable to extreme environments. Archaea harbor many unique traits besides bacterial attributes, such as size, shape, and DNA structure and eukaryal characteristics like information processing. Sulfolobus solfataricus P2, a thermoacidophilic archaeal representative, is a well-established model organism adapted to low-pH environments (pH 2-3) and high temperatures (80°C). The genome has a size of 3 Mbp and its sequence has been deciphered. Approximately 3033 predicted open reading frames have been identified and the genome is characterized by a great number of diverse insertion sequence elements. In unraveling the organisms' metabolism and lifestyle, proteomic analyses have played a major role. Much effort has been directed at this organism and is reviewed here. With the help of proteomics, unique metabolic pathways were resolved in S. solfataricus, targets for regulatory protein phosphorylation identified, and cellular responses upon virus infection as well as oxidative stress analyzed. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Viglio, Simona; Stolk, Jan; Iadarola, Paolo; Giuliano, Serena; Luisetti, Maurizio; Salvini, Roberta; Fumagalli, Marco; Bardoni, Anna
2014-01-22
To improve the knowledge on a variety of severe disorders, research has moved from the analysis of individual proteins to the investigation of all proteins expressed by a tissue/organism. This global proteomic approach could prove very useful: (i) for investigating the biochemical pathways involved in disease; (ii) for generating hypotheses; or (iii) as a tool for the identification of proteins differentially expressed in response to the disease state. Proteomics has not been used yet in the field of respiratory research as extensively as in other fields, only a few reproducible and clinically applicable molecular markers, which can assist in diagnosis, having been currently identified. The continuous advances in both instrumentation and methodology, which enable sensitive and quantitative proteomic analyses in much smaller amounts of biological material than before, will hopefully promote the identification of new candidate biomarkers in this area. The aim of this report is to critically review the application over the decade 2004-2013 of very sophisticated technologies to the study of respiratory disorders. The observed changes in protein expression profiles from tissues/fluids of patients affected by pulmonary disorders opens the route for the identification of novel pathological mediators of these disorders.
Proteomic Cinderella: Customized analysis of bulky MS/MS data in one night.
Kiseleva, Olga; Poverennaya, Ekaterina; Shargunov, Alexander; Lisitsa, Andrey
2018-02-01
Proteomic challenges, stirred up by the advent of high-throughput technologies, produce large amount of MS data. Nowadays, the routine manual search does not satisfy the "speed" of modern science any longer. In our work, the necessity of single-thread analysis of bulky data emerged during interpretation of HepG2 proteome profiling results for proteoforms searching. We compared the contribution of each of the eight search engines (X!Tandem, MS-GF[Formula: see text], MS Amanda, MyriMatch, Comet, Tide, Andromeda, and OMSSA) integrated in an open-source graphical user interface SearchGUI ( http://searchgui.googlecode.com ) into total result of proteoforms identification and optimized set of engines working simultaneously. We also compared the results of our search combination with Mascot results using protein kit UPS2, containing 48 human proteins. We selected combination of X!Tandem, MS-GF[Formula: see text] and OMMSA as the most time-efficient and productive combination of search. We added homemade java-script to automatize pipeline from file picking to report generation. These settings resulted in rise of the efficiency of our customized pipeline unobtainable by manual scouting: the analysis of 192 files searched against human proteome (42153 entries) downloaded from UniProt took 11[Formula: see text]h.
Cloud CPFP: a shotgun proteomics data analysis pipeline using cloud and high performance computing.
Trudgian, David C; Mirzaei, Hamid
2012-12-07
We have extended the functionality of the Central Proteomics Facilities Pipeline (CPFP) to allow use of remote cloud and high performance computing (HPC) resources for shotgun proteomics data processing. CPFP has been modified to include modular local and remote scheduling for data processing jobs. The pipeline can now be run on a single PC or server, a local cluster, a remote HPC cluster, and/or the Amazon Web Services (AWS) cloud. We provide public images that allow easy deployment of CPFP in its entirety in the AWS cloud. This significantly reduces the effort necessary to use the software, and allows proteomics laboratories to pay for compute time ad hoc, rather than obtaining and maintaining expensive local server clusters. Alternatively the Amazon cloud can be used to increase the throughput of a local installation of CPFP as necessary. We demonstrate that cloud CPFP allows users to process data at higher speed than local installations but with similar cost and lower staff requirements. In addition to the computational improvements, the web interface to CPFP is simplified, and other functionalities are enhanced. The software is under active development at two leading institutions and continues to be released under an open-source license at http://cpfp.sourceforge.net.
Quantitative Proteomics by Metabolic Labeling of Model Organisms*
Gouw, Joost W.; Krijgsveld, Jeroen; Heck, Albert J. R.
2010-01-01
In the biological sciences, model organisms have been used for many decades and have enabled the gathering of a large proportion of our present day knowledge of basic biological processes and their derailments in disease. Although in many of these studies using model organisms, the focus has primarily been on genetics and genomics approaches, it is important that methods become available to extend this to the relevant protein level. Mass spectrometry-based proteomics is increasingly becoming the standard to comprehensively analyze proteomes. An important transition has been made recently by moving from charting static proteomes to monitoring their dynamics by simultaneously quantifying multiple proteins obtained from differently treated samples. Especially the labeling with stable isotopes has proved an effective means to accurately determine differential expression levels of proteins. Among these, metabolic incorporation of stable isotopes in vivo in whole organisms is one of the favored strategies. In this perspective, we will focus on methodologies to stable isotope label a variety of model organisms in vivo, ranging from relatively simple organisms such as bacteria and yeast to Caenorhabditis elegans, Drosophila, and Arabidopsis up to mammals such as rats and mice. We also summarize how this has opened up ways to investigate biological processes at the protein level in health and disease, revealing conservation and variation across the evolutionary tree of life. PMID:19955089
The maxillary molar endodontic access opening: A microscope-based approach
Mamoun, John Sami
2016-01-01
This article reviews the basic clinical techniques of performing a maxillary molar endodontic access opening, starting from the initial access opening into the pulp chamber, to the point where a size #10 file has been advanced to the apices of all three or four (or more) canals. The article explains how the use of the dental surgical operating microscope or microscope-level loupes magnification of ×6–8 or greater, combined with head-mounted or coaxial illumination, improve the ability of a dentist to identify microscopic root canal orifices, which facilitates the efficient creation of conservative access openings with adequate straight-line access in maxillary molars. Magnified photos illustrate various microscopic anatomical structures or landmarks of the initial access opening. Techniques are explored for implementing an access opening for teeth with vital versus necrotic pulpal tissues. The article also explores the use of piezoelectric or ultrasonic instruments for revealing root canal orifices and for removing pulp stones or calcified pulpal tissue inside the pulp chamber. PMID:27403069
MetaboLights: towards a new COSMOS of metabolomics data management.
Steinbeck, Christoph; Conesa, Pablo; Haug, Kenneth; Mahendraker, Tejasvi; Williams, Mark; Maguire, Eamonn; Rocca-Serra, Philippe; Sansone, Susanna-Assunta; Salek, Reza M; Griffin, Julian L
2012-10-01
Exciting funding initiatives are emerging in Europe and the US for metabolomics data production, storage, dissemination and analysis. This is based on a rich ecosystem of resources around the world, which has been build during the past ten years, including but not limited to resources such as MassBank in Japan and the Human Metabolome Database in Canada. Now, the European Bioinformatics Institute has launched MetaboLights, a database for metabolomics experiments and the associated metadata (http://www.ebi.ac.uk/metabolights). It is the first comprehensive, cross-species, cross-platform metabolomics database maintained by one of the major open access data providers in molecular biology. In October, the European COSMOS consortium will start its work on Metabolomics data standardization, publication and dissemination workflows. The NIH in the US is establishing 6-8 metabolomics services cores as well as a national metabolomics repository. This communication reports about MetaboLights as a new resource for Metabolomics research, summarises the related developments and outlines how they may consolidate the knowledge management in this third large omics field next to proteomics and genomics.
Open versus Controlled-Access Data | Office of Cancer Genomics
OCG employs stringent human subjects’ protection and data access policies to protect the privacy and confidentiality of the research participants. Depending on the risk of patient identification, OCG programs data are available to the scientific community in two tiers: open or controlled access. Both types of data can be accessed through its corresponding OCG program-specific data matrix or portal. Open-access Data
[Open access :an opportunity for biomedical research].
Duchange, Nathalie; Autard, Delphine; Pinhas, Nicole
2008-01-01
Open access within the scientific community depends on the scientific context and the practices of the field. In the biomedical domain, the communication of research results is characterised by the importance of the peer reviewing process, the existence of a hierarchy among journals and the transfer of copyright to the editor. Biomedical publishing has become a lucrative market and the growth of electronic journals has not helped lower the costs. Indeed, it is difficult for today's public institutions to gain access to all the scientific literature. Open access is thus imperative, as demonstrated through the positions taken by a growing number of research funding bodies, the development of open access journals and efforts made in promoting open archives. This article describes the setting up of an Inserm portal for publication in the context of the French national protocol for open-access self-archiving and in an international context.
Supporting Access to Open Online Courses for Learners of Developing Countries
ERIC Educational Resources Information Center
Nti, Kwame
2015-01-01
This paper examines how access to, and use of, open online courses may be enhanced for learners of developing countries from a learner perspective. Using analysis of the open education concept, factors that affect access to open educational resources content, and universal standards for delivering online learning, the author demonstrates that the…
Code of Federal Regulations, 2010 CFR
2010-07-01
... resolve an allegation that open and nondiscriminatory access was denied? 291.102 Section 291.102 Mineral Resources MINERALS MANAGEMENT SERVICE, DEPARTMENT OF THE INTERIOR APPEALS OPEN AND NONDISCRIMINATORY ACCESS... Hotline to informally resolve an allegation that open and nondiscriminatory access was denied? Before...
Open Governance in Higher Education: Extending the Past to the Future
ERIC Educational Resources Information Center
Masson, Patrick
2011-01-01
Open educational resources, open content, open access, open research, open courseware--all of these open initiatives share, and benefit from, a vision of access and a collaborative framework that often result in improved outcomes. Many of these open initiatives have gained adoption within higher education and are now serving in mission-critical…
Protocols | Office of Cancer Clinical Proteomics Research
Each reagent on the Antibody Portal has been characterized by a combination of methods specific for that antibody. To view the customized antibody methods and protocols (Standard Operating Procedures) used to generate and characterize each reagent, select an antibody of interest and open the protocols associated with their respective characterization methods along with characterization data.
Open Access: "à consommer avec modération"
NASA Astrophysics Data System (ADS)
Mahoney, Terence J.
There is increasing pressure on academics and researchers to publish the results of their investigations in open access journals. Indeed, some funding agencies make open access publishing a basic requirement for funding projects, and the EU is considering taking firm steps in this direction. I argue that astronomy is already one of the most open of disciplines, and that access - both to the general public (in terms of a significantly growing outreach effort) and to developing countries (through efforts to provide computing facilities and Internet access, as well as schemes to provide research centres of limited resources with journals) - is becoming more and more open in a genuine and lasting way. I further argue that sudden switches to more formal kinds of open access schemes could cause irreparable harm to astronomical publishing. Several of the most prestigious astronomical research journals (e.g. MN, ApJ, AJ) have for more than a century met the publishing needs of the research community and continue to adapt successfully to changing demands on the part of that community. The after-effects of abrupt changes in publishing practices - implemented through primarily political concerns - are hard to predict and could be severely damaging. I conclude that open access, in its current acceptation, should be studied with great care and with sufficient time before any consideration is given to its implementation. If forced on the publishing and research communities, open access could well result in much more limited access to properly vetted research results.
Measuring, Rating, Supporting, and Strengthening Open Access Scholarly Publishing in Brazil
ERIC Educational Resources Information Center
Neto, Silvio Carvalho; Willinsky, John; Alperin, Juan Pablo
2016-01-01
This study assesses the extent and nature of open access scholarly publishing in Brazil, one of the world's leaders in providing universal access to its research and scholarship. It utilizes Brazil's Qualis journal evaluation system, along with other relevant data bases to address the association between scholarly quality and open access in the…
ERIC Educational Resources Information Center
Haggerty, Kevin D.
2008-01-01
Introduction: Presents a personal account of the transfer to open access of the leading Canadian journal of sociology. Background: The Canadian Journal of Sociology had established a strong position, internationally, among sociology journals. However, subscriptions were falling as readers increasingly accessed the resource through libraries and a…
A Comparison of Student Confidence Levels in Open Access and Undergraduate University Courses
ERIC Educational Resources Information Center
Atherton, Mirella
2017-01-01
Confidence levels of students enrolled in open access programs and undergraduate courses were measured at the University of Newcastle. The open access science students aimed to gain access to undergraduate studies in various disciplines at University. The undergraduate students were enrolled in a variety of degrees and were surveyed during their…
The future of academic publishing: what is open access?
Collins, Jannette
2005-04-01
For more than 200 years, publishers have been charging users (i.e., subscribers) for access to scientific information to make a profit. Authors have been required to grant copyright ownership to the publisher. This system was not questioned until the Internet popularized electronic publishing. The Internet allows for rapid dissemination of information to millions of readers. Some people have seen this as an opportunity to revolutionize the system of scientific publishing and to make it one that provides free, open access to all scientific information to all persons everywhere in the world. Such systems have been launched and have instigated a wave of dialogue among proponents and opponents alike. At the center of the controversy is the issue of who will pay for the costs of publishing, because an open-access system is not free, and this threatens the backbone of the traditional publishing industry. Currently, open-access publishers charge authors a fee to have their articles published. Because of this and the uncertainty of the sustainability of the open-access system, some authors are hesitant to participate in the new system. This article reviews the events that led to the creation of open-access publishing, the arguments for and against it, and the implications of open access for the future of academic publishing.
Schmidlin, Thierry; Garrigues, Luc; Lane, Catherine S; Mulder, T Celine; van Doorn, Sander; Post, Harm; de Graaf, Erik L; Lemeer, Simone; Heck, Albert J R; Altelaar, A F Maarten
2016-08-01
Hypothesis-driven MS-based targeted proteomics has gained great popularity in a relatively short timespan. Next to the widely established selected reaction monitoring (SRM) workflow, data-independent acquisition (DIA), also referred to as sequential window acquisition of all theoretical spectra (SWATH) was introduced as a high-throughput targeted proteomics method. DIA facilitates increased proteome coverage, however, does not yet reach the sensitivity obtained with SRM. Therefore, a well-informed method selection is crucial for designing a successful targeted proteomics experiment. This is especially the case when targeting less conventional peptides such as those that contain PTMs, as these peptides do not always adhere to the optimal fragmentation considerations for targeted assays. Here, we provide insight into the performance of DIA, SRM, and MRM cubed (MRM(3) ) in the analysis of phosphorylation dynamics throughout the phosphoinositide 3-kinase mechanistic target of rapamycin (PI3K-mTOR) and mitogen-activated protein kinase (MAPK) signaling network. We observe indeed that DIA is less sensitive when compared to SRM, however demonstrates increased flexibility, by postanalysis selection of alternative phosphopeptide precursors. Additionally, we demonstrate the added benefit of MRM(3) , allowing the quantification of two poorly accessible phosphosites. In total, targeted proteomics enabled the quantification of 42 PI3K-mTOR and MAPK phosphosites, gaining a so far unachieved in-depth view mTOR signaling events linked to tyrosine kinase inhibitor resistance in non-small cell lung cancer. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Proteomic analysis of the Theileria annulata schizont
Witschi, M.; Xia, D.; Sanderson, S.; Baumgartner, M.; Wastling, J.M.; Dobbelaere, D.A.E.
2013-01-01
The apicomplexan parasite, Theileria annulata, is the causative agent of tropical theileriosis, a devastating lymphoproliferative disease of cattle. The schizont stage transforms bovine leukocytes and provides an intriguing model to study host/pathogen interactions. The genome of T. annulata has been sequenced and transcriptomic data are rapidly accumulating. In contrast, little is known about the proteome of the schizont, the pathogenic, transforming life cycle stage of the parasite. Using one-dimensional (1-D) gel LC-MS/MS, a proteomic analysis of purified T. annulata schizonts was carried out. In whole parasite lysates, 645 proteins were identified. Proteins with transmembrane domains (TMDs) were under-represented and no proteins with more than four TMDs could be detected. To tackle this problem, Triton X-114 treatment was applied, which facilitates the extraction of membrane proteins, followed by 1-D gel LC-MS/MS. This resulted in the identification of an additional 153 proteins. Half of those had one or more TMD and 30 proteins with more than four TMDs were identified. This demonstrates that Triton X-114 treatment can provide a valuable additional tool for the identification of new membrane proteins in proteomic studies. With two exceptions, all proteins involved in glycolysis and the citric acid cycle were identified. For at least 29% of identified proteins, the corresponding transcripts were not present in the existing expressed sequence tag databases. The proteomics data were integrated into the publicly accessible database resource at EuPathDB (www.eupathdb.org) so that mass spectrometry-based protein expression evidence for T. annulata can be queried alongside transcriptional and other genomics data available for these parasites. PMID:23178997
Sprenger, Adrian; Weber, Sebastian; Zarai, Mostafa; Engelke, Rudolf; Nascimento, Juliana M.; Gretzmeier, Christine; Hilpert, Martin; Boerries, Melanie; Has, Cristina; Busch, Hauke; Bruckner-Tuderman, Leena; Dengjel, Jörn
2013-01-01
Keratinocytes account for 95% of all cells of the epidermis, the stratified squamous epithelium forming the outer layer of the skin, in which a significant number of skin diseases takes root. Immortalized keratinocyte cell lines are often used as research model systems providing standardized, reproducible, and homogenous biological material. Apart from that, primary human keratinocytes are frequently used for medical studies because the skin provides an important route for drug administration and is readily accessible for biopsies. However, comparability of these cell systems is not known. Cell lines may undergo phenotypic shifts and may differ from the in vivo situation in important aspects. Primary cells, on the other hand, may vary in biological functions depending on gender and age of the donor and localization of the biopsy specimen. Here we employed metabolic labeling in combination with quantitative mass spectrometry-based proteomics to assess A431 and HaCaT cell lines for their suitability as model systems. Compared with cell lines, comprehensive profiling of the primary human keratinocyte proteome with respect to gender, age, and skin localization identified an unexpected high proteomic consistency. The data were analyzed by an improved ontology enrichment analysis workflow designed for the study of global proteomics experiments. It enables a quick, comprehensive and unbiased overview of altered biological phenomena and links experimental data to literature. We guide through our workflow, point out its advantages compared with other methods and apply it to visualize differences of cell lines compared with primary human keratinocytes. PMID:23722187
CEBS object model for systems biology data, SysBio-OM.
Xirasagar, Sandhya; Gustafson, Scott; Merrick, B Alex; Tomer, Kenneth B; Stasiewicz, Stanley; Chan, Denny D; Yost, Kenneth J; Yates, John R; Sumner, Susan; Xiao, Nianqing; Waters, Michael D
2004-09-01
To promote a systems biology approach to understanding the biological effects of environmental stressors, the Chemical Effects in Biological Systems (CEBS) knowledge base is being developed to house data from multiple complex data streams in a systems friendly manner that will accommodate extensive querying from users. Unified data representation via a single object model will greatly aid in integrating data storage and management, and facilitate reuse of software to analyze and display data resulting from diverse differential expression or differential profile technologies. Data streams include, but are not limited to, gene expression analysis (transcriptomics), protein expression and protein-protein interaction analysis (proteomics) and changes in low molecular weight metabolite levels (metabolomics). To enable the integration of microarray gene expression, proteomics and metabolomics data in the CEBS system, we designed an object model, Systems Biology Object Model (SysBio-OM). The model is comprehensive and leverages other open source efforts, namely the MicroArray Gene Expression Object Model (MAGE-OM) and the Proteomics Experiment Data Repository (PEDRo) object model. SysBio-OM is designed by extending MAGE-OM to represent protein expression data elements (including those from PEDRo), protein-protein interaction and metabolomics data. SysBio-OM promotes the standardization of data representation and data quality by facilitating the capture of the minimum annotation required for an experiment. Such standardization refines the accuracy of data mining and interpretation. The open source SysBio-OM model, which can be implemented on varied computing platforms is presented here. A universal modeling language depiction of the entire SysBio-OM is available at http://cebs.niehs.nih.gov/SysBioOM/. The Rational Rose object model package is distributed under an open source license that permits unrestricted academic and commercial use and is available at http://cebs.niehs.nih.gov/cebsdownloads. The database and interface are being built to implement the model and will be available for public use at http://cebs.niehs.nih.gov.
Goloborodko, Anton A; Levitsky, Lev I; Ivanov, Mark V; Gorshkov, Mikhail V
2013-02-01
Pyteomics is a cross-platform, open-source Python library providing a rich set of tools for MS-based proteomics. It provides modules for reading LC-MS/MS data, search engine output, protein sequence databases, theoretical prediction of retention times, electrochemical properties of polypeptides, mass and m/z calculations, and sequence parsing. Pyteomics is available under Apache license; release versions are available at the Python Package Index http://pypi.python.org/pyteomics, the source code repository at http://hg.theorchromo.ru/pyteomics, documentation at http://packages.python.org/pyteomics. Pyteomics.biolccc documentation is available at http://packages.python.org/pyteomics.biolccc/. Questions on installation and usage can be addressed to pyteomics mailing list: pyteomics@googlegroups.com.
Milk Bottom-Up Proteomics: Method Optimization
Vincent, Delphine; Ezernieks, Vilnis; Elkins, Aaron; Nguyen, Nga; Moate, Peter J.; Cocks, Benjamin G.; Rochfort, Simone
2016-01-01
Milk is a complex fluid whose proteome displays a diverse set of proteins of high abundance such as caseins and medium to low abundance whey proteins such as ß-lactoglobulin, lactoferrin, immunoglobulins, glycoproteins, peptide hormones, and enzymes. A sample preparation method that enables high reproducibility and throughput is key in reliably identifying proteins present or proteins responding to conditions such as a diet, health or genetics. Using skim milk samples from Jersey and Holstein-Friesian cows, we compared three extraction procedures which have not previously been applied to samples of cows' milk. Method A (urea) involved a simple dilution of the milk in a urea-based buffer, method B (TCA/acetone) involved a trichloroacetic acid (TCA)/acetone precipitation, and method C (methanol/chloroform) involved a tri-phasic partition method in chloroform/methanol solution. Protein assays, SDS-PAGE profiling, and trypsin digestion followed by nanoHPLC-electrospray ionization-tandem mass spectrometry (nLC-ESI-MS/MS) analyses were performed to assess their efficiency. Replicates were used at each analytical step (extraction, digestion, injection) to assess reproducibility. Mass spectrometry (MS) data are available via ProteomeXchange with identifier PXD002529. Overall 186 unique accessions, major and minor proteins, were identified with a combination of methods. Method C (methanol/chloroform) yielded the best resolved SDS-patterns and highest protein recovery rates, method A (urea) yielded the greatest number of accessions, and, of the three procedures, method B (TCA/acetone) was the least compatible of all with a wide range of downstream analytical procedures. Our results also highlighted breed differences between the proteins in milk of Jersey and Holstein-Friesian cows. PMID:26793233
An Open Access future? Report from the eurocancercoms project
Kenney, R; Warden, R
2011-01-01
In March 2011, as part of the background research to the FP7 Eurocancercoms project, the European Association for Cancer Research (EACR) conducted an online survey of its members working in Europe to discover their experiences of and attitudes to the issues surrounding academic publishing and Open Access. This paper presents the results from this survey and compares them to the results from a much larger survey on the same topic from the Study of Open Access Publishing (SOAP). The responses from both surveys show very positive attitudes to the Open Access publishing route; perhaps the most challenging statistic from the EACR survey is that 88% of respondents believe that publicly funded research should be made available to be read and used without access barriers As a conclusion and invitation to further discussion, this paper also contributes to the debate around subscription and Open Access publishing, supporting the case for accelerating the progress towards Open Access publishing of cancer research articles as a particularly supportive way of assisting all researchers to make unhindered progress with their work. PMID:22276063
ERIC Educational Resources Information Center
Mathuews, Katy; Pulcini, Brad
2017-01-01
For the purposes of this article, open access universities are defined as bachelor's degree-granting institutions that do not restrict admission on the basis of ACT/SAT scores, high school grade point average, and the like. Typically, the mission of an open access university is to provide all students with the opportunity to pursue a degree. The…
Code of Federal Regulations, 2011 CFR
2011-07-01
... resolve an allegation that open and nondiscriminatory access was denied? 291.102 Section 291.102 Mineral... OPEN AND NONDISCRIMINATORY ACCESS TO OIL AND GAS PIPELINES UNDER THE OUTER CONTINENTAL SHELF LANDS ACT... allegation concerning open and nondiscriminatory access by calling the toll-free MMS Hotline at 1-888-232...
Code of Federal Regulations, 2014 CFR
2014-07-01
... 30 Mineral Resources 2 2014-07-01 2014-07-01 false May I call the BSEE Hotline to informally... the BSEE Hotline to informally resolve an allegation that open and nondiscriminatory access was denied... open and nondiscriminatory access by calling the toll-free BSEE Pipeline Open Access Hotline at 1-888...
Code of Federal Regulations, 2013 CFR
2013-07-01
... 30 Mineral Resources 2 2013-07-01 2013-07-01 false May I call the BSEE Hotline to informally... the BSEE Hotline to informally resolve an allegation that open and nondiscriminatory access was denied... open and nondiscriminatory access by calling the toll-free BSEE Pipeline Open Access Hotline at 1-888...
Code of Federal Regulations, 2012 CFR
2012-07-01
... 30 Mineral Resources 2 2012-07-01 2012-07-01 false May I call the BSEE Hotline to informally... the BSEE Hotline to informally resolve an allegation that open and nondiscriminatory access was denied... open and nondiscriminatory access by calling the toll-free BSEE Pipeline Open Access Hotline at 1-888...
Parsing and Quantification of Raw Orbitrap Mass Spectrometer Data Using RawQuant.
Kovalchik, Kevin A; Moggridge, Sophie; Chen, David D Y; Morin, Gregg B; Hughes, Christopher S
2018-06-01
Effective analysis of protein samples by mass spectrometry (MS) requires careful selection and optimization of a range of experimental parameters. As the output from the primary detection device, the "raw" MS data file can be used to gauge the success of a given sample analysis. However, the closed-source nature of the standard raw MS file can complicate effective parsing of the data contained within. To ease and increase the range of analyses possible, the RawQuant tool was developed to enable parsing of raw MS files derived from Thermo Orbitrap instruments to yield meta and scan data in an openly readable text format. RawQuant can be commanded to export user-friendly files containing MS 1 , MS 2 , and MS 3 metadata as well as matrices of quantification values based on isobaric tagging approaches. In this study, the utility of RawQuant is demonstrated in several scenarios: (1) reanalysis of shotgun proteomics data for the identification of the human proteome, (2) reanalysis of experiments utilizing isobaric tagging for whole-proteome quantification, and (3) analysis of a novel bacterial proteome and synthetic peptide mixture for assessing quantification accuracy when using isobaric tags. Together, these analyses successfully demonstrate RawQuant for the efficient parsing and quantification of data from raw Thermo Orbitrap MS files acquired in a range of common proteomics experiments. In addition, the individual analyses using RawQuant highlights parametric considerations in the different experimental sets and suggests targetable areas to improve depth of coverage in identification-focused studies and quantification accuracy when using isobaric tags.
Proteomic changes in rice leaves grown under open field high temperature stress conditions.
Das, Smruti; Krishnan, P; Mishra, Vagish; Kumar, Ritesh; Ramakrishnan, B; Singh, N K
2015-11-01
The interactive effect of temperature with other climatic and soil factors has profound influences on the growth and development of rice. The responses of rice to high temperatures under field conditions are more important than those under the controlled conditions. To understand the genes associated with high temperature stress response in general and tolerance in particular, the expression of all those genes associated with adaptation and tolerance in rice requires proteomic analysis. High temperature stress-tolerant cv. N22 was subjected to 28/18 °C (control) and 42/32 °C (high temperature stress) at flowering stage. The plants were grown in the field under the free air temperature increment condition. The proteomic changes in rice leaves due to high temperature stress were discussed. The proteomes of leaves had about 3000 protein spots, reproducibly detected on 2-dimensional electrophoretic gels with 573 proteins differentially expressed between the control and the high temperature treatments. Putative physiological functions suggested five categories such as growth (15.4%), heat shock proteins (7.7%), regulatory proteins (26.9%), redox homeostasis proteins (11.5%) and energy and metabolism (38.5%) related proteins. The results of the present study suggest that cv. N22, an agronomically recognized temperature tolerant rice cultivar copes with high temperature stress in a complex manner. Several functional proteins play important roles in its responses. The predicted climate change events necessitate more studies using this cultivar under different simulated ecological conditions to identify proteomic changes and the associated genes to be used as biomarkers and to gain a better understanding on the biochemical pathways involved in tolerance.
Mining the human urine proteome for monitoring renal transplant injury
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sigdel, Tara K.; Gao, Yuqian; He, Jintang
The human urinary proteome reflects systemic and inherent renal injury perturbations and can be analyzed to harness specific biomarkers for different kidney transplant injury states. 396 unique urine samples were collected contemporaneously with an allograft biopsy from 396 unique kidney transplant recipients. Centralized, blinded histology on the graft was used to classify matched urine samples into categories of acute rejection (AR), chronic allograft nephropathy (CAN), BK virus nephritis (BKVN), and stable graft (STA). Liquid chromatography–mass spectrometry (LC-MS) based proteomics using iTRAQ based discovery (n=108) and global label-free LC-MS analyses of individual samples (n=137) for quantitative proteome assessment were used inmore » the discovery step. Selected reaction monitoring (SRM) was applied to identify and validate minimal urine protein/peptide biomarkers to accurately segregate organ injury causation and pathology on unique urine samples (n=151). A total of 958 proteins were initially quantified by iTRAQ, 87% of which were also identified among 1574 urine proteins detected in LC-MS validation. 103 urine proteins were significantly (p<0.05) perturbed in injury and enriched for humoral immunity, complement activation, and lymphocyte trafficking. A set of 131 peptides corresponding to 78 proteins were assessed by SRM for their significance in an independent sample cohort. A minimal set of 35 peptides mapping to 33 proteins, were modeled to segregate different injury groups (AUC =93% for AR, 99% for CAN, 83% for BKVN). Urinary proteome discovery and targeted validation identified urine protein fingerprints for non-invasive differentiation of kidney transplant injuries, thus opening the door for personalized immune risk assessment and therapy.« less
2011-01-01
Background During development, the branchial mesoderm of Torpedo californica transdifferentiates into an electric organ capable of generating high voltage discharges to stun fish. The organ contains a high density of cholinergic synapses and has served as a biochemical model for the membrane specialization of myofibers, the neuromuscular junction (NMJ). We studied the genome and proteome of the electric organ to gain insight into its composition, to determine if there is concordance with skeletal muscle and the NMJ, and to identify novel synaptic proteins. Results Of 435 proteins identified, 300 mapped to Torpedo cDNA sequences with ≥2 peptides. We identified 14 uncharacterized proteins in the electric organ that are known to play a role in acetylcholine receptor clustering or signal transduction. In addition, two human open reading frames, C1orf123 and C6orf130, showed high sequence similarity to electric organ proteins. Our profile lists several proteins that are highly expressed in skeletal muscle or are muscle specific. Synaptic proteins such as acetylcholinesterase, acetylcholine receptor subunits, and rapsyn were present in the electric organ proteome but absent in the skeletal muscle proteome. Conclusions Our integrated genomic and proteomic analysis supports research describing a muscle-like profile of the organ. We show that it is a repository of NMJ proteins but we present limitations on its use as a comprehensive model of the NMJ. Finally, we identified several proteins that may become candidates for signaling proteins not previously characterized as components of the NMJ. PMID:21798097
An emerging cyberinfrastructure for biodefense pathogen and pathogen–host data
Zhang, C.; Crasta, O.; Cammer, S.; Will, R.; Kenyon, R.; Sullivan, D.; Yu, Q.; Sun, W.; Jha, R.; Liu, D.; Xue, T.; Zhang, Y.; Moore, M.; McGarvey, P.; Huang, H.; Chen, Y.; Zhang, J.; Mazumder, R.; Wu, C.; Sobral, B.
2008-01-01
The NIAID-funded Biodefense Proteomics Resource Center (RC) provides storage, dissemination, visualization and analysis capabilities for the experimental data deposited by seven Proteomics Research Centers (PRCs). The data and its publication is to support researchers working to discover candidates for the next generation of vaccines, therapeutics and diagnostics against NIAID's Category A, B and C priority pathogens. The data includes transcriptional profiles, protein profiles, protein structural data and host–pathogen protein interactions, in the context of the pathogen life cycle in vivo and in vitro. The database has stored and supported host or pathogen data derived from Bacillus, Brucella, Cryptosporidium, Salmonella, SARS, Toxoplasma, Vibrio and Yersinia, human tissue libraries, and mouse macrophages. These publicly available data cover diverse data types such as mass spectrometry, yeast two-hybrid (Y2H), gene expression profiles, X-ray and NMR determined protein structures and protein expression clones. The growing database covers over 23 000 unique genes/proteins from different experiments and organisms. All of the genes/proteins are annotated and integrated across experiments using UniProt Knowledgebase (UniProtKB) accession numbers. The web-interface for the database enables searching, querying and downloading at the level of experiment, group and individual gene(s)/protein(s) via UniProtKB accession numbers or protein function keywords. The system is accessible at http://www.proteomicsresource.org/. PMID:17984082
Opening our science: Open science and cyanobacterial research at the US EPA
In this blog post we introduce the idea of Open Science and discuss multiple ways we are implementing these concepts in our cyanobacteria research. We give examples of our open access publications, open source code that support our research, and provide open access to our resear...
Imin, Nijat; De Jong, Femke; Mathesius, Ulrike; van Noorden, Giel; Saeed, Nasir A; Wang, Xin-Ding; Rose, Ray J; Rolfe, Barry G
2004-07-01
Using a combination of two-dimensional gel electrophoresis (2-DE) protein mapping and mass spectrometry (MS) analysis, we have established proteome reference maps of Medicago truncatula embryogenic tissue culture cells. The cultures were generated from single protoplasts, which provided a relatively homogeneous cell population. We used these to analyze protein expression at the globular stages of somatic embryogenesis, which is the earliest morphogenetic embryonic stage. Over 3000 proteins could reproducibly be resolved over a pI range of 4-11. Three hundred and twelve protein spots were extracted from colloidal Coomassie Blue-stained 2-DE gels and analyzed by matrix-assisted laser desorption/ionization-time of flight MS analysis and tandem MS sequencing. This enabled the identification of 169 protein spots representing 128 unique gene products using a publicly available expressed sequence tag database and the MASCOT search engine. These reference maps will be valuable for the investigation of the molecular events which occur during somatic embryogenesis in M. truncatula. The proteome reference maps and supplementary materials will be available and updated for public access at http://semele.anu.edu.au/.
Andromeda: a peptide search engine integrated into the MaxQuant environment.
Cox, Jürgen; Neuhauser, Nadin; Michalski, Annette; Scheltema, Richard A; Olsen, Jesper V; Mann, Matthias
2011-04-01
A key step in mass spectrometry (MS)-based proteomics is the identification of peptides in sequence databases by their fragmentation spectra. Here we describe Andromeda, a novel peptide search engine using a probabilistic scoring model. On proteome data, Andromeda performs as well as Mascot, a widely used commercial search engine, as judged by sensitivity and specificity analysis based on target decoy searches. Furthermore, it can handle data with arbitrarily high fragment mass accuracy, is able to assign and score complex patterns of post-translational modifications, such as highly phosphorylated peptides, and accommodates extremely large databases. The algorithms of Andromeda are provided. Andromeda can function independently or as an integrated search engine of the widely used MaxQuant computational proteomics platform and both are freely available at www.maxquant.org. The combination enables analysis of large data sets in a simple analysis workflow on a desktop computer. For searching individual spectra Andromeda is also accessible via a web server. We demonstrate the flexibility of the system by implementing the capability to identify cofragmented peptides, significantly improving the total number of identified peptides.
Marino, Daniel; Ariz, Idoia; Lasa, Berta; Santamaría, Enrique; Fernández-Irigoyen, Joaquín; González-Murua, Carmen; Aparicio Tejo, Pedro M.
2016-01-01
Accessing different nitrogen (N) sources involves a profound adaptation of plant metabolism. In this study, a quantitative proteomic approach was used to further understand how the model plant Arabidopsis thaliana adjusts to different N sources when grown exclusively under nitrate or ammonium nutrition. Proteome data evidenced that glucosinolate metabolism was differentially regulated by the N source and that both TGG1 and TGG2 myrosinases were more abundant under ammonium nutrition, which is generally considered to be a stressful situation. Moreover, Arabidopsis plants displayed glucosinolate accumulation and induced myrosinase activity under ammonium nutrition. Interestingly, these results were also confirmed in the economically important crop broccoli (Brassica oleracea var. italica). Moreover, these metabolic changes were correlated in Arabidopsis with the differential expression of genes from the aliphatic glucosinolate metabolic pathway. This study underlines the importance of nitrogen nutrition and the potential of using ammonium as the N source in order to stimulate glucosinolate metabolism, which may have important applications not only in terms of reducing pesticide use, but also for increasing plants’ nutritional value. PMID:27085186
Runs [ Open Access : Password Protected ] CESM Development CESM Runs [ Open Access : Password Protected ] WRF Development WRF Runs [ Open Access : Password Protected ] Climate Modeling Home Projects Links Literature Manuscripts Publications Polar Group Meeting (2012) ASGC Home ASGC Jobs Web Calendar Wiki Internal
Open-access databases as unprecedented resources and drivers of cultural change in fisheries science
DOE Office of Scientific and Technical Information (OSTI.GOV)
McManamay, Ryan A; Utz, Ryan
2014-01-01
Open-access databases with utility in fisheries science have grown exponentially in quantity and scope over the past decade, with profound impacts to our discipline. The management, distillation, and sharing of an exponentially growing stream of open-access data represents several fundamental challenges in fisheries science. Many of the currently available open-access resources may not be universally known among fisheries scientists. We therefore introduce many national- and global-scale open-access databases with applications in fisheries science and provide an example of how they can be harnessed to perform valuable analyses without additional field efforts. We also discuss how the development, maintenance, and utilizationmore » of open-access data are likely to pose technical, financial, and educational challenges to fisheries scientists. Such cultural implications that will coincide with the rapidly increasing availability of free data should compel the American Fisheries Society to actively address these problems now to help ease the forthcoming cultural transition.« less
Open Access Publishing in High-Energy Physics: the SCOAP3 Initiative
NASA Astrophysics Data System (ADS)
Mele, S.
2010-10-01
Scholarly communication in High-Energy Physics (HEP) shows traits very similar to Astronomy and Astrophysics: pervasiveness of Open Access to preprints through community-based services; a culture of openness and sharing among its researchers; a compact number of yearly articles published by a relatively small number of journals which are dear to the community. These aspects have led HEP to spearhead an innovative model for the transition of its scholarly publishing to Open Access. The Sponsoring Consortium for Open Access Publishing in Particle Physics (SCOAP) aims to be a central body to finance peer-review service rather than the purchase of access to information as in the traditional subscription model, with all articles in the discipline eventually available in Open Access. Sustainable funding to SCOAP would come from libraries, library consortia and HEP funding agencies, through a re-direction of funds currently spent for subscriptions to HEP journals. This paper presents the cultural and bibliometric factors at the roots of SCOAP and the current status of this worldwide initiative.
False gold: Safely navigating open access publishing to avoid predatory publishers and journals.
McCann, Terence V; Polacsek, Meg
2018-04-01
The aim of this study was to review and discuss predatory open access publishing in the context of nursing and midwifery and develop a set of guidelines that serve as a framework to help clinicians, educators and researchers avoid predatory publishers. Open access publishing is increasingly common across all academic disciplines. However, this publishing model is vulnerable to exploitation by predatory publishers, posing a threat to nursing and midwifery scholarship and practice. Guidelines are needed to help researchers recognize predatory journals and publishers and understand the negative consequences of publishing in them. Discussion paper. A literature search of BioMed Central, CINAHL, MEDLINE with Full Text and PubMed for terms related to predatory publishing, published in the period 2007-2017. Lack of awareness of the risks and pressure to publish in international journals, may result in nursing and midwifery researchers publishing their work in dubious open access journals. Caution should be taken prior to writing and submitting a paper, to avoid predatory publishers. The advantage of open access publishing is that it provides readers with access to peer-reviewed research as soon as it is published online. However, predatory publishers use deceptive methods to exploit open access publishing for their own profit. Clear guidelines are needed to help researchers navigate safely open access publishing. A deeper understanding of the risks of predatory publishing is needed. Clear guidelines should be followed by nursing and midwifery researchers seeking to publish their work in open access journals. © 2017 John Wiley & Sons Ltd.
NASA Astrophysics Data System (ADS)
Bradford, Rick
2013-01-01
Your December 2012 issue contains both a short news article on the progress of the open-access movement ("UK open access gains ground", p11) and a lengthy feature about the visionary Irish physicist Edward Hutchinson Synge ("Unknown genius", pp26-29). I find the combination of these articles ironic.
Code of Federal Regulations, 2010 CFR
2010-04-01
... ENERGY REGULATIONS UNDER THE FEDERAL POWER ACT OPEN ACCESS SAME-TIME INFORMATION SYSTEMS § 37.2 Purpose. (a) The purpose of this part is to ensure that potential customers of open access transmission... Transmission Provider (or its agent) to create and operate an Open Access Same-time Information System (OASIS...
Code of Federal Regulations, 2011 CFR
2011-04-01
... ENERGY REGULATIONS UNDER THE FEDERAL POWER ACT OPEN ACCESS SAME-TIME INFORMATION SYSTEMS § 37.2 Purpose. (a) The purpose of this part is to ensure that potential customers of open access transmission... Transmission Provider (or its agent) to create and operate an Open Access Same-time Information System (OASIS...
Public Access and Open Access: Is There a Difference? | Poster
By Robin Meckley, Contributing Writer, and Tracie Frederick, Guest Writer Open access and public access—are they different concepts or are they the same? What do they mean for the researchers at NCI at Frederick? “Open-access (OA) literature is digital, online, free of charge, and free of most copyright and licensing restrictions. What makes it possible is the Internet and the
Dyrlund, Thomas F; Poulsen, Ebbe T; Scavenius, Carsten; Sanggaard, Kristian W; Enghild, Jan J
2012-09-01
Data processing and analysis of proteomics data are challenging and time consuming. In this paper, we present MS Data Miner (MDM) (http://sourceforge.net/p/msdataminer), a freely available web-based software solution aimed at minimizing the time required for the analysis, validation, data comparison, and presentation of data files generated in MS software, including Mascot (Matrix Science), Mascot Distiller (Matrix Science), and ProteinPilot (AB Sciex). The program was developed to significantly decrease the time required to process large proteomic data sets for publication. This open sourced system includes a spectra validation system and an automatic screenshot generation tool for Mascot-assigned spectra. In addition, a Gene Ontology term analysis function and a tool for generating comparative Excel data reports are included. We illustrate the benefits of MDM during a proteomics study comprised of more than 200 LC-MS/MS analyses recorded on an AB Sciex TripleTOF 5600, identifying more than 3000 unique proteins and 3.5 million peptides. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
[The Open Access Initiative (OAI) in the scientific literature].
Sánchez-Martín, Francisco M; Millán Rodríguez, Félix; Villavicencio Mavrich, Humberto
2009-01-01
According to the declaration of the Budapest Open Access Initiative (OAI) is defined as a editorial model in which access to scientific journal literature and his use are free. Free flow of information allowed by Internet has been the basis of this initiative. The Bethesda and the Berlin declarations, supported by some international agencies, proposes to require researchers to deposit copies of all articles published in a self-archive or an Open Access repository, and encourage researchers to publish their research papers in journals Open Access. This paper reviews the keys of the OAI, with their strengths and controversial aspects; and it discusses the position of databases, search engines and repositories of biomedical information, as well as the attitude of the scientists, publishers and journals. So far the journal Actas Urológicas Españolas (Act Urol Esp) offer their contents on Open Access as On Line in Spanish and English.
Pencina, Michael J; Louzao, Darcy M; McCourt, Brian J; Adams, Monique R; Tayyabkhan, Rehbar H; Ronco, Peter; Peterson, Eric D
2016-02-01
There are growing calls for sponsors to increase transparency by providing access to clinical trial data. In response, Bristol-Myers Squibb and the Duke Clinical Research Institute have collaborated on a new initiative, Supporting Open Access to Researchers. The aim is to facilitate open sharing of Bristol-Myers Squibb trial data with interested researchers. Key features of the Supporting Open Access to Researchers data sharing model include an independent review committee that ensures expert consideration of each proposal, stringent data deidentification/anonymization and protection of patient privacy, requirement of prespecified statistical analysis plans, and independent review of manuscripts before submission for publication. We believe that these approaches will promote open science by allowing investigators to verify trial results as well as to pursue interesting secondary uses of trial data without compromising scientific integrity. Copyright © 2015 Elsevier Inc. All rights reserved.
The Pfam protein families database: towards a more sustainable future.
Finn, Robert D; Coggill, Penelope; Eberhardt, Ruth Y; Eddy, Sean R; Mistry, Jaina; Mitchell, Alex L; Potter, Simon C; Punta, Marco; Qureshi, Matloob; Sangrador-Vegas, Amaia; Salazar, Gustavo A; Tate, John; Bateman, Alex
2016-01-04
In the last two years the Pfam database (http://pfam.xfam.org) has undergone a substantial reorganisation to reduce the effort involved in making a release, thereby permitting more frequent releases. Arguably the most significant of these changes is that Pfam is now primarily based on the UniProtKB reference proteomes, with the counts of matched sequences and species reported on the website restricted to this smaller set. Building families on reference proteomes sequences brings greater stability, which decreases the amount of manual curation required to maintain them. It also reduces the number of sequences displayed on the website, whilst still providing access to many important model organisms. Matches to the full UniProtKB database are, however, still available and Pfam annotations for individual UniProtKB sequences can still be retrieved. Some Pfam entries (1.6%) which have no matches to reference proteomes remain; we are working with UniProt to see if sequences from them can be incorporated into reference proteomes. Pfam-B, the automatically-generated supplement to Pfam, has been removed. The current release (Pfam 29.0) includes 16 295 entries and 559 clans. The facility to view the relationship between families within a clan has been improved by the introduction of a new tool. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
The Functional Human C-Terminome
Hedden, Michael; Lyon, Kenneth F.; Brooks, Steven B.; David, Roxanne P.; Limtong, Justin; Newsome, Jacklyn M.; Novakovic, Nemanja; Rajasekaran, Sanguthevar; Thapar, Vishal; Williams, Sean R.; Schiller, Martin R.
2016-01-01
All translated proteins end with a carboxylic acid commonly called the C-terminus. Many short functional sequences (minimotifs) are located on or immediately proximal to the C-terminus. However, information about the function of protein C-termini has not been consolidated into a single source. Here, we built a new “C-terminome” database and web system focused on human proteins. Approximately 3,600 C-termini in the human proteome have a minimotif with an established molecular function. To help evaluate the function of the remaining C-termini in the human proteome, we inferred minimotifs identified by experimentation in rodent cells, predicted minimotifs based upon consensus sequence matches, and predicted novel highly repetitive sequences in C-termini. Predictions can be ranked by enrichment scores or Gene Evolutionary Rate Profiling (GERP) scores, a measurement of evolutionary constraint. By searching for new anchored sequences on the last 10 amino acids of proteins in the human proteome with lengths between 3–10 residues and up to 5 degenerate positions in the consensus sequences, we have identified new consensus sequences that predict instances in the majority of human genes. All of this information is consolidated into a database that can be accessed through a C-terminome web system with search and browse functions for minimotifs and human proteins. A known consensus sequence-based predicted function is assigned to nearly half the proteins in the human proteome. Weblink: http://cterminome.bio-toolkit.com. PMID:27050421
Hempel, Kristina; Herbst, Florian-Alexander; Moche, Martin; Hecker, Michael; Becher, Dörte
2011-04-01
Staphylococcus aureus is capable of colonizing and infecting humans by its arsenal of surface-exposed and secreted proteins. Iron-limited conditions in mammalian body fluids serve as a major environmental signal to bacteria to express virulence determinants. Here we present a comprehensive, gel-free, and GeLC-MS/MS-based quantitative proteome profiling of S. aureus under this infection-relevant situation. (14)N(15)N metabolic labeling and three complementing approaches were combined for relative quantitative analyses of surface-associated proteins. The surface-exposed and secreted proteome profiling approaches comprise trypsin shaving, biotinylation, and precipitation of the supernatant. By analysis of the outer subproteomic and cytoplasmic protein fraction, 1210 proteins could be identified including 221 surface-associated proteins. Thus, access was enabled to 70% of the predicted cell wall-associated proteins, 80% of the predicted sortase substrates, two/thirds of lipoproteins and more than 50% of secreted and cytoplasmic proteins. For iron-deficiency, 158 surface-associated proteins were quantified. Twenty-nine proteins were found in altered amounts showing particularly surface-exposed proteins strongly induced, such as the iron-regulated surface determinant proteins IsdA, IsdB, IsdC and IsdD as well as lipid-anchored iron compound-binding proteins. The work presents a crucial subject for understanding S. aureus pathophysiology by the use of methods that allow quantitative surface proteome profiling.
Bini, Andressa Peres; Regiani, Thais; Franceschini, Lívia Maria; Budzinski, Ilara Gabriela Frasson; Marques, Felipe Garbelini; Labate, Mônica Teresa Veneziano; Guidetti-Gonzalez, Simone; Moon, David Henry; Labate, Carlos Alberto
2016-01-01
Puccinia psidii sensu lato (s.l.) is the causal agent of eucalyptus and guava rust, but it also attacks a wide range of plant species from the myrtle family, resulting in a significant genetic and physiological variability among populations accessed from different hosts. The uredospores are crucial to P. psidii dissemination in the field. Although they are important for the fungal pathogenesis, their molecular characterization has been poorly studied. In this work, we report the first in-depth proteomic analysis of P. psidii s.l. uredospores from two contrasting populations: guava fruits (PpGuava) and eucalyptus leaves (PpEucalyptus). NanoUPLC-MSE was used to generate peptide spectra that were matched to the UniProt Puccinia genera sequences (UniProt database) resulting in the first proteomic analysis of the phytopathogenic fungus P. psidii. Three hundred and fourty proteins were detected and quantified using Label free proteomics. A significant number of unique proteins were found for each sample, others were significantly more or less abundant, according to the fungal populations. In PpGuava population, many proteins correlated with fungal virulence, such as malate dehydrogenase, proteossomes subunits, enolases and others were increased. On the other hand, PpEucalyptus proteins involved in biogenesis, protein folding and translocation were increased, supporting the physiological variability of the fungal populations according to their protein reservoirs and specific host interaction strategies. PMID:26731728
Quecine, Maria Carolina; Leite, Thiago Falda; Bini, Andressa Peres; Regiani, Thais; Franceschini, Lívia Maria; Budzinski, Ilara Gabriela Frasson; Marques, Felipe Garbelini; Labate, Mônica Teresa Veneziano; Guidetti-Gonzalez, Simone; Moon, David Henry; Labate, Carlos Alberto
2016-01-01
Puccinia psidii sensu lato (s.l.) is the causal agent of eucalyptus and guava rust, but it also attacks a wide range of plant species from the myrtle family, resulting in a significant genetic and physiological variability among populations accessed from different hosts. The uredospores are crucial to P. psidii dissemination in the field. Although they are important for the fungal pathogenesis, their molecular characterization has been poorly studied. In this work, we report the first in-depth proteomic analysis of P. psidii s.l. uredospores from two contrasting populations: guava fruits (PpGuava) and eucalyptus leaves (PpEucalyptus). NanoUPLC-MSE was used to generate peptide spectra that were matched to the UniProt Puccinia genera sequences (UniProt database) resulting in the first proteomic analysis of the phytopathogenic fungus P. psidii. Three hundred and fourty proteins were detected and quantified using Label free proteomics. A significant number of unique proteins were found for each sample, others were significantly more or less abundant, according to the fungal populations. In PpGuava population, many proteins correlated with fungal virulence, such as malate dehydrogenase, proteossomes subunits, enolases and others were increased. On the other hand, PpEucalyptus proteins involved in biogenesis, protein folding and translocation were increased, supporting the physiological variability of the fungal populations according to their protein reservoirs and specific host interaction strategies.
Heidler, Juliana; Hardt, Stefanie; Wittig, Ilka; Tegeder, Irmgard
2016-12-01
Progranulin deficiency is associated with neurodegeneration in humans and in mice. The mechanisms likely involve progranulin-promoted removal of protein waste via autophagy. We performed a deep proteomic screen of the pre-frontal cortex in aged (13-15 months) female progranulin-deficient mice (GRN -/- ) and mice with inducible neuron-specific overexpression of progranulin (SLICK-GRN-OE) versus the respective control mice. Proteins were extracted and analyzed per liquid chromatography/mass spectrometry (LC/MS) on a Thermo Scientific™ Q Exactive Plus equipped with an ultra-high performance liquid chromatography unit and a Nanospray Flex Ion-Source. Full Scan MS-data were acquired using Xcalibur and raw files were analyzed using the proteomics software Max Quant. The mouse reference proteome set from uniprot (June 2015) was used to identify peptides and proteins. The DiB data file is a reduced MaxQuant output and includes peptide and protein identification, accession numbers, protein and gene names, sequence coverage and label free quantification (LFQ) values of each sample. Differences in protein expression in genotypes are presented in "Progranulin overexpression in sensory neurons attenuates neuropathic pain in mice: Role of autophagy" (C. Altmann, S. Hardt, C. Fischer, J. Heidler, H.Y. Lim, A. Haussler, B. Albuquerque, B. Zimmer, C. Moser, C. Behrends, F. Koentgen, I. Wittig, M.H. Schmidt, A.M. Clement, T. Deller, I. Tegeder, 2016) [1].
Empowering Learners with Mobile Open-Access Learning Initiatives
ERIC Educational Resources Information Center
Mills, Michael, Ed.; Wake, Donna, Ed.
2017-01-01
Education has been progressing at a rapid pace ever since educators have been able to harness the power of mobile technology. Open-access learning techniques provide more students with the opportunity to engage in educational opportunities that may have been previously restricted. "Empowering Learners with Mobile Open-Access Learning…
ERIC Educational Resources Information Center
Suber, Peter
2012-01-01
The Internet lets us share perfect copies of our work with a worldwide audience at virtually no cost. We take advantage of this revolutionary opportunity when we make our work "open access": digital, online, free of charge, and free of most copyright and licensing restrictions. Open access is made possible by the Internet and copyright-holder…
Open Access Publishing in Indian Premier Research Institutions
ERIC Educational Resources Information Center
Bhat, Mohammad Hanief
2009-01-01
Introduction: Publishing research findings in open access journals is a means of enhancing visibility and consequently increasing the impact of publications. This study provides an overview of open access publishing in premier research institutes of India. Method: The publication output of each institution from 2003 to 2007 was ascertained through…
Analysis of FERC's Final EIS for Electricity Open Access & Recovery of Stranded Costs
1996-01-01
Reviews the Final Environmental Impact Statement (FEIS) prepared by the Federal Energy Regulatory Commission for its electricity transmission system open access prepared in April 1996 and uses the National Energy Modeling System (NEMS) to analyze the open access rule (Orders 888 and 889).
Prospect for Development of Open Access in Argentina
ERIC Educational Resources Information Center
Miguel, Sandra; Bongiovani, Paola C.; Gomez, Nancy D.; Bueno-de-la-Fuente, Gema
2013-01-01
This perspective article presents an overview of the Open Access movement in Argentina, from a global and regional (Latin American) context. The article describes the evolution and current state of initiatives by examining two principal approaches to Open Access in Argentina: "golden" and "green roads". The article will then…
Open Access and Civic Scientific Information Literacy
ERIC Educational Resources Information Center
Zuccala, Alesia
2010-01-01
Introduction: We examine how residents and citizens of The Netherlands perceive open access to acquire preliminary insight into the role it might play in cultivating civic scientific literacy. Open access refers to scientific or scholarly research literature available on the Web to scholars and the general public in free online journals and…
50 CFR 660.310 - Purpose and scope.
Code of Federal Regulations, 2010 CFR
2010-10-01
..., DEPARTMENT OF COMMERCE (CONTINUED) FISHERIES OFF WEST COAST STATES West Coast Groundfish-Open Access Fisheries § 660.310 Purpose and scope. This subpart covers the Pacific Coast Groundfish open access fishery. The open access fishery, as defined at § 660.11, Subpart C, is the fishery composed of commercial...
Open Access, Education Research, and Discovery
ERIC Educational Resources Information Center
Furlough, Michael
2010-01-01
Background/Context: The open access movement has successfully drawn attention to economic and political aspects of scholarly communication through a significant body of commentary that debates the merits of open access and the potential damage it may do to scholarly publishing. Researchers within the field of education research, notably John…
Open-Access Electronic Textbooks: An Overview
ERIC Educational Resources Information Center
Ovadia, Steven
2011-01-01
Given the challenging economic climate in the United States, many academics are looking to open-access electronic textbooks as a way to provide students with traditional textbook content at a more financially advantageous price. Open access refers to "the free and widely available information throughout the World Wide Web. Once an article's…
A study of institutional spending on open access publication fees in Germany.
Jahn, Najko; Tullney, Marco
2016-01-01
Publication fees as a revenue source for open access publishing hold a prominent place on the agendas of researchers, policy makers, and academic publishers. This study contributes to the evolving empirical basis for funding these charges and examines how much German universities and research organisations spent on open access publication fees. Using self-reported cost data from the Open APC initiative, the analysis focused on the amount that was being spent on publication fees, and compared these expenditure with data from related Austrian (FWF) and UK (Wellcome Trust, Jisc) initiatives, in terms of both size and the proportion of articles being published in fully and hybrid open access journals. We also investigated how thoroughly self-reported articles were indexed in Crossref, a DOI minting agency for scholarly literature, and analysed how the institutional spending was distributed across publishers and journal titles. According to self-reported data from 30 German universities and research organisations between 2005 and 2015, expenditures on open access publication fees increased over the years in Germany and amounted to € 9,627,537 for 7,417 open access journal articles. The average payment was € 1,298, and the median was € 1,231. A total of 94% of the total article volume included in the study was supported in accordance with the price cap of € 2,000, a limit imposed by the Deutsche Forschungsgemeinschaft (DFG) as part of its funding activities for open access funding at German universities. Expenditures varied considerably at the institutional level. There were also differences in how much the institutions spent per journal and publisher. These differences reflect, at least in part, the varying pricing schemes in place including discounted publication fees. With an indexing coverage of 99%, Crossref thoroughly indexed the open access journals articles included in the study. A comparison with the related openly available cost data from Austria and the UK revealed that German universities and research organisations primarily funded articles in fully open access journals. By contrast, articles in hybrid journal accounted for the largest share of spending according to the Austrian and UK data. Fees paid for hybrid journals were on average more expensive than those paid for fully open access journals.
A study of institutional spending on open access publication fees in Germany
Tullney, Marco
2016-01-01
Publication fees as a revenue source for open access publishing hold a prominent place on the agendas of researchers, policy makers, and academic publishers. This study contributes to the evolving empirical basis for funding these charges and examines how much German universities and research organisations spent on open access publication fees. Using self-reported cost data from the Open APC initiative, the analysis focused on the amount that was being spent on publication fees, and compared these expenditure with data from related Austrian (FWF) and UK (Wellcome Trust, Jisc) initiatives, in terms of both size and the proportion of articles being published in fully and hybrid open access journals. We also investigated how thoroughly self-reported articles were indexed in Crossref, a DOI minting agency for scholarly literature, and analysed how the institutional spending was distributed across publishers and journal titles. According to self-reported data from 30 German universities and research organisations between 2005 and 2015, expenditures on open access publication fees increased over the years in Germany and amounted to € 9,627,537 for 7,417 open access journal articles. The average payment was € 1,298, and the median was € 1,231. A total of 94% of the total article volume included in the study was supported in accordance with the price cap of € 2,000, a limit imposed by the Deutsche Forschungsgemeinschaft (DFG) as part of its funding activities for open access funding at German universities. Expenditures varied considerably at the institutional level. There were also differences in how much the institutions spent per journal and publisher. These differences reflect, at least in part, the varying pricing schemes in place including discounted publication fees. With an indexing coverage of 99%, Crossref thoroughly indexed the open access journals articles included in the study. A comparison with the related openly available cost data from Austria and the UK revealed that German universities and research organisations primarily funded articles in fully open access journals. By contrast, articles in hybrid journal accounted for the largest share of spending according to the Austrian and UK data. Fees paid for hybrid journals were on average more expensive than those paid for fully open access journals. PMID:27602289
Quan, Sheng; Yang, Pingfang; Cassin-Ross, Gaëlle; Kaur, Navneet; Switzenberg, Robert; Aung, Kyaw; Li, Jiying; Hu, Jianping
2013-01-01
Plant peroxisomes are highly dynamic organelles that mediate a suite of metabolic processes crucial to development. Peroxisomes in seeds/dark-grown seedlings and in photosynthetic tissues constitute two major subtypes of plant peroxisomes, which had been postulated to contain distinct primary biochemical properties. Multiple in-depth proteomic analyses had been performed on leaf peroxisomes, yet the major makeup of peroxisomes in seeds or dark-grown seedlings remained unclear. To compare the metabolic pathways of the two dominant plant peroxisomal subtypes and discover new peroxisomal proteins that function specifically during seed germination, we performed proteomic analysis of peroxisomes from etiolated Arabidopsis (Arabidopsis thaliana) seedlings. The detection of 77 peroxisomal proteins allowed us to perform comparative analysis with the peroxisomal proteome of green leaves, which revealed a large overlap between these two primary peroxisomal variants. Subcellular targeting analysis by fluorescence microscopy validated around 10 new peroxisomal proteins in Arabidopsis. Mutant analysis suggested the role of the cysteine protease RESPONSE TO DROUGHT21A-LIKE1 in β-oxidation, seed germination, and growth. This work provides a much-needed road map of a major type of plant peroxisome and has established a basis for future investigations of peroxisomal proteolytic processes to understand their roles in development and in plant interaction with the environment. PMID:24130194
SwissPalm: Protein Palmitoylation database.
Blanc, Mathieu; David, Fabrice; Abrami, Laurence; Migliozzi, Daniel; Armand, Florence; Bürgi, Jérôme; van der Goot, Françoise Gisou
2015-01-01
Protein S-palmitoylation is a reversible post-translational modification that regulates many key biological processes, although the full extent and functions of protein S-palmitoylation remain largely unexplored. Recent developments of new chemical methods have allowed the establishment of palmitoyl-proteomes of a variety of cell lines and tissues from different species. As the amount of information generated by these high-throughput studies is increasing, the field requires centralization and comparison of this information. Here we present SwissPalm ( http://swisspalm.epfl.ch), our open, comprehensive, manually curated resource to study protein S-palmitoylation. It currently encompasses more than 5000 S-palmitoylated protein hits from seven species, and contains more than 500 specific sites of S-palmitoylation. SwissPalm also provides curated information and filters that increase the confidence in true positive hits, and integrates predictions of S-palmitoylated cysteine scores, orthologs and isoform multiple alignments. Systems analysis of the palmitoyl-proteome screens indicate that 10% or more of the human proteome is susceptible to S-palmitoylation. Moreover, ontology and pathway analyses of the human palmitoyl-proteome reveal that key biological functions involve this reversible lipid modification. Comparative analysis finally shows a strong crosstalk between S-palmitoylation and other post-translational modifications. Through the compilation of data and continuous updates, SwissPalm will provide a powerful tool to unravel the global importance of protein S-palmitoylation.
SwissPalm: Protein Palmitoylation database
Abrami, Laurence; Migliozzi, Daniel; Armand, Florence; Bürgi, Jérôme; van der Goot, Françoise Gisou
2015-01-01
Protein S-palmitoylation is a reversible post-translational modification that regulates many key biological processes, although the full extent and functions of protein S-palmitoylation remain largely unexplored. Recent developments of new chemical methods have allowed the establishment of palmitoyl-proteomes of a variety of cell lines and tissues from different species. As the amount of information generated by these high-throughput studies is increasing, the field requires centralization and comparison of this information. Here we present SwissPalm ( http://swisspalm.epfl.ch), our open, comprehensive, manually curated resource to study protein S-palmitoylation. It currently encompasses more than 5000 S-palmitoylated protein hits from seven species, and contains more than 500 specific sites of S-palmitoylation. SwissPalm also provides curated information and filters that increase the confidence in true positive hits, and integrates predictions of S-palmitoylated cysteine scores, orthologs and isoform multiple alignments. Systems analysis of the palmitoyl-proteome screens indicate that 10% or more of the human proteome is susceptible to S-palmitoylation. Moreover, ontology and pathway analyses of the human palmitoyl-proteome reveal that key biological functions involve this reversible lipid modification. Comparative analysis finally shows a strong crosstalk between S-palmitoylation and other post-translational modifications. Through the compilation of data and continuous updates, SwissPalm will provide a powerful tool to unravel the global importance of protein S-palmitoylation. PMID:26339475
Douché, Thibaut; San Clemente, Hélène; Burlat, Vincent; Roujol, David; Valot, Benoît; Zivy, Michel; Pont-Lezica, Rafael; Jamet, Elisabeth
2013-08-01
Polysaccharides make up about 75% of plant cell walls and can be broken down to produce sugar substrates (saccharification) from which a whole range of products can be obtained, including bioethanol. Cell walls also contain 5-10% of proteins, which could be used to tailor them for agroindustrial uses. Here we present cell wall proteomics data of Brachypodium distachyon, a model plant for temperate grasses. Leaves and culms were analyzed during active growth and at mature stage. Altogether, 559 proteins were identified by LC-MS/MS and bioinformatics, among which 314 have predicted signal peptides. Sixty-three proteins were shared by two organs at two developmental stages where they could play housekeeping functions. Differences were observed between organs and stages of development, especially at the level of glycoside hydrolases and oxidoreductases. Differences were also found between the known cell wall proteomes of B. distachyon, Oryza sativa, and the Arabidopsis thaliana dicot. Three glycoside hydrolases could be immunolocalized in cell walls using polyclonal antibodies against proteotypic peptides. Organ-specific expression consistent with proteomics results could be observed as well as cell-specific localization. Moreover, the high number of proteins of unknown function in B. distachyon cell wall proteomes opens new fields of research for monocot cell walls. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
NASA Technical Reports Server (NTRS)
Reinsch, S. S.; Galazka, J..; Berrios, D. C; Chakravarty, K.; Fogle, H.; Lai, S.; Bokyo, V.; Timucin, L. R.; Tran, P.; Skidmore, M.
2016-01-01
NASA's mission includes expanding our understanding of biological systems to improve life on Earth and to enable long-duration human exploration of space. The GeneLab Data System (GLDS) is NASA's premier open-access omics data platform for biological experiments. GLDS houses standards-compliant, high-throughput sequencing and other omics data from spaceflight-relevant experiments. The GeneLab project at NASA-Ames Research Center is developing the database, and also partnering with spaceflight projects through sharing or augmentation of experiment samples to expand omics analyses on precious spaceflight samples. The partnerships ensure that the maximum amount of data is garnered from spaceflight experiments and made publically available as rapidly as possible via the GLDS. GLDS Version 1.0, went online in April 2015. Software updates and new data releases occur at least quarterly. As of October 2016, the GLDS contains 80 datasets and has search and download capabilities. Version 2.0 is slated for release in September of 2017 and will have expanded, integrated search capabilities leveraging other public omics databases (NCBI GEO, PRIDE, MG-RAST). Future versions in this multi-phase project will provide a collaborative platform for omics data analysis. Data from experiments that explore the biological effects of the spaceflight environment on a wide variety of model organisms are housed in the GLDS including data from rodents, invertebrates, plants and microbes. Human datasets are currently limited to those with anonymized data (e.g., from cultured cell lines). GeneLab ensures prompt release and open access to high-throughput genomics, transcriptomics, proteomics, and metabolomics data from spaceflight and ground-based simulations of microgravity, radiation or other space environment factors. The data are meticulously curated to assure that accurate experimental and sample processing metadata are included with each data set. GLDS download volumes indicate strong interest of the scientific community in these data. To date GeneLab has partnered with multiple experiments including two plant (Arabidopsis thaliana) experiments, two mice experiments, and several microbe experiments. GeneLab optimized protocols in the rodent partnerships for maximum yield of RNA, DNA and protein from tissues harvested and preserved during the SpaceX-4 mission, as well as from tissues from mice that were frozen intact during spaceflight and later dissected on the ground. Analysis of GeneLab data will contribute fundamental knowledge of how the space environment affects biological systems, and as well as yield terrestrial benefits resulting from mitigation strategies to prevent effects observed during exposure to space environments.
AccessMRS: integrating OpenMRS with smart forms on Android.
Fazen, Louis E; Chemwolo, Benjamin T; Songok, Julia J; Ruhl, Laura J; Kipkoech, Carolyne; Green, James M; Ikemeri, Justus E; Christoffersen-Deb, Astrid
2013-01-01
We present a new open-source Android application, AccessMRS, for interfacing with an electronic medical record system (OpenMRS) and loading 'Smart Forms' on a mobile device. AccessMRS functions as a patient-centered interface for viewing OpenMRS data; managing patient information in reminders, task lists, and previous encounters; and launching patient-specific 'Smart Forms' for electronic data collection and dissemination of health information. We present AccessMRS in the context of related software applications we developed to serve Community Health Workers, including AccessInfo, AccessAdmin, AccessMaps, and AccessForms. The specific features and design of AccessMRS are detailed in relationship to the requirements that drove development: the workflows of the Kenyan Ministry of Health Community Health Volunteers (CHVs) supported by the AMPATH Primary Health Care Program. Specifically, AccessMRS was designed to improve the quality of community-based Maternal and Child Health services delivered by CHVs in Kosirai Division. AccessMRS is currently in use by more than 80 CHVs in Kenya and undergoing formal assessment of acceptability, effectiveness, and cost.
Ramos, H.; Shannon, P.; Aebersold, R.
2008-01-01
Motivation: Mass spectrometry experiments in the field of proteomics produce lists containing tens to thousands of identified proteins. With the protein information and property explorer (PIPE), the biologist can acquire functional annotations for these proteins and explore the enrichment of the list, or fraction thereof, with respect to functional classes. These protein lists may be saved for access at a later time or different location. The PIPE is interoperable with the Firegoose and the Gaggle, permitting wide-ranging data exploration and analysis. The PIPE is a rich-client web application which uses AJAX capabilities provided by the Google Web Toolkit, and server-side data storage using Hibernate. Availability: http://pipe.systemsbiology.net Contact: pshannon@systemsbiology.org PMID:18635572
Basic design of MRM assays for peptide quantification.
James, Andrew; Jorgensen, Claus
2010-01-01
With the recent availability and accessibility of mass spectrometry for basic and clinical research, the requirement for stable, sensitive, and reproducible assays to specifically detect proteins of interest has increased. Multiple reaction monitoring (MRM) or selective reaction monitoring (SRM) is a highly selective, sensitive, and robust assay to monitor the presence and amount of biomolecules. Until recently, MRM was typically used for the detection of drugs and other biomolecules from body fluids. With increased focus on biomarkers and systems biology approaches, researchers in the proteomics field have taken advantage of this approach. In this chapter, we will introduce the reader to the basic principle of designing and optimizing an MRM workflow. We provide examples of MRM workflows for standard proteomic samples and provide suggestions for the reader who is interested in using MRM for quantification.
Brusniak, Mi-Youn; Bodenmiller, Bernd; Campbell, David; Cooke, Kelly; Eddes, James; Garbutt, Andrew; Lau, Hollis; Letarte, Simon; Mueller, Lukas N; Sharma, Vagisha; Vitek, Olga; Zhang, Ning; Aebersold, Ruedi; Watts, Julian D
2008-01-01
Background Quantitative proteomics holds great promise for identifying proteins that are differentially abundant between populations representing different physiological or disease states. A range of computational tools is now available for both isotopically labeled and label-free liquid chromatography mass spectrometry (LC-MS) based quantitative proteomics. However, they are generally not comparable to each other in terms of functionality, user interfaces, information input/output, and do not readily facilitate appropriate statistical data analysis. These limitations, along with the array of choices, present a daunting prospect for biologists, and other researchers not trained in bioinformatics, who wish to use LC-MS-based quantitative proteomics. Results We have developed Corra, a computational framework and tools for discovery-based LC-MS proteomics. Corra extends and adapts existing algorithms used for LC-MS-based proteomics, and statistical algorithms, originally developed for microarray data analyses, appropriate for LC-MS data analysis. Corra also adapts software engineering technologies (e.g. Google Web Toolkit, distributed processing) so that computationally intense data processing and statistical analyses can run on a remote server, while the user controls and manages the process from their own computer via a simple web interface. Corra also allows the user to output significantly differentially abundant LC-MS-detected peptide features in a form compatible with subsequent sequence identification via tandem mass spectrometry (MS/MS). We present two case studies to illustrate the application of Corra to commonly performed LC-MS-based biological workflows: a pilot biomarker discovery study of glycoproteins isolated from human plasma samples relevant to type 2 diabetes, and a study in yeast to identify in vivo targets of the protein kinase Ark1 via phosphopeptide profiling. Conclusion The Corra computational framework leverages computational innovation to enable biologists or other researchers to process, analyze and visualize LC-MS data with what would otherwise be a complex and not user-friendly suite of tools. Corra enables appropriate statistical analyses, with controlled false-discovery rates, ultimately to inform subsequent targeted identification of differentially abundant peptides by MS/MS. For the user not trained in bioinformatics, Corra represents a complete, customizable, free and open source computational platform enabling LC-MS-based proteomic workflows, and as such, addresses an unmet need in the LC-MS proteomics field. PMID:19087345
Sandia National Laboratories: Livermore Valley Open Campus (LVOC)
Visiting the LVOC Locations Livermore Valley Open Campus (LVOC) Open engagement Expanding opportunities for open engagement of the broader scientific community. Building on success Sandia's Combustion Research Facility pioneered open collaboration over 30 years ago. Access to DOE-funded capabilities Expanding access
Common bean proteomics: Present status and future strategies.
Zargar, Sajad Majeed; Mahajan, Reetika; Nazir, Muslima; Nagar, Preeti; Kim, Sun Tae; Rai, Vandna; Masi, Antonio; Ahmad, Syed Mudasir; Shah, Riaz Ahmad; Ganai, Nazir Ahmad; Agrawal, Ganesh K; Rakwal, Randeep
2017-10-03
Common bean (Phaseolus vulgaris L.) is a legume of appreciable importance and usefulness worldwide to the human population providing food and feed. It is rich in high-quality protein, energy, fiber and micronutrients especially iron, zinc, and pro-vitamin A; and possesses potentially disease-preventing and health-promoting compounds. The recently published genome sequence of common bean is an important landmark in common bean research, opening new avenues for understanding its genetics in depth. This legume crop is affected by diverse biotic and abiotic stresses severely limiting its productivity. Looking at the trend of increasing world population and the need for food crops best suited to the health of humankind, the legumes will be in great demand, including the common bean mostly for its nutritive values. Hence the need for new research in understanding the biology of this crop brings us to utilize and apply high-throughput omics approaches. In this mini-review our focus will be on the need for proteomics studies in common bean, potential of proteomics for understanding genetic regulation under abiotic and biotic stresses and how proteogenomics will lead to nutritional improvement. We will also discuss future proteomics-based strategies that must be adopted to mine new genomic resources by identifying molecular switches regulating various biological processes. Common bean is regarded as "grain of hope" for the poor, being rich in high-quality protein, energy, fiber and micronutrients (iron, zinc, pro-vitamin A); and possesses potentially disease-preventing and health-promoting compounds. Increasing world population and the need for food crops best suited to the health of humankind, puts legumes into great demand, which includes the common bean mostly. An important landmark in common bean research was the recent publication of its genome sequence, opening new avenues for understanding its genetics in depth. This legume crop is affected by diverse biotic and abiotic stresses severely limiting its productivity. Therefore, the need for new research in understanding the biology of this crop brings us to utilize and apply high-throughput omics approaches. Proteomics can be used to track all the candidate proteins/genes responsible for a biological process under specific conditions in a particular tissue. The potential of proteomics will not only help in determining the functions of a large number of genes in a single experiment but will also be a useful tool to mine new genes that can provide solution to various problems (abiotic stress, biotic stress, nutritional improvement, etc). We believe that a combined approach including breeding along with omics tools will lead towards attaining sustainability in legumes, including common bean. Copyright © 2017 Elsevier B.V. All rights reserved.
Open access, open education resources and open data in Uganda.
Salvo, Ivana Di; Mwoka, Meggie; Kwaga, Teddy; Rukundo, Priscilla Aceng; Ernest, Dennis Ssesanga; Osaheni, Louis Aikoriogie; John, Kasibante; Shafik, Kasirye; de Sousa, Agostinho Moreira
2015-01-01
As a follow up to OpenCon 2014, International Federation of Medical Students' Associations (IFMSA) students organized a 3 day workshop Open Access, Open Education Resources and Open Data in Kampala from 15-18 December 2014. One of the aims of the workshop was to engage the Open Access movement in Uganda which encompasses the scientific community, librarians, academia, researchers and students. The IFMSA students held the workshop with the support of: Consortium for Uganda University Libraries (CUUL), The Right to Research Coalition, Electronic Information for Libraries (EIFL), Makerere University, International Health Sciences University (IHSU), Pan African Medical Journal (PAMJ) and the Centre for Health Human Rights and Development (CEHURD). All these organizations are based or have offices in Kampala. The event culminated in a meeting with the Science and Technology Committee of Parliament of Uganda in order to receive the support of the Ugandan Members of Parliament and to make a concrete change for Open Access in the country.
ERIC Educational Resources Information Center
Elliott, Colin; Fabbro, Elaine
2015-01-01
To address challenges that learners, course creators, librarians and academics involved with OER and MOOCs are facing when looking for scholarly materials, Athabasca University Library has initiated the development of "the Open Library at AU." This open library is a full library website that provides easy access to open and free…
Shotgun proteomics reveals physiological response to ocean acidification in Crassostrea gigas.
Timmins-Schiffman, Emma; Coffey, William D; Hua, Wilber; Nunn, Brook L; Dickinson, Gary H; Roberts, Steven B
2014-11-03
Ocean acidification as a result of increased anthropogenic CO2 emissions is occurring in marine and estuarine environments worldwide. The coastal ocean experiences additional daily and seasonal fluctuations in pH that can be lower than projected end-of-century open ocean pH reductions. In order to assess the impact of ocean acidification on marine invertebrates, Pacific oysters (Crassostrea gigas) were exposed to one of four different p CO2 levels for four weeks: 400 μatm (pH 8.0), 800 μatm (pH 7.7), 1000 μatm (pH 7.6), or 2800 μatm (pH 7.3). At the end of the four week exposure period, oysters in all four p CO2 environments deposited new shell, but growth rate was not different among the treatments. However, micromechanical properties of the new shell were compromised by elevated p CO2. Elevated p CO2 affected neither whole body fatty acid composition, nor glycogen content, nor mortality rate associated with acute heat shock. Shotgun proteomics revealed that several physiological pathways were significantly affected by ocean acidification, including antioxidant response, carbohydrate metabolism, and transcription and translation. Additionally, the proteomic response to a second stress differed with p CO2, with numerous processes significantly affected by mechanical stimulation at high versus low p CO2 (all proteomics data are available in the ProteomeXchange under the identifier PXD000835). Oyster physiology is significantly altered by exposure to elevated p CO2, indicating changes in energy resource use. This is especially apparent in the assessment of the effects of p CO2 on the proteomic response to a second stress. The altered stress response illustrates that ocean acidification may impact how oysters respond to other changes in their environment. These data contribute to an integrative view of the effects of ocean acidification on oysters as well as physiological trade-offs during environmental stress.
The Oxford Probe: an open access five-hole probe for aerodynamic measurements
NASA Astrophysics Data System (ADS)
Hall, B. F.; Povey, T.
2017-03-01
The Oxford Probe is an open access five-hole probe designed for experimental aerodynamic measurements. The open access probe can be manufactured by the end user via additive manufacturing (metal or plastic). The probe geometry, drawings, calibration maps, and software are available under a creative commons license. The purpose is to widen access to aerodynamic measurement techniques in education and research environments. There are many situations in which the open access probe will allow results of comparable accuracy to a well-calibrated commercial probe. We discuss the applications and limitations of the probe, and compare the calibration maps for 16 probes manufactured in different materials and at different scales, but with the same geometrical design.
Looking for Cancer Clues in Publicly Accessible Databases
Lemkin, Peter F.; Smythers, Gary W.; Munroe, David J.
2004-01-01
What started out as a mere attempt to tentatively identify proteins in experimental cancer-related 2D-PAGE maps developed into VIRTUAL2D, a web-accessible repository for theoretical pI/MW charts for 92 organisms. Using publicly available expression data, we developed a collection of tissue-specific plots based on differential gene expression between normal and diseased states. We use this comparative cancer proteomics knowledge base, known as the tissue molecular anatomy project (TMAP), to uncover threads of cancer markers common to several types of cancer and to relate this information to established biological pathways. PMID:18629065
Looking for cancer clues in publicly accessible databases.
Medjahed, Djamel; Lemkin, Peter F; Smythers, Gary W; Munroe, David J
2004-01-01
What started out as a mere attempt to tentatively identify proteins in experimental cancer-related 2D-PAGE maps developed into VIRTUAL2D, a web-accessible repository for theoretical pI/MW charts for 92 organisms. Using publicly available expression data, we developed a collection of tissue-specific plots based on differential gene expression between normal and diseased states. We use this comparative cancer proteomics knowledge base, known as the tissue molecular anatomy project (TMAP), to uncover threads of cancer markers common to several types of cancer and to relate this information to established biological pathways.
Strategies for Success: Open Access Policies at North American Educational Institutions
ERIC Educational Resources Information Center
Fruin, Christine; Sutton, Shan
2016-01-01
Recognizing the paucity of quantitative and qualitative data from North American educational institutions that have pursued open access policies, the authors devised a survey to collect information on the characteristics of these institutions, as well as the elements of the open access policies, the methods of promoting these policies, faculty…
Students' Experiences with Community in an Open Access Course
ERIC Educational Resources Information Center
Blackmon, Stephanie J.; Cullen, Theresa A.
2016-01-01
Online open access courses have become regular offerings of many universities. Building community and connectedness is an important part of branding and success of such offerings. Our goal was to investigate students' experiences with community in an open access course. Therefore, in this study, we explored the sense of community of 342…
EUA's Open Access Checklist for Universities: A Practical Guide on Implementation
ERIC Educational Resources Information Center
Morais, Rita; Lourenço, Joana; Smith, John H.; Borrell-Damian, Lidia
2015-01-01
Open Access (OA) to research publications has received increased attention from the academic community, scientific publishers, research funding agencies and governments. This movement has been growing exponentially in recent years, both in terms of the increasing number of Open Access journals and the proliferation of policies on this topic. The…
Perspective on Open-Access Publishing: An Interview with Peter Suber
ERIC Educational Resources Information Center
Cornwell, Reid; Suber, Peter
2008-01-01
In this edition of Perspectives, Reid Cornwell discusses open-access publishing with Peter Suber, senior researcher at the Scholarly Publishing and Academic Resources Coalition, senior research professor of philosophy at Earlham College, and currently visiting fellow at Yale Law School. Open access means that scholarly work is freely and openly…
Frame of Reference: Open Access Starts with You
ERIC Educational Resources Information Center
Goetsch, Lori A.
2010-01-01
Federal legislation now requires the deposit of some taxpayer-funded research in "open-access" repositories--that is, sites where scholarship and research are made freely available over the Internet. The institutions whose faculty produce the research have begun to see the benefit of open-access publication as well. From the perspective of faculty…
The Air Force In Silico -- Computational Biology in 2025
2007-11-01
and chromosome) these new fields are commonly referred to as “~omics.” Proteomics , transcriptomics, metabolomics , epigenomics, physiomics... Bioinformatics , 2006, http://journal.imbio.de/ http://www-bm.ipk-gatersleben.de/stable/php/ journal /articles/pdf/jib-22.pdf (accessed 30 September...Chirino, G. Tansley and I. Dryden, “The implications for Bioinformatics of integration across physical scales,” Journal of Integrative Bioinformatics
Vanhove, Anne-Catherine; Vermaelen, Wesley; Panis, Bart; Swennen, Rony; Carpentier, Sebastien C
2012-01-01
There is a great need for research aimed at understanding drought tolerance, screening for drought tolerant varieties and breeding crops with an improved water use efficiency. Bananas and plantains are a major staple food and export product with a worldwide production of over 135 million tonnes per year. Water however is the most limiting abiotic factor in banana production. A screening of the Musa biodiversity has not yet been performed. We at KU Leuven host the Musa International Germplasm collection with over 1200 accessions. To screen the Musa biodiversity for drought tolerant varieties, we developed a screening test for in vitro plants. Five varieties representing different genomic constitutions in banana (AAAh, AAA, AAB, AABp, and ABB) were selected and subjected to a mild osmotic stress. The ABB variety showed the smallest stress induced growth reduction. To get an insight into the acclimation and the accomplishment of homeostasis, the leaf proteome of this variety was characterized via 2D DIGE. After extraction of the leaf proteome of six control and six stressed plants, 2600 spots could be distinguished. A PCA analysis indicates that control and stressed plants can blindly be classified based on their proteome. One hundred and twelve proteins were significantly more abundant in the stressed plants and 18 proteins were significantly more abundant in control plants (FDR α 0.05). Twenty four differential proteins could be identified. The proteome analysis clearly shows that there is a new balance in the stressed plants and that the respiration, metabolism of ROS and several dehydrogenases involved in NAD/NADH homeostasis play an important role.
Identifier mapping performance for integrating transcriptomics and proteomics experimental results
2011-01-01
Background Studies integrating transcriptomic data with proteomic data can illuminate the proteome more clearly than either separately. Integromic studies can deepen understanding of the dynamic complex regulatory relationship between the transcriptome and the proteome. Integrating these data dictates a reliable mapping between the identifier nomenclature resultant from the two high-throughput platforms. However, this kind of analysis is well known to be hampered by lack of standardization of identifier nomenclature among proteins, genes, and microarray probe sets. Therefore data integration may also play a role in critiquing the fallible gene identifications that both platforms emit. Results We compared three freely available internet-based identifier mapping resources for mapping UniProt accessions (ACCs) to Affymetrix probesets identifications (IDs): DAVID, EnVision, and NetAffx. Liquid chromatography-tandem mass spectrometry analyses of 91 endometrial cancer and 7 noncancer samples generated 11,879 distinct ACCs. For each ACC, we compared the retrieval sets of probeset IDs from each mapping resource. We confirmed a high level of discrepancy among the mapping resources. On the same samples, mRNA expression was available. Therefore, to evaluate the quality of each ACC-to-probeset match, we calculated proteome-transcriptome correlations, and compared the resources presuming that better mapping of identifiers should generate a higher proportion of mapped pairs with strong inter-platform correlations. A mixture model for the correlations fitted well and supported regression analysis, providing a window into the performance of the mapping resources. The resources have added and dropped matches over two years, but their overall performance has not changed. Conclusions The methods presented here serve to achieve concrete context-specific insight, to support well-informed decisions in choosing an ID mapping strategy for "omic" data merging. PMID:21619611
Lehmann, Roland; Schmidt, André; Pastuschek, Jana; Müller, Mario M; Fritzsche, Andreas; Dieterle, Stefan; Greb, Robert R; Markert, Udo R; Slevogt, Hortense
2018-06-25
The proteomic analysis of complex body fluids by liquid chromatography tandem mass spectrometry (LC-MS/MS) analysis requires the selection of suitable sample preparation techniques and optimal parameter settings in data analysis software packages to obtain reliable results. Proteomic analysis of follicular fluid, as a representative of a complex body fluid similar to serum or plasma, is difficult as it contains a vast amount of high abundant proteins and a variety of proteins with different concentrations. However, the accessibility of this complex body fluid for LC-MS/MS analysis is an opportunity to gain insights into the status, the composition of fertility-relevant proteins including immunological factors or for the discovery of new diagnostic and prognostic markers for, for example, the treatment of infertility. In this study, we compared different sample preparation methods (FASP, eFASP and in-solution digestion) and three different data analysis software packages (Proteome Discoverer with SEQUEST, Mascot and MaxQuant with Andromeda) combined with semi- and full-tryptic databank search options to obtain a maximum coverage of the follicular fluid proteome. We found that the most comprehensive proteome coverage is achieved by the eFASP sample preparation method using SDS in the initial denaturing step and the SEQUEST-based semi-tryptic data analysis. In conclusion, we have developed a fractionation-free methodical workflow for in depth LC-MS/MS-based analysis for the standardized investigation of human follicle fluid as an important representative of a complex body fluid. Taken together, we were able to identify a total of 1392 proteins in follicular fluid. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Tear film proteome in age-related macular degeneration.
Winiarczyk, Mateusz; Kaarniranta, Kai; Winiarczyk, Stanisław; Adaszek, Łukasz; Winiarczyk, Dagmara; Mackiewicz, Jerzy
2018-06-01
Age-related macular degeneration (AMD) is the main reason for blindness in elderly people in the developed countries. Current screening protocols have limitations in detecting the early signs of retinal degeneration. Therefore, it would be desirable to find novel biomarkers for early detection of AMD. Development of novel biomarkers would help in the prevention, diagnostics, and treatment of AMD. Proteomic analysis of tear film has shown promise in this research area. If an optimal set of biomarkers could be obtained from accessible body fluids, it would represent a reliable way to monitor disease progression and response to novel therapies. Tear films were collected on Schirmer strips from a total of 22 patients (8 with wet AMD, 6 with dry AMD, and 8 control individuals). 2D electrophoresis was used to separate tear film proteins prior to their identification with matrix-assisted laser desorption/ionization time of flight spectrometer (MALDI-TOF/TOF) and matching with functional databases. A total of 342 proteins were identified. Most of them were previously described in various proteomic studies concerning AMD. Shootin-1, histatin-3, fidgetin-like protein 1, SRC kinase signaling inhibitor, Graves disease carrier protein, actin cytoplasmic 1, prolactin-inducible protein 1, and protein S100-A7A were upregulated in the tear film samples isolated from AMD patients and were not previously linked with this disease in any proteomic analysis. The upregulated proteins supplement our current knowledge of AMD pathogenesis, providing evidence that certain specific proteins are expressed into the tear film in AMD. As far we are aware, this is the first study to have undertaken a comprehensive in-depth analysis of the human tear film proteome in AMD patients.
[Targeted therapy: toward a clean and effective war against cancer].
Castronovo, V; Waltregny, D; Detry, O; Coimbra Marques, C; De Roover, A; Honoré, P; De Pauw, E; Turtoi, A
2009-01-01
One promising avenue towards the development of more selective, better anticancer drugs consists in the targeted delivery of bioactive compounds to the tumor environment by means of binding molecules specific for tumor-associated biomarkers. Eligibility of such markers for therapeutic use implies ideally three criteria : (i) accessibility from the bloodstream, (ii) expression at sufficient level and (iii) no (or much lower) expression in normal tissues. Most current discovery strategies (such as biomarker searching into body fluids) provide no clue as to whether proteins of interest are accessible, in human tissues, to suitable high-affinity ligands, such as systemically delivered monoclonal antibodies. Innovative proteomic technologies are able to identify such accessible biomarkers and represent a key step in the clinical development of such target therapies.
Publish or perish, and pay--the new paradigm of open-access journals.
Tzarnas, Stephanie; Tzarnas, Chris D
2015-01-01
The new open-access journal business model is changing the publication landscape and residents and junior faculty should be aware of these changes. A national survey of surgery program directors and residents was performed. Open-access journals have been growing over the past decade, and many traditional printed journals are also sponsoring open-access options (the hybrid model) for accepted articles. Authors need to be aware of the new publishing paradigm and potential costs involved in publishing their work. Copyright © 2014 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
Herzog, Rebecca; Boehm, Michael; Unterwurzacher, Markus; Wagner, Anja; Parapatics, Katja; Májek, Peter; Mueller, André C.; Lichtenauer, Anton; Bennett, Keiryn L.; Alper, Seth L.; Vychytil, Andreas; Aufricht, Christoph; Kratochwill, Klaus
2018-01-01
Peritoneal dialysis (PD) is a modality of renal replacement therapy in which the high volumes of available PD effluent (PDE) represents a rich source of biomarkers for monitoring disease and therapy. Although this information could help guide the management of PD patients, little is known about the potential of PDE to define pathomechanism-associated molecular signatures in PD. We therefore subjected PDE to a high-performance multiplex proteomic analysis after depletion of highly-abundant plasma proteins and enrichment of low-abundance proteins. A combination of label-free and isobaric labeling strategies was applied to PDE samples from PD patients (n = 20) treated in an open-label, randomized, two-period, cross-over clinical trial with standard PD fluid or with a novel PD fluid supplemented with alanyl-glutamine (AlaGln). With this workflow we identified 2506 unique proteins in the PDE proteome, greatly increasing coverage beyond the 171 previously-reported proteins. The proteins identified range from high abundance plasma proteins to low abundance cellular proteins, and are linked to larger numbers of biological processes and pathways, some of which are novel for PDE. Interestingly, proteins linked to membrane remodeling and fibrosis are overrepresented in PDE compared with plasma, whereas the proteins underrepresented in PDE suggest decreases in host defense, immune-competence and response to stress. Treatment with AlaGln-supplemented PD fluid is associated with reduced activity of membrane injury-associated mechanisms and with restoration of biological processes involved in stress responses and host defense. Our study represents the first application of the PDE proteome in a randomized controlled prospective clinical trial of PD. This novel proteomic workflow allowed detection of low abundance biomarkers to define pathomechanism-associated molecular signatures in PD and their alterations by a novel therapeutic intervention. PMID:29208752
Feng, Xin; Xu, Jian; Liang, Yu; Chen, Guo-Li; Fan, Xian-Wei; Li, You-Zhi
2017-08-01
Filamentous fungi-copper (Cu) interactions are very important in the formation of natural ecosystems and the bioremediation of heavy metal pollution. However, important issues at the proteome level remain unclear. We compared six proteomes from Cu-resistant wild-type (WT) Penicillium janthinellum strain GXCR and a Cu-sensitive mutant (EC-6) under 0, 0.5, and 3 mmol/L Cu treatments using iTRAQ. A total of 495 known proteins were identified, and the following conclusions were drawn from the results: Cu tolerance depends on ATP generation and supply, which is relevant to glycolysis pathway activity; oxidative phosphorylation, the TCA cycle, gluconeogenesis, fatty acid synthesis, and metabolism are also affected by Cu; high Cu sensitivity is primarily due to an ATP energy deficit; among ATP generation pathways, Cu-sensitive and Cu-insensitive metabolic steps exist; gluconeogenesis pathway is crucial to the survival of fungi in Cu-containing and sugar-scarce environments; fungi change their proteomes via two routes (from ATP, ATP-dependent RNA helicases (ADRHs), and ribosome biogenesis to proteasomes and from ATP, ADRHs to spliceosomes and/or stress-adapted RNA degradosomes) to cope with changes in Cu concentrations; and unique routes exist through which fungi respond to high environmental Cu. Further, a general diagram of Cu-responsive paths and a model theory of high Cu are proposed at the proteome level. Our work not only provides the potential protein biomarkers that indicate Cu pollution and targets metabolic steps for engineering Cu-tolerant fungi during bioremediation but also presents clues for further insight into the heavy metal tolerance mechanisms of other eukaryotes. © 2017 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.
Lee, HoJoon; Palm, Jennifer; Grimes, Susan M; Ji, Hanlee P
2015-10-27
The Cancer Genome Atlas (TCGA) project has generated genomic data sets covering over 20 malignancies. These data provide valuable insights into the underlying genetic and genomic basis of cancer. However, exploring the relationship among TCGA genomic results and clinical phenotype remains a challenge, particularly for individuals lacking formal bioinformatics training. Overcoming this hurdle is an important step toward the wider clinical translation of cancer genomic/proteomic data and implementation of precision cancer medicine. Several websites such as the cBio portal or University of California Santa Cruz genome browser make TCGA data accessible but lack interactive features for querying clinically relevant phenotypic associations with cancer drivers. To enable exploration of the clinical-genomic driver associations from TCGA data, we developed the Cancer Genome Atlas Clinical Explorer. The Cancer Genome Atlas Clinical Explorer interface provides a straightforward platform to query TCGA data using one of the following methods: (1) searching for clinically relevant genes, micro RNAs, and proteins by name, cancer types, or clinical parameters; (2) searching for genomic/proteomic profile changes by clinical parameters in a cancer type; or (3) testing two-hit hypotheses. SQL queries run in the background and results are displayed on our portal in an easy-to-navigate interface according to user's input. To derive these associations, we relied on elastic-net estimates of optimal multiple linear regularized regression and clinical parameters in the space of multiple genomic/proteomic features provided by TCGA data. Moreover, we identified and ranked gene/micro RNA/protein predictors of each clinical parameter for each cancer. The robustness of the results was estimated by bootstrapping. Overall, we identify associations of potential clinical relevance among genes/micro RNAs/proteins using our statistical analysis from 25 cancer types and 18 clinical parameters that include clinical stage or smoking history. The Cancer Genome Atlas Clinical Explorer enables the cancer research community and others to explore clinically relevant associations inferred from TCGA data. With its accessible web and mobile interface, users can examine queries and test hypothesis regarding genomic/proteomic alterations across a broad spectrum of malignancies.
ERIC Educational Resources Information Center
Sambe, Manasseh Tyungu; Raphael, Gabriel Okplogidi
2015-01-01
This study examines the kinds of open access scholarly publication or information resources accepted and adopted by federal university libraries in South East Nigeria. The purpose was to determine the factors that affect open access scholarly publication or information resources acceptance and adoption in university libraries. The study adopted…
The Open Access Availability of Library and Information Science Literature
ERIC Educational Resources Information Center
Way, Doug
2010-01-01
To examine the open access availability of Library and Information Science (LIS) research, a study was conducted using Google Scholar to search for articles from 20 top LIS journals. The study examined whether Google Scholar was able to find any links to full text, if open access versions of the articles were available and where these articles…
Code of Federal Regulations, 2011 CFR
2011-07-01
... decision on whether a grantee or transporter has provided open and nondiscriminatory access? 291.112..., DEPARTMENT OF THE INTERIOR APPEALS OPEN AND NONDISCRIMINATORY ACCESS TO OIL AND GAS PIPELINES UNDER THE OUTER... grantee or transporter has provided open and nondiscriminatory access? MMS will begin processing a...
BMC Medicine editorial board members on open access publishing.
Carmont, Michael R; Lawn, Stephen D; Stray-Pedersen, Babill; Shoenfeld, Yehuda; Meier, Pascal
2014-01-21
In recognition of Open Access week (21st-27th October 2013), we asked some BMC Medicine Editorial Board Members to share their views and experiences on open access publishing. In this short video, they highlight the benefits of visibility and dissemination of their research, and discuss the future directions for this model of publishing.
BMC medicine editorial board members on open access publishing
2014-01-01
In recognition of Open Access week (21st-27th October 2013), we asked some BMC Medicine Editorial Board Members to share their views and experiences on open access publishing. In this short video, they highlight the benefits of visibility and dissemination of their research, and discuss the future directions for this model of publishing. PMID:24447778
jqcML: an open-source java API for mass spectrometry quality control data in the qcML format.
Bittremieux, Wout; Kelchtermans, Pieter; Valkenborg, Dirk; Martens, Lennart; Laukens, Kris
2014-07-03
The awareness that systematic quality control is an essential factor to enable the growth of proteomics into a mature analytical discipline has increased over the past few years. To this aim, a controlled vocabulary and document structure have recently been proposed by Walzer et al. to store and disseminate quality-control metrics for mass-spectrometry-based proteomics experiments, called qcML. To facilitate the adoption of this standardized quality control routine, we introduce jqcML, a Java application programming interface (API) for the qcML data format. First, jqcML provides a complete object model to represent qcML data. Second, jqcML provides the ability to read, write, and work in a uniform manner with qcML data from different sources, including the XML-based qcML file format and the relational database qcDB. Interaction with the XML-based file format is obtained through the Java Architecture for XML Binding (JAXB), while generic database functionality is obtained by the Java Persistence API (JPA). jqcML is released as open-source software under the permissive Apache 2.0 license and can be downloaded from https://bitbucket.org/proteinspector/jqcml .
Panorama: A Targeted Proteomics Knowledge Base
2015-01-01
Panorama is a web application for storing, sharing, analyzing, and reusing targeted assays created and refined with Skyline,1 an increasingly popular Windows client software tool for targeted proteomics experiments. Panorama allows laboratories to store and organize curated results contained in Skyline documents with fine-grained permissions, which facilitates distributed collaboration and secure sharing of published and unpublished data via a web-browser interface. It is fully integrated with the Skyline workflow and supports publishing a document directly to a Panorama server from the Skyline user interface. Panorama captures the complete Skyline document information content in a relational database schema. Curated results published to Panorama can be aggregated and exported as chromatogram libraries. These libraries can be used in Skyline to pick optimal targets in new experiments and to validate peak identification of target peptides. Panorama is open-source and freely available. It is distributed as part of LabKey Server,2 an open source biomedical research data management system. Laboratories and organizations can set up Panorama locally by downloading and installing the software on their own servers. They can also request freely hosted projects on https://panoramaweb.org, a Panorama server maintained by the Department of Genome Sciences at the University of Washington. PMID:25102069
Radioactive hot cell access hole decontamination machine
Simpson, William E.
1982-01-01
Radioactive hot cell access hole decontamination machine. A mobile housing has an opening large enough to encircle the access hole and has a shielding door, with a door opening and closing mechanism, for uncovering and covering the opening. The housing contains a shaft which has an apparatus for rotating the shaft and a device for independently translating the shaft from the housing through the opening and access hole into the hot cell chamber. A properly sized cylindrical pig containing wire brushes and cloth or other disks, with an arrangement for releasably attaching it to the end of the shaft, circumferentially cleans the access hole wall of radioactive contamination and thereafter detaches from the shaft to fall into the hot cell chamber.
[Open availability of articles and raw research data in Spanish pediatrics journals].
Aleixandre-Benavent, R; Vidal-Infer, A; Alonso-Arroyo, A; González de Dios, J; Ferrer-Sapena, A; Peset, F
2015-01-01
The open Access to publications and the raw data allows its re-use and enhances the advancement of science. The aim of this paper is to identify these practices in Spanish pediatrics journals. We reviewed the author's instructions in 13 Spanish pediatrics journals, identifying their open access and deposit policy. Eight journals allow open access without restriction, and 5 provide information on the ability to re-use and depositing data in repositories or websites. Most of the journals have open access, but do not promote the deposit of additional material or articles in repositories or websites. Copyright © 2013 Asociación Española de Pediatría. Published by Elsevier Espana. All rights reserved.
Positioning Your Library in an Open-Access Environment
ERIC Educational Resources Information Center
Bhatt, Anjana H.
2010-01-01
This paper is a summary of the project that the author completed at Florida Gulf Coast University (FGCU) library for providing online access to 80 open access E-journals and digital collections. Although FGCU uses SerialsSolutions products to establish online access, any one can provide access to these collections as they are free for all. Paper…
Is open access sufficient? A review of the quality of open-access nursing journals.
Crowe, Marie; Carlyle, Dave
2015-02-01
The present study aims to review the quality of open-access nursing journals listed in the Directory of Open Access Journals that published papers in 2013 with a nursing focus, written in English, and were freely accessible. Each journal was reviewed in relation to their publisher, year of commencement, number of papers published in 2013, fee for publication, indexing, impact factor, and evidence of requirements for ethics and disclosure statements. The quality of the journals was assessed by impact factors and the requirements for indexing in PubMed. A total of 552 were published in 2013 in the 19 open-access nursing journals that met the inclusion criteria. No journals had impact factors listed in Web of Knowledge, but three had low Scopus impact factors. Only five journals were indexed with PubMed. The quality of the 19 journals included in the review was evaluated as inferior to most subscription-fee journals. Mental health nursing has some responsibility to the general public, and in particular, consumers of mental health services and their families, for the quality of papers published in open-access journals. The way forward might involve dual-platform publication or a process that enables assessment of how research has improved clinical outcomes. © 2014 Australian College of Mental Health Nurses Inc.
Web accessibility and open source software.
Obrenović, Zeljko
2009-07-01
A Web browser provides a uniform user interface to different types of information. Making this interface universally accessible and more interactive is a long-term goal still far from being achieved. Universally accessible browsers require novel interaction modalities and additional functionalities, for which existing browsers tend to provide only partial solutions. Although functionality for Web accessibility can be found as open source and free software components, their reuse and integration is complex because they were developed in diverse implementation environments, following standards and conventions incompatible with the Web. To address these problems, we have started several activities that aim at exploiting the potential of open-source software for Web accessibility. The first of these activities is the development of Adaptable Multi-Interface COmmunicator (AMICO):WEB, an infrastructure that facilitates efficient reuse and integration of open source software components into the Web environment. The main contribution of AMICO:WEB is in enabling the syntactic and semantic interoperability between Web extension mechanisms and a variety of integration mechanisms used by open source and free software components. Its design is based on our experiences in solving practical problems where we have used open source components to improve accessibility of rich media Web applications. The second of our activities involves improving education, where we have used our platform to teach students how to build advanced accessibility solutions from diverse open-source software. We are also partially involved in the recently started Eclipse projects called Accessibility Tools Framework (ACTF), the aim of which is development of extensible infrastructure, upon which developers can build a variety of utilities that help to evaluate and enhance the accessibility of applications and content for people with disabilities. In this article we briefly report on these activities.
Proteomic Alterations in Aqueous Humor From Patients With Primary Open Angle Glaucoma.
Sharma, Shruti; Bollinger, Kathryn E; Kodeboyina, Sai Karthik; Zhi, Wenbo; Patton, Jordan; Bai, Shan; Edwards, Blake; Ulrich, Lane; Bogorad, David; Sharma, Ashok
2018-05-01
Primary open angle glaucoma (POAG) is the most prevalent form of glaucoma, accounting for approximately 90% of all cases. The aqueous humor (AH), a biological fluid in the anterior and posterior chambers of the eye, is involved in a multitude of functions including the maintenance of IOP and ocular homeostasis. This fluid is very close to the pathologic site and is also known to have a significant role in glaucoma pathogenesis. The purpose of this study was to identify proteomic alterations in AH from patients with POAG. AH samples were extracted from 47 patients undergoing cataract surgery (controls: n = 32; POAG: n = 15). Proteomic analysis of the digested samples was accomplished by liquid-chromatography-mass spectrometry. The identified proteins were evaluated using a variety of statistical and bioinformatics methods. A total of 33 proteins were significantly altered in POAG subjects compared with the controls. The most abundant proteins in POAG subjects are IGKC (13.56-fold), ITIH4 (4.1-fold), APOC3 (3.36-fold), IDH3A (3.11-fold), LOC105369216 (2.98-fold). SERPINF2 (2.94-fold), NPC2 (2.88-fold), SUCLG2 (2.70-fold), KIAA0100 (2.29-fold), CNOT4 (2.23-fold), AQP4 (2.11-fold), COL18A1 (2.08-fold), NWD1 (2.07-fold), and TMEM120B (2.06-fold). A significant increasing trend in the odds ratios of having POAG was observed with increased levels of these proteins. Proteins identified in this study are implicated in signaling, glycosylation, immune response, molecular transport, and lipid metabolism. The identified candidate proteins may be potential biomarkers associated with POAG development and may lead to more insight in understanding the mechanisms underlying the pathogenesis of this disease.
Code of Federal Regulations, 2010 CFR
2010-04-01
... Wholesale Electric Quadrant, which are incorporated herein by reference: (1) Open Access Same-Time....13, 001-1.0, 001-9.7, 001-14.1.3, and 001-15.1.2); (2) Open Access Same-Time Information Systems... minor corrections applied May 29, 2009 and September 8, 2009); (3) Open Access Same-Time Information...
Code of Federal Regulations, 2012 CFR
2012-04-01
... Wholesale Electric Quadrant, which are incorporated herein by reference: (1) Open Access Same-Time....13, 001-1.0, 001-9.7, 001-14.1.3, and 001-15.1.2); (2) Open Access Same-Time Information Systems... minor corrections applied May 29, 2009 and September 8, 2009); (3) Open Access Same-Time Information...
"I've Never Heard of It Before": Awareness of Open Access at a Small Liberal Arts University
ERIC Educational Resources Information Center
Kocken, Gregory J.; Wical, Stephanie H.
2013-01-01
Small colleges and universities, often late adopters of institutional repositories and open access initiatives, face challenges that have not fully been explored in the professional literature. In an effort to gauge the level of awareness of open access and institutional repositories at the University of Wisconsin-Eau Claire (UWEC), the authors of…
The Impact Factor: Implications of Open Access on Quality
ERIC Educational Resources Information Center
Grozanick, Sara E.
2010-01-01
There has been debate about the extent to which open access affects the quality of scholarly work. At the same time, researchers have begun to look for ways to evaluate the quality of open access publications. Dating back to the growth of citation indexes during the 1960s and 1970s, citation analysis--examining citation statistics--has since been…
Federal Register 2010, 2011, 2012, 2013, 2014
2010-02-10
... Fisheries Act Catcher Vessels in the Inshore Open Access Fishery in the Bering Sea and Aleutian Islands... allowable catch (TAC) allocated to the inshore open access fishery in the BSAI. DATES: Effective 1200 hrs... open access fishery in the BSAI is 2,762 metric tons (mt) as established by the final 2009 and 2010...
The Future of Open Access Publishing in the Netherlands: Constant Dripping Wears Away the Stone
ERIC Educational Resources Information Center
Woutersen-Windhouwer, Saskia
2013-01-01
At present, about 20% of the scientific publications worldwide are freely (open-access) available (Bjork, Welling, Laakso, Majlender, Hedlund, & Guonason, 2010) and this percentage is constantly on the rise. In the Netherlands, a similar trend is visible (see Fig. 1). Why is open-access (OA) publishing important, and why will it become even…
Transcriptome of the adult female malaria mosquito vector Anopheles albimanus.
Martínez-Barnetche, Jesús; Gómez-Barreto, Rosa E; Ovilla-Muñoz, Marbella; Téllez-Sosa, Juan; García López, David E; Dinglasan, Rhoel R; Ubaida Mohien, Ceereena; MacCallum, Robert M; Redmond, Seth N; Gibbons, John G; Rokas, Antonis; Machado, Carlos A; Cazares-Raga, Febe E; González-Cerón, Lilia; Hernández-Martínez, Salvador; Rodríguez López, Mario H
2012-05-30
Human Malaria is transmitted by mosquitoes of the genus Anopheles. Transmission is a complex phenomenon involving biological and environmental factors of humans, parasites and mosquitoes. Among more than 500 anopheline species, only a few species from different branches of the mosquito evolutionary tree transmit malaria, suggesting that their vectorial capacity has evolved independently. Anopheles albimanus (subgenus Nyssorhynchus) is an important malaria vector in the Americas. The divergence time between Anopheles gambiae, the main malaria vector in Africa, and the Neotropical vectors has been estimated to be 100 My. To better understand the biological basis of malaria transmission and to develop novel and effective means of vector control, there is a need to explore the mosquito biology beyond the An. gambiae complex. We sequenced the transcriptome of the An. albimanus adult female. By combining Sanger, 454 and Illumina sequences from cDNA libraries derived from the midgut, cuticular fat body, dorsal vessel, salivary gland and whole body, we generated a single, high-quality assembly containing 16,669 transcripts, 92% of which mapped to the An. darlingi genome and covered 90% of the core eukaryotic genome. Bidirectional comparisons between the An. gambiae, An. darlingi and An. albimanus predicted proteomes allowed the identification of 3,772 putative orthologs. More than half of the transcripts had a match to proteins in other insect vectors and had an InterPro annotation. We identified several protein families that may be relevant to the study of Plasmodium-mosquito interaction. An open source transcript annotation browser called GDAV (Genome-Delinked Annotation Viewer) was developed to facilitate public access to the data generated by this and future transcriptome projects. We have explored the adult female transcriptome of one important New World malaria vector, An. albimanus. We identified protein-coding transcripts involved in biological processes that may be relevant to the Plasmodium lifecycle and can serve as the starting point for searching targets for novel control strategies. Our data increase the available genomic information regarding An. albimanus several hundred-fold, and will facilitate molecular research in medical entomology, evolutionary biology, genomics and proteomics of anopheline mosquito vectors. The data reported in this manuscript is accessible to the community via the VectorBase website (http://www.vectorbase.org/Other/AdditionalOrganisms/).
NASA Astrophysics Data System (ADS)
Arunachalam, S.
2010-10-01
Open access brings greater visibility and impact to the work of scientists as is evidenced in the examples discussed in this paper. Researchers are often reluctant and afraid to deposit their works in Institutional Repositories. However, as is shown here, once they do so, they do not regret it. Open access will shortly become the norm and will be accepted by the vast majority of scientists. Seen through the lens of the philosophy of Bertrand Russell, the moral, economic and philosophical imperatives for open access are indeed strong.
Efficient Site-Specific Labeling of Proteins via Cysteines
Kim, Younggyu; Ho, Sam O.; Gassman, Natalie R.; Korlann, You; Landorf, Elizabeth V.; Collart, Frank R.; Weiss, Shimon
2011-01-01
Methods for chemical modifications of proteins have been crucial for the advancement of proteomics. In particular, site-specific covalent labeling of proteins with fluorophores and other moieties has permitted the development of a multitude of assays for proteome analysis. A common approach for such a modification is solvent-accessible cysteine labeling using thiol-reactive dyes. Cysteine is very attractive for site-specific conjugation due to its relative rarity throughout the proteome and the ease of its introduction into a specific site along the protein's amino acid chain. This is achieved by site-directed mutagenesis, most often without perturbing the protein's function. Bottlenecks in this reaction, however, include the maintenance of reactive thiol groups without oxidation before the reaction, and the effective removal of unreacted molecules prior to fluorescence studies. Here, we describe an efficient, specific, and rapid procedure for cysteine labeling starting from well-reduced proteins in the solid state. The efficacy and specificity of the improved procedure are estimated using a variety of single-cysteine proteins and thiol-reactive dyes. Based on UV/vis absorbance spectra, coupling efficiencies are typically in the range 70–90%, and specificities are better than ~95%. The labeled proteins are evaluated using fluorescence assays, proving that the covalent modification does not alter their function. In addition to maleimide-based conjugation, this improved procedure may be used for other thiol-reactive conjugations such as haloacetyl, alkyl halide, and disulfide interchange derivatives. This facile and rapid procedure is well suited for high throughput proteome analysis. PMID:18275130
Efficient site-specific labeling of proteins via cysteines.
Kim, Younggyu; Ho, Sam O; Gassman, Natalie R; Korlann, You; Landorf, Elizabeth V; Collart, Frank R; Weiss, Shimon
2008-03-01
Methods for chemical modifications of proteins have been crucial for the advancement of proteomics. In particular, site-specific covalent labeling of proteins with fluorophores and other moieties has permitted the development of a multitude of assays for proteome analysis. A common approach for such a modification is solvent-accessible cysteine labeling using thiol-reactive dyes. Cysteine is very attractive for site-specific conjugation due to its relative rarity throughout the proteome and the ease of its introduction into a specific site along the protein's amino acid chain. This is achieved by site-directed mutagenesis, most often without perturbing the protein's function. Bottlenecks in this reaction, however, include the maintenance of reactive thiol groups without oxidation before the reaction, and the effective removal of unreacted molecules prior to fluorescence studies. Here, we describe an efficient, specific, and rapid procedure for cysteine labeling starting from well-reduced proteins in the solid state. The efficacy and specificity of the improved procedure are estimated using a variety of single-cysteine proteins and thiol-reactive dyes. Based on UV/vis absorbance spectra, coupling efficiencies are typically in the range 70-90%, and specificities are better than approximately 95%. The labeled proteins are evaluated using fluorescence assays, proving that the covalent modification does not alter their function. In addition to maleimide-based conjugation, this improved procedure may be used for other thiol-reactive conjugations such as haloacetyl, alkyl halide, and disulfide interchange derivatives. This facile and rapid procedure is well suited for high throughput proteome analysis.
Petyuk, Vladislav A.; Qian, Wei-Jun; Hinault, Charlotte; Gritsenko, Marina A.; Singhal, Mudita; Monroe, Matthew E.; Camp, David G.; Kulkarni, Rohit N.; Smith, Richard D.
2009-01-01
The pancreatic islets of Langerhans, and especially the insulin-producing beta cells, play a central role in the maintenance of glucose homeostasis. Alterations in the expression of multiple proteins in the islets that contribute to the maintenance of islet function are likely to underlie the pathogenesis of type 2 diabetes. To identify proteins that constitute the islet proteome, we provide the first comprehensive proteomic characterization of pancreatic islets for mouse, the most commonly used animal model in diabetes research. Using strong cation exchange fractionation coupled with reversed phase LC-MS/MS we report the confident identification of 17,350 different tryptic peptides covering 2,612 proteins having at least two unique peptides per protein. The dataset also identified ~60 post-translationally modified peptides including oxidative modifications and phosphorylation. While many of the identified phosphorylation sites corroborate those previously known, the oxidative modifications observed on cysteinyl residues reveal potentially novel information suggesting a role for oxidative stress in islet function. Comparative analysis with 15 available proteomic datasets from other mouse tissues and cells revealed a set of 133 proteins predominantly expressed in pancreatic islets. This unique set of proteins, in addition to those with known functions such as peptide hormones secreted from the islets, contains several proteins with as yet unknown functions. The mouse islet protein and peptide database accessible at http://ncrr.pnl.gov, provides an important reference resource for the research community to facilitate research in the diabetes and metabolism fields. PMID:18570455
Van Coillie, Samya; Liang, Lunxi; Zhang, Yao; Wang, Huanbin; Fang, Jing-Yuan; Xu, Jie
2016-04-05
High-throughput methods such as co-immunoprecipitationmass spectrometry (coIP-MS) and yeast 2 hybridization (Y2H) have suggested a broad range of unannotated protein-protein interactions (PPIs), and interpretation of these PPIs remains a challenging task. The advancements in cancer genomic researches allow for the inference of "coactivation pairs" in cancer, which may facilitate the identification of PPIs involved in cancer. Here we present OncoBinder as a tool for the assessment of proteomic interaction data based on the functional synergy of oncoproteins in cancer. This decision tree-based method combines gene mutation, copy number and mRNA expression information to infer the functional status of protein-coding genes. We applied OncoBinder to evaluate the potential binders of EGFR and ERK2 proteins based on the gastric cancer dataset of The Cancer Genome Atlas (TCGA). As a result, OncoBinder identified high confidence interactions (annotated by Kyoto Encyclopedia of Genes and Genomes (KEGG) or validated by low-throughput assays) more efficiently than co-expression based method. Taken together, our results suggest that evaluation of gene functional synergy in cancer may facilitate the interpretation of proteomic interaction data. The OncoBinder toolbox for Matlab is freely accessible online.
Libraries of Peptide Fragmentation Mass Spectra Database
National Institute of Standards and Technology Data Gateway
SRD 1C NIST Libraries of Peptide Fragmentation Mass Spectra Database (Web, free access) The purpose of the library is to provide peptide reference data for laboratories employing mass spectrometry-based proteomics methods for protein analysis. Mass spectral libraries identify these compounds in a more sensitive and robust manner than alternative methods. These databases are freely available for testing and development of new applications.
BIG: a large-scale data integration tool for renal physiology.
Zhao, Yue; Yang, Chin-Rang; Raghuram, Viswanathan; Parulekar, Jaya; Knepper, Mark A
2016-10-01
Due to recent advances in high-throughput techniques, we and others have generated multiple proteomic and transcriptomic databases to describe and quantify gene expression, protein abundance, or cellular signaling on the scale of the whole genome/proteome in kidney cells. The existence of so much data from diverse sources raises the following question: "How can researchers find information efficiently for a given gene product over all of these data sets without searching each data set individually?" This is the type of problem that has motivated the "Big-Data" revolution in Data Science, which has driven progress in fields such as marketing. Here we present an online Big-Data tool called BIG (Biological Information Gatherer) that allows users to submit a single online query to obtain all relevant information from all indexed databases. BIG is accessible at http://big.nhlbi.nih.gov/.
Integrating Mass Spectrometry of Intact Protein Complexes into Structural Proteomics
Hyung, Suk-Joon; Ruotolo, Brandon T.
2013-01-01
Summary Mass spectrometry analysis of intact protein complexes has emerged as an established technology for assessing the composition and connectivity within dynamic, heterogeneous multiprotein complexes at low concentrations and in the context of mixtures. As this technology continues to move forward, one of the main challenges is to integrate the information content of such intact protein complex measurements with other mass spectrometry approaches in structural biology. Methods such as H/D exchange, oxidative foot-printing, chemical cross-linking, affinity purification, and ion mobility separation add complementary information that allows access to every level of protein structure and organization. Here, we survey the structural information that can be retrieved by such experiments, demonstrate the applicability of integrative mass spectrometry approaches in structural proteomics, and look to the future to explore upcoming innovations in this rapidly-advancing area. PMID:22611037
An Integrated Systems Genetics and Omics Toolkit to Probe Gene Function.
Li, Hao; Wang, Xu; Rukina, Daria; Huang, Qingyao; Lin, Tao; Sorrentino, Vincenzo; Zhang, Hongbo; Bou Sleiman, Maroun; Arends, Danny; McDaid, Aaron; Luan, Peiling; Ziari, Naveed; Velázquez-Villegas, Laura A; Gariani, Karim; Kutalik, Zoltan; Schoonjans, Kristina; Radcliffe, Richard A; Prins, Pjotr; Morgenthaler, Stephan; Williams, Robert W; Auwerx, Johan
2018-01-24
Identifying genetic and environmental factors that impact complex traits and common diseases is a high biomedical priority. Here, we developed, validated, and implemented a series of multi-layered systems approaches, including (expression-based) phenome-wide association, transcriptome-/proteome-wide association, and (reverse-) mediation analysis, in an open-access web server (systems-genetics.org) to expedite the systems dissection of gene function. We applied these approaches to multi-omics datasets from the BXD mouse genetic reference population, and identified and validated associations between genes and clinical and molecular phenotypes, including previously unreported links between Rpl26 and body weight, and Cpt1a and lipid metabolism. Furthermore, through mediation and reverse-mediation analysis we established regulatory relations between genes, such as the co-regulation of BCKDHA and BCKDHB protein levels, and identified targets of transcription factors E2F6, ZFP277, and ZKSCAN1. Our multifaceted toolkit enabled the identification of gene-gene and gene-phenotype links that are robust and that translate well across populations and species, and can be universally applied to any populations with multi-omics datasets. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Respiratory cancer database: An open access database of respiratory cancer gene and miRNA.
Choubey, Jyotsna; Choudhari, Jyoti Kant; Patel, Ashish; Verma, Mukesh Kumar
2017-01-01
Respiratory cancer database (RespCanDB) is a genomic and proteomic database of cancer of respiratory organ. It also includes the information of medicinal plants used for the treatment of various respiratory cancers with structure of its active constituents as well as pharmacological and chemical information of drug associated with various respiratory cancers. Data in RespCanDB has been manually collected from published research article and from other databases. Data has been integrated using MySQL an object-relational database management system. MySQL manages all data in the back-end and provides commands to retrieve and store the data into the database. The web interface of database has been built in ASP. RespCanDB is expected to contribute to the understanding of scientific community regarding respiratory cancer biology as well as developments of new way of diagnosing and treating respiratory cancer. Currently, the database consist the oncogenomic information of lung cancer, laryngeal cancer, and nasopharyngeal cancer. Data for other cancers, such as oral and tracheal cancers, will be added in the near future. The URL of RespCanDB is http://ridb.subdic-bioinformatics-nitrr.in/.
Newt-omics: a comprehensive repository for omics data from the newt Notophthalmus viridescens
Bruckskotten, Marc; Looso, Mario; Reinhardt, Richard; Braun, Thomas; Borchardt, Thilo
2012-01-01
Notophthalmus viridescens, a member of the salamander family is an excellent model organism to study regenerative processes due to its unique ability to replace lost appendages and to repair internal organs. Molecular insights into regenerative events have been severely hampered by the lack of genomic, transcriptomic and proteomic data, as well as an appropriate database to store such novel information. Here, we describe ‘Newt-omics’ (http://newt-omics.mpi-bn.mpg.de), a database, which enables researchers to locate, retrieve and store data sets dedicated to the molecular characterization of newts. Newt-omics is a transcript-centred database, based on an Expressed Sequence Tag (EST) data set from the newt, covering ∼50 000 Sanger sequenced transcripts and a set of high-density microarray data, generated from regenerating hearts. Newt-omics also contains a large set of peptides identified by mass spectrometry, which was used to validate 13 810 ESTs as true protein coding. Newt-omics is open to implement additional high-throughput data sets without changing the database structure. Via a user-friendly interface Newt-omics allows access to a huge set of molecular data without the need for prior bioinformatical expertise. PMID:22039101
Mobilising Open Access to Research Data: Recommendations from the RECODE project
NASA Astrophysics Data System (ADS)
Finn, Rachel; Sveinsdottir, Thordis
2015-04-01
This paper will introduce the findings and policy recommendations from the FP7 project RECODE (Policy RECommendations for Open Access to Research Data in Europe) which aims to leverage existing networks, communities and projects to address challenges within the open access and data dissemination and preservation sector. We will introduce the key recommendations, which provide solutions relevant to opening access to PSI. The project is built on case study research of five scientific disciplines with the aim of recognizing and working with disciplinary fragmentation associated with open access to research data. The RECODE findings revealed that the mobilisation of open access to research data requires a partnership approach for developing a coherent and flexible ecosystem that is easy and transparent to embed in research practice and process. As such, the development of open access to research data needs to be: • Informed by research practices and processes in different fields • Supported by an integrated institutional and technological data infrastructure and guided by ethical and regulatory frameworks • Underpinned by infrastructure and guiding frameworks that allow for differences in disciplinary research and data management practices • Characterised by a partnership approach involving the key stakeholders, researchers, and institutions The proposed presentation will examine each of these aspects in detail and use information and good practices from the RECODE project to consider how stakeholders within the PSI movement might action each of these points. It will also highlight areas where RECODE findings and good practice recommendations have clear relevance for the PSI sector.
30 CFR 291.110 - Who may MMS require to produce information?
Code of Federal Regulations, 2010 CFR
2010-07-01
... OPEN AND NONDISCRIMINATORY ACCESS TO OIL AND GAS PIPELINES UNDER THE OUTER CONTINENTAL SHELF LANDS ACT... make a decision on whether open access or nondiscriminatory access was denied. (b) If you are a party...
30 CFR 291.110 - Who may MMS require to produce information?
Code of Federal Regulations, 2011 CFR
2011-07-01
..., DEPARTMENT OF THE INTERIOR APPEALS OPEN AND NONDISCRIMINATORY ACCESS TO OIL AND GAS PIPELINES UNDER THE OUTER... believes is necessary to make a decision on whether open access or nondiscriminatory access was denied. (b...
30 CFR 291.111 - How does MMS treat the confidential information I provide?
Code of Federal Regulations, 2011 CFR
2011-07-01
..., AND ENFORCEMENT, DEPARTMENT OF THE INTERIOR APPEALS OPEN AND NONDISCRIMINATORY ACCESS TO OIL AND GAS... to inform a decision on whether open access or nondiscriminatory access was denied may claim that...
Moorhead, Laura L; Holzmeyer, Cheryl; Maggio, Lauren A; Steinberg, Ryan M; Willinsky, John
2015-01-01
Through funding agency and publisher policies, an increasing proportion of the health sciences literature is being made open access. Such an increase in access raises questions about the awareness and potential utilization of this literature by those working in health fields. A sample of physicians (N=336) and public health non-governmental organization (NGO) staff (N=92) were provided with relatively complete access to the research literature indexed in PubMed, as well as access to the point-of-care service UpToDate, for up to one year, with their usage monitored through the tracking of web-log data. The physicians also participated in a one-month trial of relatively complete or limited access. The study found that participants' research interests were not satisfied by article abstracts alone nor, in the case of the physicians, by a clinical summary service such as UpToDate. On average, a third of the physicians viewed research a little more frequently than once a week, while two-thirds of the public health NGO staff viewed more than three articles a week. Those articles were published since the 2008 adoption of the NIH Public Access Policy, as well as prior to 2008 and during the maximum 12-month embargo period. A portion of the articles in each period was already open access, but complete access encouraged a viewing of more research articles. Those working in health fields will utilize more research in the course of their work as a result of (a) increasing open access to research, (b) improving awareness of and preparation for this access, and (c) adjusting public and open access policies to maximize the extent of potential access, through reduction in embargo periods and access to pre-policy literature.
Tsay, Ming-Yueh; Wu, Tai-Luan; Tseng, Ling-Li
2017-01-01
This study examines the completeness and overlap of coverage in physics of six open access scholarly communication systems, including two search engines (Google Scholar and Microsoft Academic), two aggregate institutional repositories (OAIster and OpenDOAR), and two physics-related open sources (arXiv.org and Astrophysics Data System). The 2001-2013 Nobel Laureates in Physics served as the sample. Bibliographic records of their publications were retrieved and downloaded from each system, and a computer program was developed to perform the analytical tasks of sorting, comparison, elimination, aggregation and statistical calculations. Quantitative analyses and cross-referencing were performed to determine the completeness and overlap of the system coverage of the six open access systems. The results may enable scholars to select an appropriate open access system as an efficient scholarly communication channel, and academic institutions may build institutional repositories or independently create citation index systems in the future. Suggestions on indicators and tools for academic assessment are presented based on the comprehensiveness assessment of each system.
An automated method for detecting alternatively spliced protein domains.
Coelho, Vitor; Sammeth, Michael
2018-06-01
Alternative splicing (AS) has been demonstrated to play a role in shaping eukaryotic gene diversity at the transcriptional level. However, the impact of AS on the proteome is still controversial. Studies that seek to explore the effect of AS at the proteomic level are hampered by technical difficulties in the cumbersome process of casting forth and back between genome, transcriptome and proteome space coordinates, and the naïve prediction of protein domains in the presence of AS suffers many redundant sequence scans that emerge from constitutively spliced regions that are shared between alternative products of a gene. We developed the AstaFunk pipeline that computes for every generic transcriptome all domains that are altered by AS events in a systematic and efficient manner. In a nutshell, our method employs Viterbi dynamic programming, which guarantees to find all score-optimal hits of the domains under consideration, while complementary optimisations at different levels avoid redundant and other irrelevant computations. We evaluate AstaFunk qualitatively and quantitatively using RNAseq in well-studied genes with AS, and on large-scale employing entire transcriptomes. Our study confirms complementary reports that the effect of most AS events on the proteome seems to be rather limited, but our results also pinpoint several cases where AS could have a major impact on the function of a protein domain. The JAVA implementation of AstaFunk is available as an open source project on http://astafunk.sammeth.net. micha@sammeth.net. Supplementary data are available at Bioinformatics online.