Science.gov

Sample records for distributed relevance ranking

  1. DOE SBIR Phase II Final Report: Distributed Relevance Ranking in Heterogeneous Document Collections

    SciTech Connect

    Abe Lederman

    2007-01-08

    This report contains the comprehensive summary of the work performed on the SBIR Phase II project (“Distributed Relevance Ranking in Heterogeneous Document Collections”) at Deep Web Technologies (http://www.deepwebtech.com). We have successfully completed all of the tasks defined in our SBIR Proposal work plan (See Table 1 - Phase II Tasks Status). The project was completed on schedule and we have successfully deployed an initial production release of the software architecture at DOE-OSTI for the Science.gov Alliance's search portal (http://www.science.gov). We have implemented a set of grid services that supports the extraction, filtering, aggregation, and presentation of search results from numerous heterogeneous document collections. Illustration 3 depicts the services required to perform QuickRank™ filtering of content as defined in our architecture documentation. Functionality that has been implemented is indicated by the services highlighted in green. We have successfully tested our implementation in a multi-node grid deployment both within the Deep Web Technologies offices, and in a heterogeneous geographically distributed grid environment. We have performed a series of load tests in which we successfully simulated 100 concurrent users submitting search requests to the system. This testing was performed on deployments of one, two, and three node grids with services distributed in a number of different configurations. The preliminary results from these tests indicate that our architecture will scale well across multi-node grid deployments, but more work will be needed, beyond the scope of this project, to perform testing and experimentation to determine scalability and resiliency requirements. We are pleased to report that a production quality version (1.4) of the science.gov Alliance's search portal based on our grid architecture was released in June of 2006. This demonstration portal is currently available at http://science.gov/search30 . The portal

  2. Rank distributions: Frequency vs. magnitude.

    PubMed

    Velarde, Carlos; Robledo, Alberto

    2017-01-01

    We examine the relationship between two different types of ranked data, frequencies and magnitudes. We consider data that can be sorted out either way, through numbers of occurrences or size of the measures, as it is the case, say, of moon craters, earthquakes, billionaires, etc. We indicate that these two types of distributions are functional inverses of each other, and specify this link, first in terms of the assumed parent probability distribution that generates the data samples, and then in terms of an analog (deterministic) nonlinear iterated map that reproduces them. For the particular case of hyperbolic decay with rank the distributions are identical, that is, the classical Zipf plot, a pure power law. But their difference is largest when one displays logarithmic decay and its counterpart shows the inverse exponential decay, as it is the case of Benford law, or viceversa. For all intermediate decay rates generic differences appear not only between the power-law exponents for the midway rank decline but also for small and large rank. We extend the theoretical framework to include thermodynamic and statistical-mechanical concepts, such as entropies and configuration.

  3. Ranking Biomedical Annotations with Annotator's Semantic Relevancy

    PubMed Central

    2014-01-01

    Biomedical annotation is a common and affective artifact for researchers to discuss, show opinion, and share discoveries. It becomes increasing popular in many online research communities, and implies much useful information. Ranking biomedical annotations is a critical problem for data user to efficiently get information. As the annotator's knowledge about the annotated entity normally determines quality of the annotations, we evaluate the knowledge, that is, semantic relationship between them, in two ways. The first is extracting relational information from credible websites by mining association rules between an annotator and a biomedical entity. The second way is frequent pattern mining from historical annotations, which reveals common features of biomedical entities that an annotator can annotate with high quality. We propose a weighted and concept-extended RDF model to represent an annotator, a biomedical entity, and their background attributes and merge information from the two ways as the context of an annotator. Based on that, we present a method to rank the annotations by evaluating their correctness according to user's vote and the semantic relevancy between the annotator and the annotated entity. The experimental results show that the approach is applicable and efficient even when data set is large. PMID:24899918

  4. Ranking biomedical annotations with annotator's semantic relevancy.

    PubMed

    Wu, Aihua

    2014-01-01

    Biomedical annotation is a common and affective artifact for researchers to discuss, show opinion, and share discoveries. It becomes increasing popular in many online research communities, and implies much useful information. Ranking biomedical annotations is a critical problem for data user to efficiently get information. As the annotator's knowledge about the annotated entity normally determines quality of the annotations, we evaluate the knowledge, that is, semantic relationship between them, in two ways. The first is extracting relational information from credible websites by mining association rules between an annotator and a biomedical entity. The second way is frequent pattern mining from historical annotations, which reveals common features of biomedical entities that an annotator can annotate with high quality. We propose a weighted and concept-extended RDF model to represent an annotator, a biomedical entity, and their background attributes and merge information from the two ways as the context of an annotator. Based on that, we present a method to rank the annotations by evaluating their correctness according to user's vote and the semantic relevancy between the annotator and the annotated entity. The experimental results show that the approach is applicable and efficient even when data set is large.

  5. Relevancy Ranking of Satellite Dataset Search Results

    NASA Technical Reports Server (NTRS)

    Lynnes, Christopher; Quinn, Patrick; Norton, James

    2017-01-01

    As the Variety of Earth science datasets increases, science researchers find it more challenging to discover and select the datasets that best fit their needs. The most common way of search providers to address this problem is to rank the datasets returned for a query by their likely relevance to the user. Large web page search engines typically use text matching supplemented with reverse link counts, semantic annotations and user intent modeling. However, this produces uneven results when applied to dataset metadata records simply externalized as a web page. Fortunately, data and search provides have decades of experience in serving data user communities, allowing them to form heuristics that leverage the structure in the metadata together with knowledge about the user community. Some of these heuristics include specific ways of matching the user input to the essential measurements in the dataset and determining overlaps of time range and spatial areas. Heuristics based on the novelty of the datasets can prioritize later, better versions of data over similar predecessors. And knowledge of how different user types and communities use data can be brought to bear in cases where characteristics of the user (discipline, expertise) or their intent (applications, research) can be divined. The Earth Observing System Data and Information System has begun implementing some of these heuristics in the relevancy algorithm of its Common Metadata Repository search engine.

  6. Incidence of q statistics in rank distributions

    PubMed Central

    Yalcin, G. Cigdem; Robledo, Alberto; Gell-Mann, Murray

    2014-01-01

    We show that size-rank distributions with power-law decay (often only over a limited extent) observed in a vast number of instances in a widespread family of systems obey Tsallis statistics. The theoretical framework for these distributions is analogous to that of a nonlinear iterated map near a tangent bifurcation for which the Lyapunov exponent is negligible or vanishes. The relevant statistical–mechanical expressions associated with these distributions are derived from a maximum entropy principle with the use of two different constraints, and the resulting duality of entropy indexes is seen to portray physically relevant information. Whereas the value of the index α fixes the distribution’s power-law exponent, that for the dual index 2 − α ensures the extensivity of the deformed entropy. PMID:25189773

  7. Relevance Preserving Projection and Ranking for Web Image Search Reranking.

    PubMed

    Ji, Zhong; Pang, Yanwei; Li, Xuelong

    2015-11-01

    An image search reranking (ISR) technique aims at refining text-based search results by mining images' visual content. Feature extraction and ranking function design are two key steps in ISR. Inspired by the idea of hypersphere in one-class classification, this paper proposes a feature extraction algorithm named hypersphere-based relevance preserving projection (HRPP) and a ranking function called hypersphere-based rank (H-Rank). Specifically, an HRPP is a spectral embedding algorithm to transform an original high-dimensional feature space into an intrinsically low-dimensional hypersphere space by preserving the manifold structure and a relevance relationship among the images. An H-Rank is a simple but effective ranking algorithm to sort the images by their distances to the hypersphere center. Moreover, to capture the user's intent with minimum human interaction, a reversed k-nearest neighbor (KNN) algorithm is proposed, which harvests enough pseudorelevant images by requiring that the user gives only one click on the initially searched images. The HRPP method with reversed KNN is named one-click-based HRPP (OC-HRPP). Finally, an OC-HRPP algorithm and the H-Rank algorithm form a new ISR method, H-reranking. Extensive experimental results on three large real-world data sets show that the proposed algorithms are effective. Moreover, the fact that only one relevant image is required to be labeled makes it has a strong practical significance.

  8. The LAILAPS search engine: relevance ranking in life science databases.

    PubMed

    Lange, Matthias; Spies, Karl; Bargsten, Joachim; Haberhauer, Gregor; Klapperstück, Matthias; Leps, Michael; Weinel, Christian; Wünschiers, Röbbe; Weissbach, Mandy; Stein, Jens; Scholz, Uwe

    2010-01-15

    Search engines and retrieval systems are popular tools at a life science desktop. The manual inspection of hundreds of database entries, that reflect a life science concept or fact, is a time intensive daily work. Hereby, not the number of query results matters, but the relevance does. In this paper, we present the LAILAPS search engine for life science databases. The concept is to combine a novel feature model for relevance ranking, a machine learning approach to model user relevance profiles, ranking improvement by user feedback tracking and an intuitive and slim web user interface, that estimates relevance rank by tracking user interactions. Queries are formulated as simple keyword lists and will be expanded by synonyms. Supporting a flexible text index and a simple data import format, LAILAPS can easily be used both as search engine for comprehensive integrated life science databases and for small in-house project databases. With a set of features, extracted from each database hit in combination with user relevance preferences, a neural network predicts user specific relevance scores. Using expert knowledge as training data for a predefined neural network or using users own relevance training sets, a reliable relevance ranking of database hits has been implemented. In this paper, we present the LAILAPS system, the concepts, benchmarks and use cases. LAILAPS is public available for SWISSPROT data at http://lailaps.ipk-gatersleben.de.

  9. Rank distributions: a panoramic macroscopic outlook.

    PubMed

    Eliazar, Iddo I; Cohen, Morrel H

    2014-01-01

    This paper presents a panoramic macroscopic outlook of rank distributions. We establish a general framework for the analysis of rank distributions, which classifies them into five macroscopic "socioeconomic" states: monarchy, oligarchy-feudalism, criticality, socialism-capitalism, and communism. Oligarchy-feudalism is shown to be characterized by discrete macroscopic rank distributions, and socialism-capitalism is shown to be characterized by continuous macroscopic size distributions. Criticality is a transition state between oligarchy-feudalism and socialism-capitalism, which can manifest allometric scaling with multifractal spectra. Monarchy and communism are extreme forms of oligarchy-feudalism and socialism-capitalism, respectively, in which the intrinsic randomness vanishes. The general framework is applied to three different models of rank distributions-top-down, bottom-up, and global-and unveils each model's macroscopic universality and versatility. The global model yields a macroscopic classification of the generalized Zipf law, an omnipresent form of rank distributions observed across the sciences. An amalgamation of the three models establishes a universal rank-distribution explanation for the macroscopic emergence of a prevalent class of continuous size distributions, ones governed by unimodal densities with both Pareto and inverse-Pareto power-law tails.

  10. Rank distributions: A panoramic macroscopic outlook

    NASA Astrophysics Data System (ADS)

    Eliazar, Iddo I.; Cohen, Morrel H.

    2014-01-01

    This paper presents a panoramic macroscopic outlook of rank distributions. We establish a general framework for the analysis of rank distributions, which classifies them into five macroscopic "socioeconomic" states: monarchy, oligarchy-feudalism, criticality, socialism-capitalism, and communism. Oligarchy-feudalism is shown to be characterized by discrete macroscopic rank distributions, and socialism-capitalism is shown to be characterized by continuous macroscopic size distributions. Criticality is a transition state between oligarchy-feudalism and socialism-capitalism, which can manifest allometric scaling with multifractal spectra. Monarchy and communism are extreme forms of oligarchy-feudalism and socialism-capitalism, respectively, in which the intrinsic randomness vanishes. The general framework is applied to three different models of rank distributions—top-down, bottom-up, and global—and unveils each model's macroscopic universality and versatility. The global model yields a macroscopic classification of the generalized Zipf law, an omnipresent form of rank distributions observed across the sciences. An amalgamation of the three models establishes a universal rank-distribution explanation for the macroscopic emergence of a prevalent class of continuous size distributions, ones governed by unimodal densities with both Pareto and inverse-Pareto power-law tails.

  11. Evaluation of Term Ranking Algorithms for Pseudo-Relevance Feedback in MEDLINE Retrieval

    PubMed Central

    Yoo, Sooyoung

    2011-01-01

    Objectives The purpose of this study was to investigate the effects of query expansion algorithms for MEDLINE retrieval within a pseudo-relevance feedback framework. Methods A number of query expansion algorithms were tested using various term ranking formulas, focusing on query expansion based on pseudo-relevance feedback. The OHSUMED test collection, which is a subset of the MEDLINE database, was used as a test corpus. Various ranking algorithms were tested in combination with different term re-weighting algorithms. Results Our comprehensive evaluation showed that the local context analysis ranking algorithm, when used in combination with one of the reweighting algorithms - Rocchio, the probabilistic model, and our variants - significantly outperformed other algorithm combinations by up to 12% (paired t-test; p < 0.05). In a pseudo-relevance feedback framework, effective query expansion would be achieved by the careful consideration of term ranking and re-weighting algorithm pairs, at least in the context of the OHSUMED corpus. Conclusions Comparative experiments on term ranking algorithms were performed in the context of a subset of MEDLINE documents. With medical documents, local context analysis, which uses co-occurrence with all query terms, significantly outperformed various term ranking methods based on both frequency and distribution analyses. Furthermore, the results of the experiments demonstrated that the term rank-based re-weighting method contributed to a remarkable improvement in mean average precision. PMID:21886873

  12. Evaluation of Term Ranking Algorithms for Pseudo-Relevance Feedback in MEDLINE Retrieval.

    PubMed

    Yoo, Sooyoung; Choi, Jinwook

    2011-06-01

    The purpose of this study was to investigate the effects of query expansion algorithms for MEDLINE retrieval within a pseudo-relevance feedback framework. A number of query expansion algorithms were tested using various term ranking formulas, focusing on query expansion based on pseudo-relevance feedback. The OHSUMED test collection, which is a subset of the MEDLINE database, was used as a test corpus. Various ranking algorithms were tested in combination with different term re-weighting algorithms. Our comprehensive evaluation showed that the local context analysis ranking algorithm, when used in combination with one of the reweighting algorithms - Rocchio, the probabilistic model, and our variants - significantly outperformed other algorithm combinations by up to 12% (paired t-test; p < 0.05). In a pseudo-relevance feedback framework, effective query expansion would be achieved by the careful consideration of term ranking and re-weighting algorithm pairs, at least in the context of the OHSUMED corpus. Comparative experiments on term ranking algorithms were performed in the context of a subset of MEDLINE documents. With medical documents, local context analysis, which uses co-occurrence with all query terms, significantly outperformed various term ranking methods based on both frequency and distribution analyses. Furthermore, the results of the experiments demonstrated that the term rank-based re-weighting method contributed to a remarkable improvement in mean average precision.

  13. Heuristics for Relevancy Ranking of Earth Dataset Search Results

    NASA Astrophysics Data System (ADS)

    Lynnes, C.; Quinn, P.; Norton, J.

    2016-12-01

    As the Variety of Earth science datasets increases, science researchers find it more challenging to discover and select the datasets that best fit their needs. The most common way of search providers to address this problem is to rank the datasets returned for a query by their likely relevance to the user. Large web page search engines typically use text matching supplemented with reverse link counts, semantic annotations and user intent modeling. However, this produces uneven results when applied to dataset metadata records simply externalized as a web page. Fortunately, data and search provides have decades of experience in serving data user communities, allowing them to form heuristics that leverage the structure in the metadata together with knowledge about the user community. Some of these heuristics include specific ways of matching the user input to the essential measurements in the dataset and determining overlaps of time range and spatial areas. Heuristics based on the novelty of the datasets can prioritize later, better versions of data over similar predecessors. And knowledge of how different user types and communities use data can be brought to bear in cases where characteristics of the user (discipline, expertise) or their intent (applications, research) can be divined. The Earth Observing System Data and Information System has begun implementing some of these heuristics in the relevancy algorithm of its Common Metadata Repository search engine.

  14. Heuristics for Relevancy Ranking of Earth Dataset Search Results

    NASA Technical Reports Server (NTRS)

    Lynnes, Christopher; Quinn, Patrick; Norton, James

    2016-01-01

    As the Variety of Earth science datasets increases, science researchers find it more challenging to discover and select the datasets that best fit their needs. The most common way of search providers to address this problem is to rank the datasets returned for a query by their likely relevance to the user. Large web page search engines typically use text matching supplemented with reverse link counts, semantic annotations and user intent modeling. However, this produces uneven results when applied to dataset metadata records simply externalized as a web page. Fortunately, data and search provides have decades of experience in serving data user communities, allowing them to form heuristics that leverage the structure in the metadata together with knowledge about the user community. Some of these heuristics include specific ways of matching the user input to the essential measurements in the dataset and determining overlaps of time range and spatial areas. Heuristics based on the novelty of the datasets can prioritize later, better versions of data over similar predecessors. And knowledge of how different user types and communities use data can be brought to bear in cases where characteristics of the user (discipline, expertise) or their intent (applications, research) can be divined. The Earth Observing System Data and Information System has begun implementing some of these heuristics in the relevancy algorithm of its Common Metadata Repository search engine.

  15. Development of geopolitically relevant ranking criteria for geoengineering methods

    NASA Astrophysics Data System (ADS)

    Boyd, Philip W.

    2016-11-01

    A decade has passed since Paul Crutzen published his editorial essay on the potential for stratospheric geoengineering to cool the climate in the Anthropocene. He synthesized the effects of the 1991 Pinatubo eruption on the planet's radiative budget and used this large-scale event to broaden and deepen the debate on the challenges and opportunities of large-scale geoengineering. Pinatubo had pronounced effects, both in the short and longer term (months to years), on the ocean, land, and the atmosphere. This rich set of data on how a large-scale natural event influences many regional and global facets of the Earth System provides a comprehensive viewpoint to assess the wider ramifications of geoengineering. Here, I use the Pinatubo archives to develop a range of geopolitically relevant ranking criteria for a suite of different geoengineering approaches. The criteria focus on the spatial scales needed for geoengineering and whether large-scale dispersal is a necessary requirement for a technique to deliver significant cooling or carbon dioxide reductions. These categories in turn inform whether geoengineering approaches are amenable to participation (the "democracy of geoengineering") and whether they will lead to transboundary issues that could precipitate geopolitical conflicts. The criteria provide the requisite detail to demarcate different geoengineering approaches in the context of geopolitics. Hence, they offer another tool that can be used in the development of a more holistic approach to the debate on geoengineering.

  16. The Distribution of the Sum of Signed Ranks

    ERIC Educational Resources Information Center

    Albright, Brian

    2012-01-01

    We describe the calculation of the distribution of the sum of signed ranks and develop an exact recursive algorithm for the distribution as well as an approximation of the distribution using the normal. The results have applications to the non-parametric Wilcoxon signed-rank test.

  17. Prototyping a Distributed Information Retrieval System That Uses Statistical Ranking.

    ERIC Educational Resources Information Center

    Harman, Donna; And Others

    1991-01-01

    Built using a distributed architecture, this prototype distributed information retrieval system uses statistical ranking techniques to provide better service to the end user. Distributed architecture was shown to be a feasible alternative to centralized or CD-ROM information retrieval, and user testing of the ranking methodology showed both…

  18. The Distribution of the Sum of Signed Ranks

    ERIC Educational Resources Information Center

    Albright, Brian

    2012-01-01

    We describe the calculation of the distribution of the sum of signed ranks and develop an exact recursive algorithm for the distribution as well as an approximation of the distribution using the normal. The results have applications to the non-parametric Wilcoxon signed-rank test.

  19. A multimedia retrieval framework based on semi-supervised ranking and relevance feedback.

    PubMed

    Yang, Yi; Nie, Feiping; Xu, Dong; Luo, Jiebo; Zhuang, Yueting; Pan, Yunhe

    2012-04-01

    We present a new framework for multimedia content analysis and retrieval which consists of two independent algorithms. First, we propose a new semi-supervised algorithm called ranking with Local Regression and Global Alignment (LRGA) to learn a robust Laplacian matrix for data ranking. In LRGA, for each data point, a local linear regression model is used to predict the ranking scores of its neighboring points. A unified objective function is then proposed to globally align the local models from all the data points so that an optimal ranking score can be assigned to each data point. Second, we propose a semi-supervised long-term Relevance Feedback (RF) algorithm to refine the multimedia data representation. The proposed long-term RF algorithm utilizes both the multimedia data distribution in multimedia feature space and the history RF information provided by users. A trace ratio optimization problem is then formulated and solved by an efficient algorithm. The algorithms have been applied to several content-based multimedia retrieval applications, including cross-media retrieval, image retrieval, and 3D motion/pose data retrieval. Comprehensive experiments on four data sets have demonstrated its advantages in precision, robustness, scalability, and computational efficiency.

  20. Universality in the tail of musical note rank distribution

    NASA Astrophysics Data System (ADS)

    Beltrán del Río, M.; Cocho, G.; Naumis, G. G.

    2008-09-01

    Although power laws have been used to fit rank distributions in many different contexts, they usually fail at the tails. Languages as sequences of symbols have been a popular subject for ranking distributions, and for this purpose, music can be treated as such. Here we show that more than 1800 musical compositions are very well fitted by the first kind two parameter beta distribution, which arises in the ranking of multiplicative stochastic processes. The parameters a and b are obtained for classical, jazz and rock music, revealing interesting features. Specially, we have obtained a clear trend in the values of the parameters for major and minor tonal modes. Finally, we discuss the distribution of notes for each octave and its connection with the ranking of the notes.

  1. Rank-Size Distribution of Notes in Harmonic Music: Hierarchic Shuffling of Distributions

    NASA Astrophysics Data System (ADS)

    Del Río, Manuel Beltrán; Cocho, Germinal

    We trace the rank size distribution of notes in harmonic music, which on previous works we suggested was much better represented by the Two-parameter, first class Beta distribution than the customary power law, to the ranked mixing of distributions dictated by the harmonic and instrumental nature of the piece. The same representation is shown to arise in other fields by the same type of ranked shuffling of distributions. We include the codon content of intergenic DNA sequences and the ranked distribution of sizes of trees in a determined area as examples. We show that the fittings proposed increase their accuracy with the number of distributions that are mixed and ranked.

  2. Enabling multi-level relevance feedback on PubMed by integrating rank learning into DBMS

    PubMed Central

    2010-01-01

    Background Finding relevant articles from PubMed is challenging because it is hard to express the user's specific intention in the given query interface, and a keyword query typically retrieves a large number of results. Researchers have applied machine learning techniques to find relevant articles by ranking the articles according to the learned relevance function. However, the process of learning and ranking is usually done offline without integrated with the keyword queries, and the users have to provide a large amount of training documents to get a reasonable learning accuracy. This paper proposes a novel multi-level relevance feedback system for PubMed, called RefMed, which supports both ad-hoc keyword queries and a multi-level relevance feedback in real time on PubMed. Results RefMed supports a multi-level relevance feedback by using the RankSVM as the learning method, and thus it achieves higher accuracy with less feedback. RefMed "tightly" integrates the RankSVM into RDBMS to support both keyword queries and the multi-level relevance feedback in real time; the tight coupling of the RankSVM and DBMS substantially improves the processing time. An efficient parameter selection method for the RankSVM is also proposed, which tunes the RankSVM parameter without performing validation. Thereby, RefMed achieves a high learning accuracy in real time without performing a validation process. RefMed is accessible at http://dm.postech.ac.kr/refmed. Conclusions RefMed is the first multi-level relevance feedback system for PubMed, which achieves a high accuracy with less feedback. It effectively learns an accurate relevance function from the user’s feedback and efficiently processes the function to return relevant articles in real time. PMID:20406504

  3. Relevance weighting of tier 1 endocrine screening endpoints by rank order.

    PubMed

    Borgert, Christopher J; Stuchal, Leah D; Mihaich, Ellen M; Becker, Richard A; Bentley, Karin S; Brausch, John M; Coady, Katie; Geter, David R; Gordon, Elliot; Guiney, Patrick D; Hess, Frederick; Holmes, Catherine M; LeBaron, Matthew J; Levine, Steve; Marty, Sue; Mukhi, Sandeep; Neal, Barbara H; Ortego, Lisa S; Saltmiras, David A; Snajdr, Suzanne; Staveley, Jane; Tobia, Abraham

    2014-02-01

    Weight of evidence (WoE) approaches are recommended for interpreting various toxicological data, but few systematic and transparent procedures exist. A hypothesis-based WoE framework was recently published focusing on the U.S. EPA's Tier 1 Endocrine Screening Battery (ESB) as an example. The framework recommends weighting each experimental endpoint according to its relevance for deciding eight hypotheses addressed by the ESB. Here we present detailed rationale for weighting the ESB endpoints according to three rank ordered categories and an interpretive process for using the rankings to reach WoE determinations. Rank 1 was assigned to in vivo endpoints that characterize the fundamental physiological actions for androgen, estrogen, and thyroid activities. Rank 1 endpoints are specific and sensitive for the hypothesis, interpretable without ancillary data, and rarely confounded by artifacts or nonspecific activity. Rank 2 endpoints are specific and interpretable for the hypothesis but less informative than Rank 1, often due to oversensitivity, inclusion of narrowly context-dependent components of the hormonal system (e.g., in vitro endpoints), or confounding by nonspecific activity. Rank 3 endpoints are relevant for the hypothesis but only corroborative of Ranks 1 and 2 endpoints. Rank 3 includes many apical in vivo endpoints that can be affected by systemic toxicity and nonhormonal activity. Although these relevance weight rankings (WREL ) necessarily involve professional judgment, their a priori derivation enhances transparency and renders WoE determinations amenable to methodological scrutiny according to basic scientific premises, characteristics that cannot be assured by processes in which the rationale for decisions is provided post hoc. © 2014 Wiley Periodicals, Inc.

  4. Random Texts Do Not Exhibit the Real Zipf's Law-Like Rank Distribution

    PubMed Central

    Ferrer-i-Cancho, Ramon; Elvevåg, Brita

    2010-01-01

    Background Zipf's law states that the relationship between the frequency of a word in a text and its rank (the most frequent word has rank , the 2nd most frequent word has rank ,…) is approximately linear when plotted on a double logarithmic scale. It has been argued that the law is not a relevant or useful property of language because simple random texts - constructed by concatenating random characters including blanks behaving as word delimiters - exhibit a Zipf's law-like word rank distribution. Methodology/Principal Findings In this article, we examine the flaws of such putative good fits of random texts. We demonstrate - by means of three different statistical tests - that ranks derived from random texts and ranks derived from real texts are statistically inconsistent with the parameters employed to argue for such a good fit, even when the parameters are inferred from the target real text. Our findings are valid for both the simplest random texts composed of equally likely characters as well as more elaborate and realistic versions where character probabilities are borrowed from a real text. Conclusions/Significance The good fit of random texts to real Zipf's law-like rank distributions has not yet been established. Therefore, we suggest that Zipf's law might in fact be a fundamental law in natural languages. PMID:20231884

  5. Assessing introduction risk using species’ rank-abundance distributions

    PubMed Central

    Chan, Farrah T.; Bradie, Johanna; Briski, Elizabeta; Bailey, Sarah A.; Simard, Nathalie; MacIsaac, Hugh J.

    2015-01-01

    Mixed-species assemblages are often unintentionally introduced into new ecosystems. Analysing how assemblage structure varies during transport may provide insights into how introduction risk changes before propagules are released. Characterization of introduction risk is typically based on assessments of colonization pressure (CP, the number of species transported) and total propagule pressure (total PP, the total abundance of propagules released) associated with an invasion vector. Generally, invasion potential following introduction increases with greater CP or total PP. Here, we extend these assessments using rank-abundance distributions to examine how CP : total PP relationships change temporally in ballast water of ocean-going ships. Rank-abundance distributions and CP : total PP patterns varied widely between trans-Atlantic and trans-Pacific voyages, with the latter appearing to pose a much lower risk than the former. Responses also differed by taxonomic group, with invertebrates experiencing losses mainly in total PP, while diatoms and dinoflagellates sustained losses mainly in CP. In certain cases, open-ocean ballast water exchange appeared to increase introduction risk by uptake of new species or supplementation of existing ones. Our study demonstrates that rank-abundance distributions provide new insights into the utility of CP and PP in characterizing introduction risk. PMID:25473007

  6. Assessing introduction risk using species' rank-abundance distributions.

    PubMed

    Chan, Farrah T; Bradie, Johanna; Briski, Elizabeta; Bailey, Sarah A; Simard, Nathalie; MacIsaac, Hugh J

    2015-01-22

    Mixed-species assemblages are often unintentionally introduced into new ecosystems. Analysing how assemblage structure varies during transport may provide insights into how introduction risk changes before propagules are released. Characterization of introduction risk is typically based on assessments of colonization pressure (CP, the number of species transported) and total propagule pressure (total PP, the total abundance of propagules released) associated with an invasion vector. Generally, invasion potential following introduction increases with greater CP or total PP. Here, we extend these assessments using rank-abundance distributions to examine how CP : total PP relationships change temporally in ballast water of ocean-going ships. Rank-abundance distributions and CP : total PP patterns varied widely between trans-Atlantic and trans-Pacific voyages, with the latter appearing to pose a much lower risk than the former. Responses also differed by taxonomic group, with invertebrates experiencing losses mainly in total PP, while diatoms and dinoflagellates sustained losses mainly in CP. In certain cases, open-ocean ballast water exchange appeared to increase introduction risk by uptake of new species or supplementation of existing ones. Our study demonstrates that rank-abundance distributions provide new insights into the utility of CP and PP in characterizing introduction risk.

  7. Inverted rank distributions: Macroscopic statistics, universality classes, and critical exponents

    NASA Astrophysics Data System (ADS)

    Eliazar, Iddo; Cohen, Morrel H.

    2014-01-01

    An inverted rank distribution is an infinite sequence of positive sizes ordered in a monotone increasing fashion. Interlacing together Lorenzian and oligarchic asymptotic analyses, we establish a macroscopic classification of inverted rank distributions into five “socioeconomic” universality classes: communism, socialism, criticality, feudalism, and absolute monarchy. We further establish that: (i) communism and socialism are analogous to a “disordered phase”, feudalism and absolute monarchy are analogous to an “ordered phase”, and criticality is the “phase transition” between order and disorder; (ii) the universality classes are characterized by two critical exponents, one governing the ordered phase, and the other governing the disordered phase; (iii) communism, criticality, and absolute monarchy are characterized by sharp exponent values, and are inherently deterministic; (iv) socialism is characterized by a continuous exponent range, is inherently stochastic, and is universally governed by continuous power-law statistics; (v) feudalism is characterized by a continuous exponent range, is inherently stochastic, and is universally governed by discrete exponential statistics. The results presented in this paper yield a universal macroscopic socioeconophysical perspective of inverted rank distributions.

  8. Enhancing Sketch-Based Image Retrieval by Re-Ranking and Relevance Feedback.

    PubMed

    Xueming Qian; Xianglong Tan; Yuting Zhang; Richang Hong; Meng Wang

    2016-01-01

    A sketch-based image retrieval often needs to optimize the tradeoff between efficiency and precision. Index structures are typically applied to large-scale databases to realize efficient retrievals. However, the performance can be affected by quantization errors. Moreover, the ambiguousness of user-provided examples may also degrade the performance, when compared with traditional image retrieval methods. Sketch-based image retrieval systems that preserve the index structure are challenging. In this paper, we propose an effective sketch-based image retrieval approach with re-ranking and relevance feedback schemes. Our approach makes full use of the semantics in query sketches and the top ranked images of the initial results. We also apply relevance feedback to find more relevant images for the input query sketch. The integration of the two schemes results in mutual benefits and improves the performance of the sketch-based image retrieval.

  9. Ranking initial environmental and human health risk resulting from environmentally relevant nanomaterials.

    PubMed

    O'Brien, Niall; Cummins, Enda

    2010-01-01

    As nanomaterials find increased application in commercial and industrial products and processes so too the potential for release of these novel materials into the environment increases. The characteristics of these materials also may result in novel toxicological actions related to their nanoscale, which will have implications on their ecotoxicological and toxicological limits of exposure and eventual regulation. A framework for nanomaterial risk assessment on regulatory, ecotoxicological and toxicological bases developed from recent exposure and toxicity studies is presented. The release of nanoscale TiO(2), Ag and CeO(2) to the atmosphere and surface waters is assessed against provisional toxicological bench mark doses (BMDs) and critical effect doses (CEDs) developed from best available data. Predicted levels of nanomaterial release to surface waters and the atmosphere resulted in regulatory risk rankings of moderate concern based on worst case provisional regulatory limits. Inhalation and ingestion risk rankings were of very low concern based on the provisional inhalation and ingestion toxicity BMDLs and CEDLs determined for the nanomaterials in question. More toxicological data is needed on nanoscale CeO(2) inhalation to develop a true dose response as in vitro cytotoxicity studies yielded an inhalation risk ranking of lower concern. The moderate to high ecotoxicological risk rankings posed by the release of nanoscale TiO(2) and Ag to surface waters highlights the need for guidance and restriction on the usage and disposal of commercial products containing nanomaterial. The risk rankings presented in this assessment give a first indication of the relative risks posed by the usage and release of these materials into the environment and indicate what materials require further investigation into their nano-specific toxicological actions. As more nano-relevant toxicity studies are published, end-points and risk levels related to nano-specific toxicity actions may

  10. The LAILAPS search engine: a feature model for relevance ranking in life science databases.

    PubMed

    Lange, Matthias; Spies, Karl; Colmsee, Christian; Flemming, Steffen; Klapperstück, Matthias; Scholz, Uwe

    2010-03-25

    Efficient and effective information retrieval in life sciences is one of the most pressing challenge in bioinformatics. The incredible growth of life science databases to a vast network of interconnected information systems is to the same extent a big challenge and a great chance for life science research. The knowledge found in the Web, in particular in life-science databases, are a valuable major resource. In order to bring it to the scientist desktop, it is essential to have well performing search engines. Thereby, not the response time nor the number of results is important. The most crucial factor for millions of query results is the relevance ranking. In this paper, we present a feature model for relevance ranking in life science databases and its implementation in the LAILAPS search engine. Motivated by the observation of user behavior during their inspection of search engine result, we condensed a set of 9 relevance discriminating features. These features are intuitively used by scientists, who briefly screen database entries for potential relevance. The features are both sufficient to estimate the potential relevance, and efficiently quantifiable. The derivation of a relevance prediction function that computes the relevance from this features constitutes a regression problem. To solve this problem, we used artificial neural networks that have been trained with a reference set of relevant database entries for 19 protein queries. Supporting a flexible text index and a simple data import format, this concepts are implemented in the LAILAPS search engine. It can easily be used both as search engine for comprehensive integrated life science databases and for small in-house project databases. LAILAPS is publicly available for SWISSPROT data at http://lailaps.ipk-gatersleben.de.

  11. Power-law and exponential rank distributions: A panoramic Gibbsian perspective

    SciTech Connect

    Eliazar, Iddo

    2015-04-15

    Rank distributions are collections of positive sizes ordered either increasingly or decreasingly. Many decreasing rank distributions, formed by the collective collaboration of human actions, follow an inverse power-law relation between ranks and sizes. This remarkable empirical fact is termed Zipf’s law, and one of its quintessential manifestations is the demography of human settlements — which exhibits a harmonic relation between ranks and sizes. In this paper we present a comprehensive statistical-physics analysis of rank distributions, establish that power-law and exponential rank distributions stand out as optimal in various entropy-based senses, and unveil the special role of the harmonic relation between ranks and sizes. Our results extend the contemporary entropy-maximization view of Zipf’s law to a broader, panoramic, Gibbsian perspective of increasing and decreasing power-law and exponential rank distributions — of which Zipf’s law is one out of four pillars.

  12. FUB at TREC 2008 Relevance Feedback Track: Extending Rocchio with Distributional Term Analysis

    DTIC Science & Technology

    2008-11-01

    1 FUB at TREC 2008 Relevance Feedback Track: Extending Rocchio with Distributional Term Analysis Andrea Bernardini, Claudio Carpineto Fondazione Ugo...following. - Test the effectiveness of using a combination of Rocchio and distributional term analysis on a relevance feedback task; so far, this approach has...feedback not only more effective but also more robust than pseudo-relevance feedback? 2. Our approach: combining Rocchio with term-ranking scores The

  13. Pathway Relevance Ranking for Tumor Samples through Network-Based Data Integration.

    PubMed

    Verbeke, Lieven P C; Van den Eynden, Jimmy; Fierro, Ana Carolina; Demeester, Piet; Fostier, Jan; Marchal, Kathleen

    2015-01-01

    The study of cancer, a highly heterogeneous disease with different causes and clinical outcomes, requires a multi-angle approach and the collection of large multi-omics datasets that, ideally, should be analyzed simultaneously. We present a new pathway relevance ranking method that is able to prioritize pathways according to the information contained in any combination of tumor related omics datasets. Key to the method is the conversion of all available data into a single comprehensive network representation containing not only genes but also individual patient samples. Additionally, all data are linked through a network of previously identified molecular interactions. We demonstrate the performance of the new method by applying it to breast and ovarian cancer datasets from The Cancer Genome Atlas. By integrating gene expression, copy number, mutation and methylation data, the method's potential to identify key pathways involved in breast cancer development shared by different molecular subtypes is illustrated. Interestingly, certain pathways were ranked equally important for different subtypes, even when the underlying (epi)-genetic disturbances were diverse. Next to prioritizing universally high-scoring pathways, the pathway ranking method was able to identify subtype-specific pathways. Often the score of a pathway could not be motivated by a single mutation, copy number or methylation alteration, but rather by a combination of genetic and epi-genetic disturbances, stressing the need for a network-based data integration approach. The analysis of ovarian tumors, as a function of survival-based subtypes, demonstrated the method's ability to correctly identify key pathways, irrespective of tumor subtype. A differential analysis of survival-based subtypes revealed several pathways with higher importance for the bad-outcome patient group than for the good-outcome patient group. Many of the pathways exhibiting higher importance for the bad-outcome patient group could

  14. Universality of Rank-Ordering Distributions in the Arts and Sciences

    PubMed Central

    del Río, Manuel Beltrán; Mansilla, Ricardo; Miramontes, Pedro

    2009-01-01

    Searching for generic behaviors has been one of the driving forces leading to a deep understanding and classification of diverse phenomena. Usually a starting point is the development of a phenomenology based on observations. Such is the case for power law distributions encountered in a wealth of situations coming from physics, geophysics, biology, lexicography as well as social and financial networks. This finding is however restricted to a range of values outside of which finite size corrections are often invoked. Here we uncover a universal behavior of the way in which elements of a system are distributed according to their rank with respect to a given property, valid for the full range of values, regardless of whether or not a power law has previously been suggested. We propose a two parameter functional form for these rank-ordered distributions that gives excellent fits to an impressive amount of very diverse phenomena, coming from the arts, social and natural sciences. It is a discrete version of a generalized beta distribution, given by f(r) = A(N+1-r)b/ra, where r is the rank, N its maximum value, A the normalization constant and (a, b) two fitting exponents. Prompted by our genetic sequence observations we present a growth probabilistic model incorporating mutation-duplication features that generates data complying with this distribution. The competition between permanence and change appears to be a relevant, though not necessary feature. Additionally, our observations mainly of social phenomena suggest that a multifactorial quality resulting from the convergence of several heterogeneous underlying processes is an important feature. We also explore the significance of the distribution parameters and their classifying potential. The ubiquity of our findings suggests that there must be a fundamental underlying explanation, most probably of a statistical nature, such as an appropriate central limit theorem formulation. PMID:19277122

  15. Universality of rank-ordering distributions in the arts and sciences.

    PubMed

    Martínez-Mekler, Gustavo; Alvarez Martínez, Roberto; Beltrán del Río, Manuel; Mansilla, Ricardo; Miramontes, Pedro; Cocho, Germinal

    2009-01-01

    Searching for generic behaviors has been one of the driving forces leading to a deep understanding and classification of diverse phenomena. Usually a starting point is the development of a phenomenology based on observations. Such is the case for power law distributions encountered in a wealth of situations coming from physics, geophysics, biology, lexicography as well as social and financial networks. This finding is however restricted to a range of values outside of which finite size corrections are often invoked. Here we uncover a universal behavior of the way in which elements of a system are distributed according to their rank with respect to a given property, valid for the full range of values, regardless of whether or not a power law has previously been suggested. We propose a two parameter functional form for these rank-ordered distributions that gives excellent fits to an impressive amount of very diverse phenomena, coming from the arts, social and natural sciences. It is a discrete version of a generalized beta distribution, given by f(r) = A(N+1-r)(b)/r(a), where r is the rank, N its maximum value, A the normalization constant and (a, b) two fitting exponents. Prompted by our genetic sequence observations we present a growth probabilistic model incorporating mutation-duplication features that generates data complying with this distribution. The competition between permanence and change appears to be a relevant, though not necessary feature. Additionally, our observations mainly of social phenomena suggest that a multifactorial quality resulting from the convergence of several heterogeneous underlying processes is an important feature. We also explore the significance of the distribution parameters and their classifying potential. The ubiquity of our findings suggests that there must be a fundamental underlying explanation, most probably of a statistical nature, such as an appropriate central limit theorem formulation.

  16. Development of Increasingly Autonomous Traffic Data Manager Using Pilot Relevancy and Ranking Data

    NASA Technical Reports Server (NTRS)

    Le Vie, Lisa R.; Houston, Vincent E.

    2017-01-01

    NASA's Safe Autonomous Systems Operations (SASO) project goal is to define and safely enable all future airspace operations by justifiable and optimal autonomy for advanced air, ground, and connected capabilities. This work showcases how Increasingly Autonomous Systems (IAS) could create operational transformations beneficial to the enhancement of civil aviation safety and efficiency. One such IAS under development is the Traffic Data Manager (TDM). This concept is a prototype 'intelligent party-line' system that would declutter and parse out non-relevant air traffic, displaying only relevant air traffic to the aircrew in a digital data communications (Data Comm) environment. As an initial step, over 22,000 data points were gathered from 31 Airline Transport Pilots to train the machine learning algorithms designed to mimic human experts and expertise. The test collection used an analog of the Navigation Display. Pilots were asked to rate the relevancy of the displayed traffic using an interactive tablet application. Pilots were also asked to rank the order of importance of the information given, to better weight the variables within the algorithm. They were also asked if the information given was enough data, and more importantly the "right" data to best inform the algorithm. The paper will describe the findings and their impact to the further development of the algorithm for TDM and, in general, address the issue of how can we train supervised machine learning algorithms, critical to increasingly autonomous systems, with the knowledge and expertise of expert human pilots.

  17. MememxGATE: Unearthing Latent Content Features for Improved Search and Relevancy Ranking Across Scientific Literature

    NASA Astrophysics Data System (ADS)

    Wilson, B. D.; McGibbney, L. J.; Mattmann, C. A.; Ramirez, P.; Joyce, M.; Whitehall, K. D.

    2015-12-01

    be utilized for improved search and relevancy ranking across scientific literature.

  18. Methods of computing vocabulary size for the two-parameter rank distribution

    NASA Technical Reports Server (NTRS)

    Edmundson, H. P.; Fostel, G.; Tung, I.; Underwood, W.

    1972-01-01

    A summation method is described for computing the vocabulary size for given parameter values in the 1- and 2-parameter rank distributions. Two methods of determining the asymptotes for the family of 2-parameter rank-distribution curves are also described. Tables are computed and graphs are drawn relating paris of parameter values to the vocabulary size. The partial product formula for the Riemann zeta function is investigated as an approximation to the partial sum formula for the Riemann zeta function. An error bound is established that indicates that the partial product should not be used to approximate the partial sum in calculating the vocabulary size for the 2-parameter rank distribution.

  19. The exact probability distribution of the rank product statistics for replicated experiments.

    PubMed

    Eisinga, Rob; Breitling, Rainer; Heskes, Tom

    2013-03-18

    The rank product method is a widely accepted technique for detecting differentially regulated genes in replicated microarray experiments. To approximate the sampling distribution of the rank product statistic, the original publication proposed a permutation approach, whereas recently an alternative approximation based on the continuous gamma distribution was suggested. However, both approximations are imperfect for estimating small tail probabilities. In this paper we relate the rank product statistic to number theory and provide a derivation of its exact probability distribution and the true tail probabilities.

  20. Text mixing shapes the anatomy of rank-frequency distributions

    NASA Astrophysics Data System (ADS)

    Williams, Jake Ryland; Bagrow, James P.; Danforth, Christopher M.; Dodds, Peter Sheridan

    2015-05-01

    Natural languages are full of rules and exceptions. One of the most famous quantitative rules is Zipf's law, which states that the frequency of occurrence of a word is approximately inversely proportional to its rank. Though this "law" of ranks has been found to hold across disparate texts and forms of data, analyses of increasingly large corpora since the late 1990s have revealed the existence of two scaling regimes. These regimes have thus far been explained by a hypothesis suggesting a separability of languages into core and noncore lexica. Here we present and defend an alternative hypothesis that the two scaling regimes result from the act of aggregating texts. We observe that text mixing leads to an effective decay of word introduction, which we show provides accurate predictions of the location and severity of breaks in scaling. Upon examining large corpora from 10 languages in the Project Gutenberg eBooks collection, we find emphatic empirical support for the universality of our claim.

  1. Growth, distribution and rank stability of urban settlements in Greece.

    PubMed

    Petsimeris, P

    1986-01-01

    "This paper aims at analyzing the structure of the system in Greece of urban settlements, from 1870 to 1981. It is based on a study by the author concerning the process of urbanization and the problems of the 'residential subsystem' in countries of intermediate development with special reference to Greece. The analysis takes as sole indicator of the evolution of the urban centers network, the long term variation of population of urban settlements in Greece and as tools of analysis, the Rank-Size Rule (RSR) and Hoover's Index." Distinctions are drawn between the urban settlement patterns in the pre-capitalist and capitalist periods, the latter being marked by an unbalanced hierarchy dominated by Athens and without medium-sized cities, other than Thessaloniki. excerpt

  2. Rank-size distribution and primate city characteristics in India--a temporal analysis.

    PubMed

    Das, R J; Dutt, A K

    1993-02-01

    "This paper is an analysis of the historical change in city size distribution in India....Rank-size distribution at national level and primate city-size distribution at regional levels are examined....The paper also examines, in the Indian context, the relation between rank-size distribution and an integrated urban system, and the normative nature of the latter as a spatial organization of human society. Finally, we have made a modest attempt to locate the research on city-size distribution...." excerpt

  3. Rank Regressions, Wage Distributions, and the Gender Gap.

    ERIC Educational Resources Information Center

    Fortin, Nicole M.; Lemieux, Thomas

    1998-01-01

    Current Population Survey data from 1979 and 1991 were used to decompose changes in the gender wage gap into three components: skill distribution, wage structure, and improvements in women's position. Relative wage gains by women may have been a source of increasing wage inequality among men. (SK)

  4. Environmental correlates of species rank - abundance distributions in global drylands.

    PubMed

    Ulrich, Werner; Soliveres, Santiago; Thomas, Andrew D; Dougill, Andrew J; Maestre, Fernando T

    2016-06-01

    Theoretical models predict lognormal species abundance distributions (SADs) in stable and productive environments, with log-series SADs in less stable, dispersal driven communities. We studied patterns of relative species abundances of perennial vascular plants in global dryland communities to: i) assess the influence of climatic and soil characteristics on the observed SADs, ii) infer how environmental variability influences relative abundances, and iii) evaluate how colonisation dynamics and environmental filters shape abundance distributions. We fitted lognormal and log-series SADs to 91 sites containing at least 15 species of perennial vascular plants. The dependence of species relative abundances on soil and climate variables was assessed using general linear models. Irrespective of habitat type and latitude, the majority of the SADs (70.3%) were best described by a lognormal distribution. Lognormal SADs were associated with low annual precipitation, higher aridity, high soil carbon content, and higher variability of climate variables and soil nitrate. Our results do not corroborate models predicting the prevalence of log-series SADs in dryland communities. As lognormal SADs were particularly associated with sites with drier conditions and a higher environmental variability, we reject models linking lognormality to environmental stability and high productivity conditions. Instead our results point to the prevalence of lognormal SADs in heterogeneous environments, allowing for more evenly distributed plant communities, or in stressful ecosystems, which are generally shaped by strong habitat filters and limited colonisation. This suggests that drylands may be resilient to environmental changes because the many species with intermediate relative abundances could take over ecosystem functioning if the environment becomes suboptimal for dominant species.

  5. Inheritance of Properties of Normal and Non-Normal Distributions after Transformation of Scores to Ranks

    ERIC Educational Resources Information Center

    Zimmerman, Donald W.

    2011-01-01

    This study investigated how population parameters representing heterogeneity of variance, skewness, kurtosis, bimodality, and outlier-proneness, drawn from normal and eleven non-normal distributions, also characterized the ranks corresponding to independent samples of scores. When the parameters of population distributions from which samples were…

  6. SORT-AID with RANK: Search Postprocessing Tools for Automating the Determination of Citation Relevance.

    ERIC Educational Resources Information Center

    Leigh, William; Paz, Noemi

    1986-01-01

    Reports on the design and use of a computer system designed to assist online database searchers in determining the relevance of downloaded abstracts. The system, which runs on a personal computer, includes aids for individual review of abstracts and semiautomatic determination of relevance; prospects for fully automatic use are discussed. (CDD)

  7. Document Text Characteristics Affect the Ranking of the Most Relevant Documents by Expanded Structured Queries.

    ERIC Educational Resources Information Center

    Sormunen, Eero; Kekalainen, Jaana; Koivisto, Jussi; Jarvelin, Kalervo

    2001-01-01

    Presents a new concept-based method to analyze the text characteristics of documents at varying relevance levels. Applies the results of the document analysis in an experiment on query expansion in a probabilistic information retrieval system and investigates statistical differences in textual characteristics of highly relevant and less relevant…

  8. Exploring Empirical Rank-Frequency Distributions Longitudinally through a Simple Stochastic Process

    PubMed Central

    Finley, Benjamin J.; Kilkki, Kalevi

    2014-01-01

    The frequent appearance of empirical rank-frequency laws, such as Zipf’s law, in a wide range of domains reinforces the importance of understanding and modeling these laws and rank-frequency distributions in general. In this spirit, we utilize a simple stochastic cascade process to simulate several empirical rank-frequency distributions longitudinally. We focus especially on limiting the process’s complexity to increase accessibility for non-experts in mathematics. The process provides a good fit for many empirical distributions because the stochastic multiplicative nature of the process leads to an often observed concave rank-frequency distribution (on a log-log scale) and the finiteness of the cascade replicates real-world finite size effects. Furthermore, we show that repeated trials of the process can roughly simulate the longitudinal variation of empirical ranks. However, we find that the empirical variation is often less that the average simulated process variation, likely due to longitudinal dependencies in the empirical datasets. Finally, we discuss the process limitations and practical applications. PMID:24755621

  9. Exploring empirical rank-frequency distributions longitudinally through a simple stochastic process.

    PubMed

    Finley, Benjamin J; Kilkki, Kalevi

    2014-01-01

    The frequent appearance of empirical rank-frequency laws, such as Zipf's law, in a wide range of domains reinforces the importance of understanding and modeling these laws and rank-frequency distributions in general. In this spirit, we utilize a simple stochastic cascade process to simulate several empirical rank-frequency distributions longitudinally. We focus especially on limiting the process's complexity to increase accessibility for non-experts in mathematics. The process provides a good fit for many empirical distributions because the stochastic multiplicative nature of the process leads to an often observed concave rank-frequency distribution (on a log-log scale) and the finiteness of the cascade replicates real-world finite size effects. Furthermore, we show that repeated trials of the process can roughly simulate the longitudinal variation of empirical ranks. However, we find that the empirical variation is often less that the average simulated process variation, likely due to longitudinal dependencies in the empirical datasets. Finally, we discuss the process limitations and practical applications.

  10. Degree distribution, rank-size distribution, and leadership persistence in mediation-driven attachment networks

    NASA Astrophysics Data System (ADS)

    Hassan, Md. Kamrul; Islam, Liana; Haque, Syed Arefinul

    2017-03-01

    We investigate the growth of a class of networks in which a new node first picks a mediator at random and connects with m randomly chosen neighbors of the mediator at each time step. We show that the degree distribution in such a mediation-driven attachment (MDA) network exhibits power-law P(k) ∼k - γ(m) with a spectrum of exponents depending on m. To appreciate the contrast between MDA and Barabási-Albert (BA) networks, we then discuss their rank-size distribution. To quantify how long a leader, the node with the maximum degree, persists in its leadership as the network evolves, we investigate the leadership persistence probability F(τ) i.e. the probability that a leader retains its leadership up to time τ. We find that it exhibits a power-law F(τ) ∼τ - θ(m) with persistence exponent θ(m) ≈ 1.51 ∀ m in MDA networks and θ(m) → 1.53 exponentially with m in BA networks.

  11. Accurate ranking of differentially expressed genes by a distribution-free shrinkage approach.

    PubMed

    Opgen-Rhein, Rainer; Strimmer, Korbinian

    2007-01-01

    High-dimensional case-control analysis is encountered in many different settings in genomics. In order to rank genes accordingly, many different scores have been proposed, ranging from ad hoc modifications of the ordinary t statistic to complicated hierarchical Bayesian models. Here, we introduce the "shrinkage t" statistic that is based on a novel and model-free shrinkage estimate of the variance vector across genes. This is derived in a quasi-empirical Bayes setting. The new rank score is fully automatic and requires no specification of parameters or distributions. It is computationally inexpensive and can be written analytically in closed form. Using a series of synthetic and three real expression data we studied the quality of gene rankings produced by the "shrinkage t" statistic. The new score consistently leads to highly accurate rankings for the complete range of investigated data sets and all considered scenarios for across-gene variance structures.

  12. Permutational distribution of the log-rank statistic under random censorship with applications to carcinogenicity assays.

    PubMed

    Heimann, G; Neuhaus, G

    1998-03-01

    In the random censorship model, the log-rank test is often used for comparing a control group with different dose groups. If the number of tumors is small, so-called exact methods are often applied for computing critical values from a permutational distribution. Two of these exact methods are discussed and shown to be incorrect. The correct permutational distribution is derived and studied with respect to its behavior under unequal censoring in the light of recent results proving that the permutational version and the unconditional version of the log-rank test are asymptotically equivalent even under unequal censoring. The log-rank test is studied by simulations of a realistic scenario from a bioassay with small numbers of tumors.

  13. Rank-Order Distribution of Administrative Salaries Paid, 1985-86. Nineteenth Annual Report.

    ERIC Educational Resources Information Center

    Arkansas Univ., Fayetteville. Office of Institutional Research.

    Results of a survey of salaries of full-time administrators at public, doctoral-granting institutions for 1985-1986 are presented. Rank order distributions of 12-month administrative salaries are provided for 156 state universities in 49 states and 33 university systems in 27 states. Salary data for 151 universities in 47 states are also arranged…

  14. Rank-Order Distribution of Administrative Salaries Paid, 1986-87. Twentieth Annual Report.

    ERIC Educational Resources Information Center

    Arkansas Univ., Fayetteville. Office of Institutional Research.

    Results of a survey of salaries of full-time administrators at public, doctoral-granting institutions for 1986-1987 are presented. Rank order distributions of 12-month administrative salaries are provided for 151 state universities in 49 states and 29 university systems in 23 states. Salary data for 151 universities are also arranged into the nine…

  15. Thirteenth Annual Rank-Order Distribution of Administrative Salaries Paid 1979-1980.

    ERIC Educational Resources Information Center

    Arkansas Univ., Fayetteville. Office of Institutional Research.

    Administrative salaries from a representative group of public doctorate-granting institutions are reported, including data from 128 universities in 49 states and 24 university systems in 20 states. The report is comprised of rank-order distribution tables for l2-month salaries in three categories: (1) by administrative title (president/chancellor,…

  16. GRank: a middleware search engine for ranking genes by relevance to given genes

    PubMed Central

    2013-01-01

    Background Biologists may need to know the set of genes that are semantically related to a given set of genes. For instance, a biologist may need to know the set of genes related to another set of genes known to be involved in a specific disease. Some works use the concept of gene clustering in order to identify semantically related genes. Others propose tools that return the set of genes that are semantically related to a given set of genes. Most of these gene similarity measures determine the semantic similarities among the genes based solely on the proximity to each other of the GO terms annotating the genes, while overlook the structural dependencies among these GO terms, which may lead to low recall and precision of results. Results We propose in this paper a search engine called GRank, which overcomes the limitations of the current gene similarity measures outlined above as follows. It employs the concept of existence dependency to determine the structural dependencies among the GO terms annotating a given set of gene. After determining the set of genes that are semantically related to input genes, GRank would use microarray experiment to rank these genes based on their degree of relativity to the input genes. We evaluated GRank experimentally and compared it with a comparable gene prediction tool called DynGO, which retrieves the genes and gene products that are relatives of input genes. Results showed marked improvement. Conclusions The experimental results demonstrated that GRank overcomes the limitations of current gene similarity measures. We attribute this performance to GRank’s use of existence dependency concept for determining the semantic relationships among gene annotations. The recall and precision values for two benchmarking datasets showed that GRank outperforms DynGO tool, which does not employ the concept of existence dependency. The demo of GRank using 11000 KEGG yeast genes and a Gene Expression Omnibus (GEO) microarray file named

  17. Sample size calculation for weighted rank tests comparing survival distributions under cluster randomization: a simulation method.

    PubMed

    Jung, Sin-Ho

    2007-01-01

    We propose a sample size calculation method for rank tests comparing two survival distributions under cluster randomization with possibly variable cluster sizes. Here, sample size refers to number of clusters. Our method is based on simulation procedure generating clustered exponential survival variables whose distribution is specified by the marginal hazard rate and the intracluster correlation coefficient. Sample size is calculated given significance level, power, marginal hazard rates (or median survival times) under the alternative hypothesis, intracluster correlation coefficient, accrual rate, follow-up period, and cluster size distribution.

  18. Rank Dynamics

    NASA Astrophysics Data System (ADS)

    Gershenson, Carlos

    Studies of rank distributions have been popular for decades, especially since the work of Zipf. For example, if we rank words of a given language by use frequency (most used word in English is 'the', rank 1; second most common word is 'of', rank 2), the distribution can be approximated roughly with a power law. The same applies for cities (most populated city in a country ranks first), earthquakes, metabolism, the Internet, and dozens of other phenomena. We recently proposed ``rank diversity'' to measure how ranks change in time, using the Google Books Ngram dataset. Studying six languages between 1800 and 2009, we found that the rank diversity curves of languages are universal, adjusted with a sigmoid on log-normal scale. We are studying several other datasets (sports, economies, social systems, urban systems, earthquakes, artificial life). Rank diversity seems to be universal, independently of the shape of the rank distribution. I will present our work in progress towards a general description of the features of rank change in time, along with simple models which reproduce it

  19. Modification of the Porter-Thomas Distribution by Rank-One Interaction

    NASA Astrophysics Data System (ADS)

    Bogomolny, E.

    2017-01-01

    The Porter-Thomas (PT) distribution of resonance widths is one of the oldest and simplest applications of statistical ideas in nuclear physics. Previous experimental data confirmed it quite well, but recent and more careful investigations show clear deviations from this distribution. To explain these discrepancies, Volya, Weidenmüller, and Zelevinsky [Phys. Rev. Lett. 115, 052501 (2015), 10.1103/PhysRevLett.115.052501] argued that to get a realistic model of nuclear resonances is not enough to consider one of the standard random matrix ensembles which leads immediately to the PT distribution, but it is necessary to add a rank-one interaction which couples resonances to decay channels. The purpose of this Letter is to solve this model analytically and to find explicitly the modifications of the PT distribution due to such an interaction. Resulting formulas are simple, in good agreement with numerics, and could explain experimental results.

  20. Co-pyrolysis of low rank coals and biomass: Product distributions

    SciTech Connect

    Soncini, Ryan M.; Means, Nicholas C.; Weiland, Nathan T.

    2013-10-01

    Pyrolysis and gasification of combined low rank coal and biomass feeds are the subject of much study in an effort to mitigate the production of green house gases from integrated gasification combined cycle (IGCC) systems. While co-feeding has the potential to reduce the net carbon footprint of commercial gasification operations, the effects of co-feeding on kinetics and product distributions requires study to ensure the success of this strategy. Southern yellow pine was pyrolyzed in a semi-batch type drop tube reactor with either Powder River Basin sub-bituminous coal or Mississippi lignite at several temperatures and feed ratios. Product gas composition of expected primary constituents (CO, CO{sub 2}, CH{sub 4}, H{sub 2}, H{sub 2}O, and C{sub 2}H{sub 4}) was determined by in-situ mass spectrometry while minor gaseous constituents were determined using a GC-MS. Product distributions are fit to linear functions of temperature, and quadratic functions of biomass fraction, for use in computational co-pyrolysis simulations. The results are shown to yield significant nonlinearities, particularly at higher temperatures and for lower ranked coals. The co-pyrolysis product distributions evolve more tar, and less char, CH{sub 4}, and C{sub 2}H{sub 4}, than an additive pyrolysis process would suggest. For lignite co-pyrolysis, CO and H{sub 2} production are also reduced. The data suggests that evolution of hydrogen from rapid pyrolysis of biomass prevents the crosslinking of fragmented aromatic structures during coal pyrolysis to produce tar, rather than secondary char and light gases. Finally, it is shown that, for the two coal types tested, co-pyrolysis synergies are more significant as coal rank decreases, likely because the initial structure in these coals contains larger pores and smaller clusters of aromatic structures which are more readily retained as tar in rapid co-pyrolysis.

  1. Order-disorder transition in conflicting dynamics leading to rank-frequency generalized beta distributions

    NASA Astrophysics Data System (ADS)

    Alvarez-Martinez, R.; Martinez-Mekler, G.; Cocho, G.

    2011-01-01

    The behavior of rank-ordered distributions of phenomena present in a variety of fields such as biology, sociology, linguistics, finance and geophysics has been a matter of intense research. Often power laws have been encountered; however, their validity tends to hold mainly for an intermediate range of rank values. In a recent publication (Martínez-Mekler et al., 2009 [7]), a generalization of the functional form of the beta distribution has been shown to give excellent fits for many systems of very diverse nature, valid for the whole range of rank values, regardless of whether or not a power law behavior has been previously suggested. Here we give some insight on the significance of the two free parameters which appear as exponents in the functional form, by looking into discrete probabilistic branching processes with conflicting dynamics. We analyze a variety of realizations of these so-called expansion-modification models first introduced by Wentian Li (1989) [10]. We focus our attention on an order-disorder transition we encounter as we vary the modification probability p. We characterize this transition by means of the fitting parameters. Our numerical studies show that one of the fitting exponents is related to the presence of long-range correlations exhibited by power spectrum scale invariance, while the other registers the effect of disordering elements leading to a breakdown of these properties. In the absence of long-range correlations, this parameter is sensitive to the occurrence of unlikely events. We also introduce an approximate calculation scheme that relates this dynamics to multinomial multiplicative processes. A better understanding through these models of the meaning of the generalized beta-fitting exponents may contribute to their potential for identifying and characterizing universality classes.

  2. Relationship between Particle Size Distribution of Low-Rank Pulverized Coal and Power Plant Performance

    DOE PAGES

    Ganguli, Rajive; Bandopadhyay, Sukumar

    2012-01-01

    Tmore » he impact of particle size distribution (PSD) of pulverized, low rank high volatile content Alaska coal on combustion related power plant performance was studied in a series of field scale tests. Performance was gauged through efficiency (ratio of megawatt generated to energy consumed as coal), emissions (SO 2 , NO x , CO), and carbon content of ash (fly ash and bottom ash).he study revealed that the tested coal could be burned at a grind as coarse as 50% passing 76 microns, with no deleterious impact on power generation and emissions.he PSD’s tested in this study were in the range of 41 to 81 percent passing 76 microns.here was negligible correlation between PSD and the followings factors: efficiency, SO 2 , NO x , and CO. Additionally, two tests where stack mercury (Hg) data was collected, did not demonstrate any real difference in Hg emissions with PSD.he results from the field tests positively impacts pulverized coal power plants that burn low rank high volatile content coals (such as Powder River Basin coal).hese plants can potentially reduce in-plant load by grinding the coal less (without impacting plant performance on emissions and efficiency) and thereby, increasing their marketability.« less

  3. Distributed Compressive Sensing of Hyperspectral Images Using Low Rank and Structure Similarity Property

    NASA Astrophysics Data System (ADS)

    Huang, Bingchao; Xu, Ke; Wan, Jianwei; Liu, Xu

    2015-11-01

    An efficient method and system for distributed compressive sensing of hyperspectral images is presented, which exploit the low rank and structure similarity property of hyperspectral imagery. In this paper, by integrating the respective characteristics of DSC and CS, a distributed compressive sensing framework is proposed to simultaneously capture and compress hyperspectral images. At the encoder, every band image is measured independently, where almost all computation burdens can be shifted to the decoder, resulting in a very low-complexity encoder. It is simple to operate and easy to hardware implementation. At the decoder, each band image is reconstructed by the method of total variation norm minimize. During each band reconstruction, the low rand structure of band images and spectrum structure similarity are used to give birth to the new regularizers. With combining the new regularizers and other regularizer, we can sufficiently exploit the spatial correlation, spectral correlation and spectral structural redundancy in hyperspectral imagery. A numerical optimization algorithm is also proposed to solve the reconstruction model by augmented Lagrangian multiplier method. Experimental results show that this method can effectively improve the reconstruction quality of hyperspectral images.

  4. Population distribution models: species distributions are better modeled using biologically relevant data partitions

    PubMed Central

    2011-01-01

    Background Predicting the geographic distribution of widespread species through modeling is problematic for several reasons including high rates of omission errors. One potential source of error for modeling widespread species is that subspecies and/or races of species are frequently pooled for analyses, which may mask biologically relevant spatial variation within the distribution of a single widespread species. We contrast a presence-only maximum entropy model for the widely distributed oldfield mouse (Peromyscus polionotus) that includes all available presence locations for this species, with two composite maximum entropy models. The composite models either subdivided the total species distribution into four geographic quadrants or by fifteen subspecies to capture spatially relevant variation in P. polionotus distributions. Results Despite high Area Under the ROC Curve (AUC) values for all models, the composite species distribution model of P. polionotus generated from individual subspecies models represented the known distribution of the species much better than did the models produced by partitioning data into geographic quadrants or modeling the whole species as a single unit. Conclusions Because the AUC values failed to describe the differences in the predictability of the three modeling strategies, we suggest using omission curves in addition to AUC values to assess model performance. Dividing the data of a widespread species into biologically relevant partitions greatly increased the performance of our distribution model; therefore, this approach may prove to be quite practical and informative for a wide range of modeling applications. PMID:21929792

  5. Population distribution models: species distributions are better modeled using biologically relevant data partitions.

    PubMed

    Gonzalez, Sergio C; Soto-Centeno, J Angel; Reed, David L

    2011-09-19

    Predicting the geographic distribution of widespread species through modeling is problematic for several reasons including high rates of omission errors. One potential source of error for modeling widespread species is that subspecies and/or races of species are frequently pooled for analyses, which may mask biologically relevant spatial variation within the distribution of a single widespread species. We contrast a presence-only maximum entropy model for the widely distributed oldfield mouse (Peromyscus polionotus) that includes all available presence locations for this species, with two composite maximum entropy models. The composite models either subdivided the total species distribution into four geographic quadrants or by fifteen subspecies to capture spatially relevant variation in P. polionotus distributions. Despite high Area Under the ROC Curve (AUC) values for all models, the composite species distribution model of P. polionotus generated from individual subspecies models represented the known distribution of the species much better than did the models produced by partitioning data into geographic quadrants or modeling the whole species as a single unit. Because the AUC values failed to describe the differences in the predictability of the three modeling strategies, we suggest using omission curves in addition to AUC values to assess model performance. Dividing the data of a widespread species into biologically relevant partitions greatly increased the performance of our distribution model; therefore, this approach may prove to be quite practical and informative for a wide range of modeling applications.

  6. Estimating individual tree mid- and understory rank-size distributions from airborne laser scanning in semi-arid forests

    Treesearch

    Tyson L. Swetnam; Donald A. Falk; Ann M. Lynch; Stephen R. Yool

    2014-01-01

    Limitations inherent to airborne laser scanning (ALS) technology and the complex sorting and packing relationships of forests complicate accurate remote sensing of mid- and understory trees, especially in denser forest stands. Self-similarities in rank-sized individual tree distributions (ITD), e.g. bole diameter or height, are a well-understood property of natural,...

  7. Evaluation of two formulations containing mineral trioxide aggregate on delayed tooth replantation: relevance of RANKL/RANK/OPG system.

    PubMed

    Vogt, Beatriz Farias; Souza, Carlos Eduardo Chrzanowski Pereira; Silva, Daniela Nascimento; Etges, Adriana; Campos, Maria Martha

    2016-05-01

    This study aimed to evaluate the effects of White MTA (WMTA) and MTA Fillapex(®) on root resorption, when used for root canal filling, in a rat model of delayed tooth replantation, with special focus on the RANKL/RANK/OPG system. Maxillary right central incisors of male rats were extracted (total N = 48), and exposed to dry environment for 30 min. The animals were allocated into four groups: (1) WMTA; (2) MTA Fillapex; (3) Calcium hydroxide; (4) Negative control. After periodontal ligament removal, root canals were filled with the corresponding material and replanted. After 10 and 60 days, qualitative and semi-quantitative histological and immunohistochemical analyses were carried out. Analysis of variance (ANOVA) with Tukey's post hoc adjustment was used, at 10 and 60 days, to compare the experimental groups in terms of the inflammatory scores and in terms of the changes in OPG, RANK and RANKL. Both WMTA and MTA Fillapex groups displayed inflammatory and replacement resorption, with the presence of dento-alveolar ankylosis, similarly to that observed for calcium hydroxide, in either 10 or 60 days. Notably, a slight increase of the inflammatory process was observed in both MTA groups. Quantitatively, inflammation score analysis showed a significant difference between the calcium hydroxide and the control group at 10 days. On 60 days, dento-alveolar ankylosis was found significantly increased in the MTA Fillapex, in comparison to the control group (p < 0.05). For immunohistochemical analysis, the expression of both RANK and RANKL was reduced in calcium hydroxide and WMTA groups, from 10 to 60 days of evaluation, an effect that was accompanied by increased OPG immunolabelling. Otherwise, the MTA Fillapex group presented a general increase of RANKL immunopositivity, similarly to that observed in the negative control group. Our data showed that none of tested materials was able to fully prevent the root resorption, although the white MTA cement presented an outcome

  8. [Climate suitable rank distribution of artemisinin content of Artemisia annua in China].

    PubMed

    Zhang, Xiao-bo; Guo, Lan-ping; Huang, Lu-qi

    2011-04-01

    At the urgent request of Artemisia annua (ART) planting, the paper gets artemisinin content (ARTC) of ART in China from literatures. The paper analyses the relationships between ARTC and ecological factors by statistical analytical methods. The paper also analyses the climate suitable rank distribution of ARTC in China by ArcGIS. The results display that first, ARTC is significantly different in China, that ART from the south regions ARTC is higher. Greatest north parts of China have not suitable climate for the growing of ART and the ARTC is lower than 0.2%, when ART grows above the 34th degree of northern latitude. ARTC is higher and ART grows well, when ART grows under the 34 degrees N and grows at the areas between 100 degrees E and 120 degrees E. Second, subtropical zone is the best suitable climate zone for the growing of ART. ART grows well and ARTC is higher than 0.5%, when ART grows in the subtropical zone. Third, temperature, sunshine duration and rainfall are the main ecological factors that affect the growth of ART and the accumulation of ARTC. That the year temperature between 13.9 degrees C and 22 degrees C, sunshine duration between 853 h and 1507 h, rainfall between 814 mm and 1518 mm, is the best climate for the accumulation of ARTC. Temperature between 13 degrees C and 29 degrees C, rainfall between 600 mm and 1300 mm is the best climate for the growth of ART. Fourth, in northwest of Guangxi, eastern of Sichuan, Guizhou and Yunnan provinces, south Chongqing and west Hunan Province, there are suitable climate for the growth of Artemisia and for the accumulating of ARTC. There are also some suitable climate areas for the growing of artemisia in the south of Hubei, Anhui and Jiangsu provinces.

  9. Unveiling the species-rank abundance distribution by generalizing the Good-Turing sample coverage theory.

    PubMed

    Chao, Anne; Hsieh, T C; Chazdon, Robin L; Colwell, Robert K; Gotelli, Nicholas J

    2015-05-01

    Based on a sample of individuals, we focus on inferring the vector of species relative abundance of an entire assemblage and propose a novel estimator of the complete species-rank abundance distribution (RAD). Nearly all previous estimators of the RAD use the conventional "plug-in" estimator Pi (sample relative abundance) of the true relative abundance pi of species i. Because most biodiversity samples are incomplete, the plug-in estimators are applied only to the subset of species that are detected in the sample. Using the concept of sample coverage and its generalization, we propose a new statistical framework to estimate the complete RAD by separately adjusting the sample relative abundances for the set of species detected in the sample and estimating the relative abundances for the set of species undetected in the sample but inferred to be present in the assemblage. We first show that P, is a positively biased estimator of pi for species detected in the sample, and that the degree of bias increases with increasing relative rarity of each species. We next derive a method to adjust the sample relative abundance to reduce the positive bias inherent in j. The adjustment method provides a nonparametric resolution to the longstanding challenge of characterizing the relationship between the true relative abundance in the entire assemblage and the observed relative abundance in a sample. Finally, we propose a method to estimate the true relative abundances of the undetected species based on a lower bound of the number of undetected species. We then combine the adjusted RAD for the detected species and the estimated RAD for the undetected species to obtain the complete RAD estimator. Simulation results show that the proposed RAD curve can unveil the true RAD and is more accurate than the empirical RAD. We also extend our method to incidence data. Our formulas and estimators are illustrated using empirical data sets from surveys of forest spiders (for abundance data) and

  10. Eighteenth Annual Rank-Order Distribution of Administrative Salaries Paid, 1984-85.

    ERIC Educational Resources Information Center

    Arkansas Univ., Fayetteville. Office of Institutional Research.

    Results of a survey of salaries of full-time administrators at public, doctoral-granting institutions for 1984-1985 are presented. A ranking of salaries paid among 151 state-supported universities representing 47 states and 33 university systems representing 27 states is given. Salary data are also arranged into the nine regions defined by the…

  11. Revisiting the destination ranking procedure in development of an Intervening Opportunities Model for public transit trip distribution

    NASA Astrophysics Data System (ADS)

    Nazem, Mohsen; Trépanier, Martin; Morency, Catherine

    2015-01-01

    An Enhanced Intervening Opportunities Model (EIOM) is developed for Public Transit (PT). This is a distribution supply dependent model, with single constraints on trip production for work trips during morning peak hours (6:00 a.m.-9:00 a.m.) within the Island of Montreal, Canada. Different data sets, including the 2008 Origin-Destination (OD) survey of the Greater Montreal Area, the 2006 Census of Canada, GTFS network data, along with the geographical data of the study area, are used. EIOM is a nonlinear model composed of socio-demographics, PT supply data and work location attributes. An enhanced destination ranking procedure is used to calculate the number of spatially cumulative opportunities, the basic variable of EIOM. For comparison, a Basic Intervening Opportunities Model (BIOM) is developed by using the basic destination ranking procedure. The main difference between EIOM and BIOM is in the destination ranking procedure: EIOM considers the maximization of a utility function composed of PT Level Of Service and number of opportunities at the destination, along with the OD trip duration, whereas BIOM is based on a destination ranking derived only from OD trip durations. Analysis confirmed that EIOM is more accurate than BIOM. This study presents a new tool for PT analysts, planners and policy makers to study the potential changes in PT trip patterns due to changes in socio-demographic characteristics, PT supply, and other factors. Also it opens new opportunities for the development of more accurate PT demand models with new emergent data such as smart card validations.

  12. Cometabolism of Monochloramine by Distribution System Relevant Mixed Culture Nitrifiers

    EPA Science Inventory

    Monochloramine (NH2Cl) is increasingly used as a residual disinfectant. A major problem related to NH2Cl is nitrification in distribution systems, leading to rapid NH2Cl residual loss. Ammonia-oxidizing bacteria (AOB), which oxidize ammonia (NH3) to nitrite, can cometabolize chem...

  13. Cometabolism of Monochloramine by Distribution System Relevant Mixed Culture Nitrifiers

    EPA Science Inventory

    Monochloramine (NH2Cl) is increasingly used as a residual disinfectant. A major problem related to NH2Cl is nitrification in distribution systems, leading to rapid NH2Cl residual loss. Ammonia-oxidizing bacteria (AOB), which oxidize ammonia (NH3) to nitrite, can cometabolize chem...

  14. Light distributions on the retina: relevance to macular pigment photoprotection.

    PubMed

    Bone, Richard A; Gibert, Jorge C; Mukherjee, Anirbaan

    2012-01-01

    Light exposure has been implicated in age-related macular degeneration (AMD). This study was designed to measure cumulative light distribution on the retina to determine whether it peaked in the macula. An eye-tracker recorded the subject's field of view and pupil size, and superimposed the gaze position. Fifteen naïve subjects formed a test group; 5 formed a control group. In phase 1, all subjects viewed a sequence of photographic images. In phase 2, the naïve subjects observed a video; in phase 3, they performed computer tasks; in phase 4, the subjects walked around freely. In phase 1, control subjects were instructed to gaze at bright features in the field of view and, in a second test, at dark features. Test group subjects were allowed to gaze freely for all phases. Using the subject's gaze coordinates, we calculated the cumulative light distribution on the retina. As expected for control subjects, cumulative retinal light distributions peaked and dipped in the fovea when they gazed at bright or dark features respectively in the field of view. The light distribution maps obtained from the test group showed a consistent tendency to peak in the macula in phase 3, a variable tendency in phase 4, but little tendency in phases 1 and 2. We conclude that a tendency for light to peak in the macula is a characteristic of some individuals and of certain tasks. In these situations, risk of AMD could be increased but, at the same time, mitigated by the presence of macular carotenoids.

  15. Creating Composite Age Groups to Smooth Percentile Rank Distributions of Small Samples

    ERIC Educational Resources Information Center

    Lopez, Francesca; Olson, Amy; Bansal, Naveen

    2011-01-01

    Individually administered tests are often normed on small samples, a process that may result in irregularities within and across various age or grade distributions. Test users often smooth distributions guided by Thurstone assumptions (normality and linearity) to result in norms that adhere to assumptions made about how the data should look. Test…

  16. Creating Composite Age Groups to Smooth Percentile Rank Distributions of Small Samples

    ERIC Educational Resources Information Center

    Lopez, Francesca; Olson, Amy; Bansal, Naveen

    2011-01-01

    Individually administered tests are often normed on small samples, a process that may result in irregularities within and across various age or grade distributions. Test users often smooth distributions guided by Thurstone assumptions (normality and linearity) to result in norms that adhere to assumptions made about how the data should look. Test…

  17. Grove Mountains meteorite recovery and relevant data distribution service

    NASA Astrophysics Data System (ADS)

    Zhou, Chunxia; Ai, Songtao; Chen, Nengcheng; Wang, Zemin; E, Dongchen

    2011-11-01

    Meteorites are extremely valuable in providing clues about the origin, evolution, and composition of the Sun, the Moon, the Earth, other planets, and asteroids. Since the first discovery of a meteorite in Antarctica, more and more meteorite concentrations on bare ice stranding sites were discovered. Antarctica is identified as a prolific source of extraterrestrial materials. The Grove Mountains area, covered by ice, snow, and nunataks, is located in the Antarctic inland area. It is about 380 km away from the Chinese Zhongshan Antarctic Research Station in East Antarctica. Since 1998, 11,452 meteorites have been collected from the Grove Mountains by the Chinese National Antarctic Research Expedition (CHINARE). It is confirmed that the Grove Mountains area is a productive search area for meteorites in Antarctica. More and more meteorite recoveries led to the recognition that unique mechanisms relating to meteorite concentrations exist in Antarctica. Besides meteorite field collections, the extraction of blue ice based on satellite images, meteorite concentration mechanisms, and meteorite data distribution service are discussed in this paper. Wide distribution of blue ice indicates the enrichment of meteorites. Based on the different spectrum characteristics and coherence of snow, blue ice, and bare rocks, blue ice areas are extracted from optical images and coherence maps. According to meteorite field collections and optical images, moraines are also identified as meteorite concentration sites in the Grove Mountains area. The meteorite concentration theories should be further analyzed by taking into account ice-flow dynamics, mountains' blocking effect, katabatic wind and ice ablation, and others. Moreover, in order to strengthen the visualization and network sharing of the valuable meteorite data, desktop software based on ArcObjects and web software based on ArcIMS are developed within this study. The desktop software also enables further analysis of the meteorite

  18. Solving the Ranking and Selection Indifference-Zone Formulation for Normal Distributions Using Computer Software

    DTIC Science & Technology

    1993-12-01

    0), a sample size (e.g. n), or a distribution (e.g. Fy). * bracketed subscripts (e.g. [i], [j], [a], [b]) - indicate order. For instance, popu...lation parameters are ordered as 0 [11 < 0[21 < ... <OJ. 2-3 * parenthesized subscripts (e.g. (i), (j), (a), (b)) - indicate association with a spe- cific... indicate neither order nor association with any specific ordered parameter. 2.2 Indifference-Zone Formulation (Integral Development) The indifference

  19. Environmental correlates of species rank – abundance distributions in global drylands

    PubMed Central

    Ulrich, Werner; Soliveres, Santiago; Thomas, Andrew D.; Dougill, Andrew J.; Maestre, Fernando T.

    2016-01-01

    Theoretical models predict lognormal species abundance distributions (SADs) in stable and productive environments, with log-series SADs in less stable, dispersal driven communities. We studied patterns of relative species abundances of perennial vascular plants in global dryland communities to: i) assess the influence of climatic and soil characteristics on the observed SADs, ii) infer how environmental variability influences relative abundances, and iii) evaluate how colonisation dynamics and environmental filters shape abundance distributions. We fitted lognormal and log-series SADs to 91 sites containing at least 15 species of perennial vascular plants. The dependence of species relative abundances on soil and climate variables was assessed using general linear models. Irrespective of habitat type and latitude, the majority of the SADs (70.3%) were best described by a lognormal distribution. Lognormal SADs were associated with low annual precipitation, higher aridity, high soil carbon content, and higher variability of climate variables and soil nitrate. Our results do not corroborate models predicting the prevalence of log-series SADs in dryland communities. As lognormal SADs were particularly associated with sites with drier conditions and a higher environmental variability, we reject models linking lognormality to environmental stability and high productivity conditions. Instead our results point to the prevalence of lognormal SADs in heterogeneous environments, allowing for more evenly distributed plant communities, or in stressful ecosystems, which are generally shaped by strong habitat filters and limited colonisation. This suggests that drylands may be resilient to environmental changes because the many species with intermediate relative abundances could take over ecosystem functioning if the environment becomes suboptimal for dominant species. PMID:27330404

  20. Functionally relevant climate variables for arid lands: Aclimatic water deficit approach for modelling desert shrub distributions

    Treesearch

    Thomas E. Dilts; Peter J. Weisberg; Camie M. Dencker; Jeanne C. Chambers

    2015-01-01

    We have three goals. (1) To develop a suite of functionally relevant climate variables for modelling vegetation distribution on arid and semi-arid landscapes of the Great Basin, USA. (2) To compare the predictive power of vegetation distribution models based on mechanistically proximate factors (water deficit variables) and factors that are more mechanistically removed...

  1. Ranking games.

    PubMed

    Osterloh, Margit; Frey, Bruno S

    2015-02-01

    Research rankings based on bibliometrics today dominate governance in academia and determine careers in universities. Analytical approach to capture the incentives by users of rankings and by suppliers of rankings, both on an individual and an aggregate level. Rankings may produce unintended negative side effects. In particular, rankings substitute the "taste for science" by a "taste for publication." We show that the usefulness of rankings rests on several important assumptions challenged by recent research. We suggest as alternatives careful socialization and selection of scholars, supplemented by periodic self-evaluations and awards. The aim is to encourage controversial discourses in order to contribute meaningful to the advancement of science. © The Author(s) 2014.

  2. A Comparison of the Power of Wilcoxon's Rank-Sum Statistic to that of Student's t Statistic under Various Nonnormal Distributions.

    ERIC Educational Resources Information Center

    Blair, R. Clifford; Higgins, James J.

    1980-01-01

    Monte Carlo techniques were used to compare the power of Wilcoxon's rank-sum test to the power of the two independent means t test for situations in which samples were drawn from (1) uniform, (2) Laplace, (3) half-normal, (4) exponential, (5) mixed-normal, and (6) mixed-uniform distributions. (Author/JKS)

  3. How to Rank Journals.

    PubMed

    Bradshaw, Corey J A; Brook, Barry W

    2016-01-01

    There are now many methods available to assess the relative citation performance of peer-reviewed journals. Regardless of their individual faults and advantages, citation-based metrics are used by researchers to maximize the citation potential of their articles, and by employers to rank academic track records. The absolute value of any particular index is arguably meaningless unless compared to other journals, and different metrics result in divergent rankings. To provide a simple yet more objective way to rank journals within and among disciplines, we developed a κ-resampled composite journal rank incorporating five popular citation indices: Impact Factor, Immediacy Index, Source-Normalized Impact Per Paper, SCImago Journal Rank and Google 5-year h-index; this approach provides an index of relative rank uncertainty. We applied the approach to six sample sets of scientific journals from Ecology (n = 100 journals), Medicine (n = 100), Multidisciplinary (n = 50); Ecology + Multidisciplinary (n = 25), Obstetrics & Gynaecology (n = 25) and Marine Biology & Fisheries (n = 25). We then cross-compared the κ-resampled ranking for the Ecology + Multidisciplinary journal set to the results of a survey of 188 publishing ecologists who were asked to rank the same journals, and found a 0.68-0.84 Spearman's ρ correlation between the two rankings datasets. Our composite index approach therefore approximates relative journal reputation, at least for that discipline. Agglomerative and divisive clustering and multi-dimensional scaling techniques applied to the Ecology + Multidisciplinary journal set identified specific clusters of similarly ranked journals, with only Nature & Science separating out from the others. When comparing a selection of journals within or among disciplines, we recommend collecting multiple citation-based metrics for a sample of relevant and realistic journals to calculate the composite rankings and their relative uncertainty windows.

  4. How to Rank Journals

    PubMed Central

    Bradshaw, Corey J. A.; Brook, Barry W.

    2016-01-01

    There are now many methods available to assess the relative citation performance of peer-reviewed journals. Regardless of their individual faults and advantages, citation-based metrics are used by researchers to maximize the citation potential of their articles, and by employers to rank academic track records. The absolute value of any particular index is arguably meaningless unless compared to other journals, and different metrics result in divergent rankings. To provide a simple yet more objective way to rank journals within and among disciplines, we developed a κ-resampled composite journal rank incorporating five popular citation indices: Impact Factor, Immediacy Index, Source-Normalized Impact Per Paper, SCImago Journal Rank and Google 5-year h-index; this approach provides an index of relative rank uncertainty. We applied the approach to six sample sets of scientific journals from Ecology (n = 100 journals), Medicine (n = 100), Multidisciplinary (n = 50); Ecology + Multidisciplinary (n = 25), Obstetrics & Gynaecology (n = 25) and Marine Biology & Fisheries (n = 25). We then cross-compared the κ-resampled ranking for the Ecology + Multidisciplinary journal set to the results of a survey of 188 publishing ecologists who were asked to rank the same journals, and found a 0.68–0.84 Spearman’s ρ correlation between the two rankings datasets. Our composite index approach therefore approximates relative journal reputation, at least for that discipline. Agglomerative and divisive clustering and multi-dimensional scaling techniques applied to the Ecology + Multidisciplinary journal set identified specific clusters of similarly ranked journals, with only Nature & Science separating out from the others. When comparing a selection of journals within or among disciplines, we recommend collecting multiple citation-based metrics for a sample of relevant and realistic journals to calculate the composite rankings and their relative uncertainty windows. PMID:26930052

  5. Profiling the Flagship University Model: An Exploratory Proposal for Changing the Paradigm from Ranking to Relevancy. Research & Occasional Paper Series: CSHE.5.14

    ERIC Educational Resources Information Center

    Douglass, John Aubrey

    2014-01-01

    It's a familiar if not fully explained paradigm. A "World Class University" (WCU) is supposed to have highly ranked research output, a culture of excellence, great facilities, and a brand name that transcends national borders. But perhaps most importantly, the particular institution needs to sit in the upper echelons of one or more…

  6. On the relevance of q-distribution functions: the return time distribution of restricted random walker

    NASA Astrophysics Data System (ADS)

    Zand, Jaleh; Tirnakli, Ugur; Jeldtoft Jensen, Henrik

    2015-10-01

    There exists a large literature on the application of q-statistics to the out-of-equilibrium non-ergodic systems in which some degree of strong correlations exists. Here we study the distribution of first return times to zero, PR (0, t), of a random walk on the set of integers {0, 1, 2,..., L} with a position dependent transition probability given by {| n/L| }a. We find that for all values of a\\in [0,2] PR(0, t) can be fitted by q-exponentials, but only for a = 1 is PR (0, t) given exactly by a q-exponential in the limit L\\to ∞ . This is a remarkable result since the exact analytical solution of the corresponding continuum model represents PR (0, t) as a sum of Bessel functions with a smooth dependence on a from which we are unable to identify a = 1 as of special significance. However, from the high precision numerical iteration of the discrete master equation, we do verify that only for a = 1 is PR(0, t) exactly a q-exponential and that a tiny departure from this parameter value makes the distribution deviate from q-exponential. Further research is certainly required to identify the reason for this result and also the applicability of q-statistics and its domain.

  7. To Overcome HITS Rank Similarity Confliction of Web Pages using Weight Calculation and Rank Improvement

    NASA Astrophysics Data System (ADS)

    Nath, Rajender; Kumar, Naresh

    2011-12-01

    Search Engine gives an ordered list of web search results in response to a user query, wherein the important pages are usually displayed at the top with less important ones afterwards. It may be possible that the user may have to look for many screen results to get the required documents. In literatures, many page ranking algorithms has been given to find the page rank of a page. For example PageRank is considered in this work. This algorithm treats all the links equally when distributing rank scores. That's why this algorithm some time gives equal importance to all the pages. But in real this can not be happen because, if two pages have same rank then how we can judge which page is more important then other. So this paper proposes another idea to organize the search results and describe which page is more important when confliction of same rank is produced by the PageRank. So that the user can get more relevant and important results easily and in a short span of time.

  8. Motivating online learners using attention, relevance, confidence, satisfaction Motivational Theory and distributed scaffolding.

    PubMed

    Gormley, Denise K; Colella, Christine; Shell, Dustin L

    2012-01-01

    Learning online requires self-regulation, intrinsic motivation, and independence. Building an online classroom environment that fosters the development of these behaviors for students is key to their success. Use of ARCS (attention, relevance, confidence, satisfaction) Motivational Theory and distributed scaffolding can assist faculty in developing intentional support to help the online student achieve learning outcomes. The authors discuss development of teaching strategies in online, distance learning courses that will enhance student motivation and learning outcomes.

  9. Ranking Profiles

    ERIC Educational Resources Information Center

    Van Der Werf, Martin

    2007-01-01

    This article presents the "U.S. News" ranking profiles of four colleges, namely: (1) Smith College; (2) Washington University in St. Louis; (3) Colorado State University at Fort Collins; and (4) Whitman College. Smith College was in the top 10 of the nation's liberal-arts colleges, or just outside it, almost since the "U.S.…

  10. Automated system for kinetic analysis of particle size distributions for pharmaceutically relevant systems.

    PubMed

    Green, John-Bruce D; Carter, Phillip W; Zhang, Yingqing; Patel, Dipa; Kotha, Priyanka; Gonyon, Thomas

    2014-01-01

    Detailing the kinetics of particle formation for pharmaceutically relevant solutions is challenging, especially when considering the combination of formulations, containers, and timescales of clinical importance. This paper describes a method for using commercial software Automate with a stream-selector valve capable of sampling container solutions from within an environmental chamber. The tool was built to monitor changes in particle size distributions via instrumental particle counters but can be adapted to other solution-based sensors. The tool and methodology were demonstrated to be highly effective for measuring dynamic changes in emulsion globule distributions as a function of storage and mixing conditions important for parenteral nutrition. Higher levels of agitation induced the fastest growth of large globules (≥5 μm) while the gentler conditions actually showed a decrease in the number of these large globules. The same methodology recorded calcium phosphate precipitation kinetics as a function of [Ca(2+)] and pH. This automated system is readily adaptable to a wide range of pharmaceutically relevant systems where the particle size is expected to vary with time. This instrumentation can dramatically reduce the time and resources needed to probe complex formulation issues while providing new insights for monitoring the kinetics as a function of key variables.

  11. Automated System for Kinetic Analysis of Particle Size Distributions for Pharmaceutically Relevant Systems

    PubMed Central

    Green, John-Bruce D.; Carter, Phillip W.; Zhang, Yingqing; Patel, Dipa; Kotha, Priyanka

    2014-01-01

    Detailing the kinetics of particle formation for pharmaceutically relevant solutions is challenging, especially when considering the combination of formulations, containers, and timescales of clinical importance. This paper describes a method for using commercial software Automate with a stream-selector valve capable of sampling container solutions from within an environmental chamber. The tool was built to monitor changes in particle size distributions via instrumental particle counters but can be adapted to other solution-based sensors. The tool and methodology were demonstrated to be highly effective for measuring dynamic changes in emulsion globule distributions as a function of storage and mixing conditions important for parenteral nutrition. Higher levels of agitation induced the fastest growth of large globules (≥5 μm) while the gentler conditions actually showed a decrease in the number of these large globules. The same methodology recorded calcium phosphate precipitation kinetics as a function of [Ca2+] and pH. This automated system is readily adaptable to a wide range of pharmaceutically relevant systems where the particle size is expected to vary with time. This instrumentation can dramatically reduce the time and resources needed to probe complex formulation issues while providing new insights for monitoring the kinetics as a function of key variables. PMID:25140276

  12. Exploring the Distribution of Genetic Markers of Pharmacogenomics Relevance in Brazilian and Mexican Populations

    PubMed Central

    Bonifaz-Peña, Vania; Contreras, Alejandra V.; Struchiner, Claudio Jose; Roela, Rosimeire A.; Furuya-Mazzotti, Tatiane K.; Chammas, Roger; Rangel-Escareño, Claudia; Uribe-Figueroa, Laura; Gómez-Vázquez, María José; McLeod, Howard L.; Hidalgo-Miranda, Alfredo

    2014-01-01

    Studies of pharmacogenomics-related traits are increasingly being performed to identify loci that affect either drug response or susceptibility to adverse drug reactions. However, the effect of the polymorphisms can differ in magnitude or be absent depending on the population being assessed. We used the Affymetrix Drug Metabolizing Enzymes and Transporters (DMET) Plus array to characterize the distribution of polymorphisms of pharmacogenetics and pharmacogenomics (PGx) relevance in two samples from the most populous Latin American countries, Brazil and Mexico. The sample from Brazil included 268 individuals from the southeastern state of Rio de Janeiro, and was stratified into census categories. The sample from Mexico comprised 45 Native American Zapotecas and 224 self-identified Mestizo individuals from 5 states located in geographically distant regions in Mexico. We evaluated the admixture proportions in the Brazilian and Mexican samples using a panel of Ancestry Informative Markers extracted from the DMET array, which was validated with genome-wide data. A substantial variation in ancestral proportions across census categories in Brazil, and geographic regions in Mexico was identified. We evaluated the extent of genetic differentiation (measured as FST values) of the genetic markers of the DMET Plus array between the relevant parental populations. Although the average levels of genetic differentiation are low, there is a long tail of markers showing large frequency differences, including markers located in genes belonging to the Cytochrome P450, Solute Carrier (SLC) and UDP-glucuronyltransferase (UGT) families as well as other genes of PGx relevance such as ABCC8, ADH1A, CHST3, PON1, PPARD, PPARG, and VKORC1. We show how differences in admixture history may have an important impact in the distribution of allele and genotype frequencies at the population level. PMID:25419701

  13. Exploring the distribution of genetic markers of pharmacogenomics relevance in Brazilian and Mexican populations.

    PubMed

    Bonifaz-Peña, Vania; Contreras, Alejandra V; Struchiner, Claudio Jose; Roela, Rosimeire A; Furuya-Mazzotti, Tatiane K; Chammas, Roger; Rangel-Escareño, Claudia; Uribe-Figueroa, Laura; Gómez-Vázquez, María José; McLeod, Howard L; Hidalgo-Miranda, Alfredo; Parra, Esteban J; Fernández-López, Juan Carlos; Suarez-Kurtz, Guilherme

    2014-01-01

    Studies of pharmacogenomics-related traits are increasingly being performed to identify loci that affect either drug response or susceptibility to adverse drug reactions. However, the effect of the polymorphisms can differ in magnitude or be absent depending on the population being assessed. We used the Affymetrix Drug Metabolizing Enzymes and Transporters (DMET) Plus array to characterize the distribution of polymorphisms of pharmacogenetics and pharmacogenomics (PGx) relevance in two samples from the most populous Latin American countries, Brazil and Mexico. The sample from Brazil included 268 individuals from the southeastern state of Rio de Janeiro, and was stratified into census categories. The sample from Mexico comprised 45 Native American Zapotecas and 224 self-identified Mestizo individuals from 5 states located in geographically distant regions in Mexico. We evaluated the admixture proportions in the Brazilian and Mexican samples using a panel of Ancestry Informative Markers extracted from the DMET array, which was validated with genome-wide data. A substantial variation in ancestral proportions across census categories in Brazil, and geographic regions in Mexico was identified. We evaluated the extent of genetic differentiation (measured as FST values) of the genetic markers of the DMET Plus array between the relevant parental populations. Although the average levels of genetic differentiation are low, there is a long tail of markers showing large frequency differences, including markers located in genes belonging to the Cytochrome P450, Solute Carrier (SLC) and UDP-glucuronyltransferase (UGT) families as well as other genes of PGx relevance such as ABCC8, ADH1A, CHST3, PON1, PPARD, PPARG, and VKORC1. We show how differences in admixture history may have an important impact in the distribution of allele and genotype frequencies at the population level.

  14. Ranking Information in Networks

    NASA Astrophysics Data System (ADS)

    Eliassi-Rad, Tina; Henderson, Keith

    Given a network, we are interested in ranking sets of nodes that score highest on user-specified criteria. For instance in graphs from bibliographic data (e.g. PubMed), we would like to discover sets of authors with expertise in a wide range of disciplines. We present this ranking task as a Top-K problem; utilize fixed-memory heuristic search; and present performance of both the serial and distributed search algorithms on synthetic and real-world data sets.

  15. Local populations and inaccuracies: Determining the relevant mitochondrial haplotype distributions for North West European cats.

    PubMed

    Wesselink, Monique; Desmyter, Stijn; Kuiper, Irene

    2017-09-01

    Typing of different portions of the feline mitochondrial control region has illustrated pronounced differences in haplotype distributions between cats from the Netherlands and other parts of the world. To gain a better understanding of the haplotype distribution of North West Continental Europe, 605bp of mitochondrial DNA was typed from randomly selected cats from the Netherlands (N=146), Belgium (N=64) and South West Germany (N=128). The genetic differences between these randomly sampled European populations correlate to the geographical distances, with the Dutch and the South West German populations furthest apart and the Belgian population as an intermediate (Fst values 0.01-0.03). Comparison of North West European mainland distributions to published feline mitochondrial haplotype distributions illustrated moderate to large genetic differentiation (Fst values 0.01-0.32). In this comparison, the correlation between geographical and genetic distance was absent, leading to founder effects and human impact on cat population structure and dispersion being considered as important parameters. When an accurate estimation of feline haplotype distribution is required in forensics, care should be taken when deciding whether extrapolating the frequency data from a certain source to a larger area (country/continent) is justified or whether additional typing of local populations is necessary. This may differ from case to case as local frequencies can be relevant, but can also be deceitful. To improve the applicability of forensic feline mitochondrial DNA studies, documentation and publishing of sampling strategies is advised, as is the implementation of measures to help eliminate potentially erroneous haplotypes. Copyright © 2017 Elsevier B.V. All rights reserved.

  16. The Privilege of Ranking: Google Plays Ball.

    ERIC Educational Resources Information Center

    Wiggins, Richard

    2003-01-01

    Discussion of ranking systems used in various settings, including college football and academic admissions, focuses on the Google search engine. Explains the PageRank mathematical formula that scores Web pages by connecting the number of links; limitations, including authenticity and accuracy of ranked Web pages; relevancy; adjusting algorithms;…

  17. LFERs for soil organic carbon-water distribution coefficients (Koc) at environmentally relevant sorbate concentrations.

    PubMed

    Endo, Satoshi; Grathwohl, Peter; Haderlein, Stefan B; Schmidt, Torsten C

    2009-05-01

    Organic carbon-water distribution coefficients, Koc, for organic compounds at environmentally relevant, low sorbate concentrations may substantially differ from those at higher concentrations due to nonlinear sorption to soil organic matter. However, prediction methods for Koc such as linear free energy relationships (LFERs) are currently only available for high sorbate concentrations (i.e., near solubility limits), reflecting the lack of a set of consistent experimental data in an environmentally more relevant concentration range (i.e., orders of magnitude lower than solubilities). In this study, we determined Koc for two model sorbents of soil organic matter, peat and lignite, at sorbate concentrations of 4.3 and 19 mg/kg-organic-carbon, respectively, in batch suspensions. The measured Koc values for organic sorbates (51 for peat, 58 for lignite) of varying sizes and polarities were modeled successfully with polyparameter linear free energy relationships (PP-LFERs). The resulting PP-LFER for peat was significantly different from the PP-LFERs in the literature determined at near aqueous solubility limits of sorbates. The literature PP-LFERs were found to underestimate the measured Koc values for peat at the low concentration by up to 1 order of magnitude. The extent of underestimation highly depends on the sorbate properties and can be explained by differing sorption nonlinearities of the sorbates as predicted by a reported empirical relationship between the nonlinearity in peat and the sorbate dipolarity/polarizability parameter S. Lignite appearsto be a stronger sorbent toward many sorbates than typical soil organic matter irrespective of the concentration range and thus may not be representative for organic matter with regard to the magnitude of Koc. The present study offers the first PP-LFER equation for log Koc in soil organic matter at typical environmental sorbate concentrations.

  18. Multiple graph regularized protein domain ranking

    PubMed Central

    2012-01-01

    Background Protein domain ranking is a fundamental task in structural biology. Most protein domain ranking methods rely on the pairwise comparison of protein domains while neglecting the global manifold structure of the protein domain database. Recently, graph regularized ranking that exploits the global structure of the graph defined by the pairwise similarities has been proposed. However, the existing graph regularized ranking methods are very sensitive to the choice of the graph model and parameters, and this remains a difficult problem for most of the protein domain ranking methods. Results To tackle this problem, we have developed the Multiple Graph regularized Ranking algorithm, MultiG-Rank. Instead of using a single graph to regularize the ranking scores, MultiG-Rank approximates the intrinsic manifold of protein domain distribution by combining multiple initial graphs for the regularization. Graph weights are learned with ranking scores jointly and automatically, by alternately minimizing an objective function in an iterative algorithm. Experimental results on a subset of the ASTRAL SCOP protein domain database demonstrate that MultiG-Rank achieves a better ranking performance than single graph regularized ranking methods and pairwise similarity based ranking methods. Conclusion The problem of graph model and parameter selection in graph regularized protein domain ranking can be solved effectively by combining multiple graphs. This aspect of generalization introduces a new frontier in applying multiple graphs to solving protein domain ranking applications. PMID:23157331

  19. Word ranking in a single document by Jensen-Shannon divergence

    NASA Astrophysics Data System (ADS)

    Mehri, Ali; Jamaati, Maryam; Mehri, Hassan

    2015-08-01

    Ranking the words in human written texts, according to their relevance to text context, plays a crucial role in many text mining tasks. Highly relevant words concentrate in some limited areas, while the irrelevant ones have nearly random spatial distribution throughout the text. But in the randomly shuffled version of the text, all word types are distributed at random. The difference between spatial distribution of words in the original version of a text and its shuffled version seems a proper criterion for word relevance ranking. In this procedure, spatial distribution of each word type in the document is defined by box counting method. Then we apply Jensen-Shannon divergence to measure the difference between probability distributions of each word in the original text and its shuffled version. This metric properly distinguishes relevant words from irrelevants without requiring any previous knowledge about text structure.

  20. University Rankings in China

    ERIC Educational Resources Information Center

    Liu, Nian Cai; Liu, Li

    2005-01-01

    Since the mid 1990s of last Century, university rankings have become very popular in China. Six institutions have published such rankings; some of them have also detailed their ranking methodologies. This paper features a general introduction to university ranking in China, and to the methodologies of each ranking discussed. The paper also…

  1. Universal scaling in sports ranking

    NASA Astrophysics Data System (ADS)

    Deng, Weibing; Li, Wei; Cai, Xu; Bulou, Alain; Wang, Qiuping A.

    2012-09-01

    Ranking is a ubiquitous phenomenon in human society. On the web pages of Forbes, one may find all kinds of rankings, such as the world's most powerful people, the world's richest people, the highest-earning tennis players, and so on and so forth. Herewith, we study a specific kind—sports ranking systems in which players' scores and/or prize money are accrued based on their performances in different matches. By investigating 40 data samples which span 12 different sports, we find that the distributions of scores and/or prize money follow universal power laws, with exponents nearly identical for most sports. In order to understand the origin of this universal scaling we focus on the tennis ranking systems. By checking the data we find that, for any pair of players, the probability that the higher-ranked player tops the lower-ranked opponent is proportional to the rank difference between the pair. Such a dependence can be well fitted to a sigmoidal function. By using this feature, we propose a simple toy model which can simulate the competition of players in different matches. The simulations yield results consistent with the empirical findings. Extensive simulation studies indicate that the model is quite robust with respect to the modifications of some parameters.

  2. On the Relevancy of Efficient, Integrated Computer and Network Monitoring in HEP Distributed Online Environment

    NASA Astrophysics Data System (ADS)

    Carvalho, D.; Gavillet, Ph.; Delgado, V.; Albert, J. N.; Bellas, N.; Javello, J.; Miere, Y.; Ruffinoni, D.; Smith, G.

    Large Scientific Equipments are controlled by Computer Systems whose complexity is growing driven, on the one hand by the volume and variety of the information, its distributed nature, the sophistication of its treatment and, on the other hand by the fast evolution of the computer and network market. Some people call them genetically Large-Scale Distributed Data Intensive Information Systems or Distributed Computer Control Systems (DCCS) for those systems dealing more with real time control. Taking advantage of (or forced by) the distributed architecture, the tasks are more and more often implemented as Client-Server applications. In this framework the monitoring of the computer nodes, the communications network and the applications becomes of primary importance for ensuring the safe running and guaranteed performance of the system. With the future generation of HEP experiments, such as those at the LHC in view, it is proposed to integrate the various functions of DCCS monitoring into one general purpose Multi-layer System.

  3. Frequency-Range Distribution of Boulders Around Cone Crater: Relevance to Landing Site Hazard Avoidance

    NASA Technical Reports Server (NTRS)

    Clegg-Watkins, R. N.; Jolliff, B. L.; Lawrence, S. J.

    2016-01-01

    Boulders represent a landing hazard that must be addressed in the planning of future landings on the Moon. A boulder under a landing leg can contribute to deck tilt and boulders can damage spacecraft during landing. Using orbital data to characterize boulder populations at locations where landers have safely touched down (Apollo, Luna, Surveyor, and Chang'e-3 sites) is important for determining landing hazard criteria for future missions. Additionally, assessing the distribution of boulders can address broader science issues, e.g., how far craters distribute boulders and how this distribution varies as a function of crater size and age. The availability of new Lunar Reconnaissance Orbiter Camera (LROC) Narrow Angle Camera (NAC) images [1] enables the use of boulder size- and range frequency distributions for a variety of purposes [2-6]. Boulders degrade over time and primarily occur around young or fresh craters that are large enough to excavate bedrock. Here we use NAC images to analyze boulder distributions around Cone crater (340 m diameter) at the Apollo 14 site. Cone crater (CC) was selected because it is the largest crater where astronaut surface photography is available for a radial traverse to the rim. Cone crater is young (approximately 29 Ma [7]) relative to the time required to break down boulders [3,8], giving us a data point for boulder range-frequency distributions (BRFDs) as a function of crater age.

  4. Topical distribution of initial paresis of the limbs to predict clinically relevant spasticity after ischemic stroke: a retrospective cohort study.

    PubMed

    Picelli, A; Tamburin, S; Dambruoso, F; Midiri, A; Girardi, P; Santamato, A; Fiore, P; Smania, N

    2014-10-01

    The degree of initial paresis relates to spasticity development in stroke patients. However, the importance of proximal and distal paresis in predicting spasticity after stroke is unclear. To investigate the role of topical distribution of initial limb paresis to predict clinically relevant spasticity in adults with stroke. Retrospective cohort study Seventy-two first-ever ischemic stroke patients were examined. At the acute phase of illness, demographics and the European Stroke Scale motor items (maintenance of outstretched arm position, arm raising, wrist extension, grip strength, maintenance of outstretched leg position, leg flexion, foot dorsiflexion) were evaluated. At six months after the stroke onset, spasticity was assessed at the upper and lower limb with the modified Ashworth Scale. Clinically relevant spasticity was defined as modified Ashworth Scale ≥3 (0-5). The degree of initial paresis of the proximal muscles of the upper limb and the distal muscles of the lower limb showed the strongest association and the best profile of sensitivity-specificity in predicting clinically relevant spasticity at the upper and lower limb, respectively. Younger age showed higher risk for developing clinically relevant spasticity in the upper limb. Our findings support the hypothesis that the initial degree of proximal paresis of the upper limb and distal paresis of the lower limb as well as age may be considered early predictors of clinically relevant spasticity in adults with ischemic stroke. Our findings further improve the role of initial paresis as predictor of spasticity after stroke.

  5. Ranking species in mutualistic networks.

    PubMed

    Domínguez-García, Virginia; Muñoz, Miguel A

    2015-02-02

    Understanding the architectural subtleties of ecological networks, believed to confer them enhanced stability and robustness, is a subject of outmost relevance. Mutualistic interactions have been profusely studied and their corresponding bipartite networks, such as plant-pollinator networks, have been reported to exhibit a characteristic "nested" structure. Assessing the importance of any given species in mutualistic networks is a key task when evaluating extinction risks and possible cascade effects. Inspired in a recently introduced algorithm--similar in spirit to Google's PageRank but with a built-in non-linearity--here we propose a method which--by exploiting their nested architecture--allows us to derive a sound ranking of species importance in mutualistic networks. This method clearly outperforms other existing ranking schemes and can become very useful for ecosystem management and biodiversity preservation, where decisions on what aspects of ecosystems to explicitly protect need to be made.

  6. Ranking species in mutualistic networks

    PubMed Central

    Domínguez-García, Virginia; Muñoz, Miguel A.

    2015-01-01

    Understanding the architectural subtleties of ecological networks, believed to confer them enhanced stability and robustness, is a subject of outmost relevance. Mutualistic interactions have been profusely studied and their corresponding bipartite networks, such as plant-pollinator networks, have been reported to exhibit a characteristic “nested” structure. Assessing the importance of any given species in mutualistic networks is a key task when evaluating extinction risks and possible cascade effects. Inspired in a recently introduced algorithm –similar in spirit to Google's PageRank but with a built-in non-linearity– here we propose a method which –by exploiting their nested architecture– allows us to derive a sound ranking of species importance in mutualistic networks. This method clearly outperforms other existing ranking schemes and can become very useful for ecosystem management and biodiversity preservation, where decisions on what aspects of ecosystems to explicitly protect need to be made. PMID:25640575

  7. Measurement of Curie temperature distribution relevant to heat assisted magnetic recording

    NASA Astrophysics Data System (ADS)

    Chernyshov, Alex; Le, Thanh; Livshitz, Boris; Mryasov, Oleg; Miller, Charles; Acharya, Ram; Treves, David

    2015-05-01

    Heat-Assisted Magnetic Recording (HAMR) is a likely successor of Perpendicular Magnetic Recording (PMR) in the Hard disk drive industry. In PMR, recording performance is strongly affected by the following distributions in magnetic granular media: magnetic anisotropy field (HK), volume/grain size, and interaction field from neighboring grains. Since HAMR writing occurs in a narrow temperature region below Curie point (Tc), additional grain-to-grain Tc variation would strongly affect HAMR recording performance. Thus, Tc distribution should be examined for successful HAMR media development. In this paper, we demonstrate a new approach of extracting HK and Tc distributions (σHK and σTc) from thermo-remanence measurements. During the measurement process, a thin film is magnetically saturated, laser heated to specific peak temperature (for a time typically of 5 μs), then cooled to room temperature and magnetic thermo-remanence is measured. Analytical fit to the experimental curves enables independent evaluation of both σTc (±0.5% absolute) and σHK (±2% absolute). Parameters of the analytical statistical model include: temperature dependencies Ms(T), HK(T); mean field effective demagnetization factor N; grain size, HK; and Tc distributions. Thermal fluctuations are taken into account using Arrhenius-Neel formalism. Here, we report experimental σTc values as a function of grain volume. Increase of σTc with grain size reduction might be a limiting factor for HAMR extendibility.

  8. Tau Pathology Distribution in Alzheimer's disease Corresponds Differentially to Cognition-Relevant Functional Brain Networks

    PubMed Central

    Hansson, Oskar; Grothe, Michel J.; Strandberg, Tor Olof; Ohlsson, Tomas; Hägerström, Douglas; Jögi, Jonas; Smith, Ruben; Schöll, Michael

    2017-01-01

    Neuropathological studies have shown that the typical neurofibrillary pathology of hyperphosphorylated tau protein in Alzheimer's disease (AD) preferentially affects specific brain regions whereas others remain relatively spared. It has been suggested that the distinct regional distribution profile of tau pathology in AD may be a consequence of the intrinsic network structure of the human brain. The spatially distributed brain regions that are most affected by the spread of tau pathology may hence reflect an interconnected neuronal system. Here, we characterized the brain-wide regional distribution profile of tau pathology in AD using 18F-AV 1451 tau-sensitive positron emission tomography (PET) imaging, and studied this pattern in relation to the functional network organization of the human brain. Specifically, we quantified the spatial correspondence of the regional distribution pattern of PET-evidenced tau pathology in AD with functional brain networks characterized by large-scale resting state functional magnetic resonance imaging (rs-fMRI) data in healthy subjects. Regional distribution patterns of increased PET-evidenced tau pathology in AD compared to controls were characterized in two independent samples of prodromal and manifest AD cases (the Swedish BioFINDER study, n = 44; the ADNI study, n = 35). In the BioFINDER study we found that the typical AD tau pattern involved predominantly inferior, medial, and lateral temporal cortical areas, as well as the precuneus/posterior cingulate, and lateral parts of the parietal and occipital cortex. This pattern overlapped primarily with the dorsal attention, and to some extent with higher visual, limbic and parts of the default-mode network. PET-evidenced tau pathology in the ADNI replication sample, which represented a more prodromal group of AD cases, was less pronounced but showed a highly similar spatial distribution profile, suggesting an earlier-stage snapshot of a consistently progressing regional pattern. In

  9. Tau Pathology Distribution in Alzheimer's disease Corresponds Differentially to Cognition-Relevant Functional Brain Networks.

    PubMed

    Hansson, Oskar; Grothe, Michel J; Strandberg, Tor Olof; Ohlsson, Tomas; Hägerström, Douglas; Jögi, Jonas; Smith, Ruben; Schöll, Michael

    2017-01-01

    Neuropathological studies have shown that the typical neurofibrillary pathology of hyperphosphorylated tau protein in Alzheimer's disease (AD) preferentially affects specific brain regions whereas others remain relatively spared. It has been suggested that the distinct regional distribution profile of tau pathology in AD may be a consequence of the intrinsic network structure of the human brain. The spatially distributed brain regions that are most affected by the spread of tau pathology may hence reflect an interconnected neuronal system. Here, we characterized the brain-wide regional distribution profile of tau pathology in AD using (18)F-AV 1451 tau-sensitive positron emission tomography (PET) imaging, and studied this pattern in relation to the functional network organization of the human brain. Specifically, we quantified the spatial correspondence of the regional distribution pattern of PET-evidenced tau pathology in AD with functional brain networks characterized by large-scale resting state functional magnetic resonance imaging (rs-fMRI) data in healthy subjects. Regional distribution patterns of increased PET-evidenced tau pathology in AD compared to controls were characterized in two independent samples of prodromal and manifest AD cases (the Swedish BioFINDER study, n = 44; the ADNI study, n = 35). In the BioFINDER study we found that the typical AD tau pattern involved predominantly inferior, medial, and lateral temporal cortical areas, as well as the precuneus/posterior cingulate, and lateral parts of the parietal and occipital cortex. This pattern overlapped primarily with the dorsal attention, and to some extent with higher visual, limbic and parts of the default-mode network. PET-evidenced tau pathology in the ADNI replication sample, which represented a more prodromal group of AD cases, was less pronounced but showed a highly similar spatial distribution profile, suggesting an earlier-stage snapshot of a consistently progressing regional pattern

  10. The rank product method with two samples.

    PubMed

    Koziol, James A

    2010-11-05

    Breitling et al. (2004) introduced a statistical technique, the rank product method, for detecting differentially regulated genes in replicated microarray experiments. The technique has achieved widespread acceptance and is now used more broadly, in such diverse fields as RNAi analysis, proteomics, and machine learning. In this note, we extend the rank product method to the two sample setting, provide distribution theory attending the rank product method in this setting, and give numerical details for implementing the method.

  11. Rank 4 Premodular Categories

    SciTech Connect

    Bruillard, Paul J.; Galindo, Cesar; Ng, Siu Hung; Plavnik, Julia; Rowell, Eric; Wang, Zhenghan

    2016-09-01

    We consider the classification problem for rank 4 premodular categories. We uncover a formula for the 2nd Frobenius-Schur indicator of a premodular category is determined and the classification of rank 4 premodular categories (up to Grothendieck equivalence) is completed. In the appendix we show rank finiteness for premodular categories.

  12. PageRank and rank-reversal dependence on the damping factor

    NASA Astrophysics Data System (ADS)

    Son, S.-W.; Christensen, C.; Grassberger, P.; Paczuski, M.

    2012-12-01

    PageRank (PR) is an algorithm originally developed by Google to evaluate the importance of web pages. Considering how deeply rooted Google's PR algorithm is to gathering relevant information or to the success of modern businesses, the question of rank stability and choice of the damping factor (a parameter in the algorithm) is clearly important. We investigate PR as a function of the damping factor d on a network obtained from a domain of the World Wide Web, finding that rank reversal happens frequently over a broad range of PR (and of d). We use three different correlation measures, Pearson, Spearman, and Kendall, to study rank reversal as d changes, and we show that the correlation of PR vectors drops rapidly as d changes from its frequently cited value, d0=0.85. Rank reversal is also observed by measuring the Spearman and Kendall rank correlation, which evaluate relative ranks rather than absolute PR. Rank reversal happens not only in directed networks containing rank sinks but also in a single strongly connected component, which by definition does not contain any sinks. We relate rank reversals to rank pockets and bottlenecks in the directed network structure. For the network studied, the relative rank is more stable by our measures around d=0.65 than at d=d0.

  13. Growth kinetics of coliform bacteria under conditions relevant to drinking water distribution systems.

    PubMed

    Camper, A K; McFeters, G A; Characklis, W G; Jones, W L

    1991-08-01

    The growth of environmental and clinical coliform bacteria under conditions typical of drinking water distribution systems was examined. Four coliforms (Klebsiella pneumoniae, Escherichia coli, Enterobacter aerogenes, and Enterobacter cloacae) were isolated from an operating drinking water system for study; an enterotoxigenic E. coli strain and clinical isolates of K. pneumoniae and E. coli were also used. All but one of the coliforms tested were capable of growth in unsupplemented mineral salts medium; the environmental isolates had greater specific growth rates than did the clinical isolates. This trend was maintained when the organisms were grown with low levels (less than 1 mg liter-1) of yeast extract. The environmental K. pneumoniae isolate had a greater yield, higher specific growth rates, and a lower Ks value than the other organisms. The environmental E. coli and the enterotoxigenic E. coli strains had comparable yield, growth rate, and Ks values to those of the environmental K. pneumoniae strain, and all three showed significantly more successful growth than the clinical isolates. The environmental coliforms also grew well at low temperatures on low concentrations of yeast extract. Unsupplemented distribution water from the collaborating utility supported the growth of the environmental isolates. Growth of the K. pneumoniae water isolate was stimulated by the addition of autoclaved biofilm but not by tubercle material. These findings indicate that growth of environmental coliforms is possible under the conditions found in operating municipal drinking water systems and that these bacteria could be used in tests to determine assimilable organic carbon in potable water.

  14. Estimation of Spatially Distributed Evapotranspiration Using Remote Sensing and a Relevance Vector Machine

    NASA Astrophysics Data System (ADS)

    Maslova, I.; Bachour, R.; Walker, W. R.; Ticlavilca, A. M.; McKee, M.

    2014-12-01

    With the development of surface energy balance analyses, remote sensing has become a spatially explicit and quantitative methodology for understanding evapotranspiration (ET), a critical requirement for water resources planning and management. Limited temporal resolution of satellite images and cloudy skies present major limitations that impede continuous estimates of ET. This study introduces a practical approach that overcomes (in part) the previous limitations by implementing machine learning techniques that are accurate and robust. The analysis was applied to the Canal B service area of the Delta Canal Company in central Utah using data from the 2009-2011 growing seasons. Actual ET was calculated by an algorithm using data from satellite images. A relevance vector machine (RVM), which is a sparse Bayesian regression, was used to build a spatial model for ET. The RVM was trained with a set of inputs consisting of vegetation indexes, crops, and weather data. ET estimated via the algorithm was used as an output. The developed RVM model provided an accurate estimation of spatial ET based on a Nash-Sutcliffe coefficient (E) of 0.84 and a root-mean-squared error (RMSE) of 0.5 mmday-1. This methodology lays the groundwork for estimating ET at a spatial scale for the days when a satellite image is not available. It could also be used to forecast daily spatial ET if the vegetation indexes model inputs are extrapolated in time and the reference ET is forecasted accurately.

  15. Divergence of Acoustic Signals in a Widely Distributed Frog: Relevance of Inter-Male Interactions

    PubMed Central

    Velásquez, Nelson A.; Opazo, Daniel; Díaz, Javier; Penna, Mario

    2014-01-01

    Divergence of acoustic signals in a geographic scale results from diverse evolutionary forces acting in parallel and affecting directly inter-male vocal interactions among disjunct populations. Pleurodema thaul is a frog having an extensive latitudinal distribution in Chile along which males' advertisement calls exhibit an important variation. Using the playback paradigm we studied the evoked vocal responses of males of three populations of P. thaul in Chile, from northern, central and southern distribution. In each population, males were stimulated with standard synthetic calls having the acoustic structure of local and foreign populations. Males of both northern and central populations displayed strong vocal responses when were confronted with the synthetic call of their own populations, giving weaker responses to the call of the southern population. The southern population gave stronger responses to calls of the northern population than to the local call. Furthermore, males in all populations were stimulated with synthetic calls for which the dominant frequency, pulse rate and modulation depth were varied parametrically. Individuals from the northern and central populations gave lower responses to a synthetic call devoid of amplitude modulation relative to stimuli containing modulation depths between 30–100%, whereas the southern population responded similarly to all stimuli in this series. Geographic variation in the evoked vocal responses of males of P. thaul underlines the importance of inter-male interactions in driving the divergence of the acoustic traits and contributes evidence for a role of intra-sexual selection in the evolution of the sound communication system of this anuran. PMID:24489957

  16. Enhanced chlorine dioxide decay in the presence of metal oxides: relevance to drinking water distribution systems.

    PubMed

    Liu, Chao; von Gunten, Urs; Croué, Jean-Philippe

    2013-08-06

    Chlorine dioxide (ClO2) decay in the presence of typical metal oxides occurring in distribution systems was investigated. Metal oxides generally enhanced ClO2 decay in a second-order process via three pathways: (1) catalytic disproportionation with equimolar formation of chlorite and chlorate, (2) reaction to chlorite and oxygen, and (3) oxidation of a metal in a reduced form (e.g., cuprous oxide) to a higher oxidation state. Cupric oxide (CuO) and nickel oxide (NiO) showed significantly stronger abilities than goethite (α-FeOOH) to catalyze the ClO2 disproportionation (pathway 1), which predominated at higher initial ClO2 concentrations (56-81 μM). At lower initial ClO2 concentrations (13-31 μM), pathway 2 also contributed. The CuO-enhanced ClO2 decay is a base-assisted reaction with a third-order rate constant of 1.5 × 10(6) M(-2) s(-1) in the presence of 0.1 g L(-1) CuO at 21 ± 1 °C, which is 4-5 orders of magnitude higher than in the absence of CuO. The presence of natural organic matter (NOM) significantly enhanced the formation of chlorite and decreased the ClO2 disproportionation in the CuO-ClO2 system, probably because of a higher reactivity of CuO-activated ClO2 with NOM. Furthermore, a kinetic model was developed to simulate CuO-enhanced ClO2 decay at various pH values. Model simulations that agree well with the experimental data include a pre-equilibrium step with the rapid formation of a complex, namely, CuO-activated Cl2O4. The reaction of this complex with OH(-) is the rate-limiting and pH-dependent step for the overall reaction, producing chlorite and an intermediate that further forms chlorate and oxygen in parallel. These novel findings suggest that the possible ClO2 loss and the formation of chlorite/chlorate should be carefully considered in drinking water distribution systems containing copper pipes.

  17. Stable isotope distribution in precipitation in Romania and its relevance for palaeoclimatic studies

    NASA Astrophysics Data System (ADS)

    Perşoiu, Aurel; Nagavciuc, Viorica; Bădăluţă, Carmen

    2014-05-01

    A surge of recent studies in Romania have targeted various aspects of palaeoclimate (based on stable isotopes in ice, speleothems, tree rings), mineral water origin, wine and other juices provenance. However, while much needed, these studies lack a stable isotope in precipitation background, with only two LMWL's being published so far. In this paper we discuss the links between the stable isotopic composition of precipitation (δ18O and δ2H), climate (air temperature, precipitation amount and large scale circulation) and their relevance for the palaeocllimatic interpretation of stable isotope values in cave ice, cryogenic calcite and tree rings from different sites in Romania. Most of the precipitation in Romania is delivered by the Westerlies, bringing moisture from the North Atlantic; however, their influence is greatly reduced in the eastern half of the country where local evaporative sources play an important role in the precipitation balance. The SW is dominated by water masses from the Mediterranean Sea, while the SE corner clearly draws most of the moisture from the Black Sea and strongly depleted North Atlantic vapor masses. In 2012, Romania experienced the worst draught in 60 years, possibly due to a northward shift of the jest stream associated to blocking conditions in summer, which led to a more northern penetration of the Mediterranean-derived air masses, as well increased precipitation of re-evaporated waters. We have further analyzed cave drip water (δ18O and δ2H), cryogenic cave calcite (δ18O and δ13C) and tree rings (δ18O and δ13C) from selected sites across NW Romania, where the water isotopes in precipitation showed the best (and easiest to understand, given the climatic conditions in 2012) correlation with climatic parameters. Our results that 1) δ18O and δ2H in cave ice are a good proxy for late summer through early winter air temperature; 2) δ13C in cryogenic cave calcite are possible indicators of soil humidity and 3) δ18O in pine

  18. Cellular and tissue distribution of potassium: physiological relevance, mechanisms and regulation.

    PubMed

    Ahmad, Izhar; Maathuis, Frans J M

    2014-05-15

    Potassium (K(+)) is the most important cationic nutrient for all living organisms. Its cellular levels are significant (typically around 100mM) and are highly regulated. In plants K(+) affects multiple aspects such as growth, tolerance to biotic and abiotic stress and movement of plant organs. These processes occur at the cell, organ and whole plant level and not surprisingly, plants have evolved sophisticated mechanisms for the uptake, efflux and distribution of K(+) both within cells and between organs. Great progress has been made in the last decades regarding the molecular mechanisms of K(+) uptake and efflux, particularly at the cellular level. For long distance K(+) transport our knowledge is less complete but the principles behind the overall processes are largely understood. In this chapter we will discuss how both long distance transport between different organs and intracellular transport between organelles works in general and in particular for K(+). Where possible, we will provide examples of specific genes and proteins that are responsible for these phenomena.

  19. Spatial distribution of dissolved cadmium in the Jiulong river-estuary system: Relevance of anthropogenic perturbation

    NASA Astrophysics Data System (ADS)

    Wang, Deli; Yang, Xiqian; Zhai, Weidong; Li, Yan; Hong, Huasheng

    2015-12-01

    This study first examined the spatial distribution of dissolved cadmium (Cd) along with other hydrochemical parameters in a large subtropical river estuary system (the Jiulong River-Estuary, China) between 2008 and 2010, aiming to evaluate the impacts of the recently increasing anthropogenic perturbation in natural waters. The results showed that dissolved Cd was variable in the watershed with sporadically high concentrations (>0.6 nmol L-1). The significantly positive correlation of dissolved Cd with phosphate in the watershed (May 2008: dissolved Cd=0.22*P+0.0062, r=0.64, p<0.05) indicated that dissolved Cd levels have been elevated along with P by the increasing agricultural discharges and/or sewage effluents. The estuary was characterized with decreased levels of dissolved Cd in the highly turbid upper part (salinity: <5; dissolved Cd: <0.1 nmol L-1; Total Suspended Matter: 100-300 mg/L), and a mid-salinity maximum of dissolved Cd in the middle part, which were higher in Summer high river discharge period (0.40-0.54 nmol L-1) than in Fall low river discharge period (0.25-0.35 nmol L-1). Dissolved Cd generally decreased outwards in the lower estuary and nearby coastal waters as mixed with the low Cd-content seawater offshore (dissolved Cd= -0.025*Salinity+0.96, r=0.60, p<0.05). In particular, an enhancement of dissolved Cd (by ~0.2 nmol L-1) was observed in the lower estuary and estuarine plume zone as a result of sewage discharges nearby and/or Cd-enriched submarine groundwater discharges. Summarily, our exemplary study provides clear evidence that China's natural waters are currently subject to local perturbation due to the recently increasing anthropogenic activities.

  20. An aggregate analysis of personal care products in the environment: Identifying the distribution of environmentally-relevant concentrations.

    PubMed

    Hopkins, Zachary R; Blaney, Lee

    2016-01-01

    Over the past 3-4 decades, per capita consumption of personal care products (PCPs) has steadily risen, resulting in increased discharge of the active and inactive ingredients present in these products into wastewater collection systems. PCPs comprise a long list of compounds employed in toothpaste, sunscreen, lotions, soaps, body washes, and insect repellants, among others. While comprehensive toxicological studies are not yet available, an increasing body of literature has shown that PCPs of all classes can impact aquatic wildlife, bacteria, and/or mammalian cells at low concentrations. Ongoing research efforts have identified PCPs in a variety of environmental compartments, including raw wastewater, wastewater effluent, surface water, wastewater solids, sediment, groundwater, and drinking water. Here, an aggregate analysis of over 5000 reported detections was conducted to better understand the distribution of environmentally-relevant PCP concentrations in, and between, these compartments. The distributions were used to identify whether aggregated environmentally-relevant concentration ranges intersected with available toxicity data. For raw wastewater, wastewater effluent, and surface water, a clear overlap was present between the 25th-75th percentiles and identified toxicity levels. This analysis suggests that improved wastewater treatment of antimicrobials, UV filters, and polycyclic musks is required to prevent negative impacts on aquatic species. Copyright © 2016 Elsevier Ltd. All rights reserved.

  1. Functional Multiplex PageRank

    NASA Astrophysics Data System (ADS)

    Iacovacci, Jacopo; Rahmede, Christoph; Arenas, Alex; Bianconi, Ginestra

    2016-10-01

    Recently it has been recognized that many complex social, technological and biological networks have a multilayer nature and can be described by multiplex networks. Multiplex networks are formed by a set of nodes connected by links having different connotations forming the different layers of the multiplex. Characterizing the centrality of the nodes in a multiplex network is a challenging task since the centrality of the node naturally depends on the importance associated to links of a certain type. Here we propose to assign to each node of a multiplex network a centrality called Functional Multiplex PageRank that is a function of the weights given to every different pattern of connections (multilinks) existent in the multiplex network between any two nodes. Since multilinks distinguish all the possible ways in which the links in different layers can overlap, the Functional Multiplex PageRank can describe important non-linear effects when large relevance or small relevance is assigned to multilinks with overlap. Here we apply the Functional Page Rank to the multiplex airport networks, to the neuronal network of the nematode C. elegans, and to social collaboration and citation networks between scientists. This analysis reveals important differences existing between the most central nodes of these networks, and the correlations between their so-called pattern to success.

  2. Centrality based Document Ranking

    DTIC Science & Technology

    2014-11-01

    approach. We model the documents to be ranked as nodes in a graph and place edges between documents based on their similarity. Given a query, we compute...similarity of the query with respect to every document in the graph . Based on these similarity values, documents are ranked for a given query...clinical documents using centrality based approach. We model the documents to be ranked as nodes in a graph and place edges between documents based on their

  3. Let your users do the ranking.

    SciTech Connect

    Spomer, Judith E.

    2010-12-01

    Ranking search results is a thorny issue for enterprise search. Search engines rank results using a variety of sophisticated algorithms, but users still complain that search can't ever seem to find anything useful or relevant! The challenge is to provide results that are ranked according to the users' definition of relevancy. Sandia National Laboratories has enhanced its commercial search engine to discover user preferences, re-ranking results accordingly. Immediate positive impact was achieved by modeling historical data consisting of user queries and subsequent result clicks. New data is incorporated into the model daily. An important benefit is that results improve naturally and automatically over time as a function of user actions. This session presents the method employed, how it was integrated with the search engine,metrics illustrating the subsequent improvement to the users' search experience, and plans for implementation with Sandia's FAST for SharePoint 2010 search engine.

  4. A Universal Rank-Size Law

    PubMed Central

    2016-01-01

    A mere hyperbolic law, like the Zipf’s law power function, is often inadequate to describe rank-size relationships. An alternative theoretical distribution is proposed based on theoretical physics arguments starting from the Yule-Simon distribution. A modeling is proposed leading to a universal form. A theoretical suggestion for the “best (or optimal) distribution”, is provided through an entropy argument. The ranking of areas through the number of cities in various countries and some sport competition ranking serves for the present illustrations. PMID:27812192

  5. On Rank and Nullity

    ERIC Educational Resources Information Center

    Dobbs, David E.

    2012-01-01

    This note explains how Emil Artin's proof that row rank equals column rank for a matrix with entries in a field leads naturally to the formula for the nullity of a matrix and also to an algorithm for solving any system of linear equations in any number of variables. This material could be used in any course on matrix theory or linear algebra.

  6. Memory Efficient Ranking.

    ERIC Educational Resources Information Center

    Moffat, Alistair; And Others

    1994-01-01

    Describes an approximate document ranking process that uses a compact array of in-memory, low-precision approximations for document length. Combined with another rule for reducing the memory required by partial similarity accumulators, the approximation heuristic allows the ranking of large document collections using less than one byte of memory…

  7. On Rank and Nullity

    ERIC Educational Resources Information Center

    Dobbs, David E.

    2012-01-01

    This note explains how Emil Artin's proof that row rank equals column rank for a matrix with entries in a field leads naturally to the formula for the nullity of a matrix and also to an algorithm for solving any system of linear equations in any number of variables. This material could be used in any course on matrix theory or linear algebra.

  8. Hierarchical partial order ranking.

    PubMed

    Carlsen, Lars

    2008-09-01

    Assessing the potential impact on environmental and human health from the production and use of chemicals or from polluted sites involves a multi-criteria evaluation scheme. A priori several parameters are to address, e.g., production tonnage, specific release scenarios, geographical and site-specific factors in addition to various substance dependent parameters. Further socio-economic factors may be taken into consideration. The number of parameters to be included may well appear to be prohibitive for developing a sensible model. The study introduces hierarchical partial order ranking (HPOR) that remedies this problem. By HPOR the original parameters are initially grouped based on their mutual connection and a set of meta-descriptors is derived representing the ranking corresponding to the single groups of descriptors, respectively. A second partial order ranking is carried out based on the meta-descriptors, the final ranking being disclosed though average ranks. An illustrative example on the prioritization of polluted sites is given.

  9. Comparing classical and quantum PageRanks

    NASA Astrophysics Data System (ADS)

    Loke, T.; Tang, J. W.; Rodriguez, J.; Small, M.; Wang, J. B.

    2017-01-01

    Following recent developments in quantum PageRanking, we present a comparative analysis of discrete-time and continuous-time quantum-walk-based PageRank algorithms. Relative to classical PageRank and to different extents, the quantum measures better highlight secondary hubs and resolve ranking degeneracy among peripheral nodes for all networks we studied in this paper. For the discrete-time case, we investigated the periodic nature of the walker's probability distribution for a wide range of networks and found that the dominant period does not grow with the size of these networks. Based on this observation, we introduce a new quantum measure using the maximum probabilities of the associated walker during the first couple of periods. This is particularly important, since it leads to a quantum PageRanking scheme that is scalable with respect to network size.

  10. A Markov chain model for image ranking system in social networks

    NASA Astrophysics Data System (ADS)

    Zin, Thi Thi; Tin, Pyke; Toriu, Takashi; Hama, Hiromitsu

    2014-03-01

    In today world, different kinds of networks such as social, technological, business and etc. exist. All of the networks are similar in terms of distributions, continuously growing and expanding in large scale. Among them, many social networks such as Facebook, Twitter, Flickr and many others provides a powerful abstraction of the structure and dynamics of diverse kinds of inter personal connection and interaction. Generally, the social network contents are created and consumed by the influences of all different social navigation paths that lead to the contents. Therefore, identifying important and user relevant refined structures such as visual information or communities become major factors in modern decision making world. Moreover, the traditional method of information ranking systems cannot be successful due to their lack of taking into account the properties of navigation paths driven by social connections. In this paper, we propose a novel image ranking system in social networks by using the social data relational graphs from social media platform jointly with visual data to improve the relevance between returned images and user intentions (i.e., social relevance). Specifically, we propose a Markov chain based Social-Visual Ranking algorithm by taking social relevance into account. By using some extensive experiments, we demonstrated the significant and effectiveness of the proposed social-visual ranking method.

  11. Expected value based ranking of intuitionistic fuzzy variables

    NASA Astrophysics Data System (ADS)

    Kumar, Tanuj; Bajaj, Rakesh Kumar; Kaushik, Rajeev

    2017-07-01

    In the present paper, we introduce the idea of intuitionistic fuzzy variables by means of credibility theory. The mean value of intuitionistic fuzzy variables is obtained with the help of credibility distribution. Then, we develop a more comprehensive ranking method based on mean value of intuitionistic fuzzy variable. Also, we analyze the consistency of the propose ranking method with existing ranking methods.

  12. Recurrent fuzzy ranking methods

    NASA Astrophysics Data System (ADS)

    Hajjari, Tayebeh

    2012-11-01

    With the increasing development of fuzzy set theory in various scientific fields and the need to compare fuzzy numbers in different areas. Therefore, Ranking of fuzzy numbers plays a very important role in linguistic decision-making, engineering, business and some other fuzzy application systems. Several strategies have been proposed for ranking of fuzzy numbers. Each of these techniques has been shown to produce non-intuitive results in certain case. In this paper, we reviewed some recent ranking methods, which will be useful for the researchers who are interested in this area.

  13. On the Number of Ranked Species Trees Producing Anomalous Ranked Gene Trees.

    PubMed

    Disanto, Filippo; Rosenberg, Noah A

    2014-01-01

    Analysis of probability distributions conditional on species trees has demonstrated the existence of anomalous ranked gene trees (ARGTs), ranked gene trees that are more probable than the ranked gene tree that accords with the ranked species tree. Here, to improve the characterization of ARGTs, we study enumerative and probabilistic properties of two classes of ranked labeled species trees, focusing on the presence or avoidance of certain subtree patterns associated with the production of ARGTs. We provide exact enumerations and asymptotic estimates for cardinalities of these sets of trees, showing that as the number of species increases without bound, the fraction of all ranked labeled species trees that are ARGT-producing approaches 1. This result extends beyond earlier existence results to provide a probabilistic claim about the frequency of ARGTs.

  14. Comments on the rank product method for analyzing replicated experiments.

    PubMed

    Koziol, James A

    2010-03-05

    Breitling et al. introduced a statistical technique, the rank product method, for detecting differentially regulated genes in replicated microarray experiments. The technique has achieved widespread acceptance and is now used more broadly, in such diverse fields as RNAi analysis, proteomics, and machine learning. In this note, we relate the rank product method to linear rank statistics and provide an alternative derivation of distribution theory attending the rank product method.

  15. Adiabatic Quantum Algorithm for Search Engine Ranking

    NASA Astrophysics Data System (ADS)

    Garnerone, Silvano; Zanardi, Paolo; Lidar, Daniel A.

    2012-06-01

    We propose an adiabatic quantum algorithm for generating a quantum pure state encoding of the PageRank vector, the most widely used tool in ranking the relative importance of internet pages. We present extensive numerical simulations which provide evidence that this algorithm can prepare the quantum PageRank state in a time which, on average, scales polylogarithmically in the number of web pages. We argue that the main topological feature of the underlying web graph allowing for such a scaling is the out-degree distribution. The top-ranked log⁡(n) entries of the quantum PageRank state can then be estimated with a polynomial quantum speed-up. Moreover, the quantum PageRank state can be used in “q-sampling” protocols for testing properties of distributions, which require exponentially fewer measurements than all classical schemes designed for the same task. This can be used to decide whether to run a classical update of the PageRank.

  16. Adiabatic quantum algorithm for search engine ranking.

    PubMed

    Garnerone, Silvano; Zanardi, Paolo; Lidar, Daniel A

    2012-06-08

    We propose an adiabatic quantum algorithm for generating a quantum pure state encoding of the PageRank vector, the most widely used tool in ranking the relative importance of internet pages. We present extensive numerical simulations which provide evidence that this algorithm can prepare the quantum PageRank state in a time which, on average, scales polylogarithmically in the number of web pages. We argue that the main topological feature of the underlying web graph allowing for such a scaling is the out-degree distribution. The top-ranked log(n) entries of the quantum PageRank state can then be estimated with a polynomial quantum speed-up. Moreover, the quantum PageRank state can be used in "q-sampling" protocols for testing properties of distributions, which require exponentially fewer measurements than all classical schemes designed for the same task. This can be used to decide whether to run a classical update of the PageRank.

  17. Sync-rank: Robust Ranking, Constrained Ranking and Rank Aggregation via Eigenvector and SDP Synchronization

    DTIC Science & Technology

    2015-04-28

    spectral algorithms, semidefinite programming, rank aggregation, partial rankings, least squares, singular value decomposition, densest subgraph problem. 1...provided by Google [42, 53], eBay’s feedback-based reputation mechanism [64], Amazon’s Mechanical Turk (MTurk) system for crowdsourcing which enables...compute a universal (non-negative) value πi associated to each item i, such that aij = πi πj , which can easily be seen as equivalent to the pairwise

  18. Exponential distribution of long heart beat intervals during atrial fibrillation and their relevance for white noise behaviour in power spectrum.

    PubMed

    Hennig, Thomas; Maass, Philipp; Hayano, Junichiro; Heinrichs, Stefan

    2006-11-01

    The statistical properties of heart beat intervals of 130 long-term surface electrocardiogram recordings during atrial fibrillation (AF) are investigated. We find that the distribution of interbeat intervals exhibits a characteristic exponential tail, which is absent during sinus rhythm, as tested in a corresponding control study with 72 healthy persons. The rate gamma of the exponential decay lies in the range 3-12 Hz and shows diurnal variations. It equals, up to statistical uncertainties, the level of the previously uncovered white noise part of the power spectrum, which is also characteristic for AF. The overall statistical features can be described by decomposing the intervals into two statistically independent times, where the first one is associated with a correlated process with 1/f noise characteristics, while the second one belongs to an uncorrelated process and is responsible for the exponential tail. It is suggested to use gamma as a further parameter for a better classification of AF and for the medical diagnosis. The relevance of the findings with respect to a general understanding of AF is discussed.

  19. Ranking of Rankings: Benchmarking Twenty-Five Higher Education Ranking Systems in Europe

    ERIC Educational Resources Information Center

    Stolz, Ingo; Hendel, Darwin D.; Horn, Aaron S.

    2010-01-01

    The purpose of this study is to evaluate the ranking practices of 25 European higher education ranking systems (HERSs). Ranking practices were assessed with 14 quantitative measures derived from the Berlin Principles on Ranking of Higher Education Institutions (BPs). HERSs were then ranked according to their degree of congruence with the BPs.…

  20. Ranking of Rankings: Benchmarking Twenty-Five Higher Education Ranking Systems in Europe

    ERIC Educational Resources Information Center

    Stolz, Ingo; Hendel, Darwin D.; Horn, Aaron S.

    2010-01-01

    The purpose of this study is to evaluate the ranking practices of 25 European higher education ranking systems (HERSs). Ranking practices were assessed with 14 quantitative measures derived from the Berlin Principles on Ranking of Higher Education Institutions (BPs). HERSs were then ranked according to their degree of congruence with the BPs.…

  1. Multiplex PageRank.

    PubMed

    Halu, Arda; Mondragón, Raúl J; Panzarasa, Pietro; Bianconi, Ginestra

    2013-01-01

    Many complex systems can be described as multiplex networks in which the same nodes can interact with one another in different layers, thus forming a set of interacting and co-evolving networks. Examples of such multiplex systems are social networks where people are involved in different types of relationships and interact through various forms of communication media. The ranking of nodes in multiplex networks is one of the most pressing and challenging tasks that research on complex networks is currently facing. When pairs of nodes can be connected through multiple links and in multiple layers, the ranking of nodes should necessarily reflect the importance of nodes in one layer as well as their importance in other interdependent layers. In this paper, we draw on the idea of biased random walks to define the Multiplex PageRank centrality measure in which the effects of the interplay between networks on the centrality of nodes are directly taken into account. In particular, depending on the intensity of the interaction between layers, we define the Additive, Multiplicative, Combined, and Neutral versions of Multiplex PageRank, and show how each version reflects the extent to which the importance of a node in one layer affects the importance the node can gain in another layer. We discuss these measures and apply them to an online multiplex social network. Findings indicate that taking the multiplex nature of the network into account helps uncover the emergence of rankings of nodes that differ from the rankings obtained from one single layer. Results provide support in favor of the salience of multiplex centrality measures, like Multiplex PageRank, for assessing the prominence of nodes embedded in multiple interacting networks, and for shedding a new light on structural properties that would otherwise remain undetected if each of the interacting networks were analyzed in isolation.

  2. A method for integrating and ranking the evidence for biochemical pathways by mining reactions from text

    PubMed Central

    Miwa, Makoto; Ohta, Tomoko; Rak, Rafal; Rowley, Andrew; Kell, Douglas B.; Pyysalo, Sampo; Ananiadou, Sophia

    2013-01-01

    Motivation: To create, verify and maintain pathway models, curators must discover and assess knowledge distributed over the vast body of biological literature. Methods supporting these tasks must understand both the pathway model representations and the natural language in the literature. These methods should identify and order documents by relevance to any given pathway reaction. No existing system has addressed all aspects of this challenge. Method: We present novel methods for associating pathway model reactions with relevant publications. Our approach extracts the reactions directly from the models and then turns them into queries for three text mining-based MEDLINE literature search systems. These queries are executed, and the resulting documents are combined and ranked according to their relevance to the reactions of interest. We manually annotate document-reaction pairs with the relevance of the document to the reaction and use this annotation to study several ranking methods, using various heuristic and machine-learning approaches. Results: Our evaluation shows that the annotated document-reaction pairs can be used to create a rule-based document ranking system, and that machine learning can be used to rank documents by their relevance to pathway reactions. We find that a Support Vector Machine-based system outperforms several baselines and matches the performance of the rule-based system. The success of the query extraction and ranking methods are used to update our existing pathway search system, PathText. Availability: An online demonstration of PathText 2 and the annotated corpus are available for research purposes at http://www.nactem.ac.uk/pathtext2/. Contact: makoto.miwa@manchester.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23813008

  3. Ranking of Russian Higher Education Institutions

    ERIC Educational Resources Information Center

    Pokholkov, Yuri P.; Chuchalin, Alexander I.; Agranovich, Boris L.; Mogilnitsky, Sergey B.

    2007-01-01

    This article considers some patterns of ranking higher education institutions which are used in the Russian Federation to reveal strengths and weaknesses in meeting the national individual, societal and state-related needs, as well as those of the international academic community concerning relevant information on Russian higher education…

  4. Distribution and Metabolism of Lipocurc™ (Liposomal Curcumin) in Dog and Human Blood Cells: Species Selectivity and Pharmacokinetic Relevance.

    PubMed

    Bolger, Gordon T; Licollari, Albert; Tan, Aimin; Greil, Richard; Vcelar, Brigitta; Majeed, Muhammad; Helson, Lawrence

    2017-07-01

    The aim of this study was to investigate the distribution of curcumin (in the form of Lipocurc™) and its major metabolite tetrahydrocurcumin (THC) in Beagle dog and human red blood cells, peripheral blood mononuclear cells (PBMC) and hepatocytes. Lipocurc™ was used as the source of curcumin for the cell distribution assays. In vitro findings with red blood cells were also compared to in vivo pharmacokinetic data available from preclinical studies in dogs and phase I clinical studies in humans. High levels of curcumin were measured in PBMCs (625.5 ng/g w.w. cell pellet or 7,297 pg/10(6) cells in dog and 353.7 ng/g w.w. cell pellet or 6,809 pg/10(6) cells in human) and in hepatocytes (414.5 ng/g w.w. cell pellet or 14,005 pg/10(6) cells in dog and 813.5 ng/g w.w. cell pellet or 13,780 pg/10(6) cells in human). Lower curcumin levels were measured in red blood cells (dog: 78.4 ng/g w.w. cell pellet or 7.2 pg/10(6) cells, human: 201.5 ng/g w.w. cell pellet or 18.6 pg/10(6) cells). A decrease in the medium concentration of curcumin was observed in red blood cells and hepatocytes, but not in PBMCs. Red blood cell levels of THC were ~5-fold higher in dog compared to human and similar between dog and human for hepatocytes and PBMCs. The ratio of THC to curcumin found in the red blood cell medium following incubation was 6.3 for dog compared to 0.006 for human, while for PBMCs and hepatocytes the ratio of THC to curcumin in the medium did not display such marked species differences. There was an excellent correlation between the in vitro disposition of curcumin and THC following incubation with red blood cells and in vivo plasma levels of curcumin and THC in dog and human following intravenous infusion. The disposition of curcumin in blood cells is, therefore, species-dependent and of pharmacokinetic relevance. Copyright© 2017, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.

  5. Podoplanin, E-cadherin, β-catenin, and CD44v6 in recurrent ameloblastoma: their distribution patterns and relevance.

    PubMed

    Siar, Chong Huat; Ishak, Ismadi; Ng, Kok Han

    2015-01-01

    Ameloblastoma is a benign but locally infiltrative odontogenic epithelial neoplasm with a high risk for recurrence. Podoplanin, a lymphatic endothelium marker, putatively promotes collective cell migration and invasiveness in this neoplasm. However, its role in the recurrent ameloblastoma (RA) remains unclear. As morphological, signaling, and genetic differences may exist between primary and recurrent tumors, clarification of their distribution patterns is of relevance. Podoplanin was examined immunohistochemically in conjunction with E-cadherin, β-catenin, and CD44v6 in 25 RA. Immunostaining according to tumor area, cellular type, and location, and relationship of these proteins were analyzed. Findings were compared with 25 unrelated primary ameloblastomas (UPA). All four proteins were detected in RA and UPA samples. Expression rates for each protein were not significantly different between these two groups. RA demonstrated significant upregulation of podoplanin at the invasive front (P < 0.05), whereas upregulation of β-catenin and CD44v6 and downregulation of E-cadherin at this site were not statistically significant (P > 0.05). Immunolocalization for all four proteins was predominantly membranous and less frequently cytoplasmic. Pre-ameloblast-like cells were podoplanin(+) /CD44v6(-), while stellate reticulum-like cells were podoplanin(-)/CD44v6(+). Acanthomatous, granular cell, and desmoplastic variants in both RA and UPA were podoplanin(-/low) but stained weak-to-moderate for E-cadherin, β-catenin, and CD44v6. Stromal fibroblasts and lymph channels were variably podoplanin-positive. Podoplanin, β-catenin, and CD44v6 upregulation at the tumor invasive fronts in RA and UPA supports a differential regulatory role by these molecules in mediating collective cell migration and local invasiveness. E-cadherin downregulation suggests altered cell adhesion function during tumor progression. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  6. Relevance of risk predictions derived from a chronic species sensitivity distribution with cadmium to aquatic populations and ecosystems

    USGS Publications Warehouse

    Mebane, C.A.

    2010-01-01

    Criteria to protect aquatic life are intended to protect diverse ecosystems, but in practice are usually developed from compilations of single-species toxicity tests using standard test organisms that were tested in laboratory environments. Species sensitivity distributions (SSDs) developed from these compilations are extrapolated to set aquatic ecosystem criteria. The protectiveness of the approach was critically reviewed with a chronic SSD for cadmium comprising 27 species within 21 genera. Within the data set, one genus had lower cadmium effects concentrations than the SSD fifth percentile-based criterion, so in theory this genus, the amphipod Hyalella, could be lost or at least allowed some level of harm by this criteria approach. However, population matrix modeling projected only slightly increased extinction risks for a temperate Hyalella population under scenarios similar to the SSD fifth percentile criterion. The criterion value was further compared to cadmium effects concentrations in ecosystem experiments and field studies. Generally, few adverse effects were inferred from ecosystem experiments at concentrations less than the SSD fifth percentile criterion. Exceptions were behavioral impairments in simplified food web studies. No adverse effects were apparent in field studies under conditions that seldom exceeded the criterion. At concentrations greater than the SSD fifth percentile, the magnitudes of adverse effects in the field studies were roughly proportional to the laboratory-based fraction of species with adverse effects in the SSD. Overall, the modeling and field validation comparisons of the chronic criterion values generally supported the relevance and protectiveness of the SSD fifth percentile approach with cadmium. ?? 2009 Society for Risk Analysis.

  7. Incorporating User Search Behavior into Relevance Feedback.

    ERIC Educational Resources Information Center

    Ruthven, Ian; Lalmas, Mounia; van Rijsbergen, Keith

    2003-01-01

    Presents five user experiments on incorporating behavioral information into the relevance feedback process in information retrieval, concentrating on ranking terms for query expansion and selecting new terms to add to the user's query. Topics include term ranking and user behavior; incorporating user behavior into term ranking; and user behavior…

  8. Diversifying customer review rankings.

    PubMed

    Krestel, Ralf; Dokoohaki, Nima

    2015-06-01

    E-commerce Web sites owe much of their popularity to consumer reviews accompanying product descriptions. On-line customers spend hours and hours going through heaps of textual reviews to decide which products to buy. At the same time, each popular product has thousands of user-generated reviews, making it impossible for a buyer to read everything. Current approaches to display reviews to users or recommend an individual review for a product are based on the recency or helpfulness of each review. In this paper, we present a framework to rank product reviews by optimizing the coverage of the ranking with respect to sentiment or aspects, or by summarizing all reviews with the top-K reviews in the ranking. To accomplish this, we make use of the assigned star rating for a product as an indicator for a review's sentiment polarity and compare bag-of-words (language model) with topic models (latent Dirichlet allocation) as a mean to represent aspects. Our evaluation on manually annotated review data from a commercial review Web site demonstrates the effectiveness of our approach, outperforming plain recency ranking by 30% and obtaining best results by combining language and topic model representations.

  9. Outflanking the Rankings Industry

    ERIC Educational Resources Information Center

    McGuire, Patricia

    2007-01-01

    In this article, the author argues that American higher education is allowing itself to be held hostage by the rankings industry, which can lead institutions to consider actions harmful to the public interest and encourage the public's infatuation with celebrity at the expense of substance. Instead of sitting quietly by during the upcoming ratings…

  10. Playing the Rankings Game

    ERIC Educational Resources Information Center

    Farrell, Elizabeth F.; Van Der Werf, Martin

    2007-01-01

    While some colleges claim not to care what "U.S. News & World Report" says, and experts cite problems in the way its annual rankings are done, many institutions scramble to improve their positions. There are well-documented examples of institutions that have solicited nominal donations from alumni to boost their percentage of giving, encouraged…

  11. Tool for Ranking Research Options

    NASA Technical Reports Server (NTRS)

    Ortiz, James N.; Scott, Kelly; Smith, Harold

    2005-01-01

    Tool for Research Enhancement Decision Support (TREDS) is a computer program developed to assist managers in ranking options for research aboard the International Space Station (ISS). It could likely also be adapted to perform similar decision-support functions in industrial and academic settings. TREDS provides a ranking of the options, based on a quantifiable assessment of all the relevant programmatic decision factors of benefit, cost, and risk. The computation of the benefit for each option is based on a figure of merit (FOM) for ISS research capacity that incorporates both quantitative and qualitative inputs. Qualitative inputs are gathered and partly quantified by use of the time-tested analytical hierarchical process and used to set weighting factors in the FOM corresponding to priorities determined by the cognizant decision maker(s). Then by use of algorithms developed specifically for this application, TREDS adjusts the projected benefit for each option on the basis of levels of technical implementation, cost, and schedule risk. Based partly on Excel spreadsheets, TREDS provides screens for entering cost, benefit, and risk information. Drop-down boxes are provided for entry of qualitative information. TREDS produces graphical output in multiple formats that can be tailored by users.

  12. Ranking facial attractiveness.

    PubMed

    Knight, Helen; Keith, Olly

    2005-08-01

    The first aim of this investigation was to assemble a group of photographs of 30 male and 30 female faces representing a standardized spectrum of facial attractiveness, against which orthognathic treatment outcomes could be compared. The second aim was to investigate the influence of the relationship between ANB differences and anterior lower face height (ALFH) percentages on facial attractiveness. The initial sample comprised standardized photographs of 41 female and 35 male Caucasian subjects. From these, the photographs of two groups of 30 male and 30 female subjects were compiled. A panel of six clinicians and six non-clinicians ranked the photographs. The results showed there to be a good level of reliability for each assessor when ranking the photographs on two occasions, particularly for the clinicians (female subjects r = 0.76-0.97, male subjects r = 0.72-0.94). Agreement among individuals within each group was also high, particularly when ranking facial attractiveness in male subjects (female subjects r = 0.57-0.84, male subjects r = 0.91-0.94). Antero-posterior (AP) discrepancies, as measured by soft tissue ANB, showed minimal correlation with facial attractiveness. However, a trend emerged that would suggest that in faces where the ANB varies widely from 5 degrees, the face is considered less attractive. The ALFH percentage also showed minimal correlation with facial attractiveness. However, there was a trend that suggested that greater ALFH percentages are considered less attractive in female faces, while in males the opposite trend was seen. Either of the two series of ranked photographs as judged by clinicians and non-clinicians could be used as a standard against which facial attractiveness could be assessed, as both were in total agreement about the most attractive faces. However, to judge the outcome of orthognathic treatment, the series of ranked photographs produced by the non-clinician group should be used as the 'standard' to reflect lay

  13. Beyond Zipf's Law: The Lavalette Rank Function and Its Properties.

    PubMed

    Fontanelli, Oscar; Miramontes, Pedro; Yang, Yaning; Cocho, Germinal; Li, Wentian

    Although Zipf's law is widespread in natural and social data, one often encounters situations where one or both ends of the ranked data deviate from the power-law function. Previously we proposed the Beta rank function to improve the fitting of data which does not follow a perfect Zipf's law. Here we show that when the two parameters in the Beta rank function have the same value, the Lavalette rank function, the probability density function can be derived analytically. We also show both computationally and analytically that Lavalette distribution is approximately equal, though not identical, to the lognormal distribution. We illustrate the utility of Lavalette rank function in several datasets. We also address three analysis issues on the statistical testing of Lavalette fitting function, comparison between Zipf's law and lognormal distribution through Lavalette function, and comparison between lognormal distribution and Lavalette distribution.

  14. Athletic Training Education Programs: To Rank or Not To Rank?

    PubMed Central

    Voll, Craig A.; Goodwin, Jeff E.; Pitney, William A.

    1999-01-01

    Objective: To discuss the literature regarding educational program ranking and to provide insights concerning undergraduate and graduate athletic training education ranking systems. Background: The demand for accountability and the need to evaluate the quality of educational programs have led to program ranking in many academic disciplines. As athletic training becomes more recognized within the medical community, determining a program's quality will become increasingly important. Description: We describe program rankings used in other disciplines for determining quality and providing measures of accountability. We discuss the strengths and weaknesses of both subjective and objective ranking systems, as well as the arguments for using program rankings in athletic training. Future directions for program ranking and potential research questions are suggested. Applications: Ranking systems on the basis of levels of perceived quality and academic productivity of programs that prepare future professionals will help potential undergraduate and graduate students make informed decisions when selecting an educational program. PMID:16558548

  15. Rank diversity of languages: generic behavior in computational linguistics.

    PubMed

    Cocho, Germinal; Flores, Jorge; Gershenson, Carlos; Pineda, Carlos; Sánchez, Sergio

    2015-01-01

    Statistical studies of languages have focused on the rank-frequency distribution of words. Instead, we introduce here a measure of how word ranks change in time and call this distribution rank diversity. We calculate this diversity for books published in six European languages since 1800, and find that it follows a universal lognormal distribution. Based on the mean and standard deviation associated with the lognormal distribution, we define three different word regimes of languages: "heads" consist of words which almost do not change their rank in time, "bodies" are words of general use, while "tails" are comprised by context-specific words and vary their rank considerably in time. The heads and bodies reflect the size of language cores identified by linguists for basic communication. We propose a Gaussian random walk model which reproduces the rank variation of words in time and thus the diversity. Rank diversity of words can be understood as the result of random variations in rank, where the size of the variation depends on the rank itself. We find that the core size is similar for all languages studied.

  16. Rank Diversity of Languages: Generic Behavior in Computational Linguistics

    PubMed Central

    Cocho, Germinal; Flores, Jorge; Gershenson, Carlos; Pineda, Carlos; Sánchez, Sergio

    2015-01-01

    Statistical studies of languages have focused on the rank-frequency distribution of words. Instead, we introduce here a measure of how word ranks change in time and call this distribution rank diversity. We calculate this diversity for books published in six European languages since 1800, and find that it follows a universal lognormal distribution. Based on the mean and standard deviation associated with the lognormal distribution, we define three different word regimes of languages: “heads” consist of words which almost do not change their rank in time, “bodies” are words of general use, while “tails” are comprised by context-specific words and vary their rank considerably in time. The heads and bodies reflect the size of language cores identified by linguists for basic communication. We propose a Gaussian random walk model which reproduces the rank variation of words in time and thus the diversity. Rank diversity of words can be understood as the result of random variations in rank, where the size of the variation depends on the rank itself. We find that the core size is similar for all languages studied. PMID:25849150

  17. Perceiving Action-Relevant Properties of Tools through Dynamic Touch: Effects of Mass Distribution, Exploration Style, and Intention

    ERIC Educational Resources Information Center

    Harrison, Steven J.; Hajnal, Alen; Lopresti-Goodman, Stacy; Isenhower, Robert W.; Kinsella-Shaw, J. M.

    2011-01-01

    At issue in the present series of experiments was the ability to prospectively perceive the action-relevant properties of hand-held tools by means of dynamic touch. In Experiment 1, participants judged object move-ability. In Experiment 2, participants judged how difficult an object would be to hold if held horizontally, and in Experiments 3 and…

  18. Perceiving Action-Relevant Properties of Tools through Dynamic Touch: Effects of Mass Distribution, Exploration Style, and Intention

    ERIC Educational Resources Information Center

    Harrison, Steven J.; Hajnal, Alen; Lopresti-Goodman, Stacy; Isenhower, Robert W.; Kinsella-Shaw, J. M.

    2011-01-01

    At issue in the present series of experiments was the ability to prospectively perceive the action-relevant properties of hand-held tools by means of dynamic touch. In Experiment 1, participants judged object move-ability. In Experiment 2, participants judged how difficult an object would be to hold if held horizontally, and in Experiments 3 and…

  19. Kinesiology Faculty Citations across Academic Rank

    ERIC Educational Resources Information Center

    Knudson, Duane

    2015-01-01

    Citations to research reports are used as a measure for the influence of a scholar's research line when seeking promotion, grants, and awards. The current study documented the distributions of citations to kinesiology scholars of various academic ranks. Google Scholar Citations was searched for user profiles using five research interest areas…

  20. Kinesiology Faculty Citations across Academic Rank

    ERIC Educational Resources Information Center

    Knudson, Duane

    2015-01-01

    Citations to research reports are used as a measure for the influence of a scholar's research line when seeking promotion, grants, and awards. The current study documented the distributions of citations to kinesiology scholars of various academic ranks. Google Scholar Citations was searched for user profiles using five research interest areas…

  1. Semi-quantitative spectrographic analysis and rank correlation in geochemistry

    USGS Publications Warehouse

    Flanagan, F.J.

    1957-01-01

    The rank correlation coefficient, rs, which involves less computation than the product-moment correlation coefficient, r, can be used to indicate the degree of relationship between two elements. The method is applicable in situations where the assumptions underlying normal distribution correlation theory may not be satisfied. Semi-quantitative spectrographic analyses which are reported as grouped or partly ranked data can be used to calculate rank correlations between elements. ?? 1957.

  2. Identification of Absorption, Distribution, Metabolism, and Excretion (ADME) Genes Relevant to Steatosis Using a Differential Gene Expression Approach

    EPA Science Inventory

    Absorption, distribution, metabolism, and excretion (ADME) parameters represent important connections between exposure to chemicals and the activation of molecular initiating events of Adverse Outcome Pathways (AOPs) in cellular, tissue, and organ level targets. ADME parameters u...

  3. Identification of Absorption, Distribution, Metabolism, and Excretion (ADME) Genes Relevant to Steatosis Using a Differential Gene Expression Approach

    EPA Science Inventory

    Absorption, distribution, metabolism, and excretion (ADME) parameters represent important connections between exposure to chemicals and the activation of molecular initiating events of Adverse Outcome Pathways (AOPs) in cellular, tissue, and organ level targets. ADME parameters u...

  4. University Rankings and Social Science

    ERIC Educational Resources Information Center

    Marginson, Simon

    2014-01-01

    University rankings widely affect the behaviours of prospective students and their families, university executive leaders, academic faculty, governments and investors in higher education. Yet the social science foundations of global rankings receive little scrutiny. Rankings that simply recycle reputation without any necessary connection to real…

  5. University Rankings and Social Science

    ERIC Educational Resources Information Center

    Marginson, Simon

    2014-01-01

    University rankings widely affect the behaviours of prospective students and their families, university executive leaders, academic faculty, governments and investors in higher education. Yet the social science foundations of global rankings receive little scrutiny. Rankings that simply recycle reputation without any necessary connection to real…

  6. Learning from partially annotated OPT images by contextual relevance ranking.

    PubMed

    Li, Wenqi; Zhang, Jianguo; Zheng, Wei-Shi; Coats, Maria; Carey, Frank A; McKenna, Stephen J

    2013-01-01

    Annotations delineating regions of interest can provide valuable information for training medical image classification and segmentation methods. However the process of obtaining annotations is tedious and time-consuming, especially for high-resolution volumetric images. In this paper we present a novel learning framework to reduce the requirement of manual annotations while achieving competitive classification performance. The approach is evaluated on a dataset with 59 3D optical projection tomography images of colorectal polyps. The results show that the proposed method can robustly infer patterns from partially annotated images with low computational cost.

  7. The Privileges of Rank

    PubMed Central

    MacLean, Alair

    2010-01-01

    This article examines the effects of peacetime cold war military service on the life course according to four potentially overlapping theories that state that military service (1) was a disruption, (2) was a positive turning point, (3) allowed veterans to accumulate advantage, and (4) was an agent of social reproduction. The article argues that the extent to which the effect of military service on veterans' lives corresponds with one or another of the preceding theories depends on historical shifts in three dimensions: conscription, conflict, and benefits. Military service during the peacetime draft era of the late 1950s had a neutral effect on the socioeconomic attainment of enlisted veterans. However, it had a positive effect on veterans who served as officers, which partly stemmed from status reproduction and selection. Yet net of pre-service and educational differences by rank, officers in this peacetime draft era were still able to accumulate advantage. PMID:20842210

  8. Forecasting Distributional Responses of Limber Pine to Climate Change at Management-Relevant Scales in Rocky Mountain National Park

    PubMed Central

    Monahan, William B.; Cook, Tammy; Melton, Forrest; Connor, Jeff; Bobowski, Ben

    2013-01-01

    Resource managers at parks and other protected areas are increasingly expected to factor climate change explicitly into their decision making frameworks. However, most protected areas are small relative to the geographic ranges of species being managed, so forecasts need to consider local adaptation and community dynamics that are correlated with climate and affect distributions inside protected area boundaries. Additionally, niche theory suggests that species' physiological capacities to respond to climate change may be underestimated when forecasts fail to consider the full breadth of climates occupied by the species rangewide. Here, using correlative species distribution models that contrast estimates of climatic sensitivity inferred from the two spatial extents, we quantify the response of limber pine (Pinus flexilis) to climate change in Rocky Mountain National Park (Colorado, USA). Models are trained locally within the park where limber pine is the community dominant tree species, a distinct structural-compositional vegetation class of interest to managers, and also rangewide, as suggested by niche theory. Model forecasts through 2100 under two representative concentration pathways (RCP 4.5 and 8.5 W/m2) show that the distribution of limber pine in the park is expected to move upslope in elevation, but changes in total and core patch area remain highly uncertain. Most of this uncertainty is biological, as magnitudes of projected change are considerably more variable between the two spatial extents used in model training than they are between RCPs, and novel future climates only affect local model predictions associated with RCP 8.5 after 2091. Combined, these results illustrate the importance of accounting for unknowns in species' climatic sensitivities when forecasting distributional scenarios that are used to inform management decisions. We discuss how our results for limber pine may be interpreted in the context of climate change vulnerability and used to

  9. Forecasting distributional responses of limber pine to climate change at management-relevant scales in Rocky Mountain National Park.

    PubMed

    Monahan, William B; Cook, Tammy; Melton, Forrest; Connor, Jeff; Bobowski, Ben

    2013-01-01

    Resource managers at parks and other protected areas are increasingly expected to factor climate change explicitly into their decision making frameworks. However, most protected areas are small relative to the geographic ranges of species being managed, so forecasts need to consider local adaptation and community dynamics that are correlated with climate and affect distributions inside protected area boundaries. Additionally, niche theory suggests that species' physiological capacities to respond to climate change may be underestimated when forecasts fail to consider the full breadth of climates occupied by the species rangewide. Here, using correlative species distribution models that contrast estimates of climatic sensitivity inferred from the two spatial extents, we quantify the response of limber pine (Pinus flexilis) to climate change in Rocky Mountain National Park (Colorado, USA). Models are trained locally within the park where limber pine is the community dominant tree species, a distinct structural-compositional vegetation class of interest to managers, and also rangewide, as suggested by niche theory. Model forecasts through 2100 under two representative concentration pathways (RCP 4.5 and 8.5 W/m(2)) show that the distribution of limber pine in the park is expected to move upslope in elevation, but changes in total and core patch area remain highly uncertain. Most of this uncertainty is biological, as magnitudes of projected change are considerably more variable between the two spatial extents used in model training than they are between RCPs, and novel future climates only affect local model predictions associated with RCP 8.5 after 2091. Combined, these results illustrate the importance of accounting for unknowns in species' climatic sensitivities when forecasting distributional scenarios that are used to inform management decisions. We discuss how our results for limber pine may be interpreted in the context of climate change vulnerability and used

  10. Test procedures and protocols: Their relevance to the figure of merit for thermal distribution systems. Volume 1: Informal report

    SciTech Connect

    Andrews, J.W.

    1993-09-01

    A conceptual framework is developed that categorizes measurement protocols for forced-air thermal distribution systems in small buildings. This framework is based on the distinction between two generic approaches. The {open_quote}system-comparison{close_quote} approach seeks to determine, via a pair of whole-house energy-use measurements, the difference in energy use between the house with the as-found duct system and the same house with no energy losses attributable to the thermal distribution system. The {open_quote}component loss-factor{close_quote} approach identifies and measures the individual causes of duct losses, and then builds up a value for the net overall duct efficiency, usually with the help of computer simulation. Examples of each approach are analyzed and related to a proposed Figure of Merit for thermal distribution systems. This Figure of Merit would serve as the basis for a Standard Method of Test analogous to those already in place for furnaces, boilers, air conditioners, and heat pumps.

  11. Relevance of octanol-water distribution measurements to the potential ecological uptake of multi-walled carbon nanotubes.

    PubMed

    Petersen, Elijah J; Huang, Qingguo; Weber, Walter J

    2010-05-01

    Many potential applications of carbon nanotubes (CNTs) require various physicochemical modifications prior to use, suggesting that nanotubes having varied properties may pose risks in ecosystems. A means for estimating bioaccumulation potentials of variously modified CNTs for incorporation in predictive fate models would be highly valuable. An approach commonly used for sparingly soluble organic contaminants, and previously suggested for use as well with carbonaceous nanomaterials, involves measurement of their octanol-water partitioning coefficient (KOW) values. To test the applicability of this approach, a methodology was developed to measure apparent octanol-water distribution behaviors for purified multi-walled carbon nanotubes and those acid treated. Substantial differences in apparent distribution coefficients between the two types of CNTs were observed, but these differences did not influence accumulation by either earthworms (Eisenia foetida) or oligochaetes (Lumbriculus variegatus), both of which showed minimal nanotube uptake for both types of nanotubes. The results suggest that traditional distribution behavior-based KOW approaches are likely not appropriate for predicting CNT bioaccumulation.

  12. Neophilia Ranking of Scientific Journals

    PubMed Central

    Packalen, Mikko; Bhattacharya, Jay

    2017-01-01

    The ranking of scientific journals is important because of the signal it sends to scientists about what is considered most vital for scientific progress. Existing ranking systems focus on measuring the influence of a scientific paper (citations)—these rankings do not reward journals for publishing innovative work that builds on new ideas. We propose an alternative ranking based on the proclivity of journals to publish papers that build on new ideas, and we implement this ranking via a text-based analysis of all published biomedical papers dating back to 1946. In addition, we compare our neophilia ranking to citation-based (impact factor) rankings; this comparison shows that the two ranking approaches are distinct. Prior theoretical work suggests an active role for our neophilia index in science policy. Absent an explicit incentive to pursue novel science, scientists underinvest in innovative work because of a coordination problem: for work on a new idea to flourish, many scientists must decide to adopt it in their work. Rankings that are based purely on influence thus do not provide sufficient incentives for publishing innovative work. By contrast, adoption of the neophilia index as part of journal-ranking procedures by funding agencies and university administrators would provide an explicit incentive for journals to publish innovative work and thus help solve the coordination problem by increasing scientists' incentives to pursue innovative work. PMID:28713181

  13. A LDA-based approach to promoting ranking diversity for genomics information retrieval.

    PubMed

    Chen, Yan; Yin, Xiaoshi; Li, Zhoujun; Hu, Xiaohua; Huang, Jimmy Xiangji

    2012-06-11

    In the biomedical domain, there are immense data and tremendous increase of genomics and biomedical relevant publications. The wealth of information has led to an increasing amount of interest in and need for applying information retrieval techniques to access the scientific literature in genomics and related biomedical disciplines. In many cases, the desired information of a query asked by biologists is a list of a certain type of entities covering different aspects that are related to the question, such as cells, genes, diseases, proteins, mutations, etc. Hence, it is important of a biomedical IR system to be able to provide relevant and diverse answers to fulfill biologists' information needs. However traditional IR model only concerns with the relevance between retrieved documents and user query, but does not take redundancy between retrieved documents into account. This will lead to high redundancy and low diversity in the retrieval ranked lists. In this paper, we propose an approach which employs a topic generative model called Latent Dirichlet Allocation (LDA) to promoting ranking diversity for biomedical information retrieval. Different from other approaches or models which consider aspects on word level, our approach assumes that aspects should be identified by the topics of retrieved documents. We present LDA model to discover topic distribution of retrieval passages and word distribution of each topic dimension, and then re-rank retrieval results with topic distribution similarity between passages based on N-size slide window. We perform our approach on TREC 2007 Genomics collection and two distinctive IR baseline runs, which can achieve 8% improvement over the highest Aspect MAP reported in TREC 2007 Genomics track. The proposed method is the first study of adopting topic model to genomics information retrieval, and demonstrates its effectiveness in promoting ranking diversity as well as in improving relevance of ranked lists of genomics search

  14. International ranking systems for universities and institutions: a critical appraisal

    PubMed Central

    Ioannidis, John PA; Patsopoulos, Nikolaos A; Kavvoura, Fotini K; Tatsioni, Athina; Evangelou, Evangelos; Kouri, Ioanna; Contopoulos-Ioannidis, Despina G; Liberopoulos, George

    2007-01-01

    Background Ranking of universities and institutions has attracted wide attention recently. Several systems have been proposed that attempt to rank academic institutions worldwide. Methods We review the two most publicly visible ranking systems, the Shanghai Jiao Tong University 'Academic Ranking of World Universities' and the Times Higher Education Supplement 'World University Rankings' and also briefly review other ranking systems that use different criteria. We assess the construct validity for educational and research excellence and the measurement validity of each of the proposed ranking criteria, and try to identify generic challenges in international ranking of universities and institutions. Results None of the reviewed criteria for international ranking seems to have very good construct validity for both educational and research excellence, and most don't have very good construct validity even for just one of these two aspects of excellence. Measurement error for many items is also considerable or is not possible to determine due to lack of publication of the relevant data and methodology details. The concordance between the 2006 rankings by Shanghai and Times is modest at best, with only 133 universities shared in their top 200 lists. The examination of the existing international ranking systems suggests that generic challenges include adjustment for institutional size, definition of institutions, implications of average measurements of excellence versus measurements of extremes, adjustments for scientific field, time frame of measurement and allocation of credit for excellence. Conclusion Naïve lists of international institutional rankings that do not address these fundamental challenges with transparent methods are misleading and should be abandoned. We make some suggestions on how focused and standardized evaluations of excellence could be improved and placed in proper context. PMID:17961208

  15. Imaging geochemical heterogeneities using inverse reactive transport modeling: An example relevant for characterizing arsenic mobilization and distribution

    NASA Astrophysics Data System (ADS)

    Fakhreddine, Sarah; Lee, Jonghyun; Kitanidis, Peter K.; Fendorf, Scott; Rolle, Massimo

    2016-02-01

    The spatial distribution of reactive minerals in the subsurface is often a primary factor controlling the fate and transport of contaminants in groundwater systems. However, direct measurement and estimation of heterogeneously distributed minerals are often costly and difficult to obtain. While previous studies have shown the utility of using hydrologic measurements combined with inverse modeling techniques for tomography of physical properties including hydraulic conductivity, these methods have seldom been used to image reactive geochemical heterogeneities. In this study, we focus on As-bearing reactive minerals as aquifer contaminants. We use synthetic applications to demonstrate the ability of inverse modeling techniques combined with mechanistic reactive transport models to image reactive mineral lenses in the subsurface and quantify estimation error using indirect, commonly measured groundwater parameters. Specifically, we simulate the mobilization of arsenic via kinetic oxidative dissolution of As-bearing pyrite due to dissolved oxygen in the ambient groundwater, which is an important mechanism for arsenic release in groundwater both under natural conditions and engineering applications such as managed aquifer recharge and recovery operations. The modeling investigation is carried out at various scales and considers different flow-through domains including (i) a 1D lab-scale column (80 cm), (ii) a 2D lab-scale setup (60 cm × 30 cm) and (iii) a 2D field-scale domain (20 m × 4 m). In these setups, synthetic dissolved oxygen data and forward reactive transport simulations are used to image the spatial distribution of As-bearing pyrite using the Principal Component Geostatistical Approach (PCGA) for inverse modeling.

  16. Wikipedia ranking of world universities

    NASA Astrophysics Data System (ADS)

    Lages, José; Patt, Antoine; Shepelyansky, Dima L.

    2016-03-01

    We use the directed networks between articles of 24 Wikipedia language editions for producing the wikipedia ranking of world Universities (WRWU) using PageRank, 2DRank and CheiRank algorithms. This approach allows to incorporate various cultural views on world universities using the mathematical statistical analysis independent of cultural preferences. The Wikipedia ranking of top 100 universities provides about 60% overlap with the Shanghai university ranking demonstrating the reliable features of this approach. At the same time WRWU incorporates all knowledge accumulated at 24 Wikipedia editions giving stronger highlights for historically important universities leading to a different estimation of efficiency of world countries in university education. The historical development of university ranking is analyzed during ten centuries of their history.

  17. Low-rank coal research

    SciTech Connect

    Weber, G. F.; Laudal, D. L.

    1989-01-01

    This work is a compilation of reports on ongoing research at the University of North Dakota. Topics include: Control Technology and Coal Preparation Research (SO{sub x}/NO{sub x} control, waste management), Advanced Research and Technology Development (turbine combustion phenomena, combustion inorganic transformation, coal/char reactivity, liquefaction reactivity of low-rank coals, gasification ash and slag characterization, fine particulate emissions), Combustion Research (fluidized bed combustion, beneficiation of low-rank coals, combustion characterization of low-rank coal fuels, diesel utilization of low-rank coals), Liquefaction Research (low-rank coal direct liquefaction), and Gasification Research (hydrogen production from low-rank coals, advanced wastewater treatment, mild gasification, color and residual COD removal from Synfuel wastewaters, Great Plains Gasification Plant, gasifier optimization).

  18. Sorption and competition of two persistent organic pesticides onto marine sediments: Relevance to their distribution in aquatic system.

    PubMed

    Soubaneh, Youssouf Djibril; Gagné, Jean-Pierre; Lebeuf, Michel; Nikiforov, Vladimir; Gouteux, Bruno; Osman, Awaleh Mohamed

    2015-07-01

    Sorption is a key process in the distribution of substances between environmental compartments in marine ecosystems. Two persistent organic pesticides, also known as toxaphene congeners, namely B8-1413 (P26) and B9-1679 (P50), are of special interest because they are not detected in sediments while relatively concentrated in marine mammals. Sorption-desorption, entrapment and competition behaviors of these pesticides onto marine sediments were studied to explain their environmental distribution. Data obtained under marine experimental conditions were fitted to sorption models to evaluate sorption coefficients and to assess the degree of B8-1413/B9-1679 entrapment of the two toxaphene congeners in sediments. Carbon normalized sorption coefficients (Koc) of both congeners were similar under in cold (2°C) marine (30 psu) conditions with high values ranging from 1.53×10(5) to 3.28×10(5) mL g(-1)indicative of a strong affinity to marine sediments However, the sorption-desorption investigations indicate that B8-1413/B9-1679 were on average 2.5 times less entrapped in sediments compared to B7-1450, a toxaphene congener known to accumulate predominantly in sediments. These results suggest that the low entrapment of B8-1413 and B9-1679 favor their availability and transfer to biological matrices.

  19. A ranking-theoretic approach to conditionals.

    PubMed

    Spohn, Wolfgang

    2013-08-01

    Conditionals somehow express conditional beliefs. However, conditional belief is a bi-propositional attitude that is generally not truth-evaluable, in contrast to unconditional belief. Therefore, this article opts for an expressivistic semantics for conditionals, grounds this semantics in the arguably most adequate account of conditional belief, that is, ranking theory, and dismisses probability theory for that purpose, because probabilities cannot represent belief. Various expressive options are then explained in terms of ranking theory, with the intention to set out a general interpretive scheme that is able to account for the most variegated usage of conditionals. The Ramsey test is only the first option. Relevance is another, familiar, but little understood item, which comes in several versions. This article adds a further family of expressive options, which is able to subsume also counterfactuals and causal conditionals, and indicates at the end how this family allows for partial recovery of truth conditions for conditionals.

  20. Security Techniques for Prevention of Rank Manipulation in Social Tagging Services including Robotic Domains

    PubMed Central

    2014-01-01

    With smartphone distribution becoming common and robotic applications on the rise, social tagging services for various applications including robotic domains have advanced significantly. Though social tagging plays an important role when users are finding the exact information through web search, reliability and semantic relation between web contents and tags are not considered. Spams are making ill use of this aspect and put irrelevant tags deliberately on contents and induce users to advertise contents when they click items of search results. Therefore, this study proposes a detection method for tag-ranking manipulation to solve the problem of the existing methods which cannot guarantee the reliability of tagging. Similarity is measured for ranking the grade of registered tag on the contents, and weighted values of each tag are measured by means of synonym relevance, frequency, and semantic distances between tags. Lastly, experimental evaluation results are provided and its efficiency and accuracy are verified through them. PMID:25114975

  1. Security techniques for prevention of rank manipulation in social tagging services including robotic domains.

    PubMed

    Choi, Okkyung; Jung, Hanyoung; Moon, Seungbin

    2014-01-01

    With smartphone distribution becoming common and robotic applications on the rise, social tagging services for various applications including robotic domains have advanced significantly. Though social tagging plays an important role when users are finding the exact information through web search, reliability and semantic relation between web contents and tags are not considered. Spams are making ill use of this aspect and put irrelevant tags deliberately on contents and induce users to advertise contents when they click items of search results. Therefore, this study proposes a detection method for tag-ranking manipulation to solve the problem of the existing methods which cannot guarantee the reliability of tagging. Similarity is measured for ranking the grade of registered tag on the contents, and weighted values of each tag are measured by means of synonym relevance, frequency, and semantic distances between tags. Lastly, experimental evaluation results are provided and its efficiency and accuracy are verified through them.

  2. Examination of the relevance of using radiochromic films in measuring entrance skin dose distribution in conventional digital mammography.

    PubMed

    Soliman, K; Bakkari, M

    2015-07-01

    Based on manufacturer specifications, radiochromic films are sensitive enough to be used for dosimetry in digital mammography (DM). The aim of this work was to study the feasibility of measuring entrance surface dose (ESD) distribution using Gafchromic XR-QA2 films. The films were irradiated following a standard clinical two-view screening mammography protocol using a full-field digital mammography (FFDM) imaging system. The films were then digitised using a flatbed scanner. The calibration curve relating the readings from a calibrated ionisation chamber and the films' net optical density (NOD) could not be obtained. The examination of the calibration data revealed non-sensitivity of the films to resolve dose differences below 20 mGy at 28 kVp. Therefore, radiochromic films were found not to be suitable for measuring ESD profiles in DM. A 2D map of the NOD of the irradiated films obtained using in-house developed MATLAB computer program is presented. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  3. On the size distribution of collision fragments of NLC dust particles and their relevance to meteoric smoke particles

    NASA Astrophysics Data System (ADS)

    Havnes, O.; Gumbel, J.; Antonsen, T.; Hedin, J.; La Hoz, C.

    2014-10-01

    We present the results from a new dust probe MUDD on the PHOCUS payload which was launched in July 2011. In the interior of MUDD all the incoming NLC/PMSE icy dust particles will collide, at an impact angle ~70° to the surface normal, with a grid constructed such that no dust particles can directly hit the bottom plate of the probe. Only collision fragments will continue down towards the bottom plate. We determine an energy distribution of the charged fragments by applying a variable electric field between the impact grid and the bottom plate of MUDD. We find that ~30% of the charged fragments have kinetic energies less than 10 eV, ~20% have energies between 10 and 20 eV while ~50% have energies above 20 eV. The transformation of limits in kinetic energy for ice or meteoric smoke particles (MSP) to radius is dependent on many assumptions, the most crucial being fragment velocity. We find, however, that the sizes of the charged fragments most probably are in the range of 1 to 2 nm if meteoric smoke particles (MSP), and slightly higher if ice particles. The observed high charging fraction and the dominance of fragment sizes below a few nm makes it very unlikely that the fragments can consist mainly of ice but that they must be predominantly MSP as predicted by Havnes and Næsheim (2007) and recently observed by Hervig et al. (2012). The MUDD results indicate that MSP are embedded in NLC/PMSE ice particles with a minimum volume filling factor of ~.05% in the unlikely case that all embedded MSP are released and charged. A few % volume filling factor (Hervig et al., 2012) can easily be reached if ~10% of the MSP are released and that their charging probability is ~0.1.

  4. Natural distribution of the femoral mechanical-anatomical angle in an osteoarthritic population and its relevance to total knee arthroplasty.

    PubMed

    Deakin, Angela H; Basanagoudar, Praveen L; Nunag, Perrico; Johnston, Andrew T; Sarungi, Martin

    2012-03-01

    A common surgical goal in TKA is to restore neutral alignment of the lower limb by making bone cuts perpendicular to the mechanical axes of the femur and tibia. Standard practice for many surgeons is to use the same distal femoral valgus resection angle for all patients, assuming little or no variation in the femoral mechanical-anatomical (FMA) angle between different patients' knees. This study analysed 174 pre-operative hip-knee-ankle radiographs of osteoarthritic knees (157 patients, 87 female and 70 male, mean age 70years and mean BMI 31.8). Measurements of mechanical femorotibial (MFT) and FMA angles were made. The mean FMA angle was 5.7° (SD 1.2°, range 2° to 9°). There was a statistically significant difference between the FMA angle for males and females with males tending to have larger FMA angles (p<0.001). There was a statistically significant correlation between MFT and FMA angle (r=-0.499) with varus knees tending to have larger FMA angles (p<0.001). These results indicate a wide distribution of FMA angle in an osteoarthritic population. In terms of achieving appropriate coronal alignment in TKA the use of a fixed valgus resection angle is not suitable for all patients and it may be preferable to adjust the distal femoral cut according to individual FMA angles. However if this angle is not available the cut may be adjusted according to pre-operative coronal alignment, using 6° for neutral/mild varus, >6° for more severe varus and <6° for valgus knees. Copyright © 2011 Elsevier B.V. All rights reserved.

  5. University Rankings in Critical Perspective

    ERIC Educational Resources Information Center

    Pusser, Brian; Marginson, Simon

    2013-01-01

    This article addresses global postsecondary ranking systems by using critical-theoretical perspectives on power. This research suggests rankings are at once a useful lens for studying power in higher education and an important instrument for the exercise of power in service of dominant norms in global higher education. (Contains 1 table and 1…

  6. University Ranking as Social Exclusion

    ERIC Educational Resources Information Center

    Amsler, Sarah S.; Bolsmann, Chris

    2012-01-01

    In this article we explore the dual role of global university rankings in the creation of a new, knowledge-identified, transnational capitalist class and in facilitating new forms of social exclusion. We examine how and why the practice of ranking universities has become widely defined by national and international organisations as an important…

  7. Technical Pitfalls in University Rankings

    ERIC Educational Resources Information Center

    Bougnol, Marie-Laure; Dulá, Jose H.

    2015-01-01

    Academicians, experts, and other stakeholders have contributed extensively to the literature on university rankings also known as "league tables". Often the tone is critical usually focused on the subjective aspects of the process; e.g., the list of the universities' attributes used in the rankings, their respective weights, and the size…

  8. Obsession with Rankings Goes Global

    ERIC Educational Resources Information Center

    Labi, Aisha

    2008-01-01

    A Chinese list of the world's top universities would seem an unlikely concern for French politicians. But this year, France's legislature took aim at the annual rankings produced by Shanghai Jiao Tong University, which claims to list the 500 best universities in the world. The highest-ranked French entry, Universite Pierre et Marie Curie, comes in…

  9. Obsession with Rankings Goes Global

    ERIC Educational Resources Information Center

    Labi, Aisha

    2008-01-01

    A Chinese list of the world's top universities would seem an unlikely concern for French politicians. But this year, France's legislature took aim at the annual rankings produced by Shanghai Jiao Tong University, which claims to list the 500 best universities in the world. The highest-ranked French entry, Universite Pierre et Marie Curie, comes in…

  10. University Rankings in Critical Perspective

    ERIC Educational Resources Information Center

    Pusser, Brian; Marginson, Simon

    2013-01-01

    This article addresses global postsecondary ranking systems by using critical-theoretical perspectives on power. This research suggests rankings are at once a useful lens for studying power in higher education and an important instrument for the exercise of power in service of dominant norms in global higher education. (Contains 1 table and 1…

  11. University Ranking as Social Exclusion

    ERIC Educational Resources Information Center

    Amsler, Sarah S.; Bolsmann, Chris

    2012-01-01

    In this article we explore the dual role of global university rankings in the creation of a new, knowledge-identified, transnational capitalist class and in facilitating new forms of social exclusion. We examine how and why the practice of ranking universities has become widely defined by national and international organisations as an important…

  12. US dermatology residency program rankings.

    PubMed

    Aquino, Lisa L; Wen, Ge; Wu, Jashin J

    2014-10-01

    Unlike many other adult specialties, US News & World Report does not rank dermatology residency programs annually. We conducted a study to rank individual US dermatology residency programs based on set criteria. For each residency program, data from 2008 related to a number of factors were collected, including annual amount of National Institutes of Health (NIH) and Dermatology Foundation (DF) funding received; number of publications from full-time faculty members; number of faculty lectures given at 5 annual society meetings; and number of full-time faculty members who were on the editorial boards of 6 dermatology journals with the highest impact factors. Most of the data were obtained through extensive Internet searches, and missing data were obtained by contacting individual residency programs. The programs were ranked based on the prior factors according to a weighted ranking algorithm. A list of overall rankings also was created.

  13. Phenomena Identification and Ranking Technique (PIRT) Panel Meeting Summary Report

    SciTech Connect

    Mark Holbrook

    2007-07-01

    Phenomena Identification and Ranking Technique (PIRT) is a systematic way of gathering information from experts on a specific subject and ranking the importance of the information. NRC, in collaboration with DOE and the working group, conducted the PIRT exercises to identify safety-relevant phenomena for NGNP, and to assess and rank the importance and knowledge base for each phenomenon. The overall objective was to provide NRC with an expert assessment of the safety-relevant NGNP phenomena, and an overall assessment of R and D needs for NGNP licensing. The PIRT process was applied to five major topical areas relevant to NGNP safety and licensing: (1) thermofluids and accident analysis (including neutronics), (2) fission product transport, (3) high temperature materials, (4) graphite, and (5) process heat for hydrogen cogeneration.

  14. Do lab-derived distribution coefficient values of pesticides match distribution coefficient values determined from column and field-scale experiments? A critical analysis of relevant literature.

    PubMed

    Vereecken, H; Vanderborght, J; Kasteel, R; Spiteller, M; Schäffer, A; Close, M

    2011-01-01

    In this study, we analyzed sorption parameters for pesticides that were derived from batch and column or batch and field experiments. The batch experiments analyzed in this study were run with the same pesticide and soil as in the column and field experiments. We analyzed the relationship between the pore water velocity of the column and field experiments, solute residence times, and sorption parameters, such as the organic carbon normalized distribution coefficient ( ) and the mass exchange coefficient in kinetic models, as well as the predictability of sorption parameters from basic soil properties. The batch/column analysis included 38 studies with a total of 139 observations. The batch/field analysis included five studies, resulting in a dataset of 24 observations. For the batch/column data, power law relationships between pore water velocity, residence time, and sorption constants were derived. The unexplained variability in these equations was reduced, taking into account the saturation status and the packing status (disturbed-undisturbed) of the soil sample. A new regression equation was derived that allows estimating the values derived from column experiments using organic matter and bulk density with an value of 0.56. Regression analysis of the batch/column data showed that the relationship between batch- and column-derived values depends on the saturation status and packing of the soil column. Analysis of the batch/field data showed that as the batch-derived value becomes larger, field-derived values tend to be lower than the corresponding batch-derived values, and vice versa. The present dataset also showed that the variability in the ratio of batch- to column-derived value increases with increasing pore water velocity, with a maximum value approaching 3.5.

  15. Research on B Cell Algorithm for Learning to Rank Method Based on Parallel Strategy

    PubMed Central

    Tian, Yuling; Zhang, Hongxian

    2016-01-01

    For the purposes of information retrieval, users must find highly relevant documents from within a system (and often a quite large one comprised of many individual documents) based on input query. Ranking the documents according to their relevance within the system to meet user needs is a challenging endeavor, and a hot research topic–there already exist several rank-learning methods based on machine learning techniques which can generate ranking functions automatically. This paper proposes a parallel B cell algorithm, RankBCA, for rank learning which utilizes a clonal selection mechanism based on biological immunity. The novel algorithm is compared with traditional rank-learning algorithms through experimentation and shown to outperform the others in respect to accuracy, learning time, and convergence rate; taken together, the experimental results show that the proposed algorithm indeed effectively and rapidly identifies optimal ranking functions. PMID:27487242

  16. PageRank model of opinion formation on social networks

    NASA Astrophysics Data System (ADS)

    Kandiah, Vivek; Shepelyansky, Dima L.

    2012-11-01

    We propose the PageRank model of opinion formation and investigate its rich properties on real directed networks of the Universities of Cambridge and Oxford, LiveJournal, and Twitter. In this model, the opinion formation of linked electors is weighted with their PageRank probability. Such a probability is used by the Google search engine for ranking of web pages. We find that the society elite, corresponding to the top PageRank nodes, can impose its opinion on a significant fraction of the society. However, for a homogeneous distribution of two opinions, there exists a bistability range of opinions which depends on a conformist parameter characterizing the opinion formation. We find that the LiveJournal and Twitter networks have a stronger tendency to a totalitarian opinion formation than the university networks. We also analyze the Sznajd model generalized for scale-free networks with the weighted PageRank vote of electors.

  17. Estimation of rank correlation for clustered data.

    PubMed

    Rosner, Bernard; Glynn, Robert J

    2017-06-30

    It is well known that the sample correlation coefficient (Rxy ) is the maximum likelihood estimator of the Pearson correlation (ρxy ) for independent and identically distributed (i.i.d.) bivariate normal data. However, this is not true for ophthalmologic data where X (e.g., visual acuity) and Y (e.g., visual field) are available for each eye and there is positive intraclass correlation for both X and Y in fellow eyes. In this paper, we provide a regression-based approach for obtaining the maximum likelihood estimator of ρxy for clustered data, which can be implemented using standard mixed effects model software. This method is also extended to allow for estimation of partial correlation by controlling both X and Y for a vector U_ of other covariates. In addition, these methods can be extended to allow for estimation of rank correlation for clustered data by (i) converting ranks of both X and Y to the probit scale, (ii) estimating the Pearson correlation between probit scores for X and Y, and (iii) using the relationship between Pearson and rank correlation for bivariate normally distributed data. The validity of the methods in finite-sized samples is supported by simulation studies. Finally, two examples from ophthalmology and analgesic abuse are used to illustrate the methods. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  18. Ranking Theory and Conditional Reasoning.

    PubMed

    Skovgaard-Olsen, Niels

    2016-05-01

    Ranking theory is a formal epistemology that has been developed in over 600 pages in Spohn's recent book The Laws of Belief, which aims to provide a normative account of the dynamics of beliefs that presents an alternative to current probabilistic approaches. It has long been received in the AI community, but it has not yet found application in experimental psychology. The purpose of this paper is to derive clear, quantitative predictions by exploiting a parallel between ranking theory and a statistical model called logistic regression. This approach is illustrated by the development of a model for the conditional inference task using Spohn's (2013) ranking theoretic approach to conditionals.

  19. Influence Analysis of Ranking Data.

    ERIC Educational Resources Information Center

    Poon, Wai-Yin; Chan, Wai

    2002-01-01

    Developed diagnostic measures to identify observations in Thurstonian models for ranking data that unduly influence parameter estimates obtained by the partition maximum likelihood approach of W. Chan and P. Bender (1998). (SLD)

  20. Influence Analysis of Ranking Data.

    ERIC Educational Resources Information Center

    Poon, Wai-Yin; Chan, Wai

    2002-01-01

    Developed diagnostic measures to identify observations in Thurstonian models for ranking data that unduly influence parameter estimates obtained by the partition maximum likelihood approach of W. Chan and P. Bender (1998). (SLD)

  1. Highlighting entanglement of cultures via ranking of multilingual Wikipedia articles.

    PubMed

    Eom, Young-Ho; Shepelyansky, Dima L

    2013-01-01

    How different cultures evaluate a person? Is an important person in one culture is also important in the other culture? We address these questions via ranking of multilingual Wikipedia articles. With three ranking algorithms based on network structure of Wikipedia, we assign ranking to all articles in 9 multilingual editions of Wikipedia and investigate general ranking structure of PageRank, CheiRank and 2DRank. In particular, we focus on articles related to persons, identify top 30 persons for each rank among different editions and analyze distinctions of their distributions over activity fields such as politics, art, science, religion, sport for each edition. We find that local heroes are dominant but also global heroes exist and create an effective network representing entanglement of cultures. The Google matrix analysis of network of cultures shows signs of the Zipf law distribution. This approach allows to examine diversity and shared characteristics of knowledge organization between cultures. The developed computational, data driven approach highlights cultural interconnections in a new perspective. Dated: June 26, 2013.

  2. Highlighting Entanglement of Cultures via Ranking of Multilingual Wikipedia Articles

    PubMed Central

    Eom, Young-Ho; Shepelyansky, Dima L.

    2013-01-01

    How different cultures evaluate a person? Is an important person in one culture is also important in the other culture? We address these questions via ranking of multilingual Wikipedia articles. With three ranking algorithms based on network structure of Wikipedia, we assign ranking to all articles in 9 multilingual editions of Wikipedia and investigate general ranking structure of PageRank, CheiRank and 2DRank. In particular, we focus on articles related to persons, identify top 30 persons for each rank among different editions and analyze distinctions of their distributions over activity fields such as politics, art, science, religion, sport for each edition. We find that local heroes are dominant but also global heroes exist and create an effective network representing entanglement of cultures. The Google matrix analysis of network of cultures shows signs of the Zipf law distribution. This approach allows to examine diversity and shared characteristics of knowledge organization between cultures. The developed computational, data driven approach highlights cultural interconnections in a new perspective. Dated: June 26, 2013 PMID:24098338

  3. Dimension Reduction for Object Ranking

    NASA Astrophysics Data System (ADS)

    Kamishima, Toshihiro; Akaho, Shotaro

    Ordered lists of objects are widely used as representational forms. Such ordered objects include Web search results and bestseller lists. Techniques for processing such ordinal data are being developed, particularly methods for an object ranking task: i.e., learning functions used to sort objects from sample orders. In this article, we propose two dimension reduction methods specifically designed to improve prediction performance in an object ranking task.

  4. Label Ranking Algorithms: A Survey

    NASA Astrophysics Data System (ADS)

    Vembu, Shankar; Gärtner, Thomas

    Label ranking is a complex prediction task where the goal is to map instances to a total order over a finite set of predefined labels. An interesting aspect of this problem is that it subsumes several supervised learning problems, such as multiclass prediction, multilabel classification, and hierarchical classification. Unsurprisingly, there exists a plethora of label ranking algorithms in the literature due, in part, to this versatile nature of the problem. In this paper, we survey these algorithms.

  5. Rank Pooling for Action Recognition.

    PubMed

    Fernando, Basura; Gavves, Efstratios; Oramas M, Jose Oramas; Ghodrati, Amir; Tuytelaars, Tinne

    2017-04-01

    We propose a function-based temporal pooling method that captures the latent structure of the video sequence data - e.g., how frame-level features evolve over time in a video. We show how the parameters of a function that has been fit to the video data can serve as a robust new video representation. As a specific example, we learn a pooling function via ranking machines. By learning to rank the frame-level features of a video in chronological order, we obtain a new representation that captures the video-wide temporal dynamics of a video, suitable for action recognition. Other than ranking functions, we explore different parametric models that could also explain the temporal changes in videos. The proposed functional pooling methods, and rank pooling in particular, is easy to interpret and implement, fast to compute and effective in recognizing a wide variety of actions. We evaluate our method on various benchmarks for generic action, fine-grained action and gesture recognition. Results show that rank pooling brings an absolute improvement of 7-10 average pooling baseline. At the same time, rank pooling is compatible with and complementary to several appearance and local motion based methods and features, such as improved trajectories and deep learning features.

  6. Ranking in evolving complex networks

    NASA Astrophysics Data System (ADS)

    Liao, Hao; Mariani, Manuel Sebastian; Medo, Matúš; Zhang, Yi-Cheng; Zhou, Ming-Yang

    2017-05-01

    Complex networks have emerged as a simple yet powerful framework to represent and analyze a wide range of complex systems. The problem of ranking the nodes and the edges in complex networks is critical for a broad range of real-world problems because it affects how we access online information and products, how success and talent are evaluated in human activities, and how scarce resources are allocated by companies and policymakers, among others. This calls for a deep understanding of how existing ranking algorithms perform, and which are their possible biases that may impair their effectiveness. Many popular ranking algorithms (such as Google's PageRank) are static in nature and, as a consequence, they exhibit important shortcomings when applied to real networks that rapidly evolve in time. At the same time, recent advances in the understanding and modeling of evolving networks have enabled the development of a wide and diverse range of ranking algorithms that take the temporal dimension into account. The aim of this review is to survey the existing ranking algorithms, both static and time-aware, and their applications to evolving networks. We emphasize both the impact of network evolution on well-established static algorithms and the benefits from including the temporal dimension for tasks such as prediction of network traffic, prediction of future links, and identification of significant nodes.

  7. Caipirini: using gene sets to rank literature

    PubMed Central

    2012-01-01

    Background Keeping up-to-date with bioscience literature is becoming increasingly challenging. Several recent methods help meet this challenge by allowing literature search to be launched based on lists of abstracts that the user judges to be 'interesting'. Some methods go further by allowing the user to provide a second input set of 'uninteresting' abstracts; these two input sets are then used to search and rank literature by relevance. In this work we present the service 'Caipirini' (http://caipirini.org) that also allows two input sets, but takes the novel approach of allowing ranking of literature based on one or more sets of genes. Results To evaluate the usefulness of Caipirini, we used two test cases, one related to the human cell cycle, and a second related to disease defense mechanisms in Arabidopsis thaliana. In both cases, the new method achieved high precision in finding literature related to the biological mechanisms underlying the input data sets. Conclusions To our knowledge Caipirini is the first service enabling literature search directly based on biological relevance to gene sets; thus, Caipirini gives the research community a new way to unlock hidden knowledge from gene sets derived via high-throughput experiments. PMID:22297131

  8. Ranking nodes in growing networks: When PageRank fails

    PubMed Central

    Mariani, Manuel Sebastian; Medo, Matúš; Zhang, Yi-Cheng

    2015-01-01

    PageRank is arguably the most popular ranking algorithm which is being applied in real systems ranging from information to biological and infrastructure networks. Despite its outstanding popularity and broad use in different areas of science, the relation between the algorithm’s efficacy and properties of the network on which it acts has not yet been fully understood. We study here PageRank’s performance on a network model supported by real data, and show that realistic temporal effects make PageRank fail in individuating the most valuable nodes for a broad range of model parameters. Results on real data are in qualitative agreement with our model-based findings. This failure of PageRank reveals that the static approach to information filtering is inappropriate for a broad class of growing systems, and suggest that time-dependent algorithms that are based on the temporal linking patterns of these systems are needed to better rank the nodes. PMID:26553630

  9. Ranking structures and rank-rank correlations of countries: The FIFA and UEFA cases

    NASA Astrophysics Data System (ADS)

    Ausloos, Marcel; Cloots, Rudi; Gadomski, Adam; Vitanov, Nikolay K.

    2014-04-01

    Ranking of agents competing with each other in complex systems may lead to paradoxes according to the pre-chosen different measures. A discussion is presented on such rank-rank, similar or not, correlations based on the case of European countries ranked by UEFA and FIFA from different soccer competitions. The first question to be answered is whether an empirical and simple law is obtained for such (self-) organizations of complex sociological systems with such different measuring schemes. It is found that the power law form is not the best description contrary to many modern expectations. The stretched exponential is much more adequate. Moreover, it is found that the measuring rules lead to some inner structures in both cases.

  10. Local Knowledge When Ranking Journals: Reproductive Effects and Resistant Possibilities

    ERIC Educational Resources Information Center

    Canagarajah, Suresh

    2014-01-01

    This article is based on the engagement of a US-based scholar and faculty members in a non-Western university in a mentoring exercise on publishing. It demonstrates how the "list" constructed in a particular academic department in the university for ranking relevant journals for publication has reproductive effects on knowledge…

  11. Relevancy 101

    NASA Technical Reports Server (NTRS)

    Lynnes, Chris; Newman, Doug

    2016-01-01

    Where we present an overview on why relevancy is a problem, how important it is and how we can improve it. The topic of relevancy is becoming increasingly important in earth data discovery as our audience is tuned to the accuracy of standard search engines like Google.

  12. Bayesian Plackett-Luce Mixture Models for Partially Ranked Data.

    PubMed

    Mollica, Cristina; Tardella, Luca

    2017-06-01

    The elicitation of an ordinal judgment on multiple alternatives is often required in many psychological and behavioral experiments to investigate preference/choice orientation of a specific population. The Plackett-Luce model is one of the most popular and frequently applied parametric distributions to analyze rankings of a finite set of items. The present work introduces a Bayesian finite mixture of Plackett-Luce models to account for unobserved sample heterogeneity of partially ranked data. We describe an efficient way to incorporate the latent group structure in the data augmentation approach and the derivation of existing maximum likelihood procedures as special instances of the proposed Bayesian method. Inference can be conducted with the combination of the Expectation-Maximization algorithm for maximum a posteriori estimation and the Gibbs sampling iterative procedure. We additionally investigate several Bayesian criteria for selecting the optimal mixture configuration and describe diagnostic tools for assessing the fitness of ranking distributions conditionally and unconditionally on the number of ranked items. The utility of the novel Bayesian parametric Plackett-Luce mixture for characterizing sample heterogeneity is illustrated with several applications to simulated and real preference ranked data. We compare our method with the frequentist approach and a Bayesian nonparametric mixture model both assuming the Plackett-Luce model as a mixture component. Our analysis on real datasets reveals the importance of an accurate diagnostic check for an appropriate in-depth understanding of the heterogenous nature of the partial ranking data.

  13. Learning to rank image tags with limited training examples.

    PubMed

    Songhe Feng; Zheyun Feng; Rong Jin

    2015-04-01

    With an increasing number of images that are available in social media, image annotation has emerged as an important research topic due to its application in image matching and retrieval. Most studies cast image annotation into a multilabel classification problem. The main shortcoming of this approach is that it requires a large number of training images with clean and complete annotations in order to learn a reliable model for tag prediction. We address this limitation by developing a novel approach that combines the strength of tag ranking with the power of matrix recovery. Instead of having to make a binary decision for each tag, our approach ranks tags in the descending order of their relevance to the given image, significantly simplifying the problem. In addition, the proposed method aggregates the prediction models for different tags into a matrix, and casts tag ranking into a matrix recovery problem. It introduces the matrix trace norm to explicitly control the model complexity, so that a reliable prediction model can be learned for tag ranking even when the tag space is large and the number of training images is limited. Experiments on multiple well-known image data sets demonstrate the effectiveness of the proposed framework for tag ranking compared with the state-of-the-art approaches for image annotation and tag ranking.

  14. The Globalization of College and University Rankings

    ERIC Educational Resources Information Center

    Altbach, Philip G.

    2012-01-01

    In the era of globalization, accountability, and benchmarking, university rankings have achieved a kind of iconic status. The major ones--the Academic Ranking of World Universities (ARWU, or the "Shanghai rankings"), the QS (Quacquarelli Symonds Limited) World University Rankings, and the "Times Higher Education" World…

  15. The Globalization of College and University Rankings

    ERIC Educational Resources Information Center

    Altbach, Philip G.

    2012-01-01

    In the era of globalization, accountability, and benchmarking, university rankings have achieved a kind of iconic status. The major ones--the Academic Ranking of World Universities (ARWU, or the "Shanghai rankings"), the QS (Quacquarelli Symonds Limited) World University Rankings, and the "Times Higher Education" World…

  16. Re-Ranking Algorithms for Name Tagging

    DTIC Science & Technology

    2006-06-01

    incorporating information from relation extraction, event extraction, and coreference. We evaluate three state- of-the-art re-ranking algorithms ( MaxEnt - Rank...select the best analysis. Various supervised learn- ing algorithms have been adapted to the task of re- ranking for NLP systems, such as MaxEnt -Rank... MaxEnt -Rank, SVMRank and a new algorithm, p-Norm Push Ranking – for this problem, and show how an approach based on multi-stage re-ranking can

  17. Identifying Epigenetic Biomarkers using Maximal Relevance and Minimal Redundancy Based Feature Selection for Multi-Omics Data.

    PubMed

    Mallik, Saurav; Bhadra, Tapas; Maulik, Ujjwal

    2017-01-01

    Epigenetic Biomarker discovery is an important task in bioinformatics. In this article, we develop a new framework of identifying statistically significant epigenetic biomarkers using maximal-relevance and minimal-redundancy criterion based feature (gene) selection for multi-omics dataset. Firstly, we determine the genes that have both expression as well as methylation values, and follow normal distribution. Similarly, we identify the genes which consist of both expression and methylation values, but do not follow normal distribution. For each case, we utilize a gene-selection method that provides maximal-relevant, but variable-weighted minimum-redundant genes as top ranked genes. For statistical validation, we apply t-test on both the expression and methylation data consisting of only the normally distributed top ranked genes to determine how many of them are both differentially expressed andmethylated. Similarly, we utilize Limma package for performing non-parametric Empirical Bayes test on both expression and methylation data comprising only the non-normally distributed top ranked genes to identify how many of them are both differentially expressed and methylated. We finally report the top-ranking significant gene-markerswith biological validation. Moreover, our framework improves positive predictive rate and reduces false positive rate in marker identification. In addition, we provide a comparative analysis of our gene-selection method as well as othermethods based on classificationperformances obtained using several well-known classifiers.

  18. Time evolution of Wikipedia network ranking

    NASA Astrophysics Data System (ADS)

    Eom, Young-Ho; Frahm, Klaus M.; Benczúr, András; Shepelyansky, Dima L.

    2013-12-01

    We study the time evolution of ranking and spectral properties of the Google matrix of English Wikipedia hyperlink network during years 2003-2011. The statistical properties of ranking of Wikipedia articles via PageRank and CheiRank probabilities, as well as the matrix spectrum, are shown to be stabilized for 2007-2011. A special emphasis is done on ranking of Wikipedia personalities and universities. We show that PageRank selection is dominated by politicians while 2DRank, which combines PageRank and CheiRank, gives more accent on personalities of arts. The Wikipedia PageRank of universities recovers 80% of top universities of Shanghai ranking during the considered time period.

  19. Let Us Rank Journalism Programs

    ERIC Educational Resources Information Center

    Weber, Joseph

    2014-01-01

    Unlike law, business, and medical schools, as well as universities in general, journalism schools and journalism programs have rarely been ranked. Publishers such as "U.S. News & World Report," "Forbes," "Bloomberg Businessweek," and "Washington Monthly" do not pay them much mind. What is the best…

  20. Let Us Rank Journalism Programs

    ERIC Educational Resources Information Center

    Weber, Joseph

    2014-01-01

    Unlike law, business, and medical schools, as well as universities in general, journalism schools and journalism programs have rarely been ranked. Publishers such as "U.S. News & World Report," "Forbes," "Bloomberg Businessweek," and "Washington Monthly" do not pay them much mind. What is the best…

  1. "Times Higher Education" 100 under 50 Ranking: Old Wine in a New Bottle?

    ERIC Educational Resources Information Center

    Soh, Kaycheng

    2013-01-01

    "Times Higher Education" 100 under 50 ranking is a new twist to the university ranking. It focuses on universities that have a history of 50 years or less with the purpose of offsetting the advantage of prestige of the older ones. This article re-analysed the data publicly available and looked into relevant conceptual and statistical issues. The…

  2. "Times Higher Education" 100 under 50 Ranking: Old Wine in a New Bottle?

    ERIC Educational Resources Information Center

    Soh, Kaycheng

    2013-01-01

    "Times Higher Education" 100 under 50 ranking is a new twist to the university ranking. It focuses on universities that have a history of 50 years or less with the purpose of offsetting the advantage of prestige of the older ones. This article re-analysed the data publicly available and looked into relevant conceptual and statistical issues. The…

  3. Fuzzy Multicriteria Ranking of Aluminium Coating Methods

    NASA Astrophysics Data System (ADS)

    Batzias, A. F.

    2007-12-01

    This work deals with multicriteria ranking of aluminium coating methods. The alternatives used are: sulfuric acid anodization, A1; oxalic acid anodization, A2; chromic acid anodization, A3; phosphoric acid anodization, A4; integral color anodizing, A5; chemical conversion coating, A6; electrostatic powder deposition, A7. The criteria used are: cost of production, f1; environmental friendliness of production process, f2; appearance (texture), f3; reflectivity, f4; response to coloring, f5; corrosion resistance, f6; abrasion resistance, f7; fatigue resistance, f8. Five experts coming from relevant industrial units set grades to the criteria vector and the preference matrix according to a properly modified Delphi method. Sensitivity analysis of the ranked first alternative A1 against the `second best', which was A3 at low and A7 at high resolution levels proved that the solution is robust. The dependence of anodized products quality on upstream processes is presented and the impact of energy price increase on industrial cost is discussed.

  4. An Efficient Web Page Ranking for Semantic Web

    NASA Astrophysics Data System (ADS)

    Chahal, P.; Singh, M.; Kumar, S.

    2014-01-01

    With the enormous amount of information presented on the web, the retrieval of relevant information has become a serious problem and is also the topic of research for last few years. The most common tools to retrieve information from web are search engines like Google. The Search engines are usually based on keyword searching and indexing of web pages. This approach is not very efficient as the result-set of web pages obtained include large irrelevant pages. Sometimes even the entire result-set may contain lot of irrelevant pages for the user. The next generation of search engines must address this problem. Recently, many semantic web search engines have been developed like Ontolook, Swoogle, which help in searching meaningful documents presented on semantic web. In this process the ranking of the retrieved web pages is very crucial. Some attempts have been made in ranking of semantic web pages but still the ranking of these semantic web documents is neither satisfactory and nor up to the user's expectations. In this paper we have proposed a semantic web based document ranking scheme that relies not only on the keywords but also on the conceptual instances present between the keywords. As a result only the relevant page will be on the top of the result-set of searched web pages. We explore all relevant relations between the keywords exploring the user's intention and then calculate the fraction of these relations on each web page to determine their relevance. We have found that this ranking technique gives better results than those by the prevailing methods.

  5. Scalable ranked retrieval using document images

    NASA Astrophysics Data System (ADS)

    Jain, Rajiv; Oard, Douglas W.; Doermann, David

    2013-12-01

    Despite the explosion of text on the Internet, hard copy documents that have been scanned as images still play a significant role for some tasks. The best method to perform ranked retrieval on a large corpus of document images, however, remains an open research question. The most common approach has been to perform text retrieval using terms generated by optical character recognition. This paper, by contrast, examines whether a scalable segmentation-free image retrieval algorithm, which matches sub-images containing text or graphical objects, can provide additional benefit in satisfying a user's information needs on a large, real world dataset. Results on 7 million scanned pages from the CDIP v1.0 test collection show that content based image retrieval finds a substantial number of documents that text retrieval misses, and that when used as a basis for relevance feedback can yield improvements in retrieval effectiveness.

  6. Beyond Zipf’s Law: The Lavalette Rank Function and Its Properties

    PubMed Central

    Miramontes, Pedro; Yang, Yaning; Cocho, Germinal

    2016-01-01

    Although Zipf’s law is widespread in natural and social data, one often encounters situations where one or both ends of the ranked data deviate from the power-law function. Previously we proposed the Beta rank function to improve the fitting of data which does not follow a perfect Zipf’s law. Here we show that when the two parameters in the Beta rank function have the same value, the Lavalette rank function, the probability density function can be derived analytically. We also show both computationally and analytically that Lavalette distribution is approximately equal, though not identical, to the lognormal distribution. We illustrate the utility of Lavalette rank function in several datasets. We also address three analysis issues on the statistical testing of Lavalette fitting function, comparison between Zipf’s law and lognormal distribution through Lavalette function, and comparison between lognormal distribution and Lavalette distribution. PMID:27658296

  7. Ranked retrieval of Computational Biology models

    PubMed Central

    2010-01-01

    Background The study of biological systems demands computational support. If targeting a biological problem, the reuse of existing computational models can save time and effort. Deciding for potentially suitable models, however, becomes more challenging with the increasing number of computational models available, and even more when considering the models' growing complexity. Firstly, among a set of potential model candidates it is difficult to decide for the model that best suits ones needs. Secondly, it is hard to grasp the nature of an unknown model listed in a search result set, and to judge how well it fits for the particular problem one has in mind. Results Here we present an improved search approach for computational models of biological processes. It is based on existing retrieval and ranking methods from Information Retrieval. The approach incorporates annotations suggested by MIRIAM, and additional meta-information. It is now part of the search engine of BioModels Database, a standard repository for computational models. Conclusions The introduced concept and implementation are, to our knowledge, the first application of Information Retrieval techniques on model search in Computational Systems Biology. Using the example of BioModels Database, it was shown that the approach is feasible and extends the current possibilities to search for relevant models. The advantages of our system over existing solutions are that we incorporate a rich set of meta-information, and that we provide the user with a relevance ranking of the models found for a query. Better search capabilities in model databases are expected to have a positive effect on the reuse of existing models. PMID:20701772

  8. Ranking Support Vector Machine with Kernel Approximation

    PubMed Central

    Dou, Yong

    2017-01-01

    Learning to rank algorithm has become important in recent years due to its successful application in information retrieval, recommender system, and computational biology, and so forth. Ranking support vector machine (RankSVM) is one of the state-of-art ranking models and has been favorably used. Nonlinear RankSVM (RankSVM with nonlinear kernels) can give higher accuracy than linear RankSVM (RankSVM with a linear kernel) for complex nonlinear ranking problem. However, the learning methods for nonlinear RankSVM are still time-consuming because of the calculation of kernel matrix. In this paper, we propose a fast ranking algorithm based on kernel approximation to avoid computing the kernel matrix. We explore two types of kernel approximation methods, namely, the Nyström method and random Fourier features. Primal truncated Newton method is used to optimize the pairwise L2-loss (squared Hinge-loss) objective function of the ranking model after the nonlinear kernel approximation. Experimental results demonstrate that our proposed method gets a much faster training speed than kernel RankSVM and achieves comparable or better performance over state-of-the-art ranking algorithms. PMID:28293256

  9. Ranking Support Vector Machine with Kernel Approximation.

    PubMed

    Chen, Kai; Li, Rongchun; Dou, Yong; Liang, Zhengfa; Lv, Qi

    2017-01-01

    Learning to rank algorithm has become important in recent years due to its successful application in information retrieval, recommender system, and computational biology, and so forth. Ranking support vector machine (RankSVM) is one of the state-of-art ranking models and has been favorably used. Nonlinear RankSVM (RankSVM with nonlinear kernels) can give higher accuracy than linear RankSVM (RankSVM with a linear kernel) for complex nonlinear ranking problem. However, the learning methods for nonlinear RankSVM are still time-consuming because of the calculation of kernel matrix. In this paper, we propose a fast ranking algorithm based on kernel approximation to avoid computing the kernel matrix. We explore two types of kernel approximation methods, namely, the Nyström method and random Fourier features. Primal truncated Newton method is used to optimize the pairwise L2-loss (squared Hinge-loss) objective function of the ranking model after the nonlinear kernel approximation. Experimental results demonstrate that our proposed method gets a much faster training speed than kernel RankSVM and achieves comparable or better performance over state-of-the-art ranking algorithms.

  10. A new mutually reinforcing network node and link ranking algorithm

    NASA Astrophysics Data System (ADS)

    Wang, Zhenghua; Dueñas-Osorio, Leonardo; Padgett, Jamie E.

    2015-10-01

    This study proposes a novel Normalized Wide network Ranking algorithm (NWRank) that has the advantage of ranking nodes and links of a network simultaneously. This algorithm combines the mutual reinforcement feature of Hypertext Induced Topic Selection (HITS) and the weight normalization feature of PageRank. Relative weights are assigned to links based on the degree of the adjacent neighbors and the Betweenness Centrality instead of assigning the same weight to every link as assumed in PageRank. Numerical experiment results show that NWRank performs consistently better than HITS, PageRank, eigenvector centrality, and edge betweenness from the perspective of network connectivity and approximate network flow, which is also supported by comparisons with the expensive N-1 benchmark removal criteria based on network efficiency. Furthermore, it can avoid some problems, such as the Tightly Knit Community effect, which exists in HITS. NWRank provides a new inexpensive way to rank nodes and links of a network, which has practical applications, particularly to prioritize resource allocation for upgrade of hierarchical and distributed networks, as well as to support decision making in the design of networks, where node and link importance depend on a balance of local and global integrity.

  11. A new mutually reinforcing network node and link ranking algorithm

    PubMed Central

    Wang, Zhenghua; Dueñas-Osorio, Leonardo; Padgett, Jamie E.

    2015-01-01

    This study proposes a novel Normalized Wide network Ranking algorithm (NWRank) that has the advantage of ranking nodes and links of a network simultaneously. This algorithm combines the mutual reinforcement feature of Hypertext Induced Topic Selection (HITS) and the weight normalization feature of PageRank. Relative weights are assigned to links based on the degree of the adjacent neighbors and the Betweenness Centrality instead of assigning the same weight to every link as assumed in PageRank. Numerical experiment results show that NWRank performs consistently better than HITS, PageRank, eigenvector centrality, and edge betweenness from the perspective of network connectivity and approximate network flow, which is also supported by comparisons with the expensive N-1 benchmark removal criteria based on network efficiency. Furthermore, it can avoid some problems, such as the Tightly Knit Community effect, which exists in HITS. NWRank provides a new inexpensive way to rank nodes and links of a network, which has practical applications, particularly to prioritize resource allocation for upgrade of hierarchical and distributed networks, as well as to support decision making in the design of networks, where node and link importance depend on a balance of local and global integrity. PMID:26492958

  12. Twisted Yangians of small rank

    NASA Astrophysics Data System (ADS)

    Guay, Nicolas; Regelskis, Vidas; Wendlandt, Curtis

    2016-04-01

    We study quantized enveloping algebras called twisted Yangians associated with the symmetric pairs of types CI, BDI, and DIII (in Cartan's classification) when the rank is small. We establish isomorphisms between these twisted Yangians and the well known Olshanskii's twisted Yangians of types AI and AII, and also with the Molev-Ragoucy reflection algebras associated with symmetric pairs of type AIII. We also construct isomorphisms with twisted Yangians in Drinfeld's original presentation.

  13. Knowledge-guided gene ranking by coordinative component analysis.

    PubMed

    Wang, Chen; Xuan, Jianhua; Li, Huai; Wang, Yue; Zhan, Ming; Hoffman, Eric P; Clarke, Robert

    2010-03-30

    In cancer, gene networks and pathways often exhibit dynamic behavior, particularly during the process of carcinogenesis. Thus, it is important to prioritize those genes that are strongly associated with the functionality of a network. Traditional statistical methods are often inept to identify biologically relevant member genes, motivating researchers to incorporate biological knowledge into gene ranking methods. However, current integration strategies are often heuristic and fail to incorporate fully the true interplay between biological knowledge and gene expression data. To improve knowledge-guided gene ranking, we propose a novel method called coordinative component analysis (COCA) in this paper. COCA explicitly captures those genes within a specific biological context that are likely to be expressed in a coordinative manner. Formulated as an optimization problem to maximize the coordinative effort, COCA is designed to first extract the coordinative components based on a partial guidance from knowledge genes and then rank the genes according to their participation strengths. An embedded bootstrapping procedure is implemented to improve statistical robustness of the solutions. COCA was initially tested on simulation data and then on published gene expression microarray data to demonstrate its improved performance as compared to traditional statistical methods. Finally, the COCA approach has been applied to stem cell data to identify biologically relevant genes in signaling pathways. As a result, the COCA approach uncovers novel pathway members that may shed light into the pathway deregulation in cancers. We have developed a new integrative strategy to combine biological knowledge and microarray data for gene ranking. The method utilizes knowledge genes for a guidance to first extract coordinative components, and then rank the genes according to their contribution related to a network or pathway. The experimental results show that such a knowledge-guided strategy

  14. SibRank: Signed bipartite network analysis for neighbor-based collaborative ranking

    NASA Astrophysics Data System (ADS)

    Shams, Bita; Haratizadeh, Saman

    2016-09-01

    Collaborative ranking is an emerging field of recommender systems that utilizes users' preference data rather than rating values. Unfortunately, neighbor-based collaborative ranking has gained little attention despite its more flexibility and justifiability. This paper proposes a novel framework, called SibRank that seeks to improve the state of the art neighbor-based collaborative ranking methods. SibRank represents users' preferences as a signed bipartite network, and finds similar users, through a novel personalized ranking algorithm in signed networks.

  15. A Document Clustering and Ranking System for Exploring MEDLINE Citations

    PubMed Central

    Lin, Yongjing; Li, Wenyuan; Chen, Keke; Liu, Ying

    2007-01-01

    Objective A major problem faced in biomedical informatics involves how best to present information retrieval results. When a single query retrieves many results, simply showing them as a long list often provides poor overview. With a goal of presenting users with reduced sets of relevant citations, this study developed an approach that retrieved and organized MEDLINE citations into different topical groups and prioritized important citations in each group. Design A text mining system framework for automatic document clustering and ranking organized MEDLINE citations following simple PubMed queries. The system grouped the retrieved citations, ranked the citations in each cluster, and generated a set of keywords and MeSH terms to describe the common theme of each cluster. Measurements Several possible ranking functions were compared, including citation count per year (CCPY), citation count (CC), and journal impact factor (JIF). We evaluated this framework by identifying as “important” those articles selected by the Surgical Oncology Society. Results Our results showed that CCPY outperforms CC and JIF, i.e., CCPY better ranked important articles than did the others. Furthermore, our text clustering and knowledge extraction strategy grouped the retrieval results into informative clusters as revealed by the keywords and MeSH terms extracted from the documents in each cluster. Conclusions The text mining system studied effectively integrated text clustering, text summarization, and text ranking and organized MEDLINE retrieval results into different topical groups. PMID:17600104

  16. Detecting determinism with improved sensitivity in time series: Rank-based nonlinear predictability score

    NASA Astrophysics Data System (ADS)

    Naro, Daniel; Rummel, Christian; Schindler, Kaspar; Andrzejak, Ralph G.

    2014-09-01

    The rank-based nonlinear predictability score was recently introduced as a test for determinism in point processes. We here adapt this measure to time series sampled from time-continuous flows. We use noisy Lorenz signals to compare this approach against a classical amplitude-based nonlinear prediction error. Both measures show an almost identical robustness against Gaussian white noise. In contrast, when the amplitude distribution of the noise has a narrower central peak and heavier tails than the normal distribution, the rank-based nonlinear predictability score outperforms the amplitude-based nonlinear prediction error. For this type of noise, the nonlinear predictability score has a higher sensitivity for deterministic structure in noisy signals. It also yields a higher statistical power in a surrogate test of the null hypothesis of linear stochastic correlated signals. We show the high relevance of this improved performance in an application to electroencephalographic (EEG) recordings from epilepsy patients. Here the nonlinear predictability score again appears of higher sensitivity to nonrandomness. Importantly, it yields an improved contrast between signals recorded from brain areas where the first ictal EEG signal changes were detected (focal EEG signals) versus signals recorded from brain areas that were not involved at seizure onset (nonfocal EEG signals).

  17. State Online College Job Market: Ranking the States

    ERIC Educational Resources Information Center

    Carnevale, Anthony; Jayasundera, Tamara; Repnikov, Dmitri; Gulish, Artem

    2015-01-01

    "State Online College Job Market: Ranking the States" analyzes the online college labor market on a state-by-state basis. We examine the geographic distribution of online job ads for college graduates within industries and occupational clusters, and compare the relative strength of the online college labor market across states. We…

  18. Testing for Correlation between Two Journal Ranking Methods: A Comparison of Citation Rankings and Expert Opinion Rankings.

    ERIC Educational Resources Information Center

    Russell, Robert Lowell, Jr.

    This study tests for correlation between two journal ranking methods--citation rankings and expert opinion surveys. Political science professors from four major universities were asked to rank a list of the 20 most highly cited political science journals. Citation data were taken from the "Social Sciences Citation Index Journal Citation…

  19. Class Rank Weighs Down True Learning

    ERIC Educational Resources Information Center

    Guskey, Thomas R.

    2014-01-01

    The process of determining class rank does not help students achieve more or reach higher levels of proficiency. Evidence indicates ranking students may diminish students' motivation. High school educators argue that they are compelled to rank-order graduating students because selective colleges and universities require information about…

  20. Rank Ordering or Judge-Awarded Ratings?

    ERIC Educational Resources Information Center

    Linacre, John M.

    Rank ordering examinees is an easier task for judges than is awarding numerical ratings. A measurement model for rankings based on Rasch's objectivity axioms provides linear, sample-independent and judge-independent measures. Estimates of examinee measures are obtained from the data set of rankings, along with standard errors and fit statistics.…

  1. 14 CFR 1214.1105 - Final ranking.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... 14 Aeronautics and Space 5 2011-01-01 2010-01-01 true Final ranking. 1214.1105 Section 1214.1105 Aeronautics and Space NATIONAL AERONAUTICS AND SPACE ADMINISTRATION SPACE FLIGHT NASA Astronaut Candidate Recruitment and Selection Program § 1214.1105 Final ranking. Final rankings will be based on a combination of...

  2. 14 CFR 1214.1105 - Final ranking.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... 14 Aeronautics and Space 5 2010-01-01 2010-01-01 false Final ranking. 1214.1105 Section 1214.1105 Aeronautics and Space NATIONAL AERONAUTICS AND SPACE ADMINISTRATION SPACE FLIGHT NASA Astronaut Candidate Recruitment and Selection Program § 1214.1105 Final ranking. Final rankings will be based on a combination of...

  3. 14 CFR 1214.1105 - Final ranking.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... 14 Aeronautics and Space 5 2013-01-01 2013-01-01 false Final ranking. 1214.1105 Section 1214.1105 Aeronautics and Space NATIONAL AERONAUTICS AND SPACE ADMINISTRATION SPACE FLIGHT NASA Astronaut Candidate Recruitment and Selection Program § 1214.1105 Final ranking. Final rankings will be based on a combination of...

  4. 14 CFR 1214.1105 - Final ranking.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... 14 Aeronautics and Space 5 2012-01-01 2012-01-01 false Final ranking. 1214.1105 Section 1214.1105 Aeronautics and Space NATIONAL AERONAUTICS AND SPACE ADMINISTRATION SPACE FLIGHT NASA Astronaut Candidate Recruitment and Selection Program § 1214.1105 Final ranking. Final rankings will be based on a combination of...

  5. A Comprehensive Analysis of Marketing Journal Rankings

    ERIC Educational Resources Information Center

    Steward, Michelle D.; Lewis, Bruce R.

    2010-01-01

    The purpose of this study is to offer a comprehensive assessment of journal standings in Marketing from two perspectives. The discipline perspective of rankings is obtained from a collection of published journal ranking studies during the past 15 years. The studies in the published ranking stream are assessed for reliability by examining internal…

  6. A Comprehensive Analysis of Marketing Journal Rankings

    ERIC Educational Resources Information Center

    Steward, Michelle D.; Lewis, Bruce R.

    2010-01-01

    The purpose of this study is to offer a comprehensive assessment of journal standings in Marketing from two perspectives. The discipline perspective of rankings is obtained from a collection of published journal ranking studies during the past 15 years. The studies in the published ranking stream are assessed for reliability by examining internal…

  7. The Academic Ranking of World Universities

    ERIC Educational Resources Information Center

    Liu, Nian Cai; Cheng, Ying

    2005-01-01

    Shanghai Jiao Tong University has published on the Internet an Academic Ranking of World Universities that has attracted worldwide attention. Institutions are ranked according to academic or research performance and ranking indicators include major international awards, highly cited researchers in important fields, articles published in selected…

  8. Talking in the Ranks: Gender and Military Discourse

    DTIC Science & Technology

    2007-11-02

    fading from American English , are marked nouns. "Actor", for example, increasingly appears as a non-gendered form of the noun, though the term "actress...34madam" has undergone a semantic, "transfer to values that are ranked low in the relevant society" namely, speakers of American English commonly refer...collectivist, American English reflects American egalitarianism and individualism by placing greater importance in intimacy. In an interesting contrast

  9. The DEPOSIT computer code based on the low rank approximations

    NASA Astrophysics Data System (ADS)

    Litsarev, Mikhail S.; Oseledets, Ivan V.

    2014-10-01

    We present a new version of the DEPOSIT computer code based on the low rank approximations. This approach is based on the two dimensional cross decomposition of matrices and separated representations of analytical functions. The cross algorithm is available in the distributed package and can be used independently. All integration routines related to the computation of the deposited energy T(b) are implemented in a new way (low rank separated representation format on homogeneous meshes). By using this approach a bug in integration routines of previous version of the code was found and fixed in the current version. The total computational time was significantly accelerated and is about several minutes.

  10. Ranking Geochemical Energy Availability in Hydrothermal Ecosystems

    NASA Astrophysics Data System (ADS)

    Holland, M. E.; Shock, E. L.; Meyer-Dombard, D.; Amend, J. P.

    2004-12-01

    The energy available to hyperthermophilic microorganisms in hot springs can be theoretically estimated using thermodynamic calculations based on geochemical measurements. The relative abundance of different geochemical energy sources (the "ranking" of these reactions) in particular hot springs may provide one explanation for the differences in hot spring microbial communities and also facilitate the culture of ecologically-relevant microorganisms. Geochemical sampling of seven Yellowstone National Park hot springs was repeated five times from 1999 to 2004 with the intent to compare the geochemistry and geochemical energy available to microorganisms. These seven hot springs were located in three separate regions of Yellowstone National Park: three hot springs, including Obsidian Pool, were sampled in the Mud Volcano area; two in the Sylvan Springs area (Gibbon Meadows); and one each in Imperial Meadows and Sentinel Meadows (Lower Geyser Basin). The hot springs were 75 to 93° C (with one 65° C exception) and spanned the bulk of the pH range at Yellowstone (pH 1.8 to 7.6). Geochemical measurements made on hot springs included redox-active species containing C, N, O, H, S, and Fe; these species were measured by field spectrophotometry and ion chromatography of fluid samples and gas chromatographic analysis of gas samples. From these measurements chemical affinities were calculated for 179 inorganic reactions which encompass the suite of autotrophic energy sources potentially available in each pool. Composite affinities for each reaction were compiled for each of the seven primary pools. The composite for each pool was assembled from repeat measurements from the primary pool as well as nearby pools with similar geochemistry. Calculations show that over half of these inorganic reactions could provide enough energy for a microorganism to survive, based on the threshold value of energy required by {it E. coli} (20 kJ per mole of electron pairs). Some microorganisms

  11. Issue Management Risk Ranking Systems

    SciTech Connect

    Novack, Steven David; Marshall, Frances Mc Clellan; Stromberg, Howard Merion; Grant, Gary Michael

    1999-06-01

    Thousands of safety issues have been collected on-line at the Idaho National Engineering and Environmental Laboratory (INEEL) as part of the Issue Management Plan. However, there has been no established approach to prioritize collected and future issues. The authors developed a methodology, based on hazards assessment, to identify and risk rank over 5000 safety issues collected at INEEL. This approach required that it was easily applied and understandable for site adaptation and commensurate with the Integrated Safety Plan. High-risk issues were investigated and mitigative/preventive measures were suggested and ranked based on a cost-benefit scheme to provide risk-informed safety measures. This methodology was consistent with other integrated safety management goals and tasks providing a site-wide risk informed decision tool to reduce hazardous conditions and focus resources on high-risk safety issues. As part of the issue management plan, this methodology was incorporated at the issue collection level and training was provided to management to better familiarize decision-makers with concepts of safety and risk. This prioritization methodology and issue dissemination procedure will be discussed. Results of issue prioritization and training efforts will be summarized. Difficulties and advantages of the process will be reported. Development and incorporation of this process into INEELs lessons learned reporting and the site-wide integrated safety management program will be shown with an emphasis on establishing self reliance and ownership of safety issues.

  12. Issue Management Risk Ranking Systems

    SciTech Connect

    F. M. Marshall; G. M. Grant; H. M. Stromberg; S. D. Novack

    1999-06-01

    Thousands of safety issues have been collected on-line at the Idaho National Engineering and Environmental Laboratory (INEEL) as part of the Issue Management Plan. However, there has been no established approach to prioritize collected and future issues. The authors developed a methodology, based on hazards assessment, to identify and risk rank over 5000 safety issues collected at INEEL. This approach required that it was easily applied and understandable for site adaptation and commensurate with the Integrated Safety Plan. High-risk issues were investigated and mitigative/preventive measures were suggested and ranked based on a cost-benefit scheme to provide risk-informed safety measures. This methodology was consistent with other integrated safety management goals and tasks providing a site-wide risk-informed decision tool to reduce hazardous conditions and focus resources on high-risk safety issues. As part of the issue management plan, this methodology was incorporated at the issue collection level and training was provided to management to better familiarize decision-makers with concepts of safety and risk. This prioritization methodology and issue dissemination procedure will be discussed. Results of issue prioritization and training efforts will be summarized. Difficulties and advantages of the process will be reported. Development and incorporation of this process into INEEL's lessons learned reporting and the site-wide integrated safety management program will be shown with an emphasis on establishing self reliance and ownership of safety issues.

  13. Cross ranking of cities and regions: population versus income

    NASA Astrophysics Data System (ADS)

    Cerqueti, Roy; Ausloos, Marcel

    2015-07-01

    This paper explores the relationship between the inner economical structure of communities and their population distribution through a rank-rank analysis of official data, along statistical physics ideas within two techniques. The data is taken on Italian cities. The analysis is performed both at a global (national) and at a more local (regional) level in order to distinguish ‘macro’ and ‘micro’ aspects. First, the rank-size rule is found not to be a standard power law, as in many other studies, but a doubly decreasing power law. Next, the Kendall τ and the Spearman ρ rank correlation coefficients which measure pair concordance and the correlation between fluctuations in two rankings, respectively,—as a correlation function does in thermodynamics, are calculated for finding rank correlation (if any) between demography and wealth. Results show non only global disparities for the whole (country) set, but also (regional) disparities, when comparing the number of cities in regions, the number of inhabitants in cities and that in regions, as well as when comparing the aggregated tax income of the cities and that of regions. Different outliers are pointed out and justified. Interestingly, two classes of cities in the country and two classes of regions in the country are found. ‘Common sense’ social, political, and economic considerations sustain the findings. More importantly, the methods show that they allow to distinguish communities, very clearly, when specific criteria are numerically sound. A specific modeling for the findings is presented, i.e. for the doubly decreasing power law and the two phase system, based on statistics theory, e.g. urn filling. The model ideas can be expected to hold when similar rank relationship features are observed in fields. It is emphasized that the analysis makes more sense than one through a Pearson Π value-value correlation analysis

  14. 24 CFR 599.401 - Ranking of applications.

    Code of Federal Regulations, 2010 CFR

    2010-04-01

    ... Communities § 599.401 Ranking of applications. (a) Ranking order. Rural and urban applications will be ranked... applications ranked first. (b) Separate ranking categories. After initial ranking, both rural and urban... 24 Housing and Urban Development 3 2010-04-01 2010-04-01 false Ranking of applications....

  15. Impact of Doximity Residency Rankings on Emergency Medicine Applicant Rank Lists.

    PubMed

    Peterson, William J; Hopson, Laura R; Khandelwal, Sorabh; White, Melissa; Gallahue, Fiona E; Burkhardt, John; Rolston, Aimee M; Santen, Sally A

    2016-05-01

    This study investigates the impact of the Doximity rankings on the rank list choices made by residency applicants in emergency medicine (EM). We sent an 11-item survey by email to all students who applied to EM residency programs at four different institutions representing diverse geographical regions. Students were asked questions about their perception of Doximity rankings and how it may have impacted their rank list decisions. Response rate was 58% of 1,372 opened electronic surveys. This study found that a majority of medical students applying to residency in EM were aware of the Doximity rankings prior to submitting rank lists (67%). One-quarter of these applicants changed the number of programs and ranks of those programs when completing their rank list based on the Doximity rankings (26%). Though the absolute number of programs changed on the rank lists was small, the results demonstrate that the EM Doximity rankings impact applicant decision-making in ranking residency programs. While applicants do not find the Doximity rankings to be important compared to other factors in the application process, the Doximity rankings result in a small change in residency applicant ranking behavior. This unvalidated ranking, based principally on reputational data rather than objective outcome criteria, thus has the potential to be detrimental to students, programs, and the public. We feel it important for specialties to develop consensus around measurable training outcomes and provide freely accessible metrics for candidate education.

  16. Two-dimensional ranking of Wikipedia articles

    NASA Astrophysics Data System (ADS)

    Zhirov, A. O.; Zhirov, O. V.; Shepelyansky, D. L.

    2010-10-01

    The Library of Babel, described by Jorge Luis Borges, stores an enormous amount of information. The Library exists ab aeterno. Wikipedia, a free online encyclopaedia, becomes a modern analogue of such a Library. Information retrieval and ranking of Wikipedia articles become the challenge of modern society. While PageRank highlights very well known nodes with many ingoing links, CheiRank highlights very communicative nodes with many outgoing links. In this way the ranking becomes two-dimensional. Using CheiRank and PageRank we analyze the properties of two-dimensional ranking of all Wikipedia English articles and show that it gives their reliable classification with rich and nontrivial features. Detailed studies are done for countries, universities, personalities, physicists, chess players, Dow-Jones companies and other categories.

  17. Ranking of simultaneously presented choice options in animal preference experiments.

    PubMed

    Halekoh, Ulrich; Jørgensen, Erik; Bak Jensen, Margit; Pedersen, Lene Juul; Studnitz, Merete; Højsgaard, Søren

    2007-08-01

    We considered experiments where animals chose one of all possible simultaneously presented options. The animals might be observed at repeated occasions. In the ethological literature the analysis is often focused on testing the global hypothesis of no difference in preferences by non-parametric methods. This fails to address the estimation of a ranking. Often this approach cannot adequately reflect the experimental setting and the repeated measurement structure. Therefore, we propose to model the choice probabilities for the options with a multinomial logistic model. The correlation induced by repeated measurements is incorporated by animal specific random intercepts. The ranking of the options is taken as the order of the choice probabilities. Adopting a Bayesian approach samples from the posterior distribution of the choice probabilities provide directly samples from the posterior of the rankings. Based on this an estimate of the ranking and description of its variability can be derived. The computation was performed via Markov chain Monte Carlo sampling and was implemented using WinBUGS. We illustrate our approach with an experiment to determine the preference of pigs for three different rooting materials. The proposed method allowed deriving an overall ranking for different combinations of the materials and the spatial positioning.

  18. Sparse Contextual Activation for Efficient Visual Re-Ranking.

    PubMed

    Bai, Song; Bai, Xiang

    2016-03-01

    In this paper, we propose an extremely efficient algorithm for visual re-ranking. By considering the original pairwise distance in the contextual space, we develop a feature vector called sparse contextual activation (SCA) that encodes the local distribution of an image. Hence, re-ranking task can be simply accomplished by vector comparison under the generalized Jaccard metric, which has its theoretical meaning in the fuzzy set theory. In order to improve the time efficiency of re-ranking procedure, inverted index is successfully introduced to speed up the computation of generalized Jaccard metric. As a result, the average time cost of re-ranking for a certain query can be controlled within 1 ms. Furthermore, inspired by query expansion, we also develop an additional method called local consistency enhancement on the proposed SCA to improve the retrieval performance in an unsupervised manner. On the other hand, the retrieval performance using a single feature may not be satisfactory enough, which inspires us to fuse multiple complementary features for accurate retrieval. Based on SCA, a robust feature fusion algorithm is exploited that also preserves the characteristic of high time efficiency. We assess our proposed method in various visual re-ranking tasks. Experimental results on Princeton shape benchmark (3D object), WM-SRHEC07 (3D competition), YAEL data set B (face), MPEG-7 data set (shape), and Ukbench data set (image) manifest the effectiveness and efficiency of SCA.

  19. Relations Among Some Low-Rank Subspace Recovery Models.

    PubMed

    Zhang, Hongyang; Lin, Zhouchen; Zhang, Chao; Gao, Junbin

    2015-09-01

    Recovering intrinsic low-dimensional subspaces from data distributed on them is a key preprocessing step to many applications. In recent years, a lot of work has modeled subspace recovery as low-rank minimization problems. We find that some representative models, such as robust principal component analysis (R-PCA), robust low-rank representation (R-LRR), and robust latent low-rank representation (R-LatLRR), are actually deeply connected. More specifically, we discover that once a solution to one of the models is obtained, we can obtain the solutions to other models in closed-form formulations. Since R-PCA is the simplest, our discovery makes it the center of low-rank subspace recovery models. Our work has two important implications. First, R-PCA has a solid theoretical foundation. Under certain conditions, we could find globally optimal solutions to these low-rank models at an overwhelming probability, although these models are nonconvex. Second, we can obtain significantly faster algorithms for these models by solving R-PCA first. The computation cost can be further cut by applying low-complexity randomized algorithms, for example, our novel l2,1 filtering algorithm, to R-PCA. Although for the moment the formal proof of our l2,1 filtering algorithm is not yet available, experiments verify the advantages of our algorithm over other state-of-the-art methods based on the alternating direction method.

  20. Network selection: a method for ranked lists selection.

    PubMed

    Cutillo, Luisa; Carissimo, Annamaria; Figini, Silvia

    2012-01-01

    We consider the problem of finding the set of rankings that best represents a given group of orderings on the same collection of elements (preference lists). This problem arises from social choice and voting theory, in which each voter gives a preference on a set of alternatives, and a system outputs a single preference order based on the observed voters' preferences. In this paper, we observe that, if the given set of preference lists is not homogeneous, a unique true underling ranking might not exist. Moreover only the lists that share the highest amount of information should be aggregated, and thus multiple rankings might provide a more feasible solution to the problem. In this light, we propose Network Selection, an algorithm that, given a heterogeneous group of rankings, first discovers the different communities of homogeneous rankings and then combines only the rank orderings belonging to the same community into a single final ordering. Our novel approach is inspired by graph theory; indeed our set of lists can be loosely read as the nodes of a network. As a consequence, only the lists populating the same community in the network would then be aggregated. In order to highlight the strength of our proposal, we show an application both on simulated and on two real datasets, namely a financial and a biological dataset. Experimental results on simulated data show that Network Selection can significantly outperform existing related methods. The other way around, the empirical evidence achieved on real financial data reveals that Network Selection is also able to select the most relevant variables in data mining predictive models, providing a clear superiority in terms of predictive power of the models built. Furthermore, we show the potentiality of our proposal in the bioinformatics field, providing an application to a biological microarray dataset.

  1. Network Selection: A Method for Ranked Lists Selection

    PubMed Central

    Figini, Silvia

    2012-01-01

    We consider the problem of finding the set of rankings that best represents a given group of orderings on the same collection of elements (preference lists). This problem arises from social choice and voting theory, in which each voter gives a preference on a set of alternatives, and a system outputs a single preference order based on the observed voters’ preferences. In this paper, we observe that, if the given set of preference lists is not homogeneous, a unique true underling ranking might not exist. Moreover only the lists that share the highest amount of information should be aggregated, and thus multiple rankings might provide a more feasible solution to the problem. In this light, we propose Network Selection, an algorithm that, given a heterogeneous group of rankings, first discovers the different communities of homogeneous rankings and then combines only the rank orderings belonging to the same community into a single final ordering. Our novel approach is inspired by graph theory; indeed our set of lists can be loosely read as the nodes of a network. As a consequence, only the lists populating the same community in the network would then be aggregated. In order to highlight the strength of our proposal, we show an application both on simulated and on two real datasets, namely a financial and a biological dataset. Experimental results on simulated data show that Network Selection can significantly outperform existing related methods. The other way around, the empirical evidence achieved on real financial data reveals that Network Selection is also able to select the most relevant variables in data mining predictive models, providing a clear superiority in terms of predictive power of the models built. Furthermore, we show the potentiality of our proposal in the bioinformatics field, providing an application to a biological microarray dataset. PMID:22937075

  2. Boolean versus ranked querying for biomedical systematic reviews

    PubMed Central

    2010-01-01

    Background The process of constructing a systematic review, a document that compiles the published evidence pertaining to a specified medical topic, is intensely time-consuming, often taking a team of researchers over a year, with the identification of relevant published research comprising a substantial portion of the effort. The standard paradigm for this information-seeking task is to use Boolean search; however, this leaves the user(s) the requirement of examining every returned result. Further, our experience is that effective Boolean queries for this specific task are extremely difficult to formulate and typically require multiple iterations of refinement before being finalized. Methods We explore the effectiveness of using ranked retrieval as compared to Boolean querying for the purpose of constructing a systematic review. We conduct a series of experiments involving ranked retrieval, using queries defined methodologically, in an effort to understand the practicalities of incorporating ranked retrieval into the systematic search task. Results Our results show that ranked retrieval by itself is not viable for this search task requiring high recall. However, we describe a refinement of the standard Boolean search process and show that ranking within a Boolean result set can improve the overall search performance by providing early indication of the quality of the results, thereby speeding up the iterative query-refinement process. Conclusions Outcomes of experiments suggest that an interactive query-development process using a hybrid ranked and Boolean retrieval system has the potential for significant time-savings over the current search process in the systematic reviewing. PMID:20937152

  3. VisualRank: applying PageRank to large-scale image search.

    PubMed

    Jing, Yushi; Baluja, Shumeet

    2008-11-01

    Because of the relative ease in understanding and processing text, commercial image-search systems often rely on techniques that are largely indistinguishable from text-search. Recently, academic studies have demonstrated the effectiveness of employing image-based features to provide alternative or additional signals. However, it remains uncertain whether such techniques will generalize to a large number of popular web queries, and whether the potential improvement to search quality warrants the additional computational cost. In this work, we cast the image-ranking problem into the task of identifying "authority" nodes on an inferred visual similarity graph and propose VisualRank to analyze the visual link structures among images. The images found to be "authorities" are chosen as those that answer the image-queries well. To understand the performance of such an approach in a real system, we conducted a series of large-scale experiments based on the task of retrieving images for 2000 of the most popular products queries. Our experimental results show significant improvement, in terms of user satisfaction and relevancy, in comparison to the most recent Google Image Search results. Maintaining modest computational cost is vital to ensuring that this procedure can be used in practice; we describe the techniques required to make this system practical for large scale deployment in commercial search engines.

  4. Adaptive linear rank tests for eQTL studies

    PubMed Central

    Szymczak, Silke; Scheinhardt, Markus O.; Zeller, Tanja; Wild, Philipp S.; Blankenberg, Stefan; Ziegler, Andreas

    2013-01-01

    Expression quantitative trait loci (eQTL) studies are performed to identify single-nucleotide polymorphisms that modify average expression values of genes, proteins, or metabolites, depending on the genotype. As expression values are often not normally distributed, statistical methods for eQTL studies should be valid and powerful in these situations. Adaptive tests are promising alternatives to standard approaches, such as the analysis of variance or the Kruskal–Wallis test. In a two-stage procedure, skewness and tail length of the distributions are estimated and used to select one of several linear rank tests. In this study, we compare two adaptive tests that were proposed in the literature using extensive Monte Carlo simulations of a wide range of different symmetric and skewed distributions. We derive a new adaptive test that combines the advantages of both literature-based approaches. The new test does not require the user to specify a distribution. It is slightly less powerful than the locally most powerful rank test for the correct distribution and at least as powerful as the maximin efficiency robust rank test. We illustrate the application of all tests using two examples from different eQTL studies. PMID:22933317

  5. Adaptive linear rank tests for eQTL studies.

    PubMed

    Szymczak, Silke; Scheinhardt, Markus O; Zeller, Tanja; Wild, Philipp S; Blankenberg, Stefan; Ziegler, Andreas

    2013-02-10

    Expression quantitative trait loci (eQTL) studies are performed to identify single-nucleotide polymorphisms that modify average expression values of genes, proteins, or metabolites, depending on the genotype. As expression values are often not normally distributed, statistical methods for eQTL studies should be valid and powerful in these situations. Adaptive tests are promising alternatives to standard approaches, such as the analysis of variance or the Kruskal-Wallis test. In a two-stage procedure, skewness and tail length of the distributions are estimated and used to select one of several linear rank tests. In this study, we compare two adaptive tests that were proposed in the literature using extensive Monte Carlo simulations of a wide range of different symmetric and skewed distributions. We derive a new adaptive test that combines the advantages of both literature-based approaches. The new test does not require the user to specify a distribution. It is slightly less powerful than the locally most powerful rank test for the correct distribution and at least as powerful as the maximin efficiency robust rank test. We illustrate the application of all tests using two examples from different eQTL studies. Copyright © 2012 John Wiley & Sons, Ltd.

  6. Calculating PageRank in a changing network with added or removed edges

    NASA Astrophysics Data System (ADS)

    Engström, Christopher; Silvestrov, Sergei

    2017-01-01

    PageRank was initially developed by S. Brinn and L. Page in 1998 to rank homepages on the Internet using the stationary distribution of a Markov chain created using the web graph. Due to the large size of the web graph and many other real world networks fast methods to calculate PageRank is needed and even if the original way of calculating PageRank using a Power iterations is rather fast, many other approaches have been made to improve the speed further. In this paper we will consider the problem of recalculating PageRank of a changing network where the PageRank of a previous version of the network is known. In particular we will consider the special case of adding or removing edges to a single vertex in the graph or graph component.

  7. Military Education: DOD Needs To Develop Performance Goals and Metrics for Advanced Distributed Learning in Professional Military Education. Report to the Ranking Minority Member Committee on Armed Services, House of Representatives. GAO-04-873

    ERIC Educational Resources Information Center

    US Government Accountability Office, 2004

    2004-01-01

    As part of its transformation to prepare the armed forces to meet current and future challenges, the Department of Defense (DOD) is expanding its use of advanced distributed learning (ADL) techniques in senior- and intermediate-level officer professional military education (PME).To determine whether DOD uses a systematic process for evaluating the…

  8. Improve Biomedical Information Retrieval using Modified Learning to Rank Methods.

    PubMed

    Xu, Bo; Lin, Hongfei; Lin, Yuan; Ma, Yunlong; Yang, Liang; Wang, Jian; Yang, Zhihao

    2016-06-14

    In these years, the number of biomedical articles has increased exponentially, which becomes a problem for biologists to capture all the needed information manually. Information retrieval technologies, as the core of search engines, can deal with the problem automatically, providing users with the needed information. However, it is a great challenge to apply these technologies directly for biomedical retrieval, because of the abundance of domain specific terminologies. To enhance biomedical retrieval, we propose a novel framework based on learning to rank. Learning to rank is a series of state-of-the-art information retrieval techniques, and has been proved effective in many information retrieval tasks. In the proposed framework, we attempt to tackle the problem of the abundance of terminologies by constructing ranking models, which focus on not only retrieving the most relevant documents, but also diversifying the searching results to increase the completeness of the resulting list for a given query. In the model training, we propose two novel document labeling strategies, and combine several traditional retrieval models as learning features. Besides, we also investigate the usefulness of different learning to rank approaches in our framework. Experimental results on TREC Genomics datasets demonstrate the effectiveness of our framework for biomedical information retrieval.

  9. Decision Tree Modeling for Ranking Data

    NASA Astrophysics Data System (ADS)

    Yu, Philip L. H.; Wan, Wai Ming; Lee, Paul H.

    Ranking/preference data arises from many applications in marketing, psychology, and politics. We establish a new decision tree model for the analysis of ranking data by adopting the concept of classification and regression tree. The existing splitting criteria are modified in a way that allows them to precisely measure the impurity of a set of ranking data. Two types of impurity measures for ranking data are introduced, namelyg-wise and top-k measures. Theoretical results show that the new measures exhibit properties of impurity functions. In model assessment, the area under the ROC curve (AUC) is applied to evaluate the tree performance. Experiments are carried out to investigate the predictive performance of the tree model for complete and partially ranked data and promising results are obtained. Finally, a real-world application of the proposed methodology to analyze a set of political rankings data is presented.

  10. Error Analysis of Stochastic Gradient Descent Ranking.

    PubMed

    Chen, Hong; Tang, Yi; Li, Luoqing; Yuan, Yuan; Li, Xuelong; Tang, Yuanyan

    2012-12-31

    Ranking is always an important task in machine learning and information retrieval, e.g., collaborative filtering, recommender systems, drug discovery, etc. A kernel-based stochastic gradient descent algorithm with the least squares loss is proposed for ranking in this paper. The implementation of this algorithm is simple, and an expression of the solution is derived via a sampling operator and an integral operator. An explicit convergence rate for leaning a ranking function is given in terms of the suitable choices of the step size and the regularization parameter. The analysis technique used here is capacity independent and is novel in error analysis of ranking learning. Experimental results on real-world data have shown the effectiveness of the proposed algorithm in ranking tasks, which verifies the theoretical analysis in ranking error.

  11. Dynamics of ranking processes in complex systems.

    PubMed

    Blumm, Nicholas; Ghoshal, Gourab; Forró, Zalán; Schich, Maximilian; Bianconi, Ginestra; Bouchaud, Jean-Philippe; Barabási, Albert-László

    2012-09-21

    The world is addicted to ranking: everything, from the reputation of scientists, journals, and universities to purchasing decisions is driven by measured or perceived differences between them. Here, we analyze empirical data capturing real time ranking in a number of systems, helping to identify the universal characteristics of ranking dynamics. We develop a continuum theory that not only predicts the stability of the ranking process, but shows that a noise-induced phase transition is at the heart of the observed differences in ranking regimes. The key parameters of the continuum theory can be explicitly measured from data, allowing us to predict and experimentally document the existence of three phases that govern ranking stability.

  12. Error analysis of stochastic gradient descent ranking.

    PubMed

    Chen, Hong; Tang, Yi; Li, Luoqing; Yuan, Yuan; Li, Xuelong; Tang, Yuanyan

    2013-06-01

    Ranking is always an important task in machine learning and information retrieval, e.g., collaborative filtering, recommender systems, drug discovery, etc. A kernel-based stochastic gradient descent algorithm with the least squares loss is proposed for ranking in this paper. The implementation of this algorithm is simple, and an expression of the solution is derived via a sampling operator and an integral operator. An explicit convergence rate for leaning a ranking function is given in terms of the suitable choices of the step size and the regularization parameter. The analysis technique used here is capacity independent and is novel in error analysis of ranking learning. Experimental results on real-world data have shown the effectiveness of the proposed algorithm in ranking tasks, which verifies the theoretical analysis in ranking error.

  13. Dynamics of Ranking Processes in Complex Systems

    NASA Astrophysics Data System (ADS)

    Blumm, Nicholas; Ghoshal, Gourab; Forró, Zalán; Schich, Maximilian; Bianconi, Ginestra; Bouchaud, Jean-Philippe; Barabási, Albert-László

    2012-09-01

    The world is addicted to ranking: everything, from the reputation of scientists, journals, and universities to purchasing decisions is driven by measured or perceived differences between them. Here, we analyze empirical data capturing real time ranking in a number of systems, helping to identify the universal characteristics of ranking dynamics. We develop a continuum theory that not only predicts the stability of the ranking process, but shows that a noise-induced phase transition is at the heart of the observed differences in ranking regimes. The key parameters of the continuum theory can be explicitly measured from data, allowing us to predict and experimentally document the existence of three phases that govern ranking stability.

  14. Sample size calculation for testing differences between cure rates with the optimal log-rank test.

    PubMed

    Wu, Jianrong

    2017-01-01

    In this article, sample size calculations are developed for use when the main interest is in the differences between the cure rates of two groups. Following the work of Ewell and Ibrahim, the asymptotic distribution of the weighted log-rank test is derived under the local alternative. The optimal log-rank test under the proportional distributions alternative is discussed, and sample size formulas for the optimal and standard log-rank tests are derived. Simulation results show that the proposed formulas provide adequate sample size estimation for trial designs and that the optimal log-rank test is more efficient than the standard log-rank test, particularly when both cure rates and percentages of censoring are small.

  15. Otto Rank: beginnings, endings, and current experience.

    PubMed

    Novey, R

    1983-01-01

    I have traced the theories of Otto Rank as they appeared in his major technical writings. Against this background, I have discussed references to Rank in past and contemporary psychoanalytic literature. This paper describes three important contributions of Rank--his birth trauma theory, leading to his theory of the birth of the self; his emphasis on present experience (forerunner of the current "here-and-now" theory); and his writings about the creative potential of the termination process.

  16. On Boolean matrices with full factor rank

    SciTech Connect

    Shitov, Ya

    2013-11-30

    It is demonstrated that every (0,1)-matrix of size n×m having Boolean rank n contains a column with at least √n/2−1 zero entries. This bound is shown to be asymptotically optimal. As a corollary, it is established that the size of a full-rank Boolean matrix is bounded from above by a function of its tropical and determinantal ranks. Bibliography: 16 titles.

  17. Robust rankings: Review of multivariate assessments illustrated by the Shanghai rankings.

    PubMed

    Freyer, Leo

    2014-01-01

    Defined errors are entered into data collections in order to test their influence on the reliability of multivariate rankings. Random numbers and real ranking data serve as data origins. In the course of data collection small random errors often lead to a switch in ranking, which can influence the general ranking picture considerably. For stabilisation an objective weighting method is evaluated. The robustness of these rankings is then compared to the original forms. Robust forms of the published Shanghai top 100 rankings are calculated and compared to each other. As a result, the possibilities and restrictions of this type of weighting become recognisable.

  18. Distribution

    Treesearch

    John R. Jones

    1985-01-01

    Quaking aspen is the most widely distributed native North American tree species (Little 1971, Sargent 1890). It grows in a great diversity of regions, environments, and communities (Harshberger 1911). Only one deciduous tree species in the world, the closely related Eurasian aspen (Populus tremula), has a wider range (Weigle and Frothingham 1911)....

  19. Distributions.

    ERIC Educational Resources Information Center

    Bowers, Wayne A.

    This monograph was written for the Conference of the New Instructional Materials in Physics, held at the University of Washington in summer, 1965. It is intended for students who have had an introductory college physics course. It seeks to provide an introduction to the idea of distributions in general, and to some aspects of the subject in…

  20. Augmenting the Deliberative Method for Ranking Risks.

    PubMed

    Susel, Irving; Lasley, Trace; Montezemolo, Mark; Piper, Joel

    2016-01-01

    The Department of Homeland Security (DHS) characterized and prioritized the physical cross-border threats and hazards to the nation stemming from terrorism, market-driven illicit flows of people and goods (illegal immigration, narcotics, funds, counterfeits, and weaponry), and other nonmarket concerns (movement of diseases, pests, and invasive species). These threats and hazards pose a wide diversity of consequences with very different combinations of magnitudes and likelihoods, making it very challenging to prioritize them. This article presents the approach that was used at DHS to arrive at a consensus regarding the threats and hazards that stand out from the rest based on the overall risk they pose. Due to time constraints for the decision analysis, it was not feasible to apply multiattribute methodologies like multiattribute utility theory or the analytic hierarchy process. Using a holistic approach was considered, such as the deliberative method for ranking risks first published in this journal. However, an ordinal ranking alone does not indicate relative or absolute magnitude differences among the risks. Therefore, the use of the deliberative method for ranking risks is not sufficient for deciding whether there is a material difference between the top-ranked and bottom-ranked risks, let alone deciding what the stand-out risks are. To address this limitation of ordinal rankings, the deliberative method for ranking risks was augmented by adding an additional step to transform the ordinal ranking into a ratio scale ranking. This additional step enabled the selection of stand-out risks to help prioritize further analysis.

  1. Ranking chemicals based on chronic toxicity data.

    PubMed

    De Rosa, C T; Stara, J F; Durkin, P R

    1985-12-01

    During the past 3 years, EPA's ECAO/Cincinnati has developed a method to rank chemicals based on chronic toxicity data. This ranking system reflects two primary attributes of every chemical: the minimum effective dose and the type of effect elicited at that dose. The purpose for developing this chronic toxicity ranking system was to provide the EPA with the technical background required to adjust the RQs of hazardous substances designated in Section 101(14) of CERCLA or "Superfund." This approach may have applications to other areas of interest to the EPA and other regulatory agencies where ranking of chemicals based on chronic toxicity is desired.

  2. Rank-based decompositions of morphological templates.

    PubMed

    Sussner, P; Ritter, G X

    2000-01-01

    Methods for matrix decomposition have found numerous applications in image processing, in particular for the problem of template decomposition. Since existing matrix decomposition techniques are mainly concerned with the linear domain, we consider it timely to investigate matrix decomposition techniques in the nonlinear domain with applications in image processing. The mathematical basis for these investigations is the new theory of rank within minimax algebra. Thus far, only minimax decompositions of rank 1 and rank 2 matrices into outer product expansions are known to the image processing community. We derive a heuristic algorithm for the decomposition of matrices having arbitrary rank.

  3. A Comparison of Teacher Rankings of Reading Readiness, Metropolitan Readiness Test Score Rankings, and Socioeconomic Status Rankings of First Graders.

    ERIC Educational Resources Information Center

    Elijah, David V., Jr.

    The purpose of this study was: (1) to determine to what extent teacher rankings of reading readiness compare with reading readiness test results, (2) to determine to what extent teacher rankings of reading readiness compare with pupil socioeconomic status, and (3) to determine to what extent readiness test results compare with pupil socioeconomic…

  4. Statistical regularities in the rank-citation profile of scientists

    PubMed Central

    Petersen, Alexander M.; Stanley, H. Eugene; Succi, Sauro

    2011-01-01

    Recent science of science research shows that scientific impact measures for journals and individual articles have quantifiable regularities across both time and discipline. However, little is known about the scientific impact distribution at the scale of an individual scientist. We analyze the aggregate production and impact using the rank-citation profile ci(r) of 200 distinguished professors and 100 assistant professors. For the entire range of paper rank r, we fit each ci(r) to a common distribution function. Since two scientists with equivalent Hirsch h-index can have significantly different ci(r) profiles, our results demonstrate the utility of the βi scaling parameter in conjunction with hi for quantifying individual publication impact. We show that the total number of citations Ci tallied from a scientist's Ni papers scales as . Such statistical regularities in the input-output patterns of scientists can be used as benchmarks for theoretical models of career progress. PMID:22355696

  5. Statistical regularities in the rank-citation profile of scientists

    NASA Astrophysics Data System (ADS)

    Petersen, Alexander M.; Stanley, H. Eugene; Succi, Sauro

    2011-12-01

    Recent science of science research shows that scientific impact measures for journals and individual articles have quantifiable regularities across both time and discipline. However, little is known about the scientific impact distribution at the scale of an individual scientist. We analyze the aggregate production and impact using the rank-citation profile ci(r) of 200 distinguished professors and 100 assistant professors. For the entire range of paper rank r, we fit each ci(r) to a common distribution function. Since two scientists with equivalent Hirsch h-index can have significantly different ci(r) profiles, our results demonstrate the utility of the βi scaling parameter in conjunction with hi for quantifying individual publication impact. We show that the total number of citations Ci tallied from a scientist's Ni papers scales as . Such statistical regularities in the input-output patterns of scientists can be used as benchmarks for theoretical models of career progress.

  6. Reliable detection of directional couplings using rank statistics.

    PubMed

    Chicharro, Daniel; Andrzejak, Ralph G

    2009-08-01

    To detect directional couplings from time series various measures based on distances in reconstructed state spaces were introduced. These measures can, however, be biased by asymmetries in the dynamics' structure, noise color, or noise level, which are ubiquitous in experimental signals. Using theoretical reasoning and results from model systems we identify the various sources of bias and show that most of them can be eliminated by an appropriate normalization. We furthermore diminish the remaining biases by introducing a measure based on ranks of distances. This rank-based measure outperforms existing distance-based measures concerning both sensitivity and specificity for directional couplings. Therefore, our findings are relevant for a reliable detection of directional couplings from experimental signals.

  7. Iran Mortality and Measures of Risk: Rankings for Public policy

    PubMed Central

    Aalabaf-Sabaghi, M

    2010-01-01

    Background: This paper offers mortality risk rankings for Iranian mortality data. It extends methods to include mixed cohorts, tests changes in mortality risks, compares measures of risk and discusses public policy implications. Methods: The methodology used in risk measures takes current practice and extends it to include variations in population dynamics. The specification is presented and compared with existing literature. Results: Our findings confirm literature results in the re-ordering that takes place when different risk measures are used. In addition, we find there is consistency in risk rankings between 1999 and 2000 records of Iranian mortality data. Thus, these risk measures are stable, robust across time and relay risk information consistently. Conclusions: There are considerable implications in adopting particular risk measures for public policy. However, given properties of risk measures discussed here, it is clear that policy makers can select relevant risk measures depending on their priorities. PMID:23112989

  8. A Spatial Overlay Ranking Method for a Geospatial Search of Text Objects

    USGS Publications Warehouse

    Lanfear, Kenneth J.

    2006-01-01

    Earth-science researchers need the capability to find relevant information by location and topic. Conventional geographic techniques that simply check whether polygons intersect can efficiently achieve a high recall on location, but can not achieve precision for ranking results in likely order of importance to the reader. A spatial overlay ranking based upon how well an object's footprint matches the search area provides a more effective way to spatially search a collection of reports, and avoids many of the problems associated with an 'in/out' (True/False) boolean search. Moreover, spatial overlay ranking appears to work well even when spatial extent is defined only by a simple bounding box.

  9. Ranking Slope Stability in Frozen Terrain

    NASA Astrophysics Data System (ADS)

    Stothoff, S.; Dinwiddie, C. L.; Walter, G. R.; Necsoiu, M.

    2011-12-01

    Motivated by the need to assess the risk of permafrost thaw to infrastructure, such as roads, bridges, and pipelines, a landscape-scale approach was developed to rank the risk of slope failures and thermokarst development in areas of seasonally frozen soils underlain by permafrost. The approach has two parts: (i) identifying locations where permafrost thaw is likely to occur under future climates, and (ii) identifying areas where thaw would have consequences with respect to a disturbance. The developed screening tool uses (i) land classification maps developed from remotely sensed data and (ii) a thermohydrologic hazard risk assessment to identify areas susceptible to slope instability under current and future climate states. The screening tool combines a numerical ground thawing and freezing dynamics model for calculating the thickness of the active layer and depth of permafrost with a simple slope stability model that is based upon the Level I Stability Analysis (LISA) approach of Harrell et al. (1992). Instead of using the numerical models directly within probabilistic sampling, a response function for the factor of safety in slope stability is developed from numerical simulations that systematically vary input parameters across their range of applicability. The response function is used within Monte Carlo sampling for each grid cell in a landscape model, with a probability distribution for each input parameter assigned to each grid cell based on (i) classes defined for each grid cell; (ii) a digital elevation model; (iii) empirical, mathematical, and numerical interpretive models; and (iv) probabilistic descriptions of the parameters in the interpretive models. For example, the root cohesion distribution is defined by vegetation class, with vegetation spread across the landscape using Landsat-derived vegetation classification maps. The probability of slope failure is the fraction of parameter realizations that result in a factor of safety less than 1. Ranking

  10. Physician Location Selection and Distribution. A Bibliography of Relevant Articles, Reports and Data Sources. Health Manpower Policy Discussion Paper Series No. D3.

    ERIC Educational Resources Information Center

    Crane, Stephen C.; Reynolds, Juanita

    This bibliography provides background material on two general issues of how physicians are distributed geographically and how physicians choose a practice location. The report is divided into five major categories of information: overview summary of annotated articles, reference key to location decision factors, reference key to public policy…

  11. The County Health Rankings: rationale and methods.

    PubMed

    Remington, Patrick L; Catlin, Bridget B; Gennuso, Keith P

    2015-01-01

    Annually since 2010, the University of Wisconsin Population Health Institute and the Robert Wood Johnson Foundation have produced the County Health Rankings-a "population health checkup" for the nation's over 3,000 counties. The purpose of this paper is to review the background and rationale for the Rankings, explain in detail the methods we use to create the health rankings in each state, and discuss the strengths and limitations associated with ranking the health of communities. We base the Rankings on a conceptual model of population health that includes both health outcomes (mortality and morbidity) and health factors (health behaviors, clinical care, social and economic factors, and the physical environment). Data for over 30 measures available at the county level are assembled from a number of national sources. Z-scores are calculated for each measure, multiplied by their assigned weights, and summed to create composite measure scores. Composite scores are then ordered and counties are ranked from best to worst health within each state. Health outcomes and related health factors vary significantly within states, with over two-fold differences between the least healthy counties versus the healthiest counties for measures such as premature mortality, teen birth rates, and percent of children living in poverty. Ranking within each state depicts disparities that are not apparent when counties are ranked across the entire nation. The County Health Rankings can be used to clearly demonstrate differences in health by place, raise awareness of the many factors that influence health, and stimulate community health improvement efforts. The Rankings draws upon the human instinct to compete by facilitating comparisons between neighboring or peer counties within states. Since no population health model, or rankings based off such models, will ever perfectly describe the health of its population, we encourage users to look to local sources of data to understand more about

  12. Comparison of SCImago journal rank indicator with journal impact factor.

    PubMed

    Falagas, Matthew E; Kouranos, Vasilios D; Arencibia-Jorge, Ricardo; Karageorgopoulos, Drosos E

    2008-08-01

    The application of currently available sophisticated algorithms of citation analysis allows for the incorporation of the "quality" of citations in the evaluation of scientific journals. We sought to compare the newly introduced SCImago journal rank (SJR) indicator with the journal impact factor (IF). We retrieved relevant information from the official Web sites hosting the above indices and their source databases. The SJR indicator is an open-access resource, while the journal IF requires paid subscription. The SJR indicator (based on Scopus data) lists considerably more journal titles published in a wider variety of countries and languages, than the journal IF (based on Web of Science data). Both indices divide citations to a journal by articles of the journal, during a specific time period. However, contrary to the journal IF, the SJR indicator attributes different weight to citations depending on the "prestige" of the citing journal without the influence of journal self-citations; prestige is estimated with the application of the PageRank algorithm in the network of journals. In addition, the SJR indicator includes the total number of documents of a journal in the denominator of the relevant calculation, whereas the journal IF includes only "citable" articles (mainly original articles and reviews). A 3-yr period is analyzed in both indices but with the use of different approaches. Regarding the top 100 journals in the 2006 journal IF ranking order, the median absolute change in their ranking position with the use of the SJR indicator is 32 (1st quartile: 12; 3rd quartile: 75). Although further validation is warranted, the novel SJR indicator poses as a serious alternative to the well-established journal IF, mainly due to its open-access nature, larger source database, and assessment of the quality of citations.

  13. Time-Aware Service Ranking Prediction in the Internet of Things Environment

    PubMed Central

    Huang, Yuze; Huang, Jiwei; Cheng, Bo; He, Shuqing; Chen, Junliang

    2017-01-01

    With the rapid development of the Internet of things (IoT), building IoT systems with high quality of service (QoS) has become an urgent requirement in both academia and industry. During the procedures of building IoT systems, QoS-aware service selection is an important concern, which requires the ranking of a set of functionally similar services according to their QoS values. In reality, however, it is quite expensive and even impractical to evaluate all geographically-dispersed IoT services at a single client to obtain such a ranking. Nevertheless, distributed measurement and ranking aggregation have to deal with the high dynamics of QoS values and the inconsistency of partial rankings. To address these challenges, we propose a time-aware service ranking prediction approach named TSRPred for obtaining the global ranking from the collection of partial rankings. Specifically, a pairwise comparison model is constructed to describe the relationships between different services, where the partial rankings are obtained by time series forecasting on QoS values. The comparisons of IoT services are formulated by random walks, and thus, the global ranking can be obtained by sorting the steady-state probabilities of the underlying Markov chain. Finally, the efficacy of TSRPred is validated by simulation experiments based on large-scale real-world datasets. PMID:28448451

  14. Time-Aware Service Ranking Prediction in the Internet of Things Environment.

    PubMed

    Huang, Yuze; Huang, Jiwei; Cheng, Bo; He, Shuqing; Chen, Junliang

    2017-04-27

    With the rapid development of the Internet of things (IoT), building IoT systems with high quality of service (QoS) has become an urgent requirement in both academia and industry. During the procedures of building IoT systems, QoS-aware service selection is an important concern, which requires the ranking of a set of functionally similar services according to their QoS values. In reality, however, it is quite expensive and even impractical to evaluate all geographically-dispersed IoT services at a single client to obtain such a ranking. Nevertheless, distributed measurement and ranking aggregation have to deal with the high dynamics of QoS values and the inconsistency of partial rankings. To address these challenges, we propose a time-aware service ranking prediction approach named TSRPred for obtaining the global ranking from the collection of partial rankings. Specifically, a pairwise comparison model is constructed to describe the relationships between different services, where the partial rankings are obtained by time series forecasting on QoS values. The comparisons of IoT services are formulated by random walks, and thus, the global ranking can be obtained by sorting the steady-state probabilities of the underlying Markov chain. Finally, the efficacy of TSRPred is validated by simulation experiments based on large-scale real-world datasets.

  15. Industrial activated sludge exhibit unique bacterial community composition at high taxonomic ranks.

    PubMed

    Ibarbalz, Federico M; Figuerola, Eva L M; Erijman, Leonardo

    2013-07-01

    Biological degradation of domestic and industrial wastewater by activated sludge depends on a common process of separation of the diverse self-assembled and self-sustained microbial flocs from the treated wastewater. Previous surveys of bacterial communities indicated the presence of a common core of bacterial phyla in municipal activated sludge, an observation consistent with the concept of ecological coherence of high taxonomic ranks. The aim of this work was to test whether this critical feature brings about a common pattern of abundance distribution of high bacterial taxa in industrial and domestic activated sludge, and to relate the bacterial community structure of industrial activated sludge with relevant operational parameters. We have applied 454 pyrosequencing of 16S rRNA genes to evaluate bacterial communities in full-scale biological wastewater treatment plants sampled at different times, including seven systems treating wastewater from different industries and one plant that treats domestic wastewater, and compared our datasets with the data from municipal wastewater treatment plants obtained by three different laboratories. We observed that each industrial activated sludge system exhibited a unique bacterial community composition, which is clearly distinct from the common profile of bacterial phyla or classes observed in municipal plants. The influence of process parameters on the bacterial community structure was evaluated using constrained analysis of principal coordinates (CAP). Part of the differences in the bacterial community structure between industrial wastewater treatment systems were explained by dissolved oxygen and pH. Despite the ecological relevance of floc formation for the assembly of bacterial communities in activated sludge, the wastewater characteristics are likely to be the major determinant that drives bacterial composition at high taxonomic ranks.

  16. College Rankings: History, Criticism and Reform

    ERIC Educational Resources Information Center

    Myers, Luke; Robe, Jonathan

    2009-01-01

    Today, college quality rankings in news magazines and guidebooks are a big business with tangible impacts on the operation of higher education institutions. The college rankings published annually by "U.S. News and World Report" ("U.S. News") are so influential that Don Hossler of Indiana University derisively claims that higher education is the…

  17. Public Perception of Cancer Survival Rankings

    ERIC Educational Resources Information Center

    Jensen, Jakob D.; Scherr, Courtney L.; Brown, Natasha; Jones, Christina; Christy, Katheryn

    2013-01-01

    Past research has observed that certain subgroups (e.g., individuals who are overweight/obese) have inaccurate estimates of survival rates for particular cancers (e.g., colon cancer). However, no study has examined whether the lay public can accurately rank cancer survival rates in comparison with one another (i.e., rank cancers from most deadly…

  18. A Rational Method for Ranking Engineering Programs.

    ERIC Educational Resources Information Center

    Glower, Donald D.

    1980-01-01

    Compares two methods for ranking academic programs, the opinion poll v examination of career successes of the program's alumni. For the latter, "Who's Who in Engineering" and levels of research funding provided data. Tables display resulting data and compare rankings by the two methods for chemical engineering and civil engineering. (CS)

  19. Ranking of Scientists: A New Approach.

    ERIC Educational Resources Information Center

    Sen, B. K.; Pandalai, T. A.; Karanjai, Aruna

    1998-01-01

    Proposes a formula for the ranking of scientists based on diachronous citation counts. Generalizes the fact that the citation-generation potential is not the same for all papers, and states that the proposed method of ranking does not replace peer review, but rather acts as an aid for them. (Author/LRW)

  20. Fundamental Measurement of Rank-Ordered Objects.

    ERIC Educational Resources Information Center

    Linacre, John M.

    A Rasch measurement model can be constructed to meet the requirements of rank ordered data. If multiple rankings of the same objects are available, then the parameters of the objects can be estimated, along with their standard errors and also with statistics summarizing the fit of the data to the measurement model. This paper summarizes the…

  1. Ranking scientific publications: the effect of nonlinearity

    NASA Astrophysics Data System (ADS)

    Yao, Liyang; Wei, Tian; Zeng, An; Fan, Ying; di, Zengru

    2014-10-01

    Ranking the significance of scientific publications is a long-standing challenge. The network-based analysis is a natural and common approach for evaluating the scientific credit of papers. Although the number of citations has been widely used as a metric to rank papers, recently some iterative processes such as the well-known PageRank algorithm have been applied to the citation networks to address this problem. In this paper, we introduce nonlinearity to the PageRank algorithm when aggregating resources from different nodes to further enhance the effect of important papers. The validation of our method is performed on the data of American Physical Society (APS) journals. The results indicate that the nonlinearity improves the performance of the PageRank algorithm in terms of ranking effectiveness, as well as robustness against malicious manipulations. Although the nonlinearity analysis is based on the PageRank algorithm, it can be easily extended to other iterative ranking algorithms and similar improvements are expected.

  2. Embedded feature ranking for ensemble MLP classifiers.

    PubMed

    Windeatt, Terry; Duangsoithong, Rakkrit; Smith, Raymond

    2011-06-01

    A feature ranking scheme for multilayer perceptron (MLP) ensembles is proposed, along with a stopping criterion based upon the out-of-bootstrap estimate. To solve multi-class problems feature ranking is combined with modified error-correcting output coding. Experimental results on benchmark data demonstrate the versatility of the MLP base classifier in removing irrelevant features.

  3. A Ranking Method for Evaluating Constructed Responses

    ERIC Educational Resources Information Center

    Attali, Yigal

    2014-01-01

    This article presents a comparative judgment approach for holistically scored constructed response tasks. In this approach, the grader rank orders (rather than rate) the quality of a small set of responses. A prior automated evaluation of responses guides both set formation and scaling of rankings. Sets are formed to have similar prior scores and…

  4. Ranking scientific publications: the effect of nonlinearity.

    PubMed

    Yao, Liyang; Wei, Tian; Zeng, An; Fan, Ying; Di, Zengru

    2014-10-17

    Ranking the significance of scientific publications is a long-standing challenge. The network-based analysis is a natural and common approach for evaluating the scientific credit of papers. Although the number of citations has been widely used as a metric to rank papers, recently some iterative processes such as the well-known PageRank algorithm have been applied to the citation networks to address this problem. In this paper, we introduce nonlinearity to the PageRank algorithm when aggregating resources from different nodes to further enhance the effect of important papers. The validation of our method is performed on the data of American Physical Society (APS) journals. The results indicate that the nonlinearity improves the performance of the PageRank algorithm in terms of ranking effectiveness, as well as robustness against malicious manipulations. Although the nonlinearity analysis is based on the PageRank algorithm, it can be easily extended to other iterative ranking algorithms and similar improvements are expected.

  5. Mining Feedback in Ranking and Recommendation Systems

    ERIC Educational Resources Information Center

    Zhuang, Ziming

    2009-01-01

    The amount of online information has grown exponentially over the past few decades, and users become more and more dependent on ranking and recommendation systems to address their information seeking needs. The advance in information technologies has enabled users to provide feedback on the utilities of the underlying ranking and recommendation…

  6. A Different Approach to University Rankings

    ERIC Educational Resources Information Center

    Tofallis, Chris

    2012-01-01

    Educationalists are well able to find fault with rankings on numerous grounds and may reject them outright. However, given that they are here to stay, we could also try to improve them wherever possible. All currently published university rankings combine various measures to produce an overall score using an additive approach. The individual…

  7. Ranking scientific publications: the effect of nonlinearity

    PubMed Central

    Yao, Liyang; Wei, Tian; Zeng, An; Fan, Ying; Di, Zengru

    2014-01-01

    Ranking the significance of scientific publications is a long-standing challenge. The network-based analysis is a natural and common approach for evaluating the scientific credit of papers. Although the number of citations has been widely used as a metric to rank papers, recently some iterative processes such as the well-known PageRank algorithm have been applied to the citation networks to address this problem. In this paper, we introduce nonlinearity to the PageRank algorithm when aggregating resources from different nodes to further enhance the effect of important papers. The validation of our method is performed on the data of American Physical Society (APS) journals. The results indicate that the nonlinearity improves the performance of the PageRank algorithm in terms of ranking effectiveness, as well as robustness against malicious manipulations. Although the nonlinearity analysis is based on the PageRank algorithm, it can be easily extended to other iterative ranking algorithms and similar improvements are expected. PMID:25322852

  8. Canadian University Rankings: Buyer Beware Once Again

    ERIC Educational Resources Information Center

    Page, Stewart; Cramer, Kenneth M.; Page, Laura

    2010-01-01

    We present a data-based perspective concerning recent (e.g., 2008) "Maclean's" magazine rankings of Canadian universities, including cluster analysis of the 2008 data. Canadian universities empirically resemble and relate to each other in a manner different from their formal classification and final rank ordering in the…

  9. Rankings and the Global Reputation Race

    ERIC Educational Resources Information Center

    Hazelkorn, Ellen

    2014-01-01

    This chapter delves into the growing influence and impact of rankings on higher education, as a lens through which to view how the race for reputation and status is changing the higher education landscape, both globally and nationally. The author considers the extent to which rankings are driving policy choices and institutional decisions and the…

  10. Rankings and the Global Reputation Race

    ERIC Educational Resources Information Center

    Hazelkorn, Ellen

    2014-01-01

    This chapter delves into the growing influence and impact of rankings on higher education, as a lens through which to view how the race for reputation and status is changing the higher education landscape, both globally and nationally. The author considers the extent to which rankings are driving policy choices and institutional decisions and the…

  11. Public Perception of Cancer Survival Rankings

    ERIC Educational Resources Information Center

    Jensen, Jakob D.; Scherr, Courtney L.; Brown, Natasha; Jones, Christina; Christy, Katheryn

    2013-01-01

    Past research has observed that certain subgroups (e.g., individuals who are overweight/obese) have inaccurate estimates of survival rates for particular cancers (e.g., colon cancer). However, no study has examined whether the lay public can accurately rank cancer survival rates in comparison with one another (i.e., rank cancers from most deadly…

  12. A Ranking Method for Evaluating Constructed Responses

    ERIC Educational Resources Information Center

    Attali, Yigal

    2014-01-01

    This article presents a comparative judgment approach for holistically scored constructed response tasks. In this approach, the grader rank orders (rather than rate) the quality of a small set of responses. A prior automated evaluation of responses guides both set formation and scaling of rankings. Sets are formed to have similar prior scores and…

  13. The Rankings Game: Who's Playing Whom?

    ERIC Educational Resources Information Center

    Burness, John F.

    2008-01-01

    This summer, Forbes magazine published its new rankings of "America's Best Colleges," implying that it had developed a methodology that would give the public the information that it needed to choose a college wisely. "U.S. News & World Report," which in 1983 published the first annual ranking, just announced its latest ratings last week--including…

  14. Gender Equity in Academic Rank and Salary.

    ERIC Educational Resources Information Center

    Smart, John C.

    1991-01-01

    Study of gender disparities in rank/salary of college faculty used causal model to examine variables commonly used in human capital and structural/functional perspectives that have guided most research on gender equity. More than 60 percent of total effect of gender on academic rank/salaries is indirect. Model's usefulness and implications for…

  15. Rankings as a Catalyst: Improving Student Performance

    ERIC Educational Resources Information Center

    Cyr, John; Fyfe, Diane

    2004-01-01

    In this article, the authors discuss why ranking has become so popular in schools. Those who promote ranking believe that it provides information to help parents and students choose a "good school"; that it puts pressure on the underperforming school to improve; and that it provides an opportunity for a school to be scrutinized through a…

  16. University Rankings: Status Quo, Dilemmas, and Prospects

    ERIC Educational Resources Information Center

    Hongcai, Wang

    2009-01-01

    It has been exactly twenty years since the term "university rankings" came into being in China, and people have become relatively rational about the process, after an impetuous beginning. In a sense, the appearance of university rankings in China indicates the birth of something new, or the beginning of social voices in higher education…

  17. The Rankings Game: Who's Playing Whom?

    ERIC Educational Resources Information Center

    Burness, John F.

    2008-01-01

    This summer, Forbes magazine published its new rankings of "America's Best Colleges," implying that it had developed a methodology that would give the public the information that it needed to choose a college wisely. "U.S. News & World Report," which in 1983 published the first annual ranking, just announced its latest ratings last week--including…

  18. Iterative resource allocation for ranking spreaders in complex networks

    NASA Astrophysics Data System (ADS)

    Ren, Zhuo-Ming; Zeng, An; Chen, Duan-Bing; Liao, Hao; Liu, Jian-Guo

    2014-05-01

    Ranking the spreading influence of nodes in networks is a very important issue with wide applications in many different fields. Various topology-based centrality measures have been proposed to identify influential spreaders. However, the spreading influence of a node is usually not only determined by its own centrality but also largely influenced by the centrality of neighbors. To incorporate the centrality information of neighbors in ranking spreaders, we design an iterative resource allocation (IRA) process in which the resource of nodes distributes to their neighbors according to neighbors' centrality. After iterations, the resource amount on each node will be stable and the final resources of nodes are used to rank their spreading influence. The iterative process can be applied to many traditional centrality measures including degree, K-shell, closeness, and betweenness. The validation of our method is based on the susceptible-infected-recovered (SIR) spreading in four representative real datasets. The results show that the ranking accuracy of the traditional centrality measures is remarkably enhanced by IRA.

  19. Ranking tributaries for setting remediation priorities in a TMDL context.

    PubMed

    Stringfellow, William T

    2008-05-01

    The San Joaquin River (SJR) in the Central Valley of California has been designated an impaired waterbody based on its loss of fisheries-related beneficial uses and the river is now subject to regulation under total maximum daily load (TMDL) rules. For impaired waterbodies, numeric standards alone may not be sufficient to establish remediation priorities and priorities must be established by comparing drainages to each other. Data collected as part of regional water quality (WQ) studies in the SJR Valley were not normally distributed, so nonparametric methods based on ranking were used to compare the WQ of individual tributaries and drainages. Normalized rank means (NRMs) were calculated from ranked data and NRMs were mapped to identify priority drainages for WQ improvement activities. NRMs for individual parameters were combined into indexes that are useful for examining the relative importance of different drainages for multiple parameters simultaneously. Indexes were developed for eutrophication and overall WQ. This ranking approach is being proposed as an easily understood, transparent, and scientifically rigorous method to assess the relative WQ impact of individual drainages and set watershed remediation priorities.

  20. Distribution of MdACS3 null alleles in apple (Malus × domestica Borkh.) and its relevance to the fruit ripening characters

    PubMed Central

    Bai, Songling; Wang, Aide; Igarashi, Megumi; Kon, Tomoyuki; Fukasawa-Akada, Tomoko; Li, Tianzhong; Harada, Takeo; Hatsuyama, Yoshimichi

    2012-01-01

    Expression of MdACS3a, one of the ripening-related ACC synthase genes, plays a pivotal role in initiating the burst of ethylene production by MdACS1 in apple fruit. Although previous studies have demonstrated the presence of MdACS3a-null alleles through deficiency of transcription activity or loss of enzyme activity due to amino acid substitution, which may affect the storage properties of certain fruit cultivars, an overall picture of these null alleles in cultivars is still lacking. The present study investigated the distribution of null allelic genes in 103 cultivars and 172 breeding selections by using a simple sequence repeat (SSR) marker linked to them. The results indicated that both allelic genes were widely distributed throughout the examined cultivars and selections, some occurring as the null genotype, either homozygously or heterozygously, with each null allele. The implications of MdACS3a distribution results and the influence of its null allelotypes in fruit characters are discussed. PMID:23136513

  1. ContrastRank: a new method for ranking putative cancer driver genes and classification of tumor samples

    PubMed Central

    Tian, Rui; Basu, Malay K.; Capriotti, Emidio

    2014-01-01

    Motivation: The recent advance in high-throughput sequencing technologies is generating a huge amount of data that are becoming an important resource for deciphering the genotype underlying a given phenotype. Genome sequencing has been extensively applied to the study of the cancer genomes. Although a few methods have been already proposed for the detection of cancer-related genes, their automatic identification is still a challenging task. Using the genomic data made available by The Cancer Genome Atlas Consortium (TCGA), we propose a new prioritization approach based on the analysis of the distribution of putative deleterious variants in a large cohort of cancer samples. Results: In this paper, we present ContastRank, a new method for the prioritization of putative impaired genes in cancer. The method is based on the comparison of the putative defective rate of each gene in tumor versus normal and 1000 genome samples. We show that the method is able to provide a ranked list of putative impaired genes for colon, lung and prostate adenocarcinomas. The list significantly overlaps with the list of known cancer driver genes previously published. More importantly, by using our scoring approach, we can successfully discriminate between TCGA normal and tumor samples. A binary classifier based on ContrastRank score reaches an overall accuracy >90% and the area under the curve (AUC) of receiver operating characteristics (ROC) >0.95 for all the three types of adenocarcinoma analyzed in this paper. In addition, using ContrastRank score, we are able to discriminate the three tumor types with a minimum overall accuracy of 77% and AUC of 0.83. Conclusions: We describe ContrastRank, a method for prioritizing putative impaired genes in cancer. The method is based on the comparison of exome sequencing data from different cohorts and can detect putative cancer driver genes. ContrastRank can also be used to estimate a global score for an individual genome about the risk of

  2. A Ranking Approach to Genomic Selection

    PubMed Central

    Blondel, Mathieu; Onogi, Akio; Iwata, Hiroyoshi; Ueda, Naonori

    2015-01-01

    Background Genomic selection (GS) is a recent selective breeding method which uses predictive models based on whole-genome molecular markers. Until now, existing studies formulated GS as the problem of modeling an individual’s breeding value for a particular trait of interest, i.e., as a regression problem. To assess predictive accuracy of the model, the Pearson correlation between observed and predicted trait values was used. Contributions In this paper, we propose to formulate GS as the problem of ranking individuals according to their breeding value. Our proposed framework allows us to employ machine learning methods for ranking which had previously not been considered in the GS literature. To assess ranking accuracy of a model, we introduce a new measure originating from the information retrieval literature called normalized discounted cumulative gain (NDCG). NDCG rewards more strongly models which assign a high rank to individuals with high breeding value. Therefore, NDCG reflects a prerequisite objective in selective breeding: accurate selection of individuals with high breeding value. Results We conducted a comparison of 10 existing regression methods and 3 new ranking methods on 6 datasets, consisting of 4 plant species and 25 traits. Our experimental results suggest that tree-based ensemble methods including McRank, Random Forests and Gradient Boosting Regression Trees achieve excellent ranking accuracy. RKHS regression and RankSVM also achieve good accuracy when used with an RBF kernel. Traditional regression methods such as Bayesian lasso, wBSR and BayesC were found less suitable for ranking. Pearson correlation was found to correlate poorly with NDCG. Our study suggests two important messages. First, ranking methods are a promising research direction in GS. Second, NDCG can be a useful evaluation measure for GS. PMID:26068103

  3. MedlineRanker: flexible ranking of biomedical literature

    PubMed Central

    Fontaine, Jean-Fred; Barbosa-Silva, Adriano; Schaefer, Martin; Huska, Matthew R.; Muro, Enrique M.; Andrade-Navarro, Miguel A.

    2009-01-01

    The biomedical literature is represented by millions of abstracts available in the Medline database. These abstracts can be queried with the PubMed interface, which provides a keyword-based Boolean search engine. This approach shows limitations in the retrieval of abstracts related to very specific topics, as it is difficult for a non-expert user to find all of the most relevant keywords related to a biomedical topic. Additionally, when searching for more general topics, the same approach may return hundreds of unranked references. To address these issues, text mining tools have been developed to help scientists focus on relevant abstracts. We have implemented the MedlineRanker webserver, which allows a flexible ranking of Medline for a topic of interest without expert knowledge. Given some abstracts related to a topic, the program deduces automatically the most discriminative words in comparison to a random selection. These words are used to score other abstracts, including those from not yet annotated recent publications, which can be then ranked by relevance. We show that our tool can be highly accurate and that it is able to process millions of abstracts in a practical amount of time. MedlineRanker is free for use and is available at http://cbdm.mdc-berlin.de/tools/medlineranker. PMID:19429696

  4. MedlineRanker: flexible ranking of biomedical literature.

    PubMed

    Fontaine, Jean-Fred; Barbosa-Silva, Adriano; Schaefer, Martin; Huska, Matthew R; Muro, Enrique M; Andrade-Navarro, Miguel A

    2009-07-01

    The biomedical literature is represented by millions of abstracts available in the Medline database. These abstracts can be queried with the PubMed interface, which provides a keyword-based Boolean search engine. This approach shows limitations in the retrieval of abstracts related to very specific topics, as it is difficult for a non-expert user to find all of the most relevant keywords related to a biomedical topic. Additionally, when searching for more general topics, the same approach may return hundreds of unranked references. To address these issues, text mining tools have been developed to help scientists focus on relevant abstracts. We have implemented the MedlineRanker webserver, which allows a flexible ranking of Medline for a topic of interest without expert knowledge. Given some abstracts related to a topic, the program deduces automatically the most discriminative words in comparison to a random selection. These words are used to score other abstracts, including those from not yet annotated recent publications, which can be then ranked by relevance. We show that our tool can be highly accurate and that it is able to process millions of abstracts in a practical amount of time. MedlineRanker is free for use and is available at http://cbdm.mdc-berlin.de/tools/medlineranker.

  5. EXAMINING SOCIOECONOMIC HEALTH DISPARITIES USING A RANK-DEPENDENT RÉNYI INDEX

    PubMed Central

    Talih, Makram

    2015-01-01

    The Rényi index (RI) is a one-parameter class of indices that summarize health disparities among population groups by measuring divergence between the distributions of disease burden and population shares of these groups. The rank-dependent RI introduced in this paper is a two-parameter class of health disparity indices that also accounts for the association between socioeconomic rank and health; it may be derived from a rank-dependent social welfare function. Two competing classes are discussed and the rank-dependent RI is shown to be more robust to changes in the distribution of either socioeconomic rank or health. The standard error and sampling distribution of the rank-dependent RI are evaluated using linearization and re-sampling techniques, and the methodology is illustrated using health survey data from the U.S. National Health and Nutrition Examination Survey and registry data from the U.S. Surveillance, Epidemiology and End Results Program. Such data underlie many population-based objectives within the U.S. Healthy People 2020 initiative. The rank-dependent RI provides a unified mathematical framework for eliciting various societal positions with regards to the policies that are tied to such wide-reaching public health initiatives. For example, if population groups with lower socioeconomic position were ascertained to be more likely to utilize costly public programs, then the parameters of the RI could be selected to reflect prioritizing those population groups for intervention or treatment. PMID:26566419

  6. Web Image Search Re-ranking with Click-based Similarity and Typicality.

    PubMed

    Yang, Xiaopeng; Mei, Tao; Zhang, Yong Dong; Liu, Jie; Satoh, Shin'ichi

    2016-07-20

    In image search re-ranking, besides the well known semantic gap, intent gap, which is the gap between the representation of users' query/demand and the real intent of the users, is becoming a major problem restricting the development of image retrieval. To reduce human effects, in this paper, we use image click-through data, which can be viewed as the "implicit feedback" from users, to help overcome the intention gap, and further improve the image search performance. Generally, the hypothesis visually similar images should be close in a ranking list and the strategy images with higher relevance should be ranked higher than others are widely accepted. To obtain satisfying search results, thus, image similarity and the level of relevance typicality are determinate factors correspondingly. However, when measuring image similarity and typicality, conventional re-ranking approaches only consider visual information and initial ranks of images, while overlooking the influence of click-through data. This paper presents a novel re-ranking approach, named spectral clustering re-ranking with click-based similarity and typicality (SCCST). First, to learn an appropriate similarity measurement, we propose click-based multi-feature similarity learning algorithm (CMSL), which conducts metric learning based on clickbased triplets selection, and integrates multiple features into a unified similarity space via multiple kernel learning. Then based on the learnt click-based image similarity measure, we conduct spectral clustering to group visually and semantically similar images into same clusters, and get the final re-rank list by calculating click-based clusters typicality and withinclusters click-based image typicality in descending order. Our experiments conducted on two real-world query-image datasets with diverse representative queries show that our proposed reranking approach can significantly improve initial search results, and outperform several existing re-ranking approaches.

  7. Discoveries far from the lamppost with matrix elements and ranking

    DOE PAGES

    Debnath, Dipsikha; Gainer, James S.; Matchev, Konstantin T.

    2015-04-01

    The prevalence of null results in searches for new physics at the LHC motivates the effort to make these searches as model-independent as possible. We describe procedures for adapting the Matrix Element Method for situations where the signal hypothesis is not known a priori. We also present general and intuitive approaches for performing analyses and presenting results, which involve the flattening of background distributions using likelihood information. The first flattening method involves ranking events by background matrix element, the second involves quantile binning with respect to likelihood (and other) variables, and the third method involves reweighting histograms by the inversemore » of the background distribution.« less

  8. Poisson statistics of PageRank probabilities of Twitter and Wikipedia networks

    NASA Astrophysics Data System (ADS)

    Frahm, Klaus M.; Shepelyansky, Dima L.

    2014-04-01

    We use the methods of quantum chaos and Random Matrix Theory for analysis of statistical fluctuations of PageRank probabilities in directed networks. In this approach the effective energy levels are given by a logarithm of PageRank probability at a given node. After the standard energy level unfolding procedure we establish that the nearest spacing distribution of PageRank probabilities is described by the Poisson law typical for integrable quantum systems. Our studies are done for the Twitter network and three networks of Wikipedia editions in English, French and German. We argue that due to absence of level repulsion the PageRank order of nearby nodes can be easily interchanged. The obtained Poisson law implies that the nearby PageRank probabilities fluctuate as random independent variables.

  9. Citation analysis in journal rankings: medical informatics in the library and information science literature.

    PubMed Central

    Vishwanatham, R

    1998-01-01

    Medical informatics is an interdisciplinary field. Medical informatics articles will be found in the literature of various disciplines including library and information science publications. The purpose of this study was to provide an objectively ranked list of journals that publish medical informatics articles relevant to library and information science. Library Literature, Library and Information Science Abstracts, and Social Science Citation Index were used to identify articles published on the topic of medical informatics and to identify a ranked list of journals. This study also used citation analysis to identify the most frequently cited journals relevant to library and information science. PMID:9803294

  10. Citation analysis in journal rankings: medical informatics in the library and information science literature.

    PubMed

    Vishwanatham, R

    1998-10-01

    Medical informatics is an interdisciplinary field. Medical informatics articles will be found in the literature of various disciplines including library and information science publications. The purpose of this study was to provide an objectively ranked list of journals that publish medical informatics articles relevant to library and information science. Library Literature, Library and Information Science Abstracts, and Social Science Citation Index were used to identify articles published on the topic of medical informatics and to identify a ranked list of journals. This study also used citation analysis to identify the most frequently cited journals relevant to library and information science.

  11. Regulation of gene expression and subcellular protein distribution in MLO-Y4 osteocytic cells by lysophosphatidic acid: Relevance to dendrite outgrowth.

    SciTech Connect

    Waters, Katrina M.; Jacobs, Jon M.; Gritsenko, Marina A.; Karin, Norman J.

    2011-02-26

    Osteoblastic and osteocytic cells are highly responsive to the lipid growth factor lysophosphatidic acid (LPA) but the mechanisms by which LPA alters bone cell functions are largely unknown. A major effect of LPA on osteocytic cells is the stimulation of dendrite membrane outgrowth, a process that we predicted to require changes in gene expression and protein distribution. We employed DNA microarrays for global transcriptional profiling of MLO-Y4 osteocytic cells grown for 6 and 24h in the presence or absence of LPA. We identified 932 transcripts that displayed statistically significant changes in abundance of at least 1.25-fold in response to LPA treatment. Gene ontology (GO) analysis revealed that the regulated gene products were linked to diverse cellular processes, including DNA repair, response to unfolded protein, ossification, protein-RNA complex assembly, and amine biosynthesis. Gene products associated with the regulation of actin microfilament dynamics displayed the most robust expression changes, and LPA-induced dendritogenesis in vitro was blocked by the stress fiber inhibitor cytochalasin D. Mass spectrometry-based proteomic analysis of MLO-Y4 cells revealed significant LPA-induced changes in the abundance of 284 proteins at 6h and 844 proteins at 24h. GO analysis of the proteomic data linked the effects of LPA to cell processes that control of protein distribution and membrane outgrowth, including protein localization, protein complex assembly, Golgi vesicle transport, cytoskeleton-dependent transport, and membrane invagination/endocytosis. Dendrites were isolated from LPA-treated MLO-Y4 cells and subjected to proteomic analysis to quantitatively assess the subcellular distribution of proteins. Sets of 129 and 36 proteins were enriched in the dendrite fraction as compared to whole cells after 6h and 24h of LPA exposure, respectively. Protein markers indicated that membranous organelles were largely excluded from the dendrites. Highly represented among

  12. RankExplorer: Visualization of Ranking Changes in Large Time Series Data.

    PubMed

    Shi, Conglei; Cui, Weiwei; Liu, Shixia; Xu, Panpan; Chen, Wei; Qu, Huamin

    2012-12-01

    For many applications involving time series data, people are often interested in the changes of item values over time as well as their ranking changes. For example, people search many words via search engines like Google and Bing every day. Analysts are interested in both the absolute searching number for each word as well as their relative rankings. Both sets of statistics may change over time. For very large time series data with thousands of items, how to visually present ranking changes is an interesting challenge. In this paper, we propose RankExplorer, a novel visualization method based on ThemeRiver to reveal the ranking changes. Our method consists of four major components: 1) a segmentation method which partitions a large set of time series curves into a manageable number of ranking categories; 2) an extended ThemeRiver view with embedded color bars and changing glyphs to show the evolution of aggregation values related to each ranking category over time as well as the content changes in each ranking category; 3) a trend curve to show the degree of ranking changes over time; 4) rich user interactions to support interactive exploration of ranking changes. We have applied our method to some real time series data and the case studies demonstrate that our method can reveal the underlying patterns related to ranking changes which might otherwise be obscured in traditional visualizations.

  13. Inhibition effect of enteropeptidase on RANKL-RANK signalling by cleavage of RANK.

    PubMed

    Zhao, Yunfeng; Jin, Mengmeng; Ma, Juan; Zhang, Shiqian; Li, Wei; Chen, Yuan; Zhou, Yingsheng; Tao, Hong; Liu, Yu; Wang, Lei; Han, Huamin; Niu, Ge; Tao, Hua; Liu, Changzhen; Gao, Bin

    2013-09-17

    Enteropeptidase can cleave trypsinogen on the sequence of Asp-Asp-Asp-Asp-Lys and plays an important role in food digestion. The RANKL-RANK signalling pathway plays a pivotal role in bone remodelling. In this study, we reported that enteropeptidase can inhibit the RANKL-RANK signalling pathway through the cleavage of RANK. A surrogate peptide blocking assay indicated that enteropeptidase could specifically cleave RANK on the sequence NEEDK. Osteoclast differentiation assay and NF-κB activity assay confirmed that enteropeptidase could inhibit osteoclastogenesis in vitro through the cleavage of RANK. This is the first study to prove that the RANKL-RANK signalling pathway can be inhibited by cleavage of RANK instead of targeting RANKL.

  14. Goal relevance as a quantitative model of human task relevance.

    PubMed

    Tanner, James; Itti, Laurent

    2017-03-01

    The concept of relevance is used ubiquitously in everyday life. However, a general quantitative definition of relevance has been lacking, especially as pertains to quantifying the relevance of sensory observations to one's goals. We propose a theoretical definition for the information value of data observations with respect to a goal, which we call "goal relevance." We consider the probability distribution of an agent's subjective beliefs over how a goal can be achieved. When new data are observed, its goal relevance is measured as the Kullback-Leibler divergence between belief distributions before and after the observation. Theoretical predictions about the relevance of different obstacles in simulated environments agreed with the majority response of 38 human participants in 83.5% of trials, beating multiple machine-learning models. Our new definition of goal relevance is general, quantitative, explicit, and allows one to put a number onto the previously elusive notion of relevance of observations to a goal. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  15. Size-dependent distribution of radiocesium in riverbed sediments and its relevance to the migration of radiocesium in river systems after the Fukushima Daiichi Nuclear Power Plant accident.

    PubMed

    Tanaka, Kazuya; Iwatani, Hokuto; Sakaguchi, Aya; Fan, Qiaohui; Takahashi, Yoshio

    2015-01-01

    We investigated the particle size distribution of radiocesium in riverbed sediments after the Fukushima Daiichi Nuclear Power Plant accident. Riverbed sediments were collected in the Abukuma River system in Fukushima and Miyagi Prefectures. The collected sediments were separated into 11 fractions, ranging from granular size (>2000 μm) to clay size (<2 μm) fractions. Cesium-137 concentrations were higher in the smaller particle size fractions, possibly reflecting specific surface areas and the mineralogy, in particular the clay mineral content. A gap in (137)Cs concentration was observed between the silt size and sand size fractions of riverbed sediments at downstream sites, whereas riverbed sediments at an upstream site did not show such a concentration gap. It is likely that selective transport of small particles in suspended state from upstream areas resulted in an accumulation of radiocesium in downstream areas.

  16. Discriminative Multi-view Interactive Image Re-ranking.

    PubMed

    Li, Jun; Xu, Chang; Yang, Wankou; Sun, Changyin; Tao, Dacheng

    2017-01-10

    -Given unreliable visual patterns and insufficient query information, content-based image retrieval (CBIR) is often suboptimal and requires image re-ranking using auxiliary information. In this paper, we propose Discriminative Multi-view INTeractive Image Re-ranking (DMINTIR), which integrates User Relevance Feedback (URF) capturing users' intentions and multiple features that sufficiently describe the images. In DMINTIR, heterogeneous property features are incorporated in the multi-view learning scheme to exploit their complementarities. In addition, a discriminatively learned weight vector is obtained to reassign updated scores and target images for reranking. Compared to other multi-view learning techniques, our scheme not only generates a compact representation in the latent space from the redundant multi-view features but also maximally preserves the discriminative information in feature encoding by the large-margin principle. Furthermore, the generalization error bound of the proposed algorithm is theoretically analyzed and shown to be improved by the interactions between the latent space and discriminant function learning. Experimental results on two benchmark datasets demonstrate that our approach boosts baseline retrieval quality and is competitive with other state-of-the-art re-ranking strategies.

  17. Statistically efficient tomography of low rank states with incomplete measurements

    NASA Astrophysics Data System (ADS)

    Acharya, Anirudh; Kypraios, Theodore; Guţă, Mădălin

    2016-04-01

    The construction of physically relevant low dimensional state models, and the design of appropriate measurements are key issues in tackling quantum state tomography for large dimensional systems. We consider the statistical problem of estimating low rank states in the set-up of multiple ions tomography, and investigate how the estimation error behaves with a reduction in the number of measurement settings, compared with the standard ion tomography setup. We present extensive simulation results showing that the error is robust with respect to the choice of states of a given rank, the random selection of settings, and that the number of settings can be significantly reduced with only a negligible increase in error. We present an argument to explain these findings based on a concentration inequality for the Fisher information matrix. In the more general setup of random basis measurements we use this argument to show that for certain rank r states it suffices to measure in O(r{log}d) bases to achieve the average Fisher information over all bases. We present numerical evidence for random states of up to eight atoms, which suggests that a similar behaviour holds in the case of Pauli bases measurements, for randomly chosen states. The relation to similar problems in compressed sensing is also discussed.

  18. Ranked set sampling with unequal samples.

    PubMed

    Bhoj, D S

    2001-09-01

    A ranked set sampling procedure with unequal samples (RSSU) is proposed and used to estimate the population mean. This estimator is then compared with the estimators based on the ranked set sampling (RSS) and median ranked set sampling (MRSS) procedures. It is shown that the relative precisions of the estimator based on RSSU are higher than those of the estimators based on RSS and MRSS. An example of estimating the mean diameter at breast height of longleaf-pine trees on the Wade Tract in Thomas County, Georgia, is presented.

  19. Otto Rank and man's urge to immortality.

    PubMed

    Goldwert, M

    1985-04-01

    Otto Rank, one of Sigmund Freud's original followers, posited the existence of an "urge to immortality" as man's deepest drive. In his Psychology and the Soul, Rank traced the desire for immortality through four historical eras, with particular emphasis on the creativity of the hero and the artist. By the end of his life, Rank had not only repudiated orthodox psychoanalysis and developed then abandoned a psychology of the will, he had moved "beyond psychology" to a religious view of history and the nature of man.

  20. Ranking hospitals according to acute myocardial infarction mortality: should transfers be included?

    PubMed

    Kosseim, Mylène; Mayo, Nancy E; Scott, Susan; Hanley, James A; Brophy, James; Gagnon, Bruno; Pilote, Louise

    2006-07-01

    The objective of this population-based observational cohort study was to estimate the extent to which the inclusion/exclusion of transferred patients with acute myocardial infarction (AMI) impacts on hospital performance rankings. The authors studied 91,633 adult patients admitted to 116 acute care hospitals in Quebec, Canada, with a primary diagnosis of AMI between 1992 and 1999. Hospital performance ranks, based on 30-day AMI mortality rates, were estimated with hierarchical models and compared using 3 different methods for handling transferred patients (exclude all transfers; include transfers and assign outcome to the referring hospital; include transfers and assign outcome to the receiving hospital). The explanatory variable of interest was the hospital to which the patient's outcome was attributed. Using the 3 methods, 4 hospitals were ranked "best performers" once, and 1 hospital ranked among the best in 2 of the 3 analyses performed. Nine hospitals were ranked "worst performers" at least once (4 of which ranked among the "worst" once only, 2 ranked among the "worst" twice, and 3 were consistently ranked "worst performers" in all analyses). There was significant variation in mortality rates among hospitals, and the difference in the rates between the highest and lowest ranking hospitals exceeded the clinically relevant benchmark of 1%. Performance evaluation studies that compare hospital mortality rates typically exclude transferred patients. However, methods used to deal with AMI patient transfers influenced hospital ranks when comparing 30-day mortality rates. Excluding transfers may lead to an inaccurate depiction of the quality of healthcare services in regionalized healthcare systems that call for the timely interhospital transfer of patients with AMI.

  1. Impact factor distribution revisited

    NASA Astrophysics Data System (ADS)

    Huang, Ding-wei

    2017-09-01

    We explore the consistency of a new type of frequency distribution, where the corresponding rank distribution is Lavalette distribution. Empirical data of journal impact factors can be well described. This distribution is distinct from Poisson distribution and negative binomial distribution, which were suggested by previous study. By a log transformation, we obtain a bell-shaped distribution, which is then compared to Gaussian and catenary curves. Possible mechanisms behind the shape of impact factor distribution are suggested.

  2. Effectiveness of journal ranking schemes as a tool for locating information.

    PubMed

    Stringer, Michael J; Sales-Pardo, Marta; Nunes Amaral, Luís A

    2008-02-27

    The rise of electronic publishing, preprint archives, blogs, and wikis is raising concerns among publishers, editors, and scientists about the present day relevance of academic journals and traditional peer review. These concerns are especially fuelled by the ability of search engines to automatically identify and sort information. It appears that academic journals can only remain relevant if acceptance of research for publication within a journal allows readers to infer immediate, reliable information on the value of that research. Here, we systematically evaluate the effectiveness of journals, through the work of editors and reviewers, at evaluating unpublished research. We find that the distribution of the number of citations to a paper published in a given journal in a specific year converges to a steady state after a journal-specific transient time, and demonstrate that in the steady state the logarithm of the number of citations has a journal-specific typical value. We then develop a model for the asymptotic number of citations accrued by papers published in a journal that closely matches the data. Our model enables us to quantify both the typical impact and the range of impacts of papers published in a journal. Finally, we propose a journal-ranking scheme that maximizes the efficiency of locating high impact research.

  3. Effectiveness of Journal Ranking Schemes as a Tool for Locating Information

    PubMed Central

    Stringer, Michael J.; Sales-Pardo, Marta; Nunes Amaral, Luís A.

    2008-01-01

    Background The rise of electronic publishing [1], preprint archives, blogs, and wikis is raising concerns among publishers, editors, and scientists about the present day relevance of academic journals and traditional peer review [2]. These concerns are especially fuelled by the ability of search engines to automatically identify and sort information [1]. It appears that academic journals can only remain relevant if acceptance of research for publication within a journal allows readers to infer immediate, reliable information on the value of that research. Methodology/Principal Findings Here, we systematically evaluate the effectiveness of journals, through the work of editors and reviewers, at evaluating unpublished research. We find that the distribution of the number of citations to a paper published in a given journal in a specific year converges to a steady state after a journal-specific transient time, and demonstrate that in the steady state the logarithm of the number of citations has a journal-specific typical value. We then develop a model for the asymptotic number of citations accrued by papers published in a journal that closely matches the data. Conclusions/Significance Our model enables us to quantify both the typical impact and the range of impacts of papers published in a journal. Finally, we propose a journal-ranking scheme that maximizes the efficiency of locating high impact research. PMID:18301760

  4. Moving target imaging using sparse and low-rank structure

    NASA Astrophysics Data System (ADS)

    Mason, Eric; Yazici, Birsen

    2016-05-01

    In this paper we present a method for passive radar detection of ground moving targets using sparsely distributed apertures. We assume the scene is illuminated by a source of opportunity and measure the backscattered signal. We correlate measurements from two different receivers, then form a linear forward model that operates on a rank one, positive semi-definite (PSD) operator, formed by taking the tensor product of the phase-space reflectivity function with its self. Utilizing this structure, image formation and velocity estimation are defined in a constrained optimization framework. Additionally, image formation and velocity estimation are formulated as separate optimization problems, this results in computational savings. Position estimation is posed as a rank one PSD constrained least squares problem. Then, velocity estimation is performed as a cardinality constrained least squares problem, solved using a greedy algorithm. We demonstrate the performance of our method with numerical simulations, demonstrate improvement over back-projection imaging, and evaluate the effect of spatial diversity.

  5. DebtRank-transparency: Controlling systemic risk in financial networks

    PubMed Central

    Thurner, Stefan; Poledna, Sebastian

    2013-01-01

    Nodes in a financial network, such as banks, cannot assess the true risks associated with lending to other nodes in the network, unless they have full information on the riskiness of all other nodes. These risks can be estimated by using network metrics (as DebtRank) of the interbank liability network. With a simple agent based model we show that systemic risk in financial networks can be drastically reduced by increasing transparency, i.e. making the DebtRank of individual banks visible to others, and by imposing a rule, that reduces interbank borrowing from systemically risky nodes. This scheme does not reduce the efficiency of the financial network, but fosters a more homogeneous risk-distribution within the system in a self-organized critical way. The reduction of systemic risk is due to a massive reduction of cascading failures in the transparent system. A regulation-policy implementation of the proposed scheme is discussed. PMID:23712454

  6. Relevance of biotic parameters in the assessment of the spatial distribution of gastrointestinal metal and protein levels during spawning period of European chub (Squalius cephalus L.).

    PubMed

    Filipović Marijić, Vlatka; Raspor, Biserka

    2014-06-01

    The present field study, conducted during the spawning period (April/May) of European chub (Squalius cephalus L.) from the Sava River in Croatia, indicates that seasonal changes of fish physiological state might cause variability in gastrointestinal metal (Cd, Cu, Fe, Mn and Zn), total cytosolic protein and metallothionein (MT) levels. During the period of fish spawning and increased metabolic activity, a significant relationship with chub hepatosomatic index was evident for Fe and Mn in gastrointestinal tissue (r = 0.35 and 0.26, respectively) and in cytosolic fraction (r = 0.32 and 0.41, respectively) and for Zn and Fe in the gut content (r = 0.36 and 0.31, respectively). Total cytosolic protein and MT concentrations followed the same spatial distribution as Fe and Mn in all gastrointestinal fractions and as Zn in the sub-cellular fractions, with higher levels at upstream locations. Due to the role of essential metals in metabolic processes and gonad development, increased feeding and spawning activity in April/May resulted in higher gastrointestinal essential metal (Fe, Mn and Zn) and MT concentrations, which probably follow an increase in Zn concentrations, known as the primary MT inducer. Therefore, biotic factors should be considered as important confounding factors in metal exposure assessment, while their influence on gastrointestinal metal and protein levels should be interpreted depending on the season studied.

  7. Texas Students Rank Prestige of Careers.

    ERIC Educational Resources Information Center

    Hale, Dennis

    1979-01-01

    A survey of 701 Texas high school students revealed that they ranked the prestige of six careers in the following order: (1) minister, (2) television reporter, (3) accountant, (4) policeman, (5) high school teacher, (6) newspaper reporter. (GT)

  8. Green Power Partnership Top Partner Rankings

    EPA Pesticide Factsheets

    EPA's Green Power Partnership is a voluntary program designed to reduce the environmental impact of electricity generation by promoting renewable energy. Top Partner Rankings highlight the annual green power use of leading Green Power Partners.

  9. Superfund Hazard Ranking System Training Course

    EPA Pesticide Factsheets

    The Hazard Ranking System (HRS) training course is a four and ½ day, intermediate-level course designed for personnel who are required to compile, draft, and review preliminary assessments (PAs), site inspections (SIs), and HRS documentation records/packag

  10. Rasch analysis of rank-ordered data.

    PubMed

    Linacre, John M

    2006-01-01

    Theoretical and practical aspects of several methods for the construction of linear measures from rank-ordered data are presented. The final partial-rankings of 356 professional golfers participating in 47 stroke-play tournaments are used for illustration. The methods include decomposing the rankings into independent paired comparisons without ties, into dependent paired comparisons without ties and into independent paired comparisons with ties. A further method, which is easier to implement, entails modeling each tournament as a partial-credit item in which the rank of each golfer is treated as the observation of a category on a partial-credit rating scale. For the golf data, the partial-credit method yields measures with greater face validity than the paired comparison methods. The methods are implemented with the computer programs FACETS and WINSTEPS.

  11. Quantum Navigation and Ranking in Complex Networks

    NASA Astrophysics Data System (ADS)

    Sánchez-Burillo, Eduardo; Duch, Jordi; Gómez-Gardeñes, Jesús; Zueco, David

    2012-08-01

    Complex networks are formal frameworks capturing the interdependencies between the elements of large systems and databases. This formalism allows to use network navigation methods to rank the importance that each constituent has on the global organization of the system. A key example is Pagerank navigation which is at the core of the most used search engine of the World Wide Web. Inspired in this classical algorithm, we define a quantum navigation method providing a unique ranking of the elements of a network. We analyze the convergence of quantum navigation to the stationary rank of networks and show that quantumness decreases the number of navigation steps before convergence. In addition, we show that quantum navigation allows to solve degeneracies found in classical ranks. By implementing the quantum algorithm in real networks, we confirm these improvements and show that quantum coherence unveils new hierarchical features about the global organization of complex systems.

  12. Low-rank coal oil agglomeration

    DOEpatents

    Knudson, Curtis L.; Timpe, Ronald C.

    1991-01-01

    A low-rank coal oil agglomeration process. High mineral content, a high ash content subbituminous coals are effectively agglomerated with a bridging oil which is partially water soluble and capable of entering the pore structure, and usually coal derived.

  13. Quantum Navigation and Ranking in Complex Networks

    PubMed Central

    Sánchez-Burillo, Eduardo; Duch, Jordi; Gómez-Gardeñes, Jesús; Zueco, David

    2012-01-01

    Complex networks are formal frameworks capturing the interdependencies between the elements of large systems and databases. This formalism allows to use network navigation methods to rank the importance that each constituent has on the global organization of the system. A key example is Pagerank navigation which is at the core of the most used search engine of the World Wide Web. Inspired in this classical algorithm, we define a quantum navigation method providing a unique ranking of the elements of a network. We analyze the convergence of quantum navigation to the stationary rank of networks and show that quantumness decreases the number of navigation steps before convergence. In addition, we show that quantum navigation allows to solve degeneracies found in classical ranks. By implementing the quantum algorithm in real networks, we confirm these improvements and show that quantum coherence unveils new hierarchical features about the global organization of complex systems. PMID:22930671

  14. Ranking Forestry Investments With Parametric Linear Programming

    Treesearch

    Paul A. Murphy

    1976-01-01

    Parametric linear programming is introduced as a technique for ranking forestry investments under multiple constraints; it combines the advantages of simple tanking and linear programming as capital budgeting tools.

  15. Multicenter evaluation of MIC distributions for epidemiologic cutoff value definition to detect amphotericin B, posaconazole, and itraconazole resistance among the most clinically relevant species of Mucorales.

    PubMed

    Espinel-Ingroff, A; Chakrabarti, A; Chowdhary, A; Cordoba, S; Dannaoui, E; Dufresne, P; Fothergill, A; Ghannoum, M; Gonzalez, G M; Guarro, J; Kidd, S; Lass-Flörl, C; Meis, J F; Pelaez, T; Tortorano, A M; Turnidge, J

    2015-03-01

    Clinical breakpoints (CBPs) have not been established for the Mucorales and any antifungal agent. In lieu of CBPs, epidemiologic cutoff values (ECVs) are proposed for amphotericin B, posaconazole, and itraconazole and four Mucorales species. Wild-type (WT) MIC distributions (organisms in a species-drug combination with no detectable acquired resistance mechanisms) were defined with available pooled CLSI MICs from 14 laboratories (Argentina, Australia, Canada, Europe, India, Mexico, and the United States) as follows: 10 Apophysomyces variabilis, 32 Cunninghamella bertholletiae, 136 Lichtheimia corymbifera, 10 Mucor indicus, 123 M. circinelloides, 19 M. ramosissimus, 349 Rhizopus arrhizus, 146 R. microsporus, 33 Rhizomucor pusillus, and 36 Syncephalastrum racemosum isolates. CLSI broth microdilution MICs were aggregated for the analyses. ECVs comprising ≥95% and ≥97.5% of the modeled populations were as follows: amphotericin B ECVs for L. corymbifera were 1 and 2 μg/ml, those for M. circinelloides were 1 and 2 μg/ml, those for R. arrhizus were 2 and 4 μg/ml, and those for R. microsporus were 2 and 2 μg/ml, respectively; posaconazole ECVs for L. corymbifera were 1 and 2, those for M. circinelloides were 4 and 4, those for R. arrhizus were 1 and 2, and those for R. microsporus were 1 and 2, respectively; both itraconazole ECVs for R. arrhizus were 2 μg/ml. ECVs may aid in detecting emerging resistance or isolates with reduced susceptibility (non-WT MICs) to the agents evaluated. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  16. A New Powerful Nonparametric Rank Test for Ordered Alternative Problem

    PubMed Central

    Shan, Guogen; Young, Daniel; Kang, Le

    2014-01-01

    We propose a new nonparametric test for ordered alternative problem based on the rank difference between two observations from different groups. These groups are assumed to be independent from each other. The exact mean and variance of the test statistic under the null distribution are derived, and its asymptotic distribution is proven to be normal. Furthermore, an extensive power comparison between the new test and other commonly used tests shows that the new test is generally more powerful than others under various conditions, including the same type of distribution, and mixed distributions. A real example from an anti-hypertensive drug trial is provided to illustrate the application of the tests. The new test is therefore recommended for use in practice due to easy calculation and substantial power gain. PMID:25405757

  17. Block models and personalized PageRank

    PubMed Central

    Kloumann, Isabel M.; Ugander, Johan; Kleinberg, Jon

    2017-01-01

    Methods for ranking the importance of nodes in a network have a rich history in machine learning and across domains that analyze structured data. Recent work has evaluated these methods through the “seed set expansion problem”: given a subset S of nodes from a community of interest in an underlying graph, can we reliably identify the rest of the community? We start from the observation that the most widely used techniques for this problem, personalized PageRank and heat kernel methods, operate in the space of “landing probabilities” of a random walk rooted at the seed set, ranking nodes according to weighted sums of landing probabilities of different length walks. Both schemes, however, lack an a priori relationship to the seed set objective. In this work, we develop a principled framework for evaluating ranking methods by studying seed set expansion applied to the stochastic block model. We derive the optimal gradient for separating the landing probabilities of two classes in a stochastic block model and find, surprisingly, that under reasonable assumptions the gradient is asymptotically equivalent to personalized PageRank for a specific choice of the PageRank parameter α that depends on the block model parameters. This connection provides a formal motivation for the success of personalized PageRank in seed set expansion and node ranking generally. We use this connection to propose more advanced techniques incorporating higher moments of landing probabilities; our advanced methods exhibit greatly improved performance, despite being simple linear classification rules, and are even competitive with belief propagation. PMID:27999183

  18. Monte Carlo simulations guided by imaging to predict the in vitro ranking of radiosensitizing nanoparticles.

    PubMed

    Retif, Paul; Reinhard, Aurélie; Paquot, Héna; Jouan-Hureaux, Valérie; Chateau, Alicia; Sancey, Lucie; Barberi-Heyob, Muriel; Pinel, Sophie; Bastogne, Thierry

    This article addresses the in silico-in vitro prediction issue of organometallic nanoparticles (NPs)-based radiosensitization enhancement. The goal was to carry out computational experiments to quickly identify efficient nanostructures and then to preferentially select the most promising ones for the subsequent in vivo studies. To this aim, this interdisciplinary article introduces a new theoretical Monte Carlo computational ranking method and tests it using 3 different organometallic NPs in terms of size and composition. While the ranking predicted in a classical theoretical scenario did not fit the reference results at all, in contrast, we showed for the first time how our accelerated in silico virtual screening method, based on basic in vitro experimental data (which takes into account the NPs cell biodistribution), was able to predict a relevant ranking in accordance with in vitro clonogenic efficiency. This corroborates the pertinence of such a prior ranking method that could speed up the preclinical development of NPs in radiation therapy.

  19. Monte Carlo simulations guided by imaging to predict the in vitro ranking of radiosensitizing nanoparticles

    PubMed Central

    Retif, Paul; Reinhard, Aurélie; Paquot, Héna; Jouan-Hureaux, Valérie; Chateau, Alicia; Sancey, Lucie; Barberi-Heyob, Muriel; Pinel, Sophie; Bastogne, Thierry

    2016-01-01

    This article addresses the in silico–in vitro prediction issue of organometallic nanoparticles (NPs)-based radiosensitization enhancement. The goal was to carry out computational experiments to quickly identify efficient nanostructures and then to preferentially select the most promising ones for the subsequent in vivo studies. To this aim, this interdisciplinary article introduces a new theoretical Monte Carlo computational ranking method and tests it using 3 different organometallic NPs in terms of size and composition. While the ranking predicted in a classical theoretical scenario did not fit the reference results at all, in contrast, we showed for the first time how our accelerated in silico virtual screening method, based on basic in vitro experimental data (which takes into account the NPs cell biodistribution), was able to predict a relevant ranking in accordance with in vitro clonogenic efficiency. This corroborates the pertinence of such a prior ranking method that could speed up the preclinical development of NPs in radiation therapy. PMID:27920524

  20. Hierarchical Rank Aggregation with Applications to Nanotoxicology.

    PubMed

    Patel, Trina; Telesca, Donatello; Rallo, Robert; George, Saji; Xia, Tian; Nel, André E

    2013-06-01

    The development of high throughput screening (HTS) assays in the field of nanotoxicology provide new opportunities for the hazard assessment and ranking of engineered nanomaterials (ENMs). It is often necessary to rank lists of materials based on multiple risk assessment parameters, often aggregated across several measures of toxicity and possibly spanning an array of experimental platforms. Bayesian models coupled with the optimization of loss functions have been shown to provide an effective framework for conducting inference on ranks. In this article we present various loss-function-based ranking approaches for comparing ENM within experiments and toxicity parameters. Additionally, we propose a framework for the aggregation of ranks across different sources of evidence while allowing for differential weighting of this evidence based on its reliability and importance in risk ranking. We apply these methods to high throughput toxicity data on two human cell-lines, exposed to eight different nanomaterials, and measured in relation to four cytotoxicity outcomes. This article has supplementary material online.

  1. Hierarchical Rank Aggregation with Applications to Nanotoxicology

    PubMed Central

    Telesca, Donatello; Rallo, Robert; George, Saji; Xia, Tian; Nel, André E.

    2014-01-01

    The development of high throughput screening (HTS) assays in the field of nanotoxicology provide new opportunities for the hazard assessment and ranking of engineered nanomaterials (ENMs). It is often necessary to rank lists of materials based on multiple risk assessment parameters, often aggregated across several measures of toxicity and possibly spanning an array of experimental platforms. Bayesian models coupled with the optimization of loss functions have been shown to provide an effective framework for conducting inference on ranks. In this article we present various loss-function-based ranking approaches for comparing ENM within experiments and toxicity parameters. Additionally, we propose a framework for the aggregation of ranks across different sources of evidence while allowing for differential weighting of this evidence based on its reliability and importance in risk ranking. We apply these methods to high throughput toxicity data on two human cell-lines, exposed to eight different nanomaterials, and measured in relation to four cytotoxicity outcomes. This article has supplementary material online. PMID:24839387

  2. Estimation of vanadium water quality benchmarks for the protection of aquatic life with relevance to the Athabasca Oil Sands region using species sensitivity distributions.

    PubMed

    Schiffer, Stephanie; Liber, Karsten

    2017-06-21

    Elevated vanadium (V) concentrations in oil sands coke, which is produced and stored on site of some major Athabasca Oil Sands companies, could pose a risk to aquatic ecosystems in northern Alberta, Canada, depending on its future storage and utilization. In the present study, V toxicity was determined in reconstituted Athabasca River water to various freshwater organisms, including 2 midge species (Chironomus dilutus and Chironomus riparius; 4-d and 30-d to 40-d exposures) and 2 freshwater fish species (Oncorhynchus mykiss and Pimephales promelas; 4-d and 28-d exposures) to facilitate estimation of water quality benchmarks. The acute toxicity of V was 52.0 and 63.2 mg/L for C. dilutus and C. riparius, respectively, and 4.0 and 14.8 mg V/L for P. promelas and O. mykiss, respectively. Vanadium exposure significantly impaired adult emergence of C. dilutus and C. riparius at concentrations ≥16.7 (31.6% reduction) and 8.3 (18.0% reduction) mg/L, respectively. Chronic toxicity in fish presented as lethality, with chronic 28-d LC50s of 0.5 and 4.3 mg/L for P. promelas and O. mykiss, respectively. These data were combined with data from the peer-reviewed literature, and separate acute and chronic species sensitivity distributions (SSDs) were constructed. The acute and chronic hazardous concentrations endangering only 5% of species (HC5) were estimated as 0.64 and 0.05 mg V/L, respectively. These new data for V toxicity to aquatic organisms ensure that there are now adequate data available for regulatory agencies to develop appropriate water quality guidelines for use in the Athabasca Oil Sands region and elsewhere. Until then, the HC5 values presented in the present study could serve as interim benchmarks for the protection of aquatic life from exposure to hazardous levels of V in local aquatic environments. Environ Toxicol Chem 2017;9999:1-11. © 2017 SETAC. © 2017 SETAC.

  3. Low-rank coal study: national needs for resource development. Volume 3. Technology evaluation

    SciTech Connect

    Not Available

    1980-11-01

    Technologies applicable to the development and use of low-rank coals are analyzed in order to identify specific needs for research, development, and demonstration (RD and D). Major sections of the report address the following technologies: extraction; transportation; preparation, handling and storage; conventional combustion and environmental control technology; gasification; liquefaction; and pyrolysis. Each of these sections contains an introduction and summary of the key issues with regard to subbituminous coal and lignite; description of all relevant technology, both existing and under development; a description of related environmental control technology; an evaluation of the effects of low-rank coal properties on the technology; and summaries of current commercial status of the technology and/or current RD and D projects relevant to low-rank coals.

  4. PageRank as a method to rank biomedical literature by importance.

    PubMed

    Yates, Elliot J; Dixon, Louise C

    2015-01-01

    Optimal ranking of literature importance is vital in overcoming article overload. Existing ranking methods are typically based on raw citation counts, giving a sum of 'inbound' links with no consideration of citation importance. PageRank, an algorithm originally developed for ranking webpages at the search engine, Google, could potentially be adapted to bibliometrics to quantify the relative importance weightings of a citation network. This article seeks to validate such an approach on the freely available, PubMed Central open access subset (PMC-OAS) of biomedical literature. On-demand cloud computing infrastructure was used to extract a citation network from over 600,000 full-text PMC-OAS articles. PageRanks and citation counts were calculated for each node in this network. PageRank is highly correlated with citation count (R = 0.905, P < 0.01) and we thus validate the former as a surrogate of literature importance. Furthermore, the algorithm can be run in trivial time on cheap, commodity cluster hardware, lowering the barrier of entry for resource-limited open access organisations. PageRank can be trivially computed on commodity cluster hardware and is linearly correlated with citation count. Given its putative benefits in quantifying relative importance, we suggest it may enrich the citation network, thereby overcoming the existing inadequacy of citation counts alone. We thus suggest PageRank as a feasible supplement to, or replacement of, existing bibliometric ranking methods.

  5. Ranking metrics in gene set enrichment analysis: do they matter?

    PubMed

    Zyla, Joanna; Marczyk, Michal; Weiner, January; Polanska, Joanna

    2017-05-12

    There exist many methods for describing the complex relation between changes of gene expression in molecular pathways or gene ontologies under different experimental conditions. Among them, Gene Set Enrichment Analysis seems to be one of the most commonly used (over 10,000 citations). An important parameter, which could affect the final result, is the choice of a metric for the ranking of genes. Applying a default ranking metric may lead to poor results. In this work 28 benchmark data sets were used to evaluate the sensitivity and false positive rate of gene set analysis for 16 different ranking metrics including new proposals. Furthermore, the robustness of the chosen methods to sample size was tested. Using k-means clustering algorithm a group of four metrics with the highest performance in terms of overall sensitivity, overall false positive rate and computational load was established i.e. absolute value of Moderated Welch Test statistic, Minimum Significant Difference, absolute value of Signal-To-Noise ratio and Baumgartner-Weiss-Schindler test statistic. In case of false positive rate estimation, all selected ranking metrics were robust with respect to sample size. In case of sensitivity, the absolute value of Moderated Welch Test statistic and absolute value of Signal-To-Noise ratio gave stable results, while Baumgartner-Weiss-Schindler and Minimum Significant Difference showed better results for larger sample size. Finally, the Gene Set Enrichment Analysis method with all tested ranking metrics was parallelised and implemented in MATLAB, and is available at https://github.com/ZAEDPolSl/MrGSEA . Choosing a ranking metric in Gene Set Enrichment Analysis has critical impact on results of pathway enrichment analysis. The absolute value of Moderated Welch Test has the best overall sensitivity and Minimum Significant Difference has the best overall specificity of gene set analysis. When the number of non-normally distributed genes is high, using Baumgartner

  6. Methods for Ranking and Selection in Large-Scale Inference

    NASA Astrophysics Data System (ADS)

    Henderson, Nicholas C.

    This thesis addresses two distinct problems: one related to ranking and selection for large-scale inference and another related to latent class modeling of longitudinal count data. The first part of the thesis focuses on the problem of identifying leading measurement units from a large collection with a focus on settings with differing levels of estimation precision across measurement units. The main approach presented is a Bayesian ranking procedure that populates the list of top units in a way that maximizes the expected overlap between the true and reported top lists for all list sizes. This procedure relates unit-specific posterior upper tail probabilities with their empirical distribution to yield a ranking variable. It discounts high-variance units less than other common methods and thus achieves improved operating characteristics in the models considered. In the second part of the thesis, we introduce and describe a finite mixture model for longitudinal count data where, conditional on the class label, the subject-specific observations are assumed to arise from a discrete autoregressive process. This approach offers notable computational advantages over related methods due to the within-class closed form of the likelihood function and, as we describe, has a within-class correlation structure which improves model identifiability. We also outline computational strategies for estimating model parameters, and we describe a novel measure of the underlying separation between latent classes and discuss its relation to posterior classification.

  7. Efficiency, Costs, Rankings and Heterogeneity: The Case of US Higher Education

    ERIC Educational Resources Information Center

    Agasisti, Tommaso; Johnes, Geraint

    2015-01-01

    Among the major trends in the higher education (HE) sector, the development of rankings as a policy and managerial tool is of particular relevance. However, despite the diffusion of these instruments, it is still not clear how they relate with traditional performance measures, like unit costs and efficiency scores. In this paper, we estimate a…

  8. An Economist Looks at College Rankings: An Anti-Benthamite View.

    ERIC Educational Resources Information Center

    Orr, Daniel

    1984-01-01

    Opinions about quality are seen as the only relevant measures of quality that can be collected. A different view of quality in higher education, and a different set of university rankings are proposed. A system that draws pairwise comparisons of university performance across nine disciplines is described. (MLW)

  9. Effects of OCR Errors on Ranking and Feedback Using the Vector Space Model.

    ERIC Educational Resources Information Center

    Taghva, Kazem; And Others

    1996-01-01

    Reports on the performance of the vector space model in the presence of OCR (optical character recognition) errors in information retrieval. Highlights include precision and recall, a full-text test collection, smart vector representation, impact of weighting parameters, ranking variability, and the effect of relevance feedback. (Author/LRW)

  10. Efficiency, Costs, Rankings and Heterogeneity: The Case of US Higher Education

    ERIC Educational Resources Information Center

    Agasisti, Tommaso; Johnes, Geraint

    2015-01-01

    Among the major trends in the higher education (HE) sector, the development of rankings as a policy and managerial tool is of particular relevance. However, despite the diffusion of these instruments, it is still not clear how they relate with traditional performance measures, like unit costs and efficiency scores. In this paper, we estimate a…

  11. Forming first-ranked early-type galaxies through hierarchical dissipationless merging

    NASA Astrophysics Data System (ADS)

    Solanes, José M.; Perea, Jaime D.; Darriba, Laura; García-Gómez, Carlos; Bosma, Albert; Athanassoula, Evangelia

    2016-09-01

    We have developed a computationally competitive N-body model of a previrialized aggregation of galaxies in a flat Λ cold dark matter Universe to assess the role of the multiple mergers that take place during the formation stage of such systems in the configuration of the remnants assembled at their centres. An analysis of a suite of 48 simulations of low-mass forming groups (Mtot,gr ˜ 1013 h-1 M⊙) demonstrates that the gravitational dynamics involved in their hierarchical collapse is capable of creating realistic first-ranked galaxies without the aid of dissipative processes. Our simulations indicate that the brightest group galaxies (BGGs) constitute a distinct population from other group members, sketching a scenario in which the assembly path of these objects is dictated largely by the formation of their host system. We detect significant differences in the distribution of Sérsic indices and total magnitudes, as well as a luminosity gap between BGGs and the next brightest galaxy that is positively correlated with the total luminosity of the parent group. Such gaps arise from both the grow of BGGs at the expense of lesser companions and the decrease in the relevance of second-ranked objects in equal measure. This results in a dearth of intermediate-mass galaxies which explains the characteristic central dip detected in their luminosity functions in dynamically young galaxy aggregations. The fact that the basic global properties of our BGGs define a thin mass Fundamental Plane strikingly similar to that followed by giant early-type galaxies in the local Universe reinforces confidence in the results obtained.

  12. Ranking welding intensity in pyroclastic deposits

    NASA Astrophysics Data System (ADS)

    Quane, Steven L.; Russell, James K.

    2005-02-01

    Welding of pyroclastic deposits involves flattening of glassy pyroclasts under a compactional load at temperatures above the glass transition temperature. Progressive welding is recorded by changes in the petrographic (e.g., fabric) and physical (e.g., density) properties of the deposits. Mapping the intensity of welding can be integral to studies of pyroclastic deposits, but making systematic comparisons between deposits can be problematical. Here we develop a scheme for ranking welding intensity in pyroclastic deposits on the basis of petrographic textural observations (e.g., oblateness of pumice lapilli and micro-fabric orientation) and measurements of physical properties, including density, porosity, point load strength and uniaxial compressive strength. Our dataset comprises measurements on 100 samples collected from a single cooling unit of the Bandelier Tuff and parallel measurements on 8 samples of more densely welded deposits. The proposed classification comprises six ranks of welding intensity ranging from unconsolidated (Rank I) to obsidian-like vitrophyre (Rank VI) and should allow for reproducible mapping of subtle variations in welding intensity between different deposits. The application of the ranking scheme is demonstrated by using published physical property data on welded pyroclastic deposits to map the total accumulated strain and to reconstruct their pre-welding thicknesses.

  13. Model diagnostics in reduced-rank estimation

    PubMed Central

    Chen, Kun

    2016-01-01

    Reduced-rank methods are very popular in high-dimensional multivariate analysis for conducting simultaneous dimension reduction and model estimation. However, the commonly-used reduced-rank methods are not robust, as the underlying reduced-rank structure can be easily distorted by only a few data outliers. Anomalies are bound to exist in big data problems, and in some applications they themselves could be of the primary interest. While naive residual analysis is often inadequate for outlier detection due to potential masking and swamping, robust reduced-rank estimation approaches could be computationally demanding. Under Stein's unbiased risk estimation framework, we propose a set of tools, including leverage score and generalized information score, to perform model diagnostics and outlier detection in large-scale reduced-rank estimation. The leverage scores give an exact decomposition of the so-called model degrees of freedom to the observation level, which lead to exact decomposition of many commonly-used information criteria; the resulting quantities are thus named information scores of the observations. The proposed information score approach provides a principled way of combining the residuals and leverage scores for anomaly detection. Simulation studies confirm that the proposed diagnostic tools work well. A pattern recognition example with hand-writing digital images and a time series analysis example with monthly U.S. macroeconomic data further demonstrate the efficacy of the proposed approaches. PMID:28003860

  14. Modeling Area-Level Health Rankings

    PubMed Central

    Courtemanche, Charles; Soneji, Samir; Tchernis, Rusty

    2015-01-01

    Objective Rank county health using a Bayesian factor analysis model. Data Sources Secondary county data from the National Center for Health Statistics (through 2007) and Behavioral Risk Factor Surveillance System (through 2009). Study Design Our model builds on the existing county health rankings (CHRs) by using data-derived weights to compute ranks from mortality and morbidity variables, and by quantifying uncertainty based on population, spatial correlation, and missing data. We apply our model to Wisconsin, which has comprehensive data, and Texas, which has substantial missing information. Data Collection Methods The data were downloaded from www.countyhealthrankings.org. Principal Findings Our estimated rankings are more similar to the CHRs for Wisconsin than Texas, as the data-derived factor weights are closer to the assigned weights for Wisconsin. The correlations between the CHRs and our ranks are 0.89 for Wisconsin and 0.65 for Texas. Uncertainty is especially severe for Texas given the state's substantial missing data. Conclusions The reliability of comprehensive CHRs varies from state to state. We advise focusing on the counties that remain among the least healthy after incorporating alternate weighting methods and accounting for uncertainty. Our results also highlight the need for broader geographic coverage in health data. PMID:26256684

  15. A low rank approach to automatic differentiation.

    SciTech Connect

    Abdel-Khalik, H. S.; Hovland, P. D.; Lyons, A.; Stover, T. E.; Utke, J.; Mathematics and Computer Science; North Carolina State Univ.; Univ. of Chicago

    2008-01-01

    This manuscript introduces a new approach for increasing the efficiency of automatic differentiation (AD) computations for estimating the first order derivatives comprising the Jacobian matrix of a complex large-scale computational model. The objective is to approximate the entire Jacobian matrix with minimized computational and storage resources. This is achieved by finding low rank approximations to a Jacobian matrix via the Efficient Subspace Method (ESM). Low rank Jacobian matrices arise in many of today's important scientific and engineering problems, e.g. nuclear reactor calculations, weather climate modeling, geophysical applications, etc. A low rank approximation replaces the original Jacobian matrix J (whose size is dictated by the size of the input and output data streams) with matrices of much smaller dimensions (determined by the numerical rank of the Jacobian matrix). This process reveals the rank of the Jacobian matrix and can be obtained by ESM via a series of r randomized matrix-vector products of the form: Jq, and J{sup T} {omega} which can be evaluated by the AD forward and reverse modes, respectively.

  16. Social class rank, essentialism, and punitive judgment.

    PubMed

    Kraus, Michael W; Keltner, Dacher

    2013-08-01

    Recent evidence suggests that perceptions of social class rank influence a variety of social cognitive tendencies, from patterns of causal attribution to moral judgment. In the present studies we tested the hypotheses that upper-class rank individuals would be more likely to endorse essentialist lay theories of social class categories (i.e., that social class is founded in genetically based, biological differences) than would lower-class rank individuals and that these beliefs would decrease support for restorative justice--which seeks to rehabilitate offenders, rather than punish unlawful action. Across studies, higher social class rank was associated with increased essentialism of social class categories (Studies 1, 2, and 4) and decreased support for restorative justice (Study 4). Moreover, manipulated essentialist beliefs decreased preferences for restorative justice (Study 3), and the association between social class rank and class-based essentialist theories was explained by the tendency to endorse beliefs in a just world (Study 2). Implications for how class-based essentialist beliefs potentially constrain social opportunity and mobility are discussed.

  17. Groundwater contaminant plume ranking. [UMTRA Project

    SciTech Connect

    Not Available

    1988-08-01

    Containment plumes at Uranium Mill Tailings Remedial Action (UMTRA) Project sites were ranked to assist in Subpart B (i.e., restoration requirements of 40 CFR Part 192) compliance strategies for each site, to prioritize aquifer restoration, and to budget future requests and allocations. The rankings roughly estimate hazards to the environment and human health, and thus assist in determining for which sites cleanup, if appropriate, will provide the greatest benefits for funds available. The rankings are based on the scores that were obtained using the US Department of Energy's (DOE) Modified Hazard Ranking System (MHRS). The MHRS and HRS consider and score three hazard modes for a site: migration, fire and explosion, and direct contact. The migration hazard mode score reflects the potential for harm to humans or the environment from migration of a hazardous substance off a site by groundwater, surface water, and air; it is a composite of separate scores for each of these routes. For ranking the containment plumes at UMTRA Project sites, it was assumed that each site had been remediated in compliance with the EPA standards and that relict contaminant plumes were present. Therefore, only the groundwater route was scored, and the surface water and air routes were not considered. Section 2.0 of this document describes the assumptions and procedures used to score the groundwater route, and Section 3.0 provides the resulting scores for each site. 40 tabs.

  18. Ranking USRDS provider specific SMRs from 1998-2001

    PubMed Central

    Louis, Thomas A.; Paddock, Susan M.; Ridgeway, Greg

    2009-01-01

    Provider profiling (ranking/percentiling) is prevalent in health services research. Bayesian models coupled with optimizing a loss function provide an effective framework for computing non-standard inferences such as ranks. Inferences depend on the posterior distribution and should be guided by inferential goals. However, even optimal methods might not lead to definitive results and ranks should be accompanied by valid uncertainty assessments. We outline the Bayesian approach and use estimated Standardized Mortality Ratios (SMRs) in 1998-2001 from the United States Renal Data System (USRDS) as a platform to identify issues and demonstrate approaches. Our analyses extend Liu et al. (2004) by computing estimates developed by Lin et al. (2006) that minimize errors in classifying providers above or below a percentile cut-point, by combining evidence over multiple years via a first-order, autoregressive model on log(SMR), and by use of a nonparametric prior. Results show that ranks/percentiles based on maximum likelihood estimates of the SMRs and those based on testing whether an SMR = 1 substantially under-perform the optimal estimates. Combining evidence over the four years using the autoregressive model reduces uncertainty, improving performance over percentiles based on only one year. Furthermore, percentiles based on posterior probabilities of exceeding a properly chosen SMR threshold are essentially identical to those produced by minimizing classification loss. Uncertainty measures effectively calibrate performance, showing that considerable uncertainty remains even when using optimal methods. Findings highlight the importance of using loss function guided percentiles and the necessity of accompanying estimates with uncertainty assessments. PMID:19343106

  19. Combining results of microarray experiments: a rank aggregation approach.

    PubMed

    DeConde, Robert P; Hawley, Sarah; Falcon, Seth; Clegg, Nigel; Knudsen, Beatrice; Etzioni, Ruth

    2006-01-01

    As technology for microarray analysis becomes widespread, it is becoming increasingly important to be able to compare and combine the results of experiments that explore the same scientific question. In this article, we present a rank-aggregation approach for combining results from several microarray studies. The motivation for this approach is twofold; first, the final results of microarray studies are typically expressed as lists of genes, rank-ordered by a measure of the strength of evidence that they are functionally involved in the disease process, and second, using the information on this rank-ordered metric means that we do not have to concern ourselves with data on the actual expression levels, which may not be comparable across experiments. Our approach draws on methods for combining top-k lists from the computer science literature on meta-search. The meta-search problem shares several important features with that of combining microarray experiments, including the fact that there are typically few lists with many elements and the elements may not be common to all lists. We implement two meta-search algorithms, which use a Markov chain framework to convert pairwise preferences between list elements into a stationary distribution that represents an aggregate ranking (Dwork et al, 2001). We explore the behavior of the algorithms in hypothetical examples and a simulated dataset and compare their performance with that of an algorithm based on the order-statistics model of Thurstone (Thurstone, 1927). We apply all three algorithms to aggregate the results of five microarray studies of prostate cancer.

  20. An accelerated procedure for recursive feature ranking on microarray data.

    PubMed

    Furlanello, C; Serafini, M; Merler, S; Jurman, G

    2003-01-01

    We describe a new wrapper algorithm for fast feature ranking in classification problems. The Entropy-based Recursive Feature Elimination (E-RFE) method eliminates chunks of uninteresting features according to the entropy of the weights distribution of a SVM classifier. With specific regard to DNA microarray datasets, the method is designed to support computationally intensive model selection in classification problems in which the number of features is much larger than the number of samples. We test E-RFE on synthetic and real data sets, comparing it with other SVM-based methods. The speed-up obtained with E-RFE supports predictive modeling on high dimensional microarray data.

  1. Distribution System White Papers

    EPA Pesticide Factsheets

    EPA worked with stakeholders and developed a series of white papers on distribution system issues ranked of potentially significant public health concern (see list below) to serve as background material for EPA, expert and stakeholder discussions.

  2. Population models and simulation methods: The case of the Spearman rank correlation.

    PubMed

    Astivia, Oscar L Olvera; Zumbo, Bruno D

    2017-01-31

    The purpose of this paper is to highlight the importance of a population model in guiding the design and interpretation of simulation studies used to investigate the Spearman rank correlation. The Spearman rank correlation has been known for over a hundred years to applied researchers and methodologists alike and is one of the most widely used non-parametric statistics. Still, certain misconceptions can be found, either explicitly or implicitly, in the published literature because a population definition for this statistic is rarely discussed within the social and behavioural sciences. By relying on copula distribution theory, a population model is presented for the Spearman rank correlation, and its properties are explored both theoretically and in a simulation study. Through the use of the Iman-Conover algorithm (which allows the user to specify the rank correlation as a population parameter), simulation studies from previously published articles are explored, and it is found that many of the conclusions purported in them regarding the nature of the Spearman correlation would change if the data-generation mechanism better matched the simulation design. More specifically, issues such as small sample bias and lack of power of the t-test and r-to-z Fisher transformation disappear when the rank correlation is calculated from data sampled where the rank correlation is the population parameter. A proof for the consistency of the sample estimate of the rank correlation is shown as well as the flexibility of the copula model to encompass results previously published in the mathematical literature.

  3. Factors influencing subjective ranking of driver distractions.

    PubMed

    Patel, Jayesh; Ball, David J; Jones, Huw

    2008-01-01

    Driver distraction is recognised as a significant cause of road traffic incidents. However, the more objective measurement and ranking of the relative importance of individual distractions in contributing to incidents tend to differ from subjectively-held rankings. To investigate this, the present study examines qualitative characteristics of 14 driver distractions to determine if these characteristics might explain the discrepancy. The conclusion is that for laypersons, qualitative characteristics, such as equity and familiarity, do contribute to their ranking of driver distractions. This poses some interesting issues for risk managers. For example, should safety interventions aimed at driver distractions be based purely on factual data and life-saving potential, or should they accommodate qualitative factors of salience to the public?

  4. Higher-rank fields and currents

    NASA Astrophysics Data System (ADS)

    Gelfond, O. A.; Vasiliev, M. A.

    2016-10-01

    Sp(2 M) invariant field equations in the space ℳ M with symmetric matrix coordinates are classified. Analogous results are obtained for Minkowski-like subspaces of ℳ M which include usual 4 d Minkowski space as a particular case. The constructed equations are associated with the tensor products of the Fock (singleton) representation of Sp(2 M) of any rank r. The infinite set of higher-spin conserved currents multilinear in rank-one fields in ℳ M is found. The associated conserved charges are supported by rM-r(r-1)/2 -dimensional differential forms in ℳ M , that are closed by virtue of the rank-2 r field equations. The cohomology groups H p ( σ - r ) with all p and r, which determine the form of appropriate gauge fields and their field equations, are found both for ℳ M and for its Minkowski-like subspace.

  5. Social rank strategies in hierarchical relationships.

    PubMed

    Fournier, Marc A; Moskowitz, D S; Zuroff, David C

    2002-08-01

    Social rank theorists propose that threat appraisals evoke escalation behavior toward subordinates and de-escalation behavior toward superiors. These hypotheses were examined among records of behavior sampled ecologically from the work environments of 90 individuals. At the level of the event, situated threat appraisals (feeling criticized) predicted different kinds of behavior across status situations. Individuals tended to quarrel when criticized by subordinates and to submit when criticized by superiors. At the level of the person, aggregated rank appraisals (feeling inferior) predicted different kinds of behavior across status situations. Individuals who typically felt more inferior tended to quarrel more frequently with subordinates and to submit more frequently with superiors. Findings implicated inferiority and threat as fundamental dimensions underlying the behavior of the social rank system.

  6. Ranking of facial profiles among Asians.

    PubMed

    Lew, K K; Soh, G; Loh, E

    1992-01-01

    The purpose of this study was to determine the facial profile preferences in a sample of 1,189 Asian teenagers (aged 15.3 +/- 3.2 years). Five facial profile types were computer-generated by trained personnel (orthodontists and oral maxillofacial surgeons) to represent distinct facial types. Subjects were asked to rank the profiles in descending order of attractiveness. The ranking was as follows: orthognathic profile, bimaxillary retrusive profile, bimaxillary protrusive profile, mandibular retrognathic profile, and mandibular prognathic profile. The differences in rank scores between all the profile types were statistically significant (p < 0.05). Assessment of profile types among lay personnel could provide clinicians an indication into the relative attractiveness among profile types and health care workers in treatment prioritization among dysmorphic facial types.

  7. Adjoints and Low-rank Covariance Representation

    NASA Technical Reports Server (NTRS)

    Tippett, Michael K.; Cohn, Stephen E.

    2000-01-01

    Quantitative measures of the uncertainty of Earth System estimates can be as important as the estimates themselves. Second moments of estimation errors are described by the covariance matrix, whose direct calculation is impractical when the number of degrees of freedom of the system state is large. Ensemble and reduced-state approaches to prediction and data assimilation replace full estimation error covariance matrices by low-rank approximations. The appropriateness of such approximations depends on the spectrum of the full error covariance matrix, whose calculation is also often impractical. Here we examine the situation where the error covariance is a linear transformation of a forcing error covariance. We use operator norms and adjoints to relate the appropriateness of low-rank representations to the conditioning of this transformation. The analysis is used to investigate low-rank representations of the steady-state response to random forcing of an idealized discrete-time dynamical system.

  8. Validation of SmartRank: A likelihood ratio software for searching national DNA databases with complex DNA profiles.

    PubMed

    Benschop, Corina C G; van de Merwe, Linda; de Jong, Jeroen; Vanvooren, Vanessa; Kempenaers, Morgane; Kees van der Beek, C P; Barni, Filippo; Reyes, Eusebio López; Moulin, Léa; Pene, Laurent; Haned, Hinda; Sijen, Titia

    2017-07-01

    Searching a national DNA database with complex and incomplete profiles usually yields very large numbers of possible matches that can present many candidate suspects to be further investigated by the forensic scientist and/or police. Current practice in most forensic laboratories consists of ordering these 'hits' based on the number of matching alleles with the searched profile. Thus, candidate profiles that share the same number of matching alleles are not differentiated and due to the lack of other ranking criteria for the candidate list it may be difficult to discern a true match from the false positives or notice that all candidates are in fact false positives. SmartRank was developed to put forward only relevant candidates and rank them accordingly. The SmartRank software computes a likelihood ratio (LR) for the searched profile and each profile in the DNA database and ranks database entries above a defined LR threshold according to the calculated LR. In this study, we examined for mixed DNA profiles of variable complexity whether the true donors are retrieved, what the number of false positives above an LR threshold is and the ranking position of the true donors. Using 343 mixed DNA profiles over 750 SmartRank searches were performed. In addition, the performance of SmartRank and CODIS were compared regarding DNA database searches and SmartRank was found complementary to CODIS. We also describe the applicable domain of SmartRank and provide guidelines. The SmartRank software is open-source and freely available. Using the best practice guidelines, SmartRank enables obtaining investigative leads in criminal cases lacking a suspect. Copyright © 2017 Elsevier B.V. All rights reserved.

  9. Using rank-order geostatistics for spatial interpolation of highly skewed data in a heavy-metal contaminated site.

    PubMed

    Juang, K W; Lee, D Y; Ellsworth, T R

    2001-01-01

    The spatial distribution of a pollutant in contaminated soils is usually highly skewed. As a result, the sample variogram often differs considerably from its regional counterpart and the geostatistical interpolation is hindered. In this study, rank-order geostatistics with standardized rank transformation was used for the spatial interpolation of pollutants with a highly skewed distribution in contaminated soils when commonly used nonlinear methods, such as logarithmic and normal-scored transformations, are not suitable. A real data set of soil Cd concentrations with great variation and high skewness in a contaminated site of Taiwan was used for illustration. The spatial dependence of ranks transformed from Cd concentrations was identified and kriging estimation was readily performed in the standardized-rank space. The estimated standardized rank was back-transformed into the concentration space using the middle point model within a standardized-rank interval of the empirical distribution function (EDF). The spatial distribution of Cd concentrations was then obtained. The probability of Cd concentration being higher than a given cutoff value also can be estimated by using the estimated distribution of standardized ranks. The contour maps of Cd concentrations and the probabilities of Cd concentrations being higher than the cutoff value can be simultaneously used for delineation of hazardous areas of contaminated soils.

  10. Resonances under rank-one perturbations

    NASA Astrophysics Data System (ADS)

    Bourget, Olivier; Cortés, Víctor H.; Del Río, Rafael; Fernández, Claudio

    2017-09-01

    We study resonances generated by rank-one perturbations of self-adjoint operators with eigenvalues embedded in the continuous spectrum. Instability of these eigenvalues is analyzed and almost exponential decay for the associated resonant states is exhibited. We show how these results can be applied to Sturm-Liouville operators. Main tools are the Aronszajn-Donoghue theory for rank-one perturbations, a reduction process of the resolvent based on the Feshbach-Livsic formula, the Fermi golden rule, and a careful analysis of the Fourier transform of quasi-Lorentzian functions. We relate these results to sojourn time estimates and spectral concentration phenomena.

  11. Locally asymptotically rank-based procedures for testing autoregressive moving average dependence

    PubMed Central

    Hallin, Marc; Puri, Madan L.

    1988-01-01

    The problem of testing a given autoregressive moving average (ARMA) model (in which the density of the generating white noise is unspecified) against other ARMA models is considered. A distribution-free asymptotically most powerful test, based on a generalized linear serial rank statistic, is provided against contiguous ARMA alternatives with specified coefficients. In the case in which the ARMA model in the alternative has unspecified coefficients, the asymptotic sufficiency (in the sense of Hájek) of a finite-dimensional vector of rank statistics is established. This asymptotic sufficiency is used to derive an asymptotically maximin most powerful test, based on a generalized quadratic serial rank statistic. The asymptotically maximin optimal test statistic can be interpreted as a rank-based, weighted version of the classical Box-Pierce portmanteau statistic, to which it reduces, in some particular problems, under gaussian assumptions. PMID:16593917

  12. Systemic testing on Bradley-Terry model against nonlinear ranking hierarchy.

    PubMed

    Shev, Aaron; Fujii, Kevin; Hsieh, Fushing; McCowan, Brenda

    2014-01-01

    We take a system point of view toward constructing any power or ranking hierarchy onto a society of human or animal players. The most common hierarchy is the linear ranking, which is habitually used in nearly all real-world problems. A stronger version of linear ranking via increasing and unvarying winning potentials, known as Bradley-Terry model, is particularly popular. Only recently non-linear ranking hierarchy is discussed and developed through recognition of dominance information contents beyond direct dyadic win-and-loss. We take this development further by rigorously arguing for the necessity of accommodating system's global pattern information contents, and then introducing a systemic testing on Bradley-Terry model. Our test statistic with an ensemble based empirical distribution favorably compares with the Deviance test equipped with a Chi-squared asymptotic approximation. Several simulated and real data sets are analyzed throughout our development.

  13. Deans' Perceptions of Published Rankings of Business Programs

    ERIC Educational Resources Information Center

    Athavale, Manoj; Bott, Jennifer; Myring, Mark; Richardson, Lynne

    2017-01-01

    Using a survey of college of business deans, the authors investigate perceptions of published rankings of academic programs. Published rankings have become quite prominent, and anecdotal evidence suggests great efforts are being undertaken to be included in rankings or enhance rankings. The authors conducted a survey of business school deans to…

  14. World University Rankings: Take with a Large Pinch of Salt

    ERIC Educational Resources Information Center

    Cheng, Soh Kay

    2011-01-01

    Equating the unequal is misleading, and this happens consistently in comparing rankings from different university ranking systems, as the NUT saga shows. This article illustrates the problem by analyzing the 2011 rankings of the top 100 universities in the AWUR, QSWUR and THEWUR ranking results. It also discusses the reasons why the rankings…

  15. 5 CFR 451.302 - Ranks for senior career employees.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... 5 Administrative Personnel 1 2010-01-01 2010-01-01 false Ranks for senior career employees. 451... AWARDS Presidential Rank Awards § 451.302 Ranks for senior career employees. (a) The circumstances under... Professional to a senior career employee are set forth in 5 U.S.C. 4507a. (b) To be eligible for a rank...

  16. 5 CFR 451.302 - Ranks for senior career employees.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... 5 Administrative Personnel 1 2011-01-01 2011-01-01 false Ranks for senior career employees. 451... AWARDS Presidential Rank Awards § 451.302 Ranks for senior career employees. (a) The circumstances under... Professional to a senior career employee are set forth in 5 U.S.C. 4507a. (b) To be eligible for a rank...

  17. Nominal versus Attained Weights in Universitas 21 Ranking

    ERIC Educational Resources Information Center

    Soh, Kaycheng

    2014-01-01

    Universitas 21 Ranking of National Higher Education Systems (U21 Ranking) is one of the three new ranking systems appearing in 2012. In contrast with the other systems, U21 Ranking uses countries as the unit of analysis. It has several features which lend it with greater trustworthiness, but it also shared some methodological issues with the other…

  18. Examining Major Rankings According to the Berlin Principles

    ERIC Educational Resources Information Center

    Cheng, Ying; Liu, Nian Cai

    2008-01-01

    While the ranking of higher education institutions (HEIs) has become more and more popular, there are increasing concerns about the quality of such ranking. In response to such legitimate expectations, in May 2006, the International Ranking Expert Group (IREG) developed and endorsed a guideline document--the Berlin Principles on Ranking of Higher…

  19. Academic Quality Rankings of American Colleges and Universities.

    ERIC Educational Resources Information Center

    Webster, David S.

    Past and current methods used in academic quality rankings of U.S. colleges and universities are discussed. In addition to a literature and historical review, modern quality rankings are compared with early (pre-1959) rankings, including past rankings of medical, dental, legal and black education. Also considered are the exemplary 1982 evaluation…

  20. Rehabbing the Rankings: Fool's Errand or the Lord's Work?

    ERIC Educational Resources Information Center

    Kuh, George D.

    2011-01-01

    For better or worse, rankings shape public conceptions of collegiate quality. This paper reviews the history of rankings, analyzes what they represent, explores recent efforts to employ indicators in addition to institutional resources and reputation on which the most popular rankings are based, and evaluates the extent to which rankings serve…

  1. Statistical and Mathematical Aspects of Ranking: Lessons from Poland

    ERIC Educational Resources Information Center

    Rocki, Marek

    2005-01-01

    This paper presents both the formal and methodological variety of ranking approaches, and asks whether a really objective ranking is possible. A ranking represents compiled information, provided according to a criterion or set of criteria. Its purpose is to highlight real or perceived differences in quality. Ranking methodology refers to its…

  2. 5 CFR 451.302 - Ranks for senior career employees.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... 5 Administrative Personnel 1 2013-01-01 2013-01-01 false Ranks for senior career employees. 451... AWARDS Presidential Rank Awards § 451.302 Ranks for senior career employees. (a) The circumstances under... Professional to a senior career employee are set forth in 5 U.S.C. 4507a. (b) To be eligible for a rank award...

  3. 5 CFR 451.302 - Ranks for senior career employees.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... 5 Administrative Personnel 1 2012-01-01 2012-01-01 false Ranks for senior career employees. 451... AWARDS Presidential Rank Awards § 451.302 Ranks for senior career employees. (a) The circumstances under... Professional to a senior career employee are set forth in 5 U.S.C. 4507a. (b) To be eligible for a rank award...

  4. 5 CFR 451.302 - Ranks for senior career employees.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... 5 Administrative Personnel 1 2014-01-01 2014-01-01 false Ranks for senior career employees. 451... AWARDS Presidential Rank Awards § 451.302 Ranks for senior career employees. (a) The circumstances under... Professional to a senior career employee are set forth in 5 U.S.C. 4507a. (b) To be eligible for a rank award...

  5. On Classification of Modular Categories by Rank: Table A.1

    SciTech Connect

    Bruillard, Paul; Ng, Siu-Hung; Rowell, Eric C.; Wang, Zhenghan

    2016-04-10

    The feasibility of a classification-by-rank program for modular categories follows from the Rank-Finiteness Theorem. We develop arithmetic, representation theoretic and algebraic methods for classifying modular categories by rank. As an application, we determine all possible fusion rules for all rank=5 modular categories and describe the corresponding monoidal equivalence classes.

  6. Nominal versus Attained Weights in Universitas 21 Ranking

    ERIC Educational Resources Information Center

    Soh, Kaycheng

    2014-01-01

    Universitas 21 Ranking of National Higher Education Systems (U21 Ranking) is one of the three new ranking systems appearing in 2012. In contrast with the other systems, U21 Ranking uses countries as the unit of analysis. It has several features which lend it with greater trustworthiness, but it also shared some methodological issues with the other…

  7. Ending the Reign of the Fraser Institute's School Rankings

    ERIC Educational Resources Information Center

    Raptis, Helen

    2012-01-01

    The Fraser Institute "Report Card" of school rankings has won the hearts of parents and the press. For over a decade, the rankings have been particularly burdensome for low-ranking (usually low socio-economic status, high-poverty) schools when parents of high-achieving children move them to higher-ranking schools. In February 2010, after…

  8. Earthdata Search: The Relevance of Relevance

    NASA Technical Reports Server (NTRS)

    Quinn, Patrick

    2016-01-01

    Through recent usability studies, the issue of relevance became increasingly clear in the Earthdata Search Client. After all, if a user can't find the data they are looking for, nothing else we do matters. This presentation walks through usability testing findings and recent relevance improvements made to the Earthdata Search Client.

  9. Statistical Optimality in Multipartite Ranking and Ordinal Regression.

    PubMed

    Uematsu, Kazuki; Lee, Yoonkyung

    2015-05-01

    Statistical optimality in multipartite ranking is investigated as an extension of bipartite ranking. We consider the optimality of ranking algorithms through minimization of the theoretical risk which combines pairwise ranking errors of ordinal categories with differential ranking costs. The extension shows that for a certain class of convex loss functions including exponential loss, the optimal ranking function can be represented as a ratio of weighted conditional probability of upper categories to lower categories, where the weights are given by the misranking costs. This result also bridges traditional ranking methods such as proportional odds model in statistics with various ranking algorithms in machine learning. Further, the analysis of multipartite ranking with different costs provides a new perspective on non-smooth list-wise ranking measures such as the discounted cumulative gain and preference learning. We illustrate our findings with simulation study and real data analysis.

  10. Ranking Workplace Competencies: Student and Graduate Perceptions.

    ERIC Educational Resources Information Center

    Rainsbury, Elizabeth; Hodges, Dave; Burchell, Noel; Lay, Mark

    2002-01-01

    New Zealand business students and graduates made similar rankings of the five most important workplace competencies: computer literacy, customer service orientation, teamwork and cooperation, self-confidence, and willingness to learn. Graduates placed greater importance on most of the 24 competencies, resulting in a statistically significant…

  11. Document Ranking Using an Enriched Thesaurus.

    ERIC Educational Resources Information Center

    Rada, Roy; And Others

    1991-01-01

    Describes a study funded by the Commission of the European Community that applied document retrieval algorithms to the "Excerpta Medica" database using the EMTREE thesaurus. Nonhierarchical relations were added to enrich the thesaurus, and document ranking is discussed in terms of the conceptual distance between the documents and the…

  12. Suppression pheromone and cockroach rank formation

    NASA Astrophysics Data System (ADS)

    Kou, Rong; Chang, Huan-Wen; Chen, Shu-Chun; Ho, Hsiao-Yung

    2009-06-01

    Although agonistic behaviors in the male lobster cockroach ( Nauphoeta cinerea) are well known, the formation of an unstable hierarchy has long been a puzzle. In this study, we investigate how the unstable dominance hierarchy in N. cinerea is maintained via a pheromone signaling system. In agonistic interactions, aggressive posture (AP) is an important behavioral index of aggression. This study showed that, during the formation of a governing hierarchy, thousands of nanograms of 3-hydroxy-2-butanone (3H-2B) were released by the AP-adopting dominant in the first encounter fight, then during the early domination period and that this release of 3H-2B was related to rank maintenance, but not to rank establishment. For rank maintenance, 3H-2B functioned as a suppression pheromone, which suppressed the fighting capability of rivals and kept them in a submissive state. During the period of rank maintenance, as the dominant male gradually decreased his 3H-2B release, the fighting ability of the subordinate gradually developed, as shown by the increasing odds of a subordinate adopting an AP (OSAP). The OSAP was negatively correlated with the amount of 3H-2B released by the dominant and positively correlated with the number of domination days. The same OSAP could be achieved earlier by reducing the amount of 3H-2B released by the dominant indicates that whether the subordinate adopts an offensive strategy depends on what the dominant is doing.

  13. Rankings of the States 1996: Addendum.

    ERIC Educational Resources Information Center

    National Education Association, Washington, DC. Research Div.

    Certain data from the Bureau of the Census were not available at the time of publication of the National Education Association report "Rankings of the States 1996" because of a change in the Bureau's schedule for issuing that data. This addendum contains the previously unavailable tables of finance data for state and local governments and…

  14. Deep impact: unintended consequences of journal rank

    PubMed Central

    Brembs, Björn; Button, Katherine; Munafò, Marcus

    2013-01-01

    Most researchers acknowledge an intrinsic hierarchy in the scholarly journals (“journal rank”) that they submit their work to, and adjust not only their submission but also their reading strategies accordingly. On the other hand, much has been written about the negative effects of institutionalizing journal rank as an impact measure. So far, contributions to the debate concerning the limitations of journal rank as a scientific impact assessment tool have either lacked data, or relied on only a few studies. In this review, we present the most recent and pertinent data on the consequences of our current scholarly communication system with respect to various measures of scientific quality (such as utility/citations, methodological soundness, expert ratings or retractions). These data corroborate previous hypotheses: using journal rank as an assessment tool is bad scientific practice. Moreover, the data lead us to argue that any journal rank (not only the currently-favored Impact Factor) would have this negative impact. Therefore, we suggest that abandoning journals altogether, in favor of a library-based scholarly communication system, will ultimately be necessary. This new system will use modern information technology to vastly improve the filter, sort and discovery functions of the current journal system. PMID:23805088

  15. Ranking Workplace Competencies: Student and Graduate Perceptions.

    ERIC Educational Resources Information Center

    Rainsbury, Elizabeth; Hodges, Dave; Burchell, Noel; Lay, Mark

    2002-01-01

    New Zealand business students and graduates made similar rankings of the five most important workplace competencies: computer literacy, customer service orientation, teamwork and cooperation, self-confidence, and willingness to learn. Graduates placed greater importance on most of the 24 competencies, resulting in a statistically significant…

  16. Low-rank coal oil agglomeration

    DOEpatents

    Knudson, C.L.; Timpe, R.C.

    1991-07-16

    A low-rank coal oil agglomeration process is described. High mineral content, a high ash content subbituminous coals are effectively agglomerated with a bridging oil which is partially water soluble and capable of entering the pore structure, and is usually coal-derived.

  17. Spanish Universities and the "Ranking 2005" Initiative

    ERIC Educational Resources Information Center

    De Miguel, Jesus M.; Vaquera, Elizabeth; Sanchez, Jara D.

    2005-01-01

    This article assesses the quality of the Spanish higher education system, focusing mainly on the methodological challenges that the existence of public and private universities represents in the calculation of global higher education rankings. Researchers from the University of Barcelona and the University of Pennsylvania calculated the first…

  18. City Life: Rankings (Livability) versus Perceptions (Satisfaction)

    ERIC Educational Resources Information Center

    Okulicz-Kozaryn, Adam

    2013-01-01

    I investigate the relationship between the popular Mercer city ranking (livability) and survey data (satisfactions). Livability aims to capture "objective" quality of life such as infrastructure. Survey items capture "subjective" quality of life such as satisfaction with city. The relationship between objective measures of quality of life and…

  19. Efficiently Ranking Hyphotheses in Machine Learning

    NASA Technical Reports Server (NTRS)

    Chien, Steve

    1997-01-01

    This paper considers the problem of learning the ranking of a set of alternatives based upon incomplete information (e.g. a limited number of observations). At each decision cycle, the system can output a complete ordering on the hypotheses or decide to gather additional information (e.g. observation) at some cost.

  20. An Application of Sylvester's Rank Inequality

    ERIC Educational Resources Information Center

    Kung, Sidney H.

    2011-01-01

    Using two well known criteria for the diagonalizability of a square matrix plus an extended form of Sylvester's Rank Inequality, the author presents a new condition for the diagonalization of a real matrix from which one can obtain the eigenvectors by simply multiplying some associated matrices without solving a linear system of simultaneous…

  1. Measures of Agreement for Incompletely Ranked Data.

    ERIC Educational Resources Information Center

    Iachan, Ronaldo

    1984-01-01

    Measures of agreement for ordinal-scaled data are suggested that make use of the k categories with the highest ranks. The proposed measures are applied to the Self-Directed Search in order to evaluate its agreement with self-assessment (translation ability) or with the work environment (congruence). (Author)

  2. An Application of Sylvester's Rank Inequality

    ERIC Educational Resources Information Center

    Kung, Sidney H.

    2011-01-01

    Using two well known criteria for the diagonalizability of a square matrix plus an extended form of Sylvester's Rank Inequality, the author presents a new condition for the diagonalization of a real matrix from which one can obtain the eigenvectors by simply multiplying some associated matrices without solving a linear system of simultaneous…

  3. World University Ranking Methodologies: Stability and Variability

    ERIC Educational Resources Information Center

    Fidler, Brian; Parsons, Christine

    2008-01-01

    There has been a steady growth in the number of national university league tables over the last 25 years. By contrast, "World University Rankings" are a more recent development and have received little serious academic scrutiny in peer-reviewed publications. Few researchers have evaluated the sources of data and the statistical…

  4. George Wilbur: Otto Rank and Hanns Sachs.

    PubMed

    Roazen, Paul

    2006-01-01

    George Wilbur, a pioneering Cape Cod psychoanalytic psychiatrist, was a long-standing editor of the journal "American Imago," and an excellent source of information about the Viennese analysts Otto Rank and Hanns Sachs. Wilbur was also knowledgeable about the early reception of psychoanalysis in the Boston community.

  5. Spanish Universities and the "Ranking 2005" Initiative

    ERIC Educational Resources Information Center

    De Miguel, Jesus M.; Vaquera, Elizabeth; Sanchez, Jara D.

    2005-01-01

    This article assesses the quality of the Spanish higher education system, focusing mainly on the methodological challenges that the existence of public and private universities represents in the calculation of global higher education rankings. Researchers from the University of Barcelona and the University of Pennsylvania calculated the first…

  6. Efficiently Ranking Hyphotheses in Machine Learning

    NASA Technical Reports Server (NTRS)

    Chien, Steve

    1997-01-01

    This paper considers the problem of learning the ranking of a set of alternatives based upon incomplete information (e.g. a limited number of observations). At each decision cycle, the system can output a complete ordering on the hypotheses or decide to gather additional information (e.g. observation) at some cost.

  7. Ranks, Rates, and Numbers--and Confusion

    ERIC Educational Resources Information Center

    Bracey, Gerald W.

    2008-01-01

    The United States may be the most rank-crazy country in the world, but the world is catching up. The author cites the Organization for Economic and Cooperating and Development (OECD). When the International Association for the Evaluation of Educational Achievement (IEA) started its international studies--the First International Mathematics Study…

  8. City Life: Rankings (Livability) versus Perceptions (Satisfaction)

    ERIC Educational Resources Information Center

    Okulicz-Kozaryn, Adam

    2013-01-01

    I investigate the relationship between the popular Mercer city ranking (livability) and survey data (satisfactions). Livability aims to capture "objective" quality of life such as infrastructure. Survey items capture "subjective" quality of life such as satisfaction with city. The relationship between objective measures of quality of life and…

  9. An Optimization-Based Method for Feature Ranking in Nonlinear Regression Problems.

    PubMed

    Bravi, Luca; Piccialli, Veronica; Sciandrone, Marco

    2016-02-03

    In this paper, we consider the feature ranking problem, where, given a set of training instances, the task is to associate a score with the features in order to assess their relevance. Feature ranking is a very important tool for decision support systems, and may be used as an auxiliary step of feature selection to reduce the high dimensionality of real-world data. We focus on regression problems by assuming that the process underlying the generated data can be approximated by a continuous function (for instance, a feedforward neural network). We formally state the notion of relevance of a feature by introducing a minimum zero-norm inversion problem of a neural network, which is a nonsmooth, constrained optimization problem. We employ a concave approximation of the zero-norm function, and we define a smooth, global optimization problem to be solved in order to assess the relevance of the features. We present the new feature ranking method based on the solution of instances of the global optimization problem depending on the available training data. Computational experiments on both artificial and real data sets are performed, and point out that the proposed feature ranking method is a valid alternative to existing methods in terms of effectiveness. The obtained results also show that the method is costly in terms of CPU time, and this may be a limitation in the solution of large-dimensional problems.

  10. VaRank: a simple and powerful tool for ranking genetic variants

    PubMed Central

    Geoffroy, Véronique; Pizot, Cécile; Redin, Claire; Piton, Amélie; Vasli, Nasim; Stoetzel, Corinne; Blavier, André; Laporte, Jocelyn

    2015-01-01

    Background. Most genetic disorders are caused by single nucleotide variations (SNVs) or small insertion/deletions (indels). High throughput sequencing has broadened the catalogue of human variation, including common polymorphisms, rare variations or disease causing mutations. However, identifying one variation among hundreds or thousands of others is still a complex task for biologists, geneticists and clinicians. Results. We have developed VaRank, a command-line tool for the ranking of genetic variants detected by high-throughput sequencing. VaRank scores and prioritizes variants annotated either by Alamut Batch or SnpEff. A barcode allows users to quickly view the presence/absence of variants (with homozygote/heterozygote status) in analyzed samples. VaRank supports the commonly used VCF input format for variants analysis thus allowing it to be easily integrated into NGS bioinformatics analysis pipelines. VaRank has been successfully applied to disease-gene identification as well as to molecular diagnostics setup for several hundred patients. Conclusions. VaRank is implemented in Tcl/Tk, a scripting language which is platform-independent but has been tested only on Unix environment. The source code is available under the GNU GPL, and together with sample data and detailed documentation can be downloaded from http://www.lbgi.fr/VaRank/. PMID:25780760

  11. VaRank: a simple and powerful tool for ranking genetic variants.

    PubMed

    Geoffroy, Véronique; Pizot, Cécile; Redin, Claire; Piton, Amélie; Vasli, Nasim; Stoetzel, Corinne; Blavier, André; Laporte, Jocelyn; Muller, Jean

    2015-01-01

    Background. Most genetic disorders are caused by single nucleotide variations (SNVs) or small insertion/deletions (indels). High throughput sequencing has broadened the catalogue of human variation, including common polymorphisms, rare variations or disease causing mutations. However, identifying one variation among hundreds or thousands of others is still a complex task for biologists, geneticists and clinicians. Results. We have developed VaRank, a command-line tool for the ranking of genetic variants detected by high-throughput sequencing. VaRank scores and prioritizes variants annotated either by Alamut Batch or SnpEff. A barcode allows users to quickly view the presence/absence of variants (with homozygote/heterozygote status) in analyzed samples. VaRank supports the commonly used VCF input format for variants analysis thus allowing it to be easily integrated into NGS bioinformatics analysis pipelines. VaRank has been successfully applied to disease-gene identification as well as to molecular diagnostics setup for several hundred patients. Conclusions. VaRank is implemented in Tcl/Tk, a scripting language which is platform-independent but has been tested only on Unix environment. The source code is available under the GNU GPL, and together with sample data and detailed documentation can be downloaded from http://www.lbgi.fr/VaRank/.

  12. Beyond Low Rank + Sparse: Multi-scale Low Rank Matrix Decomposition

    PubMed Central

    Ong, Frank; Lustig, Michael

    2016-01-01

    We present a natural generalization of the recent low rank + sparse matrix decomposition and consider the decomposition of matrices into components of multiple scales. Such decomposition is well motivated in practice as data matrices often exhibit local correlations in multiple scales. Concretely, we propose a multi-scale low rank modeling that represents a data matrix as a sum of block-wise low rank matrices with increasing scales of block sizes. We then consider the inverse problem of decomposing the data matrix into its multi-scale low rank components and approach the problem via a convex formulation. Theoretically, we show that under various incoherence conditions, the convex program recovers the multi-scale low rank components either exactly or approximately. Practically, we provide guidance on selecting the regularization parameters and incorporate cycle spinning to reduce blocking artifacts. Experimentally, we show that the multi-scale low rank decomposition provides a more intuitive decomposition than conventional low rank methods and demonstrate its effectiveness in four applications, including illumination normalization for face images, motion separation for surveillance videos, multi-scale modeling of the dynamic contrast enhanced magnetic resonance imaging and collaborative filtering exploiting age information. PMID:28450978

  13. Beyond Low Rank + Sparse: Multi-scale Low Rank Matrix Decomposition.

    PubMed

    Ong, Frank; Lustig, Michael

    2016-06-01

    We present a natural generalization of the recent low rank + sparse matrix decomposition and consider the decomposition of matrices into components of multiple scales. Such decomposition is well motivated in practice as data matrices often exhibit local correlations in multiple scales. Concretely, we propose a multi-scale low rank modeling that represents a data matrix as a sum of block-wise low rank matrices with increasing scales of block sizes. We then consider the inverse problem of decomposing the data matrix into its multi-scale low rank components and approach the problem via a convex formulation. Theoretically, we show that under various incoherence conditions, the convex program recovers the multi-scale low rank components either exactly or approximately. Practically, we provide guidance on selecting the regularization parameters and incorporate cycle spinning to reduce blocking artifacts. Experimentally, we show that the multi-scale low rank decomposition provides a more intuitive decomposition than conventional low rank methods and demonstrate its effectiveness in four applications, including illumination normalization for face images, motion separation for surveillance videos, multi-scale modeling of the dynamic contrast enhanced magnetic resonance imaging and collaborative filtering exploiting age information.

  14. Statistical regularities in the rank-citation profile of scientists.

    PubMed

    Petersen, Alexander M; Stanley, H Eugene; Succi, Sauro

    2011-01-01

    Recent science of science research shows that scientific impact measures for journals and individual articles have quantifiable regularities across both time and discipline. However, little is known about the scientific impact distribution at the scale of an individual scientist. We analyze the aggregate production and impact using the rank-citation profile c(i)(r) of 200 distinguished professors and 100 assistant professors. For the entire range of paper rank r, we fit each c(i)(r) to a common distribution function. Since two scientists with equivalent Hirsch h-index can have significantly different c(i)(r) profiles, our results demonstrate the utility of the β(i) scaling parameter in conjunction with h(i) for quantifying individual publication impact. We show that the total number of citations C(i) tallied from a scientist's N(i) papers scales as [Formula: see text]. Such statistical regularities in the input-output patterns of scientists can be used as benchmarks for theoretical models of career progress.

  15. Metric Ranking of Invariant Networks with Belief Propagation

    SciTech Connect

    Tao, Changxia; Ge, Yong; Song, Qinbao; Ge, Yuan; Omitaomu, Olufemi A

    2014-01-01

    The management of large-scale distributed information systems relies on the effective use and modeling of monitoring data collected at various points in the distributed information systems. A promising approach is to discover invariant relationships among the monitoring data and generate invariant networks, where a node is a monitoring data source (metric) and a link indicates an invariant relationship between two monitoring data. Such an invariant network representation can help system experts to localize and diagnose the system faults by examining those broken invariant relationships and their related metrics, because system faults usually propagate among the monitoring data and eventually lead to some broken invariant relationships. However, at one time, there are usually a lot of broken links (invariant relationships) within an invariant network. Without proper guidance, it is difficult for system experts to manually inspect this large number of broken links. Thus, a critical challenge is how to effectively and efficiently rank metrics (nodes) of invariant networks according to the anomaly levels of metrics. The ranked list of metrics will provide system experts with useful guidance for them to localize and diagnose the system faults. To this end, we propose to model the nodes and the broken links as a Markov Random Field (MRF), and develop an iteration algorithm to infer the anomaly of each node based on belief propagation (BP). Finally, we validate the proposed algorithm on both realworld and synthetic data sets to illustrate its effectiveness.

  16. Evaluation and ranking of restoration strategies for radioactively contaminated sites.

    PubMed

    Zeevaert, T; Bousher, A; Brendler, V; Jensen, P H; Nordlinder, S

    2001-01-01

    An international project, whose aim was the development of a transparent and robust method for evaluating and ranking restoration strategies for radioactively contaminated sites (RESTRAT), was carried out under the Fourth Framework of the Nuclear Fission Safety Programme of the EU. The evaluation and ranking procedure used was based on the principles of justification and optimisation for radiation protection. A multi-attribute utility analysis was applied to allow for the inclusion of radiological health effects, economic costs and social factors. Values of these attributes were converted into utility values by applying linear utility functions and weighting factors, derived from scaling constants and expert judgement. The uncertainties and variabilities associated with these utility functions and weighting factors were dealt with by a probabilistic approach which utilised a Latin Hypercube Sampling technique. Potentially relevant restoration techniques were identified and their characteristics determined through a literature review. The methodology developed by this project has been illustrated by application to representative examples of different categories of contaminated sites; a waste disposal site, a uranium tailing site and a contaminated freshwater river.

  17. Social Image Tag Ranking by Two-View Learning

    NASA Astrophysics Data System (ADS)

    Zhuang, Jinfeng; Hoi, Steven C. H.

    Tags play a central role in text-based social image retrieval and browsing. However, the tags annotated by web users could be noisy, irrelevant, and often incomplete for describing the image contents, which may severely deteriorate the performance of text-based image retrieval models. In order to solve this problem, researchers have proposed techniques to rank the annotated tags of a social image according to their relevance to the visual content of the image. In this paper, we aim to overcome the challenge of social image tag ranking for a corpus of social images with rich user-generated tags by proposing a novel two-view learning approach. It can effectively exploit both textual and visual contents of social images to discover the complicated relationship between tags and images. Unlike the conventional learning approaches that usually assumes some parametric models, our method is completely data-driven and makes no assumption about the underlying models, making the proposed solution practically more effective. We formulate our method as an optimization task and present an efficient algorithm to solve it. To evaluate the efficacy of our method, we conducted an extensive set of experiments by applying our technique to both text-based social image retrieval and automatic image annotation tasks. Our empirical results showed that the proposed method can be more effective than the conventional approaches.

  18. Ranking of characteristic features in combined wrapper approaches to selection.

    PubMed

    Stańczyk, Urszula

    The performance of a classification system of any type can suffer from irrelevant or redundant data, contained in characteristic features that describe objects of the universe. To estimate relevance of attributes and select their subset for a constructed classifier typically either a filter, wrapper, or an embedded approach, is implemented. The paper presents a combined wrapper framework, where in a pre-processing step, a ranking of variables is established by a simple wrapper model employing sequential backward search procedure. Next, another predictor exploits this resulting ordering of features in their reduction. The proposed methodology is illustrated firstly for a binary classification task of authorship attribution from stylometric domain, and then for additional verification for a waveform dataset from UCI machine learning repository.

  19. Power Scaling of Uplink Massive MIMO Systems With Arbitrary-Rank Channel Means

    NASA Astrophysics Data System (ADS)

    Zhang, Qi; Jin, Shi; Wong, Kai-Kit; Zhu, Hongbo; Matthaiou, Michail

    2014-10-01

    This paper investigates the uplink achievable rates of massive multiple-input multiple-output (MIMO) antenna systems in Ricean fading channels, using maximal-ratio combining (MRC) and zero-forcing (ZF) receivers, assuming perfect and imperfect channel state information (CSI). In contrast to previous relevant works, the fast fading MIMO channel matrix is assumed to have an arbitrary-rank deterministic component as well as a Rayleigh-distributed random component. We derive tractable expressions for the achievable uplink rate in the large-antenna limit, along with approximating results that hold for any finite number of antennas. Based on these analytical results, we obtain the scaling law that the users' transmit power should satisfy, while maintaining a desirable quality of service. In particular, it is found that regardless of the Ricean $K$-factor, in the case of perfect CSI, the approximations converge to the same constant value as the exact results, as the number of base station antennas, $M$, grows large, while the transmit power of each user can be scaled down proportionally to $1/M$. If CSI is estimated with uncertainty, the same result holds true but only when the Ricean $K$-factor is non-zero. Otherwise, if the channel experiences Rayleigh fading, we can only cut the transmit power of each user proportionally to $1/\\sqrt M$. In addition, we show that with an increasing Ricean $K$-factor, the uplink rates will converge to fixed values for both MRC and ZF receivers.

  20. Pulling Rank: Military Rank Affects Hormone Levels and Fairness in an Allocation Experiment

    PubMed Central

    Siart, Benjamin; Pflüger, Lena S.; Wallner, Bernard

    2016-01-01

    Status within social hierarchies has great effects on the lives of socially organized mammals. Its effects on human behavior and related physiology, however, is relatively little studied. The present study investigated the impact of military rank on fairness and behavior in relation to salivary cortisol (C) and testosterone (T) levels in male soldiers. For this purpose 180 members of the Austrian Armed Forces belonging to two distinct rank groups participated in two variations of a computer-based guard duty allocation experiment. The rank groups were (1) warrant officers (high rank, HR) and (2) enlisted men (low rank, LR). One soldier from each rank group participated in every experiment. At the beginning of the experiment, one participant was assigned to start standing guard and the other participant at rest. The participant who started at rest could choose if and when to relieve his fellow soldier and therefore had control over the experiment. In order to trigger perception of unfair behavior, an additional experiment was conducted which was manipulated by the experimenter. In the manipulated version both soldiers started in the standing guard position and were never relieved, believing that their opponent was at rest, not relieving them. Our aim was to test whether unfair behavior causes a physiological reaction. Saliva samples for hormone analysis were collected at regular intervals throughout the experiment. We found that in the un-manipulated setup high-ranking soldiers spent less time standing guard than lower ranking individuals. Rank was a significant predictor for C but not for T levels during the experiment. C levels in the HR group were higher than in the LR group. C levels were also elevated in the manipulated experiment compared to the un-manipulated experiment, especially in LR. We assume that the elevated C levels in HR were caused by HR feeling their status challenged by the situation of having to negotiate with an individual of lower military rank

  1. Pulling Rank: Military Rank Affects Hormone Levels and Fairness in an Allocation Experiment.

    PubMed

    Siart, Benjamin; Pflüger, Lena S; Wallner, Bernard

    2016-01-01

    Status within social hierarchies has great effects on the lives of socially organized mammals. Its effects on human behavior and related physiology, however, is relatively little studied. The present study investigated the impact of military rank on fairness and behavior in relation to salivary cortisol (C) and testosterone (T) levels in male soldiers. For this purpose 180 members of the Austrian Armed Forces belonging to two distinct rank groups participated in two variations of a computer-based guard duty allocation experiment. The rank groups were (1) warrant officers (high rank, HR) and (2) enlisted men (low rank, LR). One soldier from each rank group participated in every experiment. At the beginning of the experiment, one participant was assigned to start standing guard and the other participant at rest. The participant who started at rest could choose if and when to relieve his fellow soldier and therefore had control over the experiment. In order to trigger perception of unfair behavior, an additional experiment was conducted which was manipulated by the experimenter. In the manipulated version both soldiers started in the standing guard position and were never relieved, believing that their opponent was at rest, not relieving them. Our aim was to test whether unfair behavior causes a physiological reaction. Saliva samples for hormone analysis were collected at regular intervals throughout the experiment. We found that in the un-manipulated setup high-ranking soldiers spent less time standing guard than lower ranking individuals. Rank was a significant predictor for C but not for T levels during the experiment. C levels in the HR group were higher than in the LR group. C levels were also elevated in the manipulated experiment compared to the un-manipulated experiment, especially in LR. We assume that the elevated C levels in HR were caused by HR feeling their status challenged by the situation of having to negotiate with an individual of lower military rank

  2. Quantile rank maps: a new tool for understanding individual brain development

    PubMed Central

    Chen, Huaihou; Kelly, Clare; Castellanos, Xavier; He, Ye; Zuo, Xi-Nian; Reiss, Philip T.

    2015-01-01

    We propose a novel method for neurodevelopmental brain mapping that displays how an individual’s values for a quantity of interest compare with age-specific norms. By estimating smoothly age-varying distributions at a set of brain regions of interest, we derive age-dependent region-wise quantile ranks for a given individual, which can be presented in the form of a brain map. Such quantile rank maps could potentially be used for clinical screening. Bootstrap-based confidence intervals are proposed for the quantile rank estimates. We also propose a recalibrated Kolmogorov-Smirnov test for detecting group differences in the age-varying distribution. This test is shown to be more robust to model misspecification than a linear regression-based test. The proposed methods are applied to brain imaging data from the Nathan Kline Institute Rockland Sample and from the Autism Brain Imaging Data Exchange (ABIDE) sample. PMID:25585020

  3. Adaptive two-pass rank order filter to remove impulse noise in highly corrupted images.

    PubMed

    Xu, Xiaoyin; Miller, Eric L; Chen, Dongbin; Sarhadi, Mansoor

    2004-02-01

    In this paper, we present an adaptive two-pass rank order filter to remove impulse noise in highly corrupted images. When the noise ratio is high, rank order filters, such as the median filter for example, can produce unsatisfactory results. Better results can be obtained by applying the filter twice, which we call two-pass filtering. To further improve the performance, we develop an adaptive two-pass rank order filter. Between the passes of filtering, an adaptive process is used to detect irregularities in the spatial distribution of the estimated impulse noise. The adaptive process then selectively replaces some pixels changed by the first pass of filtering with their original observed pixel values. These pixels are then kept unchanged during the second filtering. In combination, the adaptive process and the second filter eliminate more impulse noise and restore some pixels that are mistakenly altered by the first filtering. As a final result, the reconstructed image maintains a higher degree of fidelity and has a smaller amount of noise. The idea of adaptive two-pass processing can be applied to many rank order filters, such as a center-weighted median filter (CWMF), adaptive CWMF, lower-upper-middle filter, and soft-decision rank-order-mean filter. Results from computer simulations are used to demonstrate the performance of this type of adaptation using a number of basic rank order filters.

  4. A linear functional strategy for regularized ranking.

    PubMed

    Kriukova, Galyna; Panasiuk, Oleksandra; Pereverzyev, Sergei V; Tkachenko, Pavlo

    2016-01-01

    Regularization schemes are frequently used for performing ranking tasks. This topic has been intensively studied in recent years. However, to be effective a regularization scheme should be equipped with a suitable strategy for choosing a regularization parameter. In the present study we discuss an approach, which is based on the idea of a linear combination of regularized rankers corresponding to different values of the regularization parameter. The coefficients of the linear combination are estimated by means of the so-called linear functional strategy. We provide a theoretical justification of the proposed approach and illustrate them by numerical experiments. Some of them are related with ranking the risk of nocturnal hypoglycemia of diabetes patients.

  5. On higher rank coisotropic A-branes

    NASA Astrophysics Data System (ADS)

    Herbst, Manfred

    2012-02-01

    This article is devoted to a world sheet analysis of A-type D-branes in N=(2,2) supersymmetric non-linear sigma models. In addition to the familiar Lagrangian submanifolds with flat connection we reproduce the rank one A-branes of Kapustin and Orlov, which are supported on coisotropic submanifolds. The main focus is however on gauge fields of higher rank and on tachyon profiles on brane-antibrane pairs. This will lead to the notion of a complex of coisotropic A-branes. A particular role is played by the noncommutative geometry on the brane world volume. It ensures that brane-antibrane pairs localize again on coisotropic submanifolds.

  6. Social Bookmarking Induced Active Page Ranking

    NASA Astrophysics Data System (ADS)

    Takahashi, Tsubasa; Kitagawa, Hiroyuki; Watanabe, Keita

    Social bookmarking services have recently made it possible for us to register and share our own bookmarks on the web and are attracting attention. The services let us get structured data: (URL, Username, Timestamp, Tag Set). And these data represent user interest in web pages. The number of bookmarks is a barometer of web page value. Some web pages have many bookmarks, but most of those bookmarks may have been posted far in the past. Therefore, even if a web page has many bookmarks, their value is not guaranteed. If most of the bookmarks are very old, the page may be obsolete. In this paper, by focusing on the timestamp sequence of social bookmarkings on web pages, we model their activation levels representing current values. Further, we improve our previously proposed ranking method for web search by introducing the activation level concept. Finally, through experiments, we show effectiveness of the proposed ranking method.

  7. Ranking inter-relationships between clusters

    NASA Astrophysics Data System (ADS)

    Wang, Tingting; Chen, Feng; Phoebe Chen, Yi-Ping

    2011-12-01

    The evaluation of the relationships between clusters is important to identify vital unknown information in many real-life applications, such as in the fields of crime detection, evolution trees, metallurgical industry and biology engraftment. This article proposes a method called 'mode pattern + mutual information' to rank the inter-relationship between clusters. The idea of the mode pattern is used to find outstanding objects from each cluster, and the mutual information criterion measures the close proximity of a pair of clusters. Our approach is different from the conventional algorithms of classifying and clustering, because our focus is not to classify objects into different clusters, but instead, we aim to rank the inter-relationship between clusters when the clusters are given. We conducted experiments on a wide range of real-life datasets, including image data and cancer diagnosis data. The experimental results show that our algorithm is effective and promising.

  8. Probabilistic Low-Rank Multitask Learning.

    PubMed

    Kong, Yu; Shao, Ming; Li, Kang; Fu, Yun

    2017-01-04

    In this paper, we consider the problem of learning multiple related tasks simultaneously with the goal of improving the generalization performance of individual tasks. The key challenge is to effectively exploit the shared information across multiple tasks as well as preserve the discriminative information for each individual task. To address this, we propose a novel probabilistic model for multitask learning (MTL) that can automatically balance between low-rank and sparsity constraints. The former assumes a low-rank structure of the underlying predictive hypothesis space to explicitly capture the relationship of different tasks and the latter learns the incoherent sparse patterns private to each task. We derive and perform inference via variational Bayesian methods. Experimental results on both regression and classification tasks on real-world applications demonstrate the effectiveness of the proposed method in dealing with the MTL problems.

  9. Moving object detection via low-rank total variation regularization

    NASA Astrophysics Data System (ADS)

    Wang, Pengcheng; Chen, Qian; Shao, Na

    2016-09-01

    Moving object detection is a challenging task in video surveillance. Recently proposed Robust Principal Component Analysis (RPCA) can recover the outlier patterns from the low-rank data under some mild conditions. However, the l-penalty in RPCA doesn't work well in moving object detection because the irrepresentable condition is often not satisfied. In this paper, a method based on total variation (TV) regularization scheme is proposed. In our model, image sequences captured with a static camera are highly related, which can be described using a low-rank matrix. Meanwhile, the low-rank matrix can absorb background motion, e.g. periodic and random perturbation. The foreground objects in the sequence are usually sparsely distributed and drifting continuously, and can be treated as group outliers from the highly-related background scenes. Instead of l-penalty, we exploit the total variation of the foreground. By minimizing the total variation energy, the outliers tend to collapse and finally converge to be the exact moving objects. The TV-penalty is superior to the l-penalty especially when the outlier is in the majority for some pixels, and our method can estimate the outlier explicitly with less bias but higher variance. To solve the problem, a joint optimization function is formulated and can be effectively solved through the inexact Augmented Lagrange Multiplier (ALM) method. We evaluate our method along with several state-of-the-art approaches in MATLAB. Both qualitative and quantitative results demonstrate that our proposed method works effectively on a large range of complex scenarios.

  10. Anaerobic bioprocessing of low-rank coals

    SciTech Connect

    Jain, M.K.; Narayan, R.; Han, O.

    1992-04-15

    The overall goal of this project is to find biological methods to remove carboxylic functionalities from low-rank coals and to assess the properties of the modified coal towards coal liquefaction. The main objectives for this quarter were: (1) continuation of microbial consortia development and maintenance, (2) crude enzyme study using best decarboxylating organisms, (3) decarboxylation of lignite, demineralized Wyodak coal and model polymers, and (4) characterization of biotreated coals.

  11. A theory of measuring, electing, and ranking

    PubMed Central

    Balinski, Michel; Laraki, Rida

    2007-01-01

    The impossibility theorems that abound in the theory of social choice show that there can be no satisfactory method for electing and ranking in the context of the traditional, 700-year-old model. A more realistic model, whose antecedents may be traced to Laplace and Galton, leads to a new theory that avoids all impossibilities with a simple and eminently practical method, “the majority judgement.” It has already been tested. PMID:17496140

  12. Relevance Is the Issue.

    ERIC Educational Resources Information Center

    Smeltzer, Larry

    1993-01-01

    Points out that good research must be applied, theoretical, rigorous, and relevant all at the same time. Argues for relevant research that develops and tests theoretical constructs that provide useful business knowledge. (SR)

  13. Medical School Ranking and Student Research Opportunities.

    PubMed

    Havnaer, Annika G; Greenberg, Paul B

    2016-10-04

    This study aimed to characterize the current state of student research opportunities in a sample of US medical schools ranked in three different tiers. The authors examined the websites for five US medical schools in each of the first, second, and third tiers per National Institutes of Health funding and U.S. News & World Report rankings. Available research opportunities were identified and categorized. There were 26 schools in the first (n=6), second (n=10), and third (n=10) tiers. From the first, second, and third tiers, 4/6 (67%), 1/10 (10%) and none, respectively, required a research experience (p=0.003); 6/6 (100%), 4/10 (40%) and 1/10 (10%), respectively, offered internally funded one-year research (p=0.002); and 5/6 (83%), 4/10 (40%) and 2/10 (20%), respectively, offered student research days (p=0.045). Higher ranked schools provided more opportunities for student research by providing internally funded one-year research, requiring research, and offering student research days. [Full article available at http://rimed.org/rimedicaljournal-2016-10.asp].

  14. [2013 research ranking of Spanish public universities].

    PubMed

    Buela-Casal, Gualberto; Quevedo-Blasco, Raúl; Guillén-Riquelme, Alejandro

    2015-01-01

    The evaluation of research production and productivity is becoming increasingly necessary for universities. Having reliable and clear data is extremely useful in order to uncover strengths and weaknesses. The objective of this article is to update the research ranking of Spanish public universities with the 2013 data. Assessment was carried out based on articles in journals indexed in the JCR, research periods, R+D projects, doctoral theses, FPU grants, doctoral studies awarded with a citation of excellence, and patents, providing a rating, both for each individual indicator and globally, in production and productivity. The same methodology as previous editions was followed. In the global ranking, the universities with a higher production are Barcelona, Complutense of Madrid, and Granada. In productivity, the first positions are held by the universities Pompeu Fabra, Pablo de Olavide, and the Autonomous University of Barcelona. Differences can be found between the universities in production and productivity, while there are also certain similarities with regard to the position of Spanish universities in international rankings.

  15. Distribution of CFTR mutations in the Czech population: positive impact of integrated clinical and laboratory expertise, detection of novel/de novo alleles and relevance for related/derived populations.

    PubMed

    Křenková, Petra; Piskáčková, Tereza; Holubová, Andrea; Balaščaková, Miroslava; Krulišová, Veronika; Čamajová, Jana; Turnovec, Marek; Libik, Malgorzata; Norambuena, Patricia; Štambergová, Alexandra; Dvořáková, Lenka; Skalická, Veronika; Bartošová, Jana; Kučerová, Tereza; Fila, Libor; Zemková, Dana; Vávrová, Věra; Koudová, Monika; Macek, Milan; Krebsová, Alice; Macek, Milan

    2013-09-01

    This two decade long study presents a comprehensive overview of the CFTR mutation distribution in a representative cohort of 600 Czech CF patients derived from all regions of the Czech Republic. We examined the most common CF-causing mutations using the Elucigene CF-EU2v1™ assay, followed by MLPA, mutation scanning and/or sequencing of the entire CFTR coding region and splice site junctions. We identified 99.5% of all mutations (1194/1200 CFTR alleles) in the Czech CF population. Altogether 91 different CFTR mutations, of which 20 were novel, were detected. One case of de novo mutation and a novel polymorphism was revealed. The commercial assay achieved 90.7%, the MLPA added 1.0% and sequencing increased the detection rate by 7.8%. These comprehensive data provide a basis for the improvement of CF DNA diagnostics and/or newborn screening in our country. In addition, they are relevant to related Central European populations with lower mutation detection rates, as well as to the sizeable North American "Bohemian diaspora". Copyright © 2012 European Cystic Fibrosis Society. Published by Elsevier B.V. All rights reserved.

  16. Mutual enrichment in ranked lists and the statistical assessment of position weight matrix motifs

    PubMed Central

    2014-01-01

    Background Statistics in ranked lists is useful in analysing molecular biology measurement data, such as differential expression, resulting in ranked lists of genes, or ChIP-Seq, which yields ranked lists of genomic sequences. State of the art methods study fixed motifs in ranked lists of sequences. More flexible models such as position weight matrix (PWM) motifs are more challenging in this context, partially because it is not clear how to avoid the use of arbitrary thresholds. Results To assess the enrichment of a PWM motif in a ranked list we use a second ranking on the same set of elements induced by the PWM. Possible orders of one ranked list relative to another can be modelled as permutations. Due to sample space complexity, it is difficult to accurately characterize tail distributions in the group of permutations. In this paper we develop tight upper bounds on tail distributions of the size of the intersection of the top parts of two uniformly and independently drawn permutations. We further demonstrate advantages of this approach using our software implementation, mmHG-Finder, which is publicly available, to study PWM motifs in several datasets. In addition to validating known motifs, we found GC-rich strings to be enriched amongst the promoter sequences of long non-coding RNAs that are specifically expressed in thyroid and prostate tissue samples and observed a statistical association with tissue specific CpG hypo-methylation. Conclusions We develop tight bounds that can be calculated in polynomial time. We demonstrate utility of mutual enrichment in motif search and assess performance for synthetic and biological datasets. We suggest that thyroid and prostate-specific long non-coding RNAs are regulated by transcription factors that bind GC-rich sequences, such as EGR1, SP1 and E2F3. We further suggest that this regulation is associated with DNA hypo-methylation. PMID:24708618

  17. Low-rank coal research under the UND/DOE cooperative agreement. Quarterly technical progress report, April 1983-June 1983

    SciTech Connect

    Wiltsee, Jr., G. A.

    1983-01-01

    Progress reports are presented for the following tasks: (1) gasification wastewater treatment and reuse; (2) fine coal cleaning; (3) coal-water slurry preparation; (4) low-rank coal liquefaction; (5) combined flue gas cleanup/simultaneous SO/sub x/-NO/sub x/ control; (6) particulate control and hydrocarbons and trace element emissions from low-rank coals; (7) waste characterization; (8) combustion research and ash fowling; (9) fluidized-bed combustion of low-rank coals; (10) ash and slag characterization; (11) organic structure of coal; (12) distribution of inorganics in low-rank coals; (13) physical properties and moisture of low-rank coals; (14) supercritical solvent extraction; and (15) pyrolysis and devolatilization.

  18. Ranking welding intensity in pyroclastic deposits

    NASA Astrophysics Data System (ADS)

    Quane, S. L.; Russell, J. K.

    2003-04-01

    Pyroclastic deposits emplaced at high temperatures and having sufficient thickness become welded. The welding process involves sintering, compaction and flattening of hot glassy pyroclastic material and is attended by systematic changes in physical properties. Historically, the terms nonwelded, incipiently welded, partially welded with pumice, partially welded with fiamme, moderately welded and densely welded have been used as field descriptors for welding intensity (e.g., Smith &Bailey, 1966; Smith, 1979; Ross &Smith, 1980; Streck &Grunder, 1995). While using these descriptive words is often effective for delineating variations of welding intensity within a single deposit, their qualitative character does not provide for consistency between field areas or workers, and inhibits accurate comparison between deposits. Hence, there is a need for a universal classification of welding intensity in pyroclastic deposits. Here we develop an objective ranking system. The system recognizes 8 ranks (I to VIII) based on measurements of physical properties and petrographic characteristics. The physical property measurements include both lab and field observations: density, porosity, uniaxial compressive strength, point load strength, fiamme elongation, and foliation/fabric. The values are normalized in order to make the system universal. The rank divisions are adaptations of a rock mass-rating scheme based on rock strength (Hoek &Brown, 1980) and previous divisions of welding degree based on physical properties (e.g., density: Ragan &Sheridan, 1972, Streck &Grunder, 1995; fiamme elongation: Peterson, 1979). Each rank comprises a range of normalized values for each of the physical properties and a corresponding set of petrographic characteristics. Our new ranking system provides a consistent, objective means by which each sample or section of welded tuff can be evaluated, thus providing a much needed uniformity in nomenclature for degree of welding. References: Hoek, E. &Brown, E

  19. Social choice functions: A tool for ranking variables involved in action plans against road noise.

    PubMed

    Ruiz-Padillo, Alejandro; de Oliveira, Thiago B F; Alves, Matheus; Bazzan, Ana L C; Ruiz, Diego P

    2016-08-01

    Traffic noise is gaining importance in planning and operation of roads in developing countries, and particularly in Europe and Latin America. Many variables with different degrees of importance influence the perception of noise from roads. Thus, the problem of prioritizing road stretches for action against such noise is an important issue in environmental noise management. For example, it can be addressed using multicriteria methods. However, these methodologies require criteria or suitable variables to be ranked according to their relative importance. In the present study, for this ranking, a list of nine variables involved in the decision-making process (called "road stretch priority variables") was presented in the form of questionnaires to high-level experts from Andalusia, southern Spain. These experts ranked the variables by relevance. Using the same data, seven social choice functions (Plurality, Raynaud, Kemeny-Young, Copeland, Simpson, Schulze, and Borda) were used in order to rank the variables. The results indicate that the most important variables were those that take into account the parameters of greatest exposure for the citizens, followed by variables related to the intensity of the problem analyzed. The results show that a combination of the use of social choice functions on aggregated information from expert panels can provide a consensus for ranking priority variables related to road stretches. Copyright © 2016 Elsevier Ltd. All rights reserved.

  20. Quantification of Transporter and Receptor Proteins in Dog Brain Capillaries and Choroid Plexus: Relevance for the Distribution in Brain and CSF of Selected BCRP and P-gp Substrates.

    PubMed

    Braun, Clemens; Sakamoto, Atsushi; Fuchs, Holger; Ishiguro, Naoki; Suzuki, Shinobu; Cui, Yunhai; Klinder, Klaus; Watanabe, Michitoshi; Terasaki, Tetsuya; Sauer, Achim

    2017-10-02

    Transporters at the blood-brain barrier (BBB) and the blood-cerebrospinal fluid barrier (BCSFB) play a pivotal role as gatekeepers for efflux or uptake of endogenous and exogenous molecules. The protein expression of a number of them has already been determined in the brains of rodents, nonhuman primates, and humans using quantitative targeted absolute proteomics (QTAP). The dog is an important animal model for drug discovery and development, especially for safety evaluations. The purpose of the present study was to clarify the relevance of the transporter protein expression for drug distribution in the dog brain and CSF. We used QTAP to examine the protein expression of 17 selected transporters and receptors at the dog BBB and BCSFB. For the first time, we directly linked the expression of two efflux transporters, P-glycoprotein (P-gp) and breast cancer resistance protein (BCRP), to regional brain and CSF distribution using specific substrates. Two cocktails, each containing one P-gp substrate (quinidine or apafant) and one BCRP substrate (dantrolene or daidzein) were infused intravenously prior to collection of the brain. Transporter expression varied only slightly between the capillaries of different brain regions and did not result in region-specific distribution of the investigated substrates. There were, however, distinct differences between brain capillaries and choroid plexus. Largest differences were observed for BCRP and P-gp: both were highly expressed in brain capillaries, but no BCRP and only low amounts of P-gp were detected in the choroid plexus. Kp,uu,brain and Kp,uu,CSF of both P-gp substrates were indicative of drug efflux. Also, Kp,uu,brain for the BCRP substrates was low. In contrast, Kp,uu,CSF for both BCRP substrates was close to unity, resulting in Kp,uu,CSF/Kp,uu,brain ratios of 7 and 8, respectively. We conclude that the drug transporter expression profiles differ between the BBB and BCSFB in dogs, that there are species differences in the

  1. Military rank and AIDS proportionate mortality in the Brazilian Navy.

    PubMed

    Silva, Marlene; Santana, Vilma; Dourado, Inês

    2007-02-01

    This study describes AIDS mortality and occupational factors among servicemen in the Brazilian Navy. This is a proportional mortality study of 2,586 servicemen's death certificates (20-72 years of age) recorded from 1991 to 1995. Death certificates and occupational histories came from the Brazilian Navy Insurance System archives. Association was measured using proportionate mortality odds ratios obtained with unconditional logistic regression. AIDS proportionate mortality was estimated at 4.8% (n = 125) and increased during the study period, particularly among servicemen under 50 years of age and those with low rank. As compared to other occupations, there was relative excess AIDS in the "management" (proportionate mortality odds ratio, PMORage-adjusted = 2.45; 95%CI: 1.27-4.71), "secretarial" (PMORage-adjusted = 2.49; 95%CI: 1.22-5.08), and "janitorial" (PMORage-adjusted = 2.61; 95%CI: 1.10-6.16) occupational groups. AIDS proportionate mortality was higher among male than female military members. Higher rates were observed in some occupational groups when the members were low ranking. Power distribution, gender issues, and low socioeconomic status require further investigation using more appropriate methods.

  2. Low-Rank Coal Grinding Performance Versus Power Plant Performance

    SciTech Connect

    Rajive Ganguli; Sukumar Bandopadhyay

    2008-12-31

    The intent of this project was to demonstrate that Alaskan low-rank coal, which is high in volatile content, need not be ground as fine as bituminous coal (typically low in volatile content) for optimum combustion in power plants. The grind or particle size distribution (PSD), which is quantified by percentage of pulverized coal passing 74 microns (200 mesh), affects the pulverizer throughput in power plants. The finer the grind, the lower the throughput. For a power plant to maintain combustion levels, throughput needs to be high. The problem of particle size is compounded for Alaskan coal since it has a low Hardgrove grindability index (HGI); that is, it is difficult to grind. If the thesis of this project is demonstrated, then Alaskan coal need not be ground to the industry standard, thereby alleviating somewhat the low HGI issue (and, hopefully, furthering the salability of Alaskan coal). This project studied the relationship between PSD and power plant efficiency, emissions, and mill power consumption for low-rank high-volatile-content Alaskan coal. The emissions studied were CO, CO{sub 2}, NO{sub x}, SO{sub 2}, and Hg (only two tests). The tested PSD range was 42 to 81 percent passing 76 microns. Within the tested range, there was very little correlation between PSD and power plant efficiency, CO, NO{sub x}, and SO{sub 2}. Hg emissions were very low and, therefore, did not allow comparison between grind sizes. Mill power consumption was lower for coarser grinds.

  3. Social dominance rank influences wheel running behavior in mice.

    PubMed

    Vargas-Pérez, Héctor; Sellings, Laurie; Grieder, Taryn; Díaz, José-Luis

    2009-07-03

    Dominance hierarchies within social groups determine resource distribution. Resources, such as food and access to mating partners, can act as reinforcers. The present study examined the effect of social rank on access to wheel running-a reinforcing behavior performed by laboratory animals. Mice were identified as dominant or subordinate and given access to a running wheel access under solitary or social conditions. In the solitary condition, subordinate and dominant mice spent equal amounts of time on the running wheel. In the social condition, when one wheel was present, subordinate mice spent less time on the wheel than did dominant mice. Conversely, when two wheels were present, subordinates spent more time on the wheel than did dominant mice. When mice were given 24h access to one running wheel in the social condition, dominant mice ran more than subordinates during the dark cycle. Subordinate mice did not compensate for the lack of running wheel access by schedule shifting. These results suggest that social rank influences access to reinforcers by behavioral interference rather than by social inhibition.

  4. Global Low-Rank Image Restoration With Gaussian Mixture Model.

    PubMed

    Zhang, Sibo; Jiao, Licheng; Liu, Fang; Wang, Shuang

    2017-06-27

    Low-rank restoration has recently attracted a lot of attention in the research of computer vision. Empirical studies show that exploring the low-rank property of the patch groups can lead to superior restoration performance, however, there is limited achievement on the global low-rank restoration because the rank minimization at image level is too strong for the natural images which seldom match the low-rank condition. In this paper, we describe a flexible global low-rank restoration model which introduces the local statistical properties into the rank minimization. The proposed model can effectively recover the latent global low-rank structure via nuclear norm, as well as the fine details via Gaussian mixture model. An alternating scheme is developed to estimate the Gaussian parameters and the restored image, and it shows excellent convergence and stability. Besides, experiments on image and video sequence datasets show the effectiveness of the proposed method in image inpainting problems.

  5. Rings whose p-ranks do not exceed 1

    SciTech Connect

    Guseva, O. S.; Tsarev, A. V. E-mail: an-tsarev@yandex.ru

    2014-04-30

    We consider associative torsion-free rings of finite rank whose p-ranks do not exceed 1. For these rings, certain analogues of Wedderburn's theorem on finite-dimensional algebras are found. Bibliography: 11 titles. (paper)

  6. Sum of ranking differences to rank stationary phases used in packed column supercritical fluid chromatography.

    PubMed

    West, Caroline; Khalikova, Maria A; Lesellier, Eric; Héberger, Károly

    2015-08-28

    The identification of a suitable stationary phase in supercritical fluid chromatography (SFC) is a major source of difficulty for those with little experience in this technique. Several protocols have been suggested for column classification in high-performance liquid chromatography (HPLC), gas chromatography (GC), and SFC. However, none of the proposed classification schemes received general acceptance. A fair way to compare columns was proposed with the sum of ranking differences (SRD). In this project, we used the retention data obtained for 86 test compounds with varied polarity and structure, analyzed on 71 different stationary phases encompassing the full range in polarity of commercial packed columns currently available to the SFC chromatographer, with a single set of mobile phase and operating conditions (carbon dioxide-methanol mobile phase, 25°C, 150bar outlet pressure, 3ml/min). First, a reference column was selected and the 70 remaining columns were ranked based on this reference column and the retention data obtained on the 86 analytes. As these analytes previously served for the calculation of linear solvation energy relationships (LSER) on the 71 columns, SRD ranks were compared to LSER methodology. Finally, an external comparison based on the analysis of 10 other analytes (UV filters) related the observed selectivity to SRD ranking. Comparison of elution orders of the UV filters to the SRD rankings is highly supportive of the adequacy of SRD methodology to select similar and dissimilar columns.

  7. On the significance of the fourth-rank orientational order parameter of fluorophores in membranes

    NASA Astrophysics Data System (ADS)

    Pottel, H.; Herreman, W.; van der Meer, B. W.; Ameloot, M.

    1986-02-01

    Using information theory, the orientational distribution function of cylindrically symmetric probe molecules in uniaxial systems is constructed from orientational order parameters. If only the second-rank order parameter < P4 > is known, a distribution of the gaussian type results. If also the fourth-rank order parameter < P4 > is known, it is possible to distinguish between several hypothetical models. A reanalysis of time-resolved fluorescence anisotropy data indicates that the so-called < P4 >-distribution (introduced by Zannoni and described below) is a good model for diphenylhexatriene in non-oriented membranes. The behaviour of the distribution function is related to the fluctuation in P2. This fluctuation is proportional to < P4 >, at fixed < P2 >.

  8. Visualizing Rank Time Series of Wikipedia Top-Viewed Pages.

    PubMed

    Xia, Jing; Hou, Yumeng; Chen, Yingjie Victor; Qian, Zhenyu Cheryl; Ebert, David S; Chen, Wei

    2017-01-01

    Visual clutter is a common challenge when visualizing large rank time series data. WikiTopReader, a reader of Wikipedia page rank, lets users explore connections among top-viewed pages by connecting page-rank behaviors with page-link relations. Such a combination enhances the unweighted Wikipedia page-link network and focuses attention on the page of interest. A set of user evaluations shows that the system effectively represents evolving ranking patterns and page-wise correlation.

  9. Ranking Quality in Higher Education: Guiding or Misleading?

    ERIC Educational Resources Information Center

    Bergseth, Brita; Petocz, Peter; Abrandt Dahlgren, Madeleine

    2014-01-01

    The study examines two different models of measuring, assessing and ranking quality in higher education. Do different systems of quality assessment lead to equivalent conclusions about the quality of education? This comparative study is based on the rankings of 24 Swedish higher education institutions. Two ranking actors have independently…

  10. Academic Ranking of World Universities by Broad Subject Fields

    ERIC Educational Resources Information Center

    Cheng, Ying; Liu, Nian Cai

    2007-01-01

    Upon numerous requests to provide ranking of world universities by broad subject fields/schools/colleges and by subject fields/programs/departments, the authors present the ranking methodologies and problems that arose from the research by the Institute of Higher Education, Shanghai Jiao Tong University on the Academic Ranking of World…

  11. Ranking Scholarly Publishers in Political Science: An Alternative Approach

    ERIC Educational Resources Information Center

    Garand, James C.; Giles, Micheal W.

    2011-01-01

    Previous research has documented how political scientists evaluate and rank scholarly journals, but the evaluation and ranking of scholarly book publishers has drawn less attention. In this article, we use data from a survey of 603 American political scientists to generate a ranking of scholarly publishers in political science. We used open-ended…

  12. Ranking Scholarly Publishers in Political Science: An Alternative Approach

    ERIC Educational Resources Information Center

    Garand, James C.; Giles, Micheal W.

    2011-01-01

    Previous research has documented how political scientists evaluate and rank scholarly journals, but the evaluation and ranking of scholarly book publishers has drawn less attention. In this article, we use data from a survey of 603 American political scientists to generate a ranking of scholarly publishers in political science. We used open-ended…

  13. University Rankings 2.0: New Frontiers in Institutional Comparisons

    ERIC Educational Resources Information Center

    Usher, Alex

    2009-01-01

    The number of university rankings systems in use around the world has increased dramatically over the last decade. As they have spread, they have mutated; no longer are ranking systems simply clones of the original ranking systems such as "US News" and "World Report". A number of different types of "mutation" have occurred, so that there are now…

  14. 14 CFR § 1214.1105 - Final ranking.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... 14 Aeronautics and Space 5 2014-01-01 2014-01-01 false Final ranking. § 1214.1105 Section § 1214.1105 Aeronautics and Space NATIONAL AERONAUTICS AND SPACE ADMINISTRATION SPACE FLIGHT NASA Astronaut Candidate Recruitment and Selection Program § 1214.1105 Final ranking. Final rankings will be based on a...

  15. 25 CFR 1001.3 - Priority ranking for negotiations.

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ... 25 Indians 2 2013-04-01 2013-04-01 false Priority ranking for negotiations. 1001.3 Section 1001.3... PROGRAM § 1001.3 Priority ranking for negotiations. In addition to the eligibility criteria identified above, a tribe or consortium of tribes seeking priority ranking for negotiations must submit a...

  16. 25 CFR 1001.3 - Priority ranking for negotiations.

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ... 25 Indians 2 2012-04-01 2012-04-01 false Priority ranking for negotiations. 1001.3 Section 1001.3... PROGRAM § 1001.3 Priority ranking for negotiations. In addition to the eligibility criteria identified above, a tribe or consortium of tribes seeking priority ranking for negotiations must submit a...

  17. 25 CFR 1001.3 - Priority ranking for negotiations.

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ... 25 Indians 2 2014-04-01 2014-04-01 false Priority ranking for negotiations. 1001.3 Section 1001.3... PROGRAM § 1001.3 Priority ranking for negotiations. In addition to the eligibility criteria identified above, a tribe or consortium of tribes seeking priority ranking for negotiations must submit a...

  18. 25 CFR 1001.3 - Priority ranking for negotiations.

    Code of Federal Regulations, 2010 CFR

    2010-04-01

    ... 25 Indians 2 2010-04-01 2010-04-01 false Priority ranking for negotiations. 1001.3 Section 1001.3... PROGRAM § 1001.3 Priority ranking for negotiations. In addition to the eligibility criteria identified above, a tribe or consortium of tribes seeking priority ranking for negotiations must submit a...

  19. 25 CFR 1001.3 - Priority ranking for negotiations.

    Code of Federal Regulations, 2011 CFR

    2011-04-01

    ... 25 Indians 2 2011-04-01 2011-04-01 false Priority ranking for negotiations. 1001.3 Section 1001.3... PROGRAM § 1001.3 Priority ranking for negotiations. In addition to the eligibility criteria identified above, a tribe or consortium of tribes seeking priority ranking for negotiations must submit a...

  20. On Graph Isomorphism and the PageRank Algorithm

    DTIC Science & Technology

    2008-09-01

    53 33. PageRank: An Algorithm for Ordering Vertices [PBM+98] .......................................63 34. Paw Graph [Wes01...64 35. Applying the PageRank Perturbation to the Paw Graph, α...0.85 ............................64 36. Paw Graph’s PageRank Vector, α = 0.85

  1. Tutorial: Calculating Percentile Rank and Percentile Norms Using SPSS

    ERIC Educational Resources Information Center

    Baumgartner, Ted A.

    2009-01-01

    Practitioners can benefit from using norms, but they often have to develop their own percentile rank and percentile norms. This article is a tutorial on how to quickly and easily calculate percentile rank and percentile norms using SPSS, and this information is presented for a data set. Some issues in calculating percentile rank and percentile…

  2. Rankings in Institutional Strategies and Processes: Impact or Illusion?

    ERIC Educational Resources Information Center

    Hazelkorn, Ellen; Loukkola, Tia; Zhang, Thérèse

    2014-01-01

    The "Rankings in Institutional Strategies and Processes" (RISP) project is the first pan-European study of the impact and influence of rankings on European higher education institutions. The project has sought to build understanding of how rankings impact and influence the development of institutional strategies and processes and its…

  3. Academic Ranking--From Its Genesis to Its International Expansion

    ERIC Educational Resources Information Center

    Vieira, Rosilene C.; Lima, Manolita C.

    2015-01-01

    Given the visibility and popularity of rankings that encompass the measurement of quality of post-graduate courses, for instance, the MBA (Master of Business Administration) or graduate studies program (MSc and PhD) as do global academic rankings--Academic Ranking of World Universities-ARWU, Times Higher/Thomson Reuters World University Ranking…

  4. Control by Numbers: New Managerialism and Ranking in Higher Education

    ERIC Educational Resources Information Center

    Lynch, Kathleen

    2015-01-01

    This paper analyses the role of rankings as an instrument of new managerialism. It shows how rankings are reconstituting the purpose of universities, the role of academics and the definition of what it is to be a student. The paper opens by examining the forces that have facilitated the emergence of the ranking industry and the ideologies…

  5. Ranking Quality in Higher Education: Guiding or Misleading?

    ERIC Educational Resources Information Center

    Bergseth, Brita; Petocz, Peter; Abrandt Dahlgren, Madeleine

    2014-01-01

    The study examines two different models of measuring, assessing and ranking quality in higher education. Do different systems of quality assessment lead to equivalent conclusions about the quality of education? This comparative study is based on the rankings of 24 Swedish higher education institutions. Two ranking actors have independently…

  6. Rankings in Institutional Strategies and Processes: Impact or Illusion?

    ERIC Educational Resources Information Center

    Hazelkorn, Ellen; Loukkola, Tia; Zhang, Thérèse

    2014-01-01

    The "Rankings in Institutional Strategies and Processes" (RISP) project is the first pan-European study of the impact and influence of rankings on European higher education institutions. The project has sought to build understanding of how rankings impact and influence the development of institutional strategies and processes and its…

  7. Control by Numbers: New Managerialism and Ranking in Higher Education

    ERIC Educational Resources Information Center

    Lynch, Kathleen

    2015-01-01

    This paper analyses the role of rankings as an instrument of new managerialism. It shows how rankings are reconstituting the purpose of universities, the role of academics and the definition of what it is to be a student. The paper opens by examining the forces that have facilitated the emergence of the ranking industry and the ideologies…

  8. Academic Ranking of World Universities by Broad Subject Fields

    ERIC Educational Resources Information Center

    Cheng, Ying; Liu, Nian Cai

    2007-01-01

    Upon numerous requests to provide ranking of world universities by broad subject fields/schools/colleges and by subject fields/programs/departments, the authors present the ranking methodologies and problems that arose from the research by the Institute of Higher Education, Shanghai Jiao Tong University on the Academic Ranking of World…

  9. Higher Education Ranking and Leagues Tables: Lessons Learned from Benchmarking

    ERIC Educational Resources Information Center

    Proulx, Roland

    2007-01-01

    The paper intends to contribute to the debate on ranking and league tables by adopting a critical approach to ranking methodologies from the point of view of a university benchmarking exercise. The absence of a strict benchmarking exercise in the ranking process has been, in the opinion of the author, one of the major problems encountered in the…

  10. World University Rankings: Ambiguous Signals. Go8 Backgrounder 30

    ERIC Educational Resources Information Center

    Group of Eight (NJ1), 2012

    2012-01-01

    The current main world university rankings broadly group the leading research universities of nations. Australia's Go8 universities are generally within the top 250 ranked universities, with several institutions in the top 50-100 on some measures. This recognition is commendable, however imperfect the individual rankings may be. Use is made of…

  11. Higher Education Ranking and Leagues Tables: Lessons Learned from Benchmarking

    ERIC Educational Resources Information Center

    Proulx, Roland

    2007-01-01

    The paper intends to contribute to the debate on ranking and league tables by adopting a critical approach to ranking methodologies from the point of view of a university benchmarking exercise. The absence of a strict benchmarking exercise in the ranking process has been, in the opinion of the author, one of the major problems encountered in the…

  12. The Importance of Rank Position. CEP Discussion Paper No. 1241

    ERIC Educational Resources Information Center

    Murphy, Richard; Weinhardt, Felix

    2013-01-01

    We find an individual's rank within their reference group has effects on later objective outcomes. To evaluate the impact of local rank, we use a large administrative dataset tracking over two million students in England from primary through to secondary school. Academic rank within primary school has sizable, robust and significant effects on…

  13. Note: A manifold ranking based saliency detection method for camera

    NASA Astrophysics Data System (ADS)

    Zhang, Libo; Sun, Yihan; Luo, Tiejian; Rahman, Mohammad Muntasir

    2016-09-01

    Research focused on salient object region in natural scenes has attracted a lot in computer vision and has widely been used in many applications like object detection and segmentation. However, an accurate focusing on the salient region, while taking photographs of the real-world scenery, is still a challenging task. In order to deal with the problem, this paper presents a novel approach based on human visual system, which works better with the usage of both background prior and compactness prior. In the proposed method, we eliminate the unsuitable boundary with a fixed threshold to optimize the image boundary selection which can provide more precise estimations. Then, the object detection, which is optimized with compactness prior, is obtained by ranking with background queries. Salient objects are generally grouped together into connected areas that have compact spatial distributions. The experimental results on three public datasets demonstrate that the precision and robustness of the proposed algorithm have been improved obviously.

  14. Porphyrin analysis and coal rank. A porphyrin index of coalification

    SciTech Connect

    Bonnett, R.; Hughes, P.S. )

    1989-03-01

    The stable aromatic nature of the porphyrin nucleus might be expected to make biomarkers containing it excellent bases for the study of the maturation of sedimentary deposits. Thus the porphyrin macroring can be thought of as an inert carrier of information contained in eight or nine peripheral substituents the increased cracking of which would reveal increased maturation. For non-migrating fossil fuels such as lignite and coal, a relationship between the distribution of porphyrin molecular mass and coal rank would result. This idea is examined for a series of well characterized bituminous coals from the British Carboniferous. Extraction of porphyrins and metalloporphyrins is carried out with methanolic sulfuric acid, and the gallium porphyrin concentrates are analyzed both by HPLC and by mass spectrometry. A Porphyrin Index of Coalification (PIC Number) is derived and related to other maturity indices. Within the range of examples chosen it appears to provide a useful scientifically-based indicator of coal maturity.

  15. Robust Generalized Low Rank Approximations of Matrices.

    PubMed

    Shi, Jiarong; Yang, Wei; Zheng, Xiuyun

    2015-01-01

    In recent years, the intrinsic low rank structure of some datasets has been extensively exploited to reduce dimensionality, remove noise and complete the missing entries. As a well-known technique for dimensionality reduction and data compression, Generalized Low Rank Approximations of Matrices (GLRAM) claims its superiority on computation time and compression ratio over the SVD. However, GLRAM is very sensitive to sparse large noise or outliers and its robust version does not have been explored or solved yet. To address this problem, this paper proposes a robust method for GLRAM, named Robust GLRAM (RGLRAM). We first formulate RGLRAM as an l1-norm optimization problem which minimizes the l1-norm of the approximation errors. Secondly, we apply the technique of Augmented Lagrange Multipliers (ALM) to solve this l1-norm minimization problem and derive a corresponding iterative scheme. Then the weak convergence of the proposed algorithm is discussed under mild conditions. Next, we investigate a special case of RGLRAM and extend RGLRAM to a general tensor case. Finally, the extensive experiments on synthetic data show that it is possible for RGLRAM to exactly recover both the low rank and the sparse components while it may be difficult for previous state-of-the-art algorithms. We also discuss three issues on RGLRAM: the sensitivity to initialization, the generalization ability and the relationship between the running time and the size/number of matrices. Moreover, the experimental results on images of faces with large corruptions illustrate that RGLRAM obtains the best denoising and compression performance than other methods.

  16. Robust Generalized Low Rank Approximations of Matrices

    PubMed Central

    Shi, Jiarong; Yang, Wei; Zheng, Xiuyun

    2015-01-01

    In recent years, the intrinsic low rank structure of some datasets has been extensively exploited to reduce dimensionality, remove noise and complete the missing entries. As a well-known technique for dimensionality reduction and data compression, Generalized Low Rank Approximations of Matrices (GLRAM) claims its superiority on computation time and compression ratio over the SVD. However, GLRAM is very sensitive to sparse large noise or outliers and its robust version does not have been explored or solved yet. To address this problem, this paper proposes a robust method for GLRAM, named Robust GLRAM (RGLRAM). We first formulate RGLRAM as an l1-norm optimization problem which minimizes the l1-norm of the approximation errors. Secondly, we apply the technique of Augmented Lagrange Multipliers (ALM) to solve this l1-norm minimization problem and derive a corresponding iterative scheme. Then the weak convergence of the proposed algorithm is discussed under mild conditions. Next, we investigate a special case of RGLRAM and extend RGLRAM to a general tensor case. Finally, the extensive experiments on synthetic data show that it is possible for RGLRAM to exactly recover both the low rank and the sparse components while it may be difficult for previous state-of-the-art algorithms. We also discuss three issues on RGLRAM: the sensitivity to initialization, the generalization ability and the relationship between the running time and the size/number of matrices. Moreover, the experimental results on images of faces with large corruptions illustrate that RGLRAM obtains the best denoising and compression performance than other methods. PMID:26367116

  17. Simple approach for ranking structure determining residues.

    PubMed

    Luna-Martínez, Oscar D; Vidal-Limón, Abraham; Villalba-Velázquez, Miryam I; Sánchez-Alcalá, Rosalba; Garduño-Juárez, Ramón; Uversky, Vladimir N; Becerril, Baltazar

    2016-01-01

    Mutating residues has been a common task in order to study structural properties of the protein of interest. Here, we propose and validate a simple method that allows the identification of structural determinants; i.e., residues essential for preservation of the stability of global structure, regardless of the protein topology. This method evaluates all of the residues in a 3D structure of a given globular protein by ranking them according to their connectivity and movement restrictions without topology constraints. Our results matched up with sequence-based predictors that look up for intrinsically disordered segments, suggesting that protein disorder can also be described with the proposed methodology.

  18. Simple approach for ranking structure determining residues

    PubMed Central

    Luna-Martínez, Oscar D.; Vidal-Limón, Abraham; Villalba-Velázquez, Miryam I.; Sánchez-Alcalá, Rosalba; Garduño-Juárez, Ramón; Uversky, Vladimir N.

    2016-01-01

    Mutating residues has been a common task in order to study structural properties of the protein of interest. Here, we propose and validate a simple method that allows the identification of structural determinants; i.e., residues essential for preservation of the stability of global structure, regardless of the protein topology. This method evaluates all of the residues in a 3D structure of a given globular protein by ranking them according to their connectivity and movement restrictions without topology constraints. Our results matched up with sequence-based predictors that look up for intrinsically disordered segments, suggesting that protein disorder can also be described with the proposed methodology. PMID:27366642

  19. Compressive Sensing via Nonlocal Smoothed Rank Function

    PubMed Central

    Fan, Ya-Ru; Liu, Jun; Zhao, Xi-Le

    2016-01-01

    Compressive sensing (CS) theory asserts that we can reconstruct signals and images with only a small number of samples or measurements. Recent works exploiting the nonlocal similarity have led to better results in various CS studies. To better exploit the nonlocal similarity, in this paper, we propose a non-convex smoothed rank function based model for CS image reconstruction. We also propose an efficient alternating minimization method to solve the proposed model, which reduces a difficult and coupled problem to two tractable subproblems. Experimental results have shown that the proposed method performs better than several existing state-of-the-art CS methods for image reconstruction. PMID:27583683

  20. Compressive Sensing via Nonlocal Smoothed Rank Function.

    PubMed

    Fan, Ya-Ru; Huang, Ting-Zhu; Liu, Jun; Zhao, Xi-Le

    2016-01-01

    Compressive sensing (CS) theory asserts that we can reconstruct signals and images with only a small number of samples or measurements. Recent works exploiting the nonlocal similarity have led to better results in various CS studies. To better exploit the nonlocal similarity, in this paper, we propose a non-convex smoothed rank function based model for CS image reconstruction. We also propose an efficient alternating minimization method to solve the proposed model, which reduces a difficult and coupled problem to two tractable subproblems. Experimental results have shown that the proposed method performs better than several existing state-of-the-art CS methods for image reconstruction.