WIND Toolkit Offshore Summary Dataset
DOE Office of Scientific and Technical Information (OSTI.GOV)
Draxl, Caroline; Musial, Walt; Scott, George
This dataset contains summary statistics for offshore wind resources for the continental United States derived from the Wind Integration National Datatset (WIND) Toolkit. These data are available in two formats: GDB - Compressed geodatabases containing statistical summaries aligned with lease blocks (aliquots) stored in a GIS format. These data are partitioned into Pacific, Atlantic, and Gulf resource regions. HDF5 - Statistical summaries of all points in the offshore Pacific, Atlantic, and Gulf offshore regions. These data are located on the original WIND Toolkit grid and have not been reassigned or downsampled to lease blocks. These data were developed under contractmore » by NREL for the Bureau of Oceanic Energy Management (BOEM).« less
Efficient summary statistical representation when change localization fails.
Haberman, Jason; Whitney, David
2011-10-01
People are sensitive to the summary statistics of the visual world (e.g., average orientation/speed/facial expression). We readily derive this information from complex scenes, often without explicit awareness. Given the fundamental and ubiquitous nature of summary statistical representation, we tested whether this kind of information is subject to the attentional constraints imposed by change blindness. We show that information regarding the summary statistics of a scene is available despite limited conscious access. In a novel experiment, we found that while observers can suffer from change blindness (i.e., not localize where change occurred between two views of the same scene), observers could nevertheless accurately report changes in the summary statistics (or "gist") about the very same scene. In the experiment, observers saw two successively presented sets of 16 faces that varied in expression. Four of the faces in the first set changed from one emotional extreme (e.g., happy) to another (e.g., sad) in the second set. Observers performed poorly when asked to locate any of the faces that changed (change blindness). However, when asked about the ensemble (which set was happier, on average), observer performance remained high. Observers were sensitive to the average expression even when they failed to localize any specific object change. That is, even when observers could not locate the very faces driving the change in average expression between the two sets, they nonetheless derived a precise ensemble representation. Thus, the visual system may be optimized to process summary statistics in an efficient manner, allowing it to operate despite minimal conscious access to the information presented.
Bryce, Richard; Losada Carreño, Ignacio; Kumler, Andrew; Hodge, Bri-Mathias; Roberts, Billy; Brancucci Martinez-Anido, Carlo
2018-08-01
This article contains data and summary statistics of solar irradiance and dry bulb temperature across the Hawaiian archipelago resolved on a monthly basis and spanning years 1998-2015. This data was derived in association with an article titled "Consequences of Neglecting the Interannual Variability of the Solar Resource: A Case Study of Photovoltaic Power Among the Hawaiian Islands" (Bryce et al., 2018 [7]). The solar irradiance data is presented in terms of Direct Normal Irradiance (DNI), Diffuse Horizontal Irradiance (DHI), and Global Horizontal Irradiance (GHI) and was obtained from the satellite-derived data contained in the National Solar Radiation Database (NSRDB). The temperature data is also obtained from this source. We have processed the NSRDB data and compiled these monthly resolved data sets, along with interannual summary statistics including the interannual coefficient of variability.
Your Travtek Driving Experience -- Rental Users Study Data Summary
DOT National Transportation Integrated Search
1993-11-01
THIS REPORT DOCUMENTS THE QUESTIONNAIRE DATA COLLECTED AND THE INSTRUMENTS USED FOR THE TRAVTEK EVALUATION TASK BL - RENTAL USERS STUDY. IT PRESENTS SUMMARY STATISTICS FOR THE PRIMARY DRIVERS DERIVED FROM THE RENTER STUDY, WHICH WAS CONDUCTED FROM MA...
Your TravTek driving experience : rental users study : data summary
DOT National Transportation Integrated Search
1993-11-01
This report documents the questionnaire data collected and the instruments used for the TravTek : Evaluation Task Bl - Rental Users Study. It presents summary statistics for the primary drivers : derived from the renter study, which was conducted fro...
Falgreen, Steffen; Laursen, Maria Bach; Bødker, Julie Støve; Kjeldsen, Malene Krag; Schmitz, Alexander; Nyegaard, Mette; Johnsen, Hans Erik; Dybkær, Karen; Bøgsted, Martin
2014-06-05
In vitro generated dose-response curves of human cancer cell lines are widely used to develop new therapeutics. The curves are summarised by simplified statistics that ignore the conventionally used dose-response curves' dependency on drug exposure time and growth kinetics. This may lead to suboptimal exploitation of data and biased conclusions on the potential of the drug in question. Therefore we set out to improve the dose-response assessments by eliminating the impact of time dependency. First, a mathematical model for drug induced cell growth inhibition was formulated and used to derive novel dose-response curves and improved summary statistics that are independent of time under the proposed model. Next, a statistical analysis workflow for estimating the improved statistics was suggested consisting of 1) nonlinear regression models for estimation of cell counts and doubling times, 2) isotonic regression for modelling the suggested dose-response curves, and 3) resampling based method for assessing variation of the novel summary statistics. We document that conventionally used summary statistics for dose-response experiments depend on time so that fast growing cell lines compared to slowly growing ones are considered overly sensitive. The adequacy of the mathematical model is tested for doxorubicin and found to fit real data to an acceptable degree. Dose-response data from the NCI60 drug screen were used to illustrate the time dependency and demonstrate an adjustment correcting for it. The applicability of the workflow was illustrated by simulation and application on a doxorubicin growth inhibition screen. The simulations show that under the proposed mathematical model the suggested statistical workflow results in unbiased estimates of the time independent summary statistics. Variance estimates of the novel summary statistics are used to conclude that the doxorubicin screen covers a significant diverse range of responses ensuring it is useful for biological interpretations. Time independent summary statistics may aid the understanding of drugs' action mechanism on tumour cells and potentially renew previous drug sensitivity evaluation studies.
2014-01-01
Background In vitro generated dose-response curves of human cancer cell lines are widely used to develop new therapeutics. The curves are summarised by simplified statistics that ignore the conventionally used dose-response curves’ dependency on drug exposure time and growth kinetics. This may lead to suboptimal exploitation of data and biased conclusions on the potential of the drug in question. Therefore we set out to improve the dose-response assessments by eliminating the impact of time dependency. Results First, a mathematical model for drug induced cell growth inhibition was formulated and used to derive novel dose-response curves and improved summary statistics that are independent of time under the proposed model. Next, a statistical analysis workflow for estimating the improved statistics was suggested consisting of 1) nonlinear regression models for estimation of cell counts and doubling times, 2) isotonic regression for modelling the suggested dose-response curves, and 3) resampling based method for assessing variation of the novel summary statistics. We document that conventionally used summary statistics for dose-response experiments depend on time so that fast growing cell lines compared to slowly growing ones are considered overly sensitive. The adequacy of the mathematical model is tested for doxorubicin and found to fit real data to an acceptable degree. Dose-response data from the NCI60 drug screen were used to illustrate the time dependency and demonstrate an adjustment correcting for it. The applicability of the workflow was illustrated by simulation and application on a doxorubicin growth inhibition screen. The simulations show that under the proposed mathematical model the suggested statistical workflow results in unbiased estimates of the time independent summary statistics. Variance estimates of the novel summary statistics are used to conclude that the doxorubicin screen covers a significant diverse range of responses ensuring it is useful for biological interpretations. Conclusion Time independent summary statistics may aid the understanding of drugs’ action mechanism on tumour cells and potentially renew previous drug sensitivity evaluation studies. PMID:24902483
Cohen, Trevor; Blatter, Brett; Patel, Vimla
2005-01-01
Certain applications require computer systems to approximate intended human meaning. This is achievable in constrained domains with a finite number of concepts. Areas such as psychiatry, however, draw on concepts from the world-at-large. A knowledge structure with broad scope is required to comprehend such domains. Latent Semantic Analysis (LSA) is an unsupervised corpus-based statistical method that derives quantitative estimates of the similarity between words and documents from their contextual usage statistics. The aim of this research was to evaluate the ability of LSA to derive meaningful associations between concepts relevant to the assessment of dangerousness in psychiatry. An expert reference model of dangerousness was used to guide the construction of a relevant corpus. Derived associations between words in the corpus were evaluated qualitatively. A similarity-based scoring function was used to assign dangerousness categories to discharge summaries. LSA was shown to derive intuitive relationships between concepts and correlated significantly better than random with human categorization of psychiatric discharge summaries according to dangerousness. The use of LSA to derive a simulated knowledge structure can extend the scope of computer systems beyond the boundaries of constrained conceptual domains. PMID:16779020
Radar derived spatial statistics of summer rain. Volume 3: Appendices
NASA Technical Reports Server (NTRS)
Ronnenburg, C.; Bassnett, A.; Knapp, H.; Vann, W. A.
1975-01-01
A collection of selected important memoranda written during the course of the experiment. It contains detailed information on: (1) frequency diversity, (2) radar controller and radar video processor, (3) SPANDAR calibration, and (4) meteorological summaries.
Generalized massive optimal data compression
NASA Astrophysics Data System (ADS)
Alsing, Justin; Wandelt, Benjamin
2018-05-01
In this paper, we provide a general procedure for optimally compressing N data down to n summary statistics, where n is equal to the number of parameters of interest. We show that compression to the score function - the gradient of the log-likelihood with respect to the parameters - yields n compressed statistics that are optimal in the sense that they preserve the Fisher information content of the data. Our method generalizes earlier work on linear Karhunen-Loéve compression for Gaussian data whilst recovering both lossless linear compression and quadratic estimation as special cases when they are optimal. We give a unified treatment that also includes the general non-Gaussian case as long as mild regularity conditions are satisfied, producing optimal non-linear summary statistics when appropriate. As a worked example, we derive explicitly the n optimal compressed statistics for Gaussian data in the general case where both the mean and covariance depend on the parameters.
Kathleen M. Bergen; Daniel G. Brown; James F. Rutherford; Eric J. Gustafson
2005-01-01
A ca. 1980 national-scale land-cover classification based on aerial photo interpretation was combined with 2000 AVHRR satellite imagery to derive land cover and land-cover change information for forest, urban, and agriculture categories over a seven-state region in the U.S. To derive useful land-cover change data using a heterogeneous dataset and to validate our...
Raymond L. Czaplewski
2005-01-01
Forest Service Research and Development (R&D) and State and Private Forestry Deputy Areas, in partnership with the National Forest System Remote Sensing Applications Center (RSAC), built a 250-m resolution (6.25-ha pixel) dataset for the entire USA. It assembles multi-seasonal hyperspectral MODIS data and derivatives, Landsat derivatives (i.e., summary statistics...
Petsch, Harold E.
1979-01-01
Statistical summaries of daily streamflow data for 189 stations west of the Continental Divide in Colorado are presented in this report. Duration tables, high-flow sequence tables, and low-flow sequence tables provide information about daily mean discharge. The mean, variance, standard deviation, skewness, and coefficient of variation are provided for monthly and annual flows. Percentages of average flow are provided for monthly flows and first-order serial-correlation coefficients are provided for annual flows. The text explain the nature and derivation of the data and illustrates applications of the tabulated information by examples. The data may be used by agencies and individuals engaged in water studies. (USGS)
Mutual interference between statistical summary perception and statistical learning.
Zhao, Jiaying; Ngo, Nhi; McKendrick, Ryan; Turk-Browne, Nicholas B
2011-09-01
The visual system is an efficient statistician, extracting statistical summaries over sets of objects (statistical summary perception) and statistical regularities among individual objects (statistical learning). Although these two kinds of statistical processing have been studied extensively in isolation, their relationship is not yet understood. We first examined how statistical summary perception influences statistical learning by manipulating the task that participants performed over sets of objects containing statistical regularities (Experiment 1). Participants who performed a summary task showed no statistical learning of the regularities, whereas those who performed control tasks showed robust learning. We then examined how statistical learning influences statistical summary perception by manipulating whether the sets being summarized contained regularities (Experiment 2) and whether such regularities had already been learned (Experiment 3). The accuracy of summary judgments improved when regularities were removed and when learning had occurred in advance. In sum, calculating summary statistics impeded statistical learning, and extracting statistical regularities impeded statistical summary perception. This mutual interference suggests that statistical summary perception and statistical learning are fundamentally related.
Petsch, Harold E.
1979-01-01
Statistical summaries of daily streamflow data for 246 stations east of the Continental Divide in Colorado and adjacent States are presented in this report. Duration tables, high-flow sequence tables, and low-flow sequence tables provide information about daily mean discharge. The mean, variance, standard deviation, skewness, and coefficient of variation are provided for monthly and annual flows. Percentages of average flow are provided for monthly flows and first-order serial-correlation coefficients are provided for annual flows. The text explains the nature and derivation of the data and illustrates applications of the tabulated information by examples. The data may be used by agencies and individuals engaged in water studies. (USGS)
IMNN: Information Maximizing Neural Networks
NASA Astrophysics Data System (ADS)
Charnock, Tom; Lavaux, Guilhem; Wandelt, Benjamin D.
2018-04-01
This software trains artificial neural networks to find non-linear functionals of data that maximize Fisher information: information maximizing neural networks (IMNNs). As compressing large data sets vastly simplifies both frequentist and Bayesian inference, important information may be inadvertently missed. Likelihood-free inference based on automatically derived IMNN summaries produces summaries that are good approximations to sufficient statistics. IMNNs are robustly capable of automatically finding optimal, non-linear summaries of the data even in cases where linear compression fails: inferring the variance of Gaussian signal in the presence of noise, inferring cosmological parameters from mock simulations of the Lyman-α forest in quasar spectra, and inferring frequency-domain parameters from LISA-like detections of gravitational waveforms. In this final case, the IMNN summary outperforms linear data compression by avoiding the introduction of spurious likelihood maxima.
Apprentices and Trainees 2014. Annual. Australian Vocational Education and Training Statistics
ERIC Educational Resources Information Center
National Centre for Vocational Education Research (NCVER), 2014
2014-01-01
This annual publication provides a summary of training activity in apprenticeships and traineeships in Australia, including information on training rates and duration of training, from 2004 to 2014. The figures in this publication are derived from the National Apprentice and Trainee Collection no. 83 (March, 2015 estimates), which is compiled…
Australian Vocational Education and Training Statistics: Apprentices and Trainees. Annual, 2008
ERIC Educational Resources Information Center
National Centre for Vocational Education Research (NCVER), 2009
2009-01-01
This annual publication provides a summary of training activity in apprenticeships and traineeships in Australia, from the period 1998 to 2008, including information on training rates, attrition rates, completion rates, training within the trades and duration of training. The figures in this publication are derived from the National Apprentice and…
Apprentices and Trainees 2016. Annual. Australian Vocational Education and Training Statistics:
ERIC Educational Resources Information Center
National Centre for Vocational Education Research (NCVER), 2017
2017-01-01
This annual publication provides a summary of training activity in apprenticeships and traineeships in Australia, including information on training rates and duration of training. The figures in this publication are derived from the National Apprentice and Trainee Collection no. 91 (March 2017 estimates), which is compiled under the Australian…
Guan, Yongtao; Li, Yehua; Sinha, Rajita
2011-01-01
In a cocaine dependence treatment study, we use linear and nonlinear regression models to model posttreatment cocaine craving scores and first cocaine relapse time. A subset of the covariates are summary statistics derived from baseline daily cocaine use trajectories, such as baseline cocaine use frequency and average daily use amount. These summary statistics are subject to estimation error and can therefore cause biased estimators for the regression coefficients. Unlike classical measurement error problems, the error we encounter here is heteroscedastic with an unknown distribution, and there are no replicates for the error-prone variables or instrumental variables. We propose two robust methods to correct for the bias: a computationally efficient method-of-moments-based method for linear regression models and a subsampling extrapolation method that is generally applicable to both linear and nonlinear regression models. Simulations and an application to the cocaine dependence treatment data are used to illustrate the efficacy of the proposed methods. Asymptotic theory and variance estimation for the proposed subsampling extrapolation method and some additional simulation results are described in the online supplementary material. PMID:21984854
Automatic physical inference with information maximizing neural networks
NASA Astrophysics Data System (ADS)
Charnock, Tom; Lavaux, Guilhem; Wandelt, Benjamin D.
2018-04-01
Compressing large data sets to a manageable number of summaries that are informative about the underlying parameters vastly simplifies both frequentist and Bayesian inference. When only simulations are available, these summaries are typically chosen heuristically, so they may inadvertently miss important information. We introduce a simulation-based machine learning technique that trains artificial neural networks to find nonlinear functionals of data that maximize Fisher information: information maximizing neural networks (IMNNs). In test cases where the posterior can be derived exactly, likelihood-free inference based on automatically derived IMNN summaries produces nearly exact posteriors, showing that these summaries are good approximations to sufficient statistics. In a series of numerical examples of increasing complexity and astrophysical relevance we show that IMNNs are robustly capable of automatically finding optimal, nonlinear summaries of the data even in cases where linear compression fails: inferring the variance of Gaussian signal in the presence of noise, inferring cosmological parameters from mock simulations of the Lyman-α forest in quasar spectra, and inferring frequency-domain parameters from LISA-like detections of gravitational waveforms. In this final case, the IMNN summary outperforms linear data compression by avoiding the introduction of spurious likelihood maxima. We anticipate that the automatic physical inference method described in this paper will be essential to obtain both accurate and precise cosmological parameter estimates from complex and large astronomical data sets, including those from LSST and Euclid.
Willis, Brian H; Riley, Richard D
2017-09-20
An important question for clinicians appraising a meta-analysis is: are the findings likely to be valid in their own practice-does the reported effect accurately represent the effect that would occur in their own clinical population? To this end we advance the concept of statistical validity-where the parameter being estimated equals the corresponding parameter for a new independent study. Using a simple ('leave-one-out') cross-validation technique, we demonstrate how we may test meta-analysis estimates for statistical validity using a new validation statistic, Vn, and derive its distribution. We compare this with the usual approach of investigating heterogeneity in meta-analyses and demonstrate the link between statistical validity and homogeneity. Using a simulation study, the properties of Vn and the Q statistic are compared for univariate random effects meta-analysis and a tailored meta-regression model, where information from the setting (included as model covariates) is used to calibrate the summary estimate to the setting of application. Their properties are found to be similar when there are 50 studies or more, but for fewer studies Vn has greater power but a higher type 1 error rate than Q. The power and type 1 error rate of Vn are also shown to depend on the within-study variance, between-study variance, study sample size, and the number of studies in the meta-analysis. Finally, we apply Vn to two published meta-analyses and conclude that it usefully augments standard methods when deciding upon the likely validity of summary meta-analysis estimates in clinical practice. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.
Alternative Natural Resource Monitoring Strategies in the Mexican States of Jalisco and Colima
Cele Aguirre-Bravo; Hans Schreuder
2005-01-01
This paper presents a strategy for inventorying and monitoring the natural resources in the Mexican states of Jalisco and Colima. The strategy emphasizes a strong linkage between remote sensing with field sampling design to produce statistical summaries and spatial estimates at multiple scales and resolution levels. Outputs derived from this strategy are expected to...
VET Student Outcomes 2017. Australian Vocational Education and Training Statistics
ERIC Educational Resources Information Center
National Centre for Vocational Education Research (NCVER), 2017
2017-01-01
This publication provides a summary of the outcomes of students who completed their vocational education and training (VET) in Australia during 2016. The outcomes are reported for students in receipt of Commonwealth or state funding and those who paid for their training by other means. The figures are derived from the National Student Outcomes…
Probabilistic Evaluation of Competing Climate Models
NASA Astrophysics Data System (ADS)
Braverman, A. J.; Chatterjee, S.; Heyman, M.; Cressie, N.
2017-12-01
A standard paradigm for assessing the quality of climate model simulations is to compare what these models produce for past and present time periods, to observations of the past and present. Many of these comparisons are based on simple summary statistics called metrics. Here, we propose an alternative: evaluation of competing climate models through probabilities derived from tests of the hypothesis that climate-model-simulated and observed time sequences share common climate-scale signals. The probabilities are based on the behavior of summary statistics of climate model output and observational data, over ensembles of pseudo-realizations. These are obtained by partitioning the original time sequences into signal and noise components, and using a parametric bootstrap to create pseudo-realizations of the noise sequences. The statistics we choose come from working in the space of decorrelated and dimension-reduced wavelet coefficients. We compare monthly sequences of CMIP5 model output of average global near-surface temperature anomalies to similar sequences obtained from the well-known HadCRUT4 data set, as an illustration.
Lloyd-Jones, Luke R; Robinson, Matthew R; Yang, Jian; Visscher, Peter M
2018-04-01
Genome-wide association studies (GWAS) have identified thousands of loci that are robustly associated with complex diseases. The use of linear mixed model (LMM) methodology for GWAS is becoming more prevalent due to its ability to control for population structure and cryptic relatedness and to increase power. The odds ratio (OR) is a common measure of the association of a disease with an exposure ( e.g. , a genetic variant) and is readably available from logistic regression. However, when the LMM is applied to all-or-none traits it provides estimates of genetic effects on the observed 0-1 scale, a different scale to that in logistic regression. This limits the comparability of results across studies, for example in a meta-analysis, and makes the interpretation of the magnitude of an effect from an LMM GWAS difficult. In this study, we derived transformations from the genetic effects estimated under the LMM to the OR that only rely on summary statistics. To test the proposed transformations, we used real genotypes from two large, publicly available data sets to simulate all-or-none phenotypes for a set of scenarios that differ in underlying model, disease prevalence, and heritability. Furthermore, we applied these transformations to GWAS summary statistics for type 2 diabetes generated from 108,042 individuals in the UK Biobank. In both simulation and real-data application, we observed very high concordance between the transformed OR from the LMM and either the simulated truth or estimates from logistic regression. The transformations derived and validated in this study improve the comparability of results from prospective and already performed LMM GWAS on complex diseases by providing a reliable transformation to a common comparative scale for the genetic effects. Copyright © 2018 by the Genetics Society of America.
Riley, Richard D.
2017-01-01
An important question for clinicians appraising a meta‐analysis is: are the findings likely to be valid in their own practice—does the reported effect accurately represent the effect that would occur in their own clinical population? To this end we advance the concept of statistical validity—where the parameter being estimated equals the corresponding parameter for a new independent study. Using a simple (‘leave‐one‐out’) cross‐validation technique, we demonstrate how we may test meta‐analysis estimates for statistical validity using a new validation statistic, Vn, and derive its distribution. We compare this with the usual approach of investigating heterogeneity in meta‐analyses and demonstrate the link between statistical validity and homogeneity. Using a simulation study, the properties of Vn and the Q statistic are compared for univariate random effects meta‐analysis and a tailored meta‐regression model, where information from the setting (included as model covariates) is used to calibrate the summary estimate to the setting of application. Their properties are found to be similar when there are 50 studies or more, but for fewer studies Vn has greater power but a higher type 1 error rate than Q. The power and type 1 error rate of Vn are also shown to depend on the within‐study variance, between‐study variance, study sample size, and the number of studies in the meta‐analysis. Finally, we apply Vn to two published meta‐analyses and conclude that it usefully augments standard methods when deciding upon the likely validity of summary meta‐analysis estimates in clinical practice. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. PMID:28620945
Observers Exploit Stochastic Models of Sensory Change to Help Judge the Passage of Time
Ahrens, Misha B.; Sahani, Maneesh
2011-01-01
Summary Sensory stimulation can systematically bias the perceived passage of time [1–5], but why and how this happens is mysterious. In this report, we provide evidence that such biases may ultimately derive from an innate and adaptive use of stochastically evolving dynamic stimuli to help refine estimates derived from internal timekeeping mechanisms [6–15]. A simplified statistical model based on probabilistic expectations of stimulus change derived from the second-order temporal statistics of the natural environment [16, 17] makes three predictions. First, random noise-like stimuli whose statistics violate natural expectations should induce timing bias. Second, a previously unexplored obverse of this effect is that similar noise stimuli with natural statistics should reduce the variability of timing estimates. Finally, this reduction in variability should scale with the interval being timed, so as to preserve the overall Weber law of interval timing. All three predictions are borne out experimentally. Thus, in the context of our novel theoretical framework, these results suggest that observers routinely rely on sensory input to augment their sense of the passage of time, through a process of Bayesian inference based on expectations of change in the natural environment. PMID:21256018
Global, Regional, and National Fossil-Fuel CO2 Emissions, 1751 - 2008 (Version 2011)
Boden, Thomas A. [CDIAC, Oak Ridge National Laboratory; Marland, G. [CDIAC, Oak Ridge National Laboratory; Andres, Robert J. [CDIAC, Oak Ridge National Laboratory
2011-01-01
Publications containing historical energy statistics make it possible to estimate fossil fuel CO2 emissions back to 1751. Etemad et al. (1991) published a summary compilation that tabulates coal, brown coal, peat, and crude oil production by nation and year. Footnotes in the Etemad et al.(1991) publication extend the energy statistics time series back to 1751. Summary compilations of fossil fuel trade were published by Mitchell (1983, 1992, 1993, 1995). Mitchell's work tabulates solid and liquid fuel imports and exports by nation and year. These pre-1950 production and trade data were digitized and CO2 emission calculations were made following the procedures discussed in Marland and Rotty (1984) and Boden et al. (1995). Further details on the contents and processing of the historical energy statistics are provided in Andres et al. (1999). The 1950 to present CO2 emission estimates are derived primarily from energy statistics published by the United Nations (2010), using the methods of Marland and Rotty (1984). The energy statistics were compiled primarily from annual questionnaires distributed by the U.N. Statistical Office and supplemented by official national statistical publications. As stated in the introduction of the Statistical Yearbook, "in a few cases, official sources are supplemented by other sources and estimates, where these have been subjected to professional scrutiny and debate and are consistent with other independent sources." Data from the U.S. Department of Interior's Geological Survey (USGS 2010) were used to estimate CO2 emitted during cement production. Values for emissions from gas flaring were derived primarily from U.N. data but were supplemented with data from the U.S. Department of Energy's Energy Information Administration (1994), Rotty (1974), and data provided by G. Marland. Greater details about these methods are provided in Marland and Rotty (1984), Boden et al. (1995), and Andres et al. (1999).
Global, Regional, and National Fossil-Fuel CO2 Emissions (1751 - 2010) (V. 2013)
Boden, Thomas A. [CDIAC, Oak Ridge National Laboratory; Andres, Robert J. [CDIAC, Oak Ridge National Laboratory; Marland, G.
2013-01-01
Publications containing historical energy statistics make it possible to estimate fossil fuel CO2 emissions back to 1751. Etemad et al. (1991) published a summary compilation that tabulates coal, brown coal, peat, and crude oil production by nation and year. Footnotes in the Etemad et al.(1991) publication extend the energy statistics time series back to 1751. Summary compilations of fossil fuel trade were published by Mitchell (1983, 1992, 1993, 1995). Mitchell's work tabulates solid and liquid fuel imports and exports by nation and year. These pre-1950 production and trade data were digitized and CO2 emission calculations were made following the procedures discussed in Marland and Rotty (1984) and Boden et al. (1995). Further details on the contents and processing of the historical energy statistics are provided in Andres et al. (1999). The 1950 to present CO2 emission estimates are derived primarily from energy statistics published by the United Nations (2013), using the methods of Marland and Rotty (1984). The energy statistics were compiled primarily from annual questionnaires distributed by the U.N. Statistical Office and supplemented by official national statistical publications. As stated in the introduction of the Statistical Yearbook, "in a few cases, official sources are supplemented by other sources and estimates, where these have been subjected to professional scrutiny and debate and are consistent with other independent sources." Data from the U.S. Department of Interior's Geological Survey (USGS 2012) were used to estimate CO2 emitted during cement production. Values for emissions from gas flaring were derived primarily from U.N. data but were supplemented with data from the U.S. Department of Energy's Energy Information Administration (1994), Rotty (1974), and data provided by G. Marland. Greater details about these methods are provided in Marland and Rotty (1984), Boden et al. (1995), and Andres et al. (1999).
Global, Regional, and National Fossil-Fuel CO2 Emissions (1751 - 2014) (V. 2017)
Boden, T. A. [CDIAC, Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (USA); Andres, R. J. [CDIAC, Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (USA); Marland, G. [Appalachian State University, Boone, NC (USA)
2017-01-01
Publications containing historical energy statistics make it possible to estimate fossil fuel CO2 emissions back to 1751. Etemad et al. (1991) published a summary compilation that tabulates coal, brown coal, peat, and crude oil production by nation and year. Footnotes in the Etemad et al.(1991) publication extend the energy statistics time series back to 1751. Summary compilations of fossil fuel trade were published by Mitchell (1983, 1992, 1993, 1995). Mitchell's work tabulates solid and liquid fuel imports and exports by nation and year. These pre-1950 production and trade data were digitized and CO2 emission calculations were made following the procedures discussed in Marland and Rotty (1984) and Boden et al. (1995). Further details on the contents and processing of the historical energy statistics are provided in Andres et al. (1999). The 1950 to present CO2 emission estimates are derived primarily from energy statistics published by the United Nations (2017), using the methods of Marland and Rotty (1984). The energy statistics were compiled primarily from annual questionnaires distributed by the U.N. Statistical Office and supplemented by official national statistical publications. As stated in the introduction of the Statistical Yearbook, "in a few cases, official sources are supplemented by other sources and estimates, where these have been subjected to professional scrutiny and debate and are consistent with other independent sources." Data from the U.S. Department of Interior's Geological Survey (USGS 2017) were used to estimate CO2 emitted during cement production. Values for emissions from gas flaring were derived primarily from U.N. data but were supplemented with data from the U.S. Department of Energy's Energy Information Administration (1994), Rotty (1974), and data provided by G. Marland. Greater details about these methods are provided in Marland and Rotty (1984), Boden et al. (1995), and Andres et al. (1999).
Global, Regional, and National Fossil-Fuel CO2 Emissions (1751 - 2013) (V. 2016)
Boden, T. A. [CDIAC, Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (USA); Andres, R. J. [CDIAC, Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (USA); Marland, G. [Appalachian State University, Boone, NC (USA)
2016-01-01
Publications containing historical energy statistics make it possible to estimate fossil fuel CO2 emissions back to 1751. Etemad et al. (1991) published a summary compilation that tabulates coal, brown coal, peat, and crude oil production by nation and year. Footnotes in the Etemad et al.(1991) publication extend the energy statistics time series back to 1751. Summary compilations of fossil fuel trade were published by Mitchell (1983, 1992, 1993, 1995). Mitchell's work tabulates solid and liquid fuel imports and exports by nation and year. These pre-1950 production and trade data were digitized and CO2 emission calculations were made following the procedures discussed in Marland and Rotty (1984) and Boden et al. (1995). Further details on the contents and processing of the historical energy statistics are provided in Andres et al. (1999). The 1950 to present CO2 emission estimates are derived primarily from energy statistics published by the United Nations (2016), using the methods of Marland and Rotty (1984). The energy statistics were compiled primarily from annual questionnaires distributed by the U.N. Statistical Office and supplemented by official national statistical publications. As stated in the introduction of the Statistical Yearbook, "in a few cases, official sources are supplemented by other sources and estimates, where these have been subjected to professional scrutiny and debate and are consistent with other independent sources." Data from the U.S. Department of Interior's Geological Survey (USGS 2016) were used to estimate CO2 emitted during cement production. Values for emissions from gas flaring were derived primarily from U.N. data but were supplemented with data from the U.S. Department of Energy's Energy Information Administration (1994), Rotty (1974), and data provided by G. Marland. Greater details about these methods are provided in Marland and Rotty (1984), Boden et al. (1995), and Andres et al. (1999).
Global, Regional, and National Fossil-Fuel CO2 Emissions (1751 - 2011) (V. 2015)
Boden, T. A. [CDIAC, Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (USA); Andres, R. J. [CDIAC, Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (USA); Marland, G. [Appalachian State University Boone, NC (USA)
2015-01-01
Publications containing historical energy statistics make it possible to estimate fossil fuel CO2 emissions back to 1751. Etemad et al. (1991) published a summary compilation that tabulates coal, brown coal, peat, and crude oil production by nation and year. Footnotes in the Etemad et al.(1991) publication extend the energy statistics time series back to 1751. Summary compilations of fossil fuel trade were published by Mitchell (1983, 1992, 1993, 1995). Mitchell's work tabulates solid and liquid fuel imports and exports by nation and year. These pre-1950 production and trade data were digitized and CO2 emission calculations were made following the procedures discussed in Marland and Rotty (1984) and Boden et al. (1995). Further details on the contents and processing of the historical energy statistics are provided in Andres et al. (1999). The 1950 to present CO2 emission estimates are derived primarily from energy statistics published by the United Nations (2014), using the methods of Marland and Rotty (1984). The energy statistics were compiled primarily from annual questionnaires distributed by the U.N. Statistical Office and supplemented by official national statistical publications. As stated in the introduction of the Statistical Yearbook, "in a few cases, official sources are supplemented by other sources and estimates, where these have been subjected to professional scrutiny and debate and are consistent with other independent sources." Data from the U.S. Department of Interior's Geological Survey (USGS 2014) were used to estimate CO2 emitted during cement production. Values for emissions from gas flaring were derived primarily from U.N. data but were supplemented with data from the U.S. Department of Energy's Energy Information Administration (1994), Rotty (1974), and data provided by G. Marland. Greater details about these methods are provided in Marland and Rotty (1984), Boden et al. (1995), and Andres et al. (1999).
Global, Regional, and National Fossil-Fuel CO2 Emissions (1751 - 2009) (V. 2012)
Boden, Thomas A. [CDIAC, Oak Ridge National Laboratory; Andres, Robert J. [Oak Ridge National Laboratory; Marland, G. [Research Institute for Environment, Energy and Economics, Appalachian State University
2012-01-01
Publications containing historical energy statistics make it possible to estimate fossil fuel CO2 emissions back to 1751. Etemad et al. (1991) published a summary compilation that tabulates coal, brown coal, peat, and crude oil production by nation and year. Footnotes in the Etemad et al.(1991) publication extend the energy statistics time series back to 1751. Summary compilations of fossil fuel trade were published by Mitchell (1983, 1992, 1993, 1995). Mitchell's work tabulates solid and liquid fuel imports and exports by nation and year. These pre-1950 production and trade data were digitized and CO2 emission calculations were made following the procedures discussed in Marland and Rotty (1984) and Boden et al. (1995). Further details on the contents and processing of the historical energy statistics are provided in Andres et al. (1999). The 1950 to present CO2 emission estimates are derived primarily from energy statistics published by the United Nations (2012), using the methods of Marland and Rotty (1984). The energy statistics were compiled primarily from annual questionnaires distributed by the U.N. Statistical Office and supplemented by official national statistical publications. As stated in the introduction of the Statistical Yearbook, "in a few cases, official sources are supplemented by other sources and estimates, where these have been subjected to professional scrutiny and debate and are consistent with other independent sources." Data from the U.S. Department of Interior's Geological Survey (USGS 2011) were used to estimate CO2 emitted during cement production. Values for emissions from gas flaring were derived primarily from U.N. data but were supplemented with data from the U.S. Department of Energy's Energy Information Administration (1994), Rotty (1974), and data provided by G. Marland. Greater details about these methods are provided in Marland and Rotty (1984), Boden et al. (1995), and Andres et al. (1999).
Global, Regional, and National Fossil-Fuel CO2 Emissions, 1751 - 2007 (Version 2010)
Boden, Thomas A. [CDIAC, Oak Ridge National Laboratory; Marland, G. [CDIAC, Oak Ridge National Laboratory; Andres, Robert J. [CDIAC, Oak Ridge National Laboratory
2010-01-01
Publications containing historical energy statistics make it possible to estimate fossil fuel CO2 emissions back to 1751. Etemad et al. (1991) published a summary compilation that tabulates coal, brown coal, peat, and crude oil production by nation and year. Footnotes in the Etemad et al.(1991) publication extend the energy statistics time series back to 1751. Summary compilations of fossil fuel trade were published by Mitchell (1983, 1992, 1993, 1995). Mitchell's work tabulates solid and liquid fuel imports and exports by nation and year. These pre-1950 production and trade data were digitized and CO2 emission calculations were made following the procedures discussed in Marland and Rotty (1984) and Boden et al. (1995). Further details on the contents and processing of the historical energy statistics are provided in Andres et al. (1999). The 1950 to present CO2 emission estimates are derived primarily from energy statistics published by the United Nations (2009), using the methods of Marland and Rotty (1984). The energy statistics were compiled primarily from annual questionnaires distributed by the U.N. Statistical Office and supplemented by official national statistical publications. As stated in the introduction of the Statistical Yearbook, "in a few cases, official sources are supplemented by other sources and estimates, where these have been subjected to professional scrutiny and debate and are consistent with other independent sources." Data from the U.S. Department of Interior's Geological Survey (USGS 2009) were used to estimate CO2 emitted during cement production. Values for emissions from gas flaring were derived primarily from U.N. data but were supplemented with data from the U.S. Department of Energy's Energy Information Administration (1994), Rotty (1974), and data provided by G. Marland. Greater details about these methods are provided in Marland and Rotty (1984), Boden et al. (1995), and Andres et al. (1999).
Global, Regional, and National Fossil-Fuel CO2 Emissions, 1751 - 2006 (published 2009)
Boden, Thomas A. [CDIAC, Oak Ridge National Laboratory; Marland, G. [CDIAC, Oak Ridge National Laboratory; Andres, Robert J. [CDIAC, Oak Ridge National Laboratory
2009-01-01
Publications containing historical energy statistics make it possible to estimate fossil fuel CO2 emissions back to 1751. Etemad et al. (1991) published a summary compilation that tabulates coal, brown coal, peat, and crude oil production by nation and year. Footnotes in the Etemad et al.(1991) publication extend the energy statistics time series back to 1751. Summary compilations of fossil fuel trade were published by Mitchell (1983, 1992, 1993, 1995). Mitchell's work tabulates solid and liquid fuel imports and exports by nation and year. These pre-1950 production and trade data were digitized and CO2 emission calculations were made following the procedures discussed in Marland and Rotty (1984) and Boden et al. (1995). Further details on the contents and processing of the historical energy statistics are provided in Andres et al. (1999). The 1950 to present CO2 emission estimates are derived primarily from energy statistics published by the United Nations (2008), using the methods of Marland and Rotty (1984). The energy statistics were compiled primarily from annual questionnaires distributed by the U.N. Statistical Office and supplemented by official national statistical publications. As stated in the introduction of the Statistical Yearbook, "in a few cases, official sources are supplemented by other sources and estimates, where these have been subjected to professional scrutiny and debate and are consistent with other independent sources." Data from the U.S. Department of Interior's Geological Survey (USGS 2008) were used to estimate CO2 emitted during cement production. Values for emissions from gas flaring were derived primarily from U.N. data but were supplemented with data from the U.S. Department of Energy's Energy Information Administration (1994), Rotty (1974), and data provided by G. Marland. Greater details about these methods are provided in Marland and Rotty (1984), Boden et al. (1995), and Andres et al. (1999).
Atlas of current and potential future distributions of common trees of the eastern United States
Louis R. Iverson; Anantha M. Prasad; Betsy J. Hale; Elaine Kennedy Sutherland
1999-01-01
This atlas documents the current and possible future distribution of 80 common tree species in the Eastern United States and gives detailed information on environmental characteristics defining these distributions. Also included are outlines of life history characteristics and summary statistics for these species. Much of the data are derived from Forest Inventory and...
R2 & NE State - 2010 Census; Housing and Population Summary
The TIGER/Line Files are shapefiles and related database files (.dbf) that are an extract of selected geographic and cartographic information from the U.S. Census Bureau's Master Address File / Topologically Integrated Geographic Encoding and Referencing (MAF/TIGER) Database (MTDB). The MTDB represents a seamless national file with no overlaps or gaps between parts, however, each TIGER/Line File is designed to stand alone as an independent data set, or they can be combined to cover the entire nation. States and equivalent entities are the primary governmental divisions of the United States. In addition to the fifty States, the Census Bureau treats the District of Columbia, Puerto Rico, and each of the Island Areas (American Samoa, the Commonwealth of the Northern Mariana Islands, Guam, and the U.S. Virgin Islands) as the statistical equivalents of States for the purpose of data presentation.This table contains housing data derived from the U.S. Census 2010 Summary file 1 database for states. The 2010 Summary File 1 (SF 1) contains data compiled from the 2010 Decennial Census questions. This table contains data on housing units, owner and rental.This table contains population data derived from the U.S. Census 2010 Summary file 1 database for states. The 2010 Summary File 1 (SF 1) contains data compiled from the 2010 Decennial Census questions. This table contains data on ancestry, age, and sex.
Liu, Dungang; Liu, Regina; Xie, Minge
2014-01-01
Meta-analysis has been widely used to synthesize evidence from multiple studies for common hypotheses or parameters of interest. However, it has not yet been fully developed for incorporating heterogeneous studies, which arise often in applications due to different study designs, populations or outcomes. For heterogeneous studies, the parameter of interest may not be estimable for certain studies, and in such a case, these studies are typically excluded from conventional meta-analysis. The exclusion of part of the studies can lead to a non-negligible loss of information. This paper introduces a metaanalysis for heterogeneous studies by combining the confidence density functions derived from the summary statistics of individual studies, hence referred to as the CD approach. It includes all the studies in the analysis and makes use of all information, direct as well as indirect. Under a general likelihood inference framework, this new approach is shown to have several desirable properties, including: i) it is asymptotically as efficient as the maximum likelihood approach using individual participant data (IPD) from all studies; ii) unlike the IPD analysis, it suffices to use summary statistics to carry out the CD approach. Individual-level data are not required; and iii) it is robust against misspecification of the working covariance structure of the parameter estimates. Besides its own theoretical significance, the last property also substantially broadens the applicability of the CD approach. All the properties of the CD approach are further confirmed by data simulated from a randomized clinical trials setting as well as by real data on aircraft landing performance. Overall, one obtains an unifying approach for combining summary statistics, subsuming many of the existing meta-analysis methods as special cases. PMID:26190875
Code of Federal Regulations, 2010 CFR
2010-10-01
... statistical summaries and other information it maintains? 40.111 Section 40.111 Transportation Office of the... Testing Laboratories § 40.111 When and how must a laboratory disclose statistical summaries and other information it maintains? (a) As a laboratory, you must transmit an aggregate statistical summary, by employer...
Evidence for a Global Sampling Process in Extraction of Summary Statistics of Item Sizes in a Set.
Tokita, Midori; Ueda, Sachiyo; Ishiguchi, Akira
2016-01-01
Several studies have shown that our visual system may construct a "summary statistical representation" over groups of visual objects. Although there is a general understanding that human observers can accurately represent sets of a variety of features, many questions on how summary statistics, such as an average, are computed remain unanswered. This study investigated sampling properties of visual information used by human observers to extract two types of summary statistics of item sets, average and variance. We presented three models of ideal observers to extract the summary statistics: a global sampling model without sampling noise, global sampling model with sampling noise, and limited sampling model. We compared the performance of an ideal observer of each model with that of human observers using statistical efficiency analysis. Results suggest that summary statistics of items in a set may be computed without representing individual items, which makes it possible to discard the limited sampling account. Moreover, the extraction of summary statistics may not necessarily require the representation of individual objects with focused attention when the sets of items are larger than 4.
NASA Technical Reports Server (NTRS)
Klemin, Alexander; Warner, Edward P; Denkinger, George M
1918-01-01
Part 1 gives details of models tested and methods of testing of the Eiffel 36 wing alone and the JN2 aircraft. Characteristics and performance curves for standard JN are included. Part 2 presents a statistical analysis of the following: lift and drag contributed by body and chassis tested without wings; lift and drag contributed by tail, tested without wings; the effect on lift and drift of interference between the wings of a biplane combination; lift and drag contributed by the addition of body, chassis, and tail to a biplane combination; total parasite resistance; effect of varying size of tail, keeping angle of setting constant; effect of varying length of body and size of tail at the same time, keeping constant moment of tail surface about the center of gravity; forces on the tail and the effects of downwash; effect of size and setting of tail on statical longitudinal stability effects of length of body on stability; the effects of the various elements of an airplane on longitudinal stability and the placing of the force vectors. Part 3 presents the fundamental principals of dynamical stability; computations of resistance derivatives; solution of the stability equation; dynamical stability of the Curtiss JN2; tabulation of resistance derivatives; discussion of the resistance derivatives; formation and solution of stability equations; physical conceptions of the resistance derivatives; elements contributing to damping and an investigation of low speed conditions. Part 4 includes a summary of the results of the statistical investigation and a summary of the results for dynamic stability.
Selecting Summary Statistics in Approximate Bayesian Computation for Calibrating Stochastic Models
Burr, Tom
2013-01-01
Approximate Bayesian computation (ABC) is an approach for using measurement data to calibrate stochastic computer models, which are common in biology applications. ABC is becoming the “go-to” option when the data and/or parameter dimension is large because it relies on user-chosen summary statistics rather than the full data and is therefore computationally feasible. One technical challenge with ABC is that the quality of the approximation to the posterior distribution of model parameters depends on the user-chosen summary statistics. In this paper, the user requirement to choose effective summary statistics in order to accurately estimate the posterior distribution of model parameters is investigated and illustrated by example, using a model and corresponding real data of mitochondrial DNA population dynamics. We show that for some choices of summary statistics, the posterior distribution of model parameters is closely approximated and for other choices of summary statistics, the posterior distribution is not closely approximated. A strategy to choose effective summary statistics is suggested in cases where the stochastic computer model can be run at many trial parameter settings, as in the example. PMID:24288668
Selecting summary statistics in approximate Bayesian computation for calibrating stochastic models.
Burr, Tom; Skurikhin, Alexei
2013-01-01
Approximate Bayesian computation (ABC) is an approach for using measurement data to calibrate stochastic computer models, which are common in biology applications. ABC is becoming the "go-to" option when the data and/or parameter dimension is large because it relies on user-chosen summary statistics rather than the full data and is therefore computationally feasible. One technical challenge with ABC is that the quality of the approximation to the posterior distribution of model parameters depends on the user-chosen summary statistics. In this paper, the user requirement to choose effective summary statistics in order to accurately estimate the posterior distribution of model parameters is investigated and illustrated by example, using a model and corresponding real data of mitochondrial DNA population dynamics. We show that for some choices of summary statistics, the posterior distribution of model parameters is closely approximated and for other choices of summary statistics, the posterior distribution is not closely approximated. A strategy to choose effective summary statistics is suggested in cases where the stochastic computer model can be run at many trial parameter settings, as in the example.
2012 Anthropometric Survey of U.S. Army Personnel: Methods and Summary Statistics
2014-12-05
s to complete. The software for participant scanning, CyScan for the whole-body and head scanners and INFOOT for the foot scanner...information is estimated to average 1 hour per response, including the time for reviewing instructions, searching existing data sources, gathering and ...design and engineering needs, as well as those anticipated well into the future . Ninety-four directly measured dimensions, 39 derived
DOT National Transportation Integrated Search
2001-01-01
The Bureau of Transportation Statistics (BTS) Airport Activity Statistics of Certificated Air Carriers: Summary Tables presents summary data for all scheduled and nonscheduled service by large certificated U.S. air carriers including the volume of pa...
19 CFR 141.61 - Completion of entry and entry summary documentation.
Code of Federal Regulations, 2010 CFR
2010-04-01
... on CBP Form 7501. (e) Statistical information—(1) Information required on entry summary or withdrawal... a separate statistical reporting number, the applicable information required by the General Statistical Notes, Harmonized Tariff Schedule of the United States (HTSUS), must be shown on the entry summary...
Linked Micromaps: Statistical Summaries in a Spatial Context
Communicating summaries of spatial data to decision makers and the public is challenging. We present a graphical method that provides both a geographic context and a statistical summary for such spatial data. Monitoring programs have a need for such geographical summaries. For ...
Evaluation and application of summary statistic imputation to discover new height-associated loci.
Rüeger, Sina; McDaid, Aaron; Kutalik, Zoltán
2018-05-01
As most of the heritability of complex traits is attributed to common and low frequency genetic variants, imputing them by combining genotyping chips and large sequenced reference panels is the most cost-effective approach to discover the genetic basis of these traits. Association summary statistics from genome-wide meta-analyses are available for hundreds of traits. Updating these to ever-increasing reference panels is very cumbersome as it requires reimputation of the genetic data, rerunning the association scan, and meta-analysing the results. A much more efficient method is to directly impute the summary statistics, termed as summary statistics imputation, which we improved to accommodate variable sample size across SNVs. Its performance relative to genotype imputation and practical utility has not yet been fully investigated. To this end, we compared the two approaches on real (genotyped and imputed) data from 120K samples from the UK Biobank and show that, genotype imputation boasts a 3- to 5-fold lower root-mean-square error, and better distinguishes true associations from null ones: We observed the largest differences in power for variants with low minor allele frequency and low imputation quality. For fixed false positive rates of 0.001, 0.01, 0.05, using summary statistics imputation yielded a decrease in statistical power by 9, 43 and 35%, respectively. To test its capacity to discover novel associations, we applied summary statistics imputation to the GIANT height meta-analysis summary statistics covering HapMap variants, and identified 34 novel loci, 19 of which replicated using data in the UK Biobank. Additionally, we successfully replicated 55 out of the 111 variants published in an exome chip study. Our study demonstrates that summary statistics imputation is a very efficient and cost-effective way to identify and fine-map trait-associated loci. Moreover, the ability to impute summary statistics is important for follow-up analyses, such as Mendelian randomisation or LD-score regression.
Evaluation and application of summary statistic imputation to discover new height-associated loci
2018-01-01
As most of the heritability of complex traits is attributed to common and low frequency genetic variants, imputing them by combining genotyping chips and large sequenced reference panels is the most cost-effective approach to discover the genetic basis of these traits. Association summary statistics from genome-wide meta-analyses are available for hundreds of traits. Updating these to ever-increasing reference panels is very cumbersome as it requires reimputation of the genetic data, rerunning the association scan, and meta-analysing the results. A much more efficient method is to directly impute the summary statistics, termed as summary statistics imputation, which we improved to accommodate variable sample size across SNVs. Its performance relative to genotype imputation and practical utility has not yet been fully investigated. To this end, we compared the two approaches on real (genotyped and imputed) data from 120K samples from the UK Biobank and show that, genotype imputation boasts a 3- to 5-fold lower root-mean-square error, and better distinguishes true associations from null ones: We observed the largest differences in power for variants with low minor allele frequency and low imputation quality. For fixed false positive rates of 0.001, 0.01, 0.05, using summary statistics imputation yielded a decrease in statistical power by 9, 43 and 35%, respectively. To test its capacity to discover novel associations, we applied summary statistics imputation to the GIANT height meta-analysis summary statistics covering HapMap variants, and identified 34 novel loci, 19 of which replicated using data in the UK Biobank. Additionally, we successfully replicated 55 out of the 111 variants published in an exome chip study. Our study demonstrates that summary statistics imputation is a very efficient and cost-effective way to identify and fine-map trait-associated loci. Moreover, the ability to impute summary statistics is important for follow-up analyses, such as Mendelian randomisation or LD-score regression. PMID:29782485
DISSCO: direct imputation of summary statistics allowing covariates
Xu, Zheng; Duan, Qing; Yan, Song; Chen, Wei; Li, Mingyao; Lange, Ethan; Li, Yun
2015-01-01
Background: Imputation of individual level genotypes at untyped markers using an external reference panel of genotyped or sequenced individuals has become standard practice in genetic association studies. Direct imputation of summary statistics can also be valuable, for example in meta-analyses where individual level genotype data are not available. Two methods (DIST and ImpG-Summary/LD), that assume a multivariate Gaussian distribution for the association summary statistics, have been proposed for imputing association summary statistics. However, both methods assume that the correlations between association summary statistics are the same as the correlations between the corresponding genotypes. This assumption can be violated in the presence of confounding covariates. Methods: We analytically show that in the absence of covariates, correlation among association summary statistics is indeed the same as that among the corresponding genotypes, thus serving as a theoretical justification for the recently proposed methods. We continue to prove that in the presence of covariates, correlation among association summary statistics becomes the partial correlation of the corresponding genotypes controlling for covariates. We therefore develop direct imputation of summary statistics allowing covariates (DISSCO). Results: We consider two real-life scenarios where the correlation and partial correlation likely make practical difference: (i) association studies in admixed populations; (ii) association studies in presence of other confounding covariate(s). Application of DISSCO to real datasets under both scenarios shows at least comparable, if not better, performance compared with existing correlation-based methods, particularly for lower frequency variants. For example, DISSCO can reduce the absolute deviation from the truth by 3.9–15.2% for variants with minor allele frequency <5%. Availability and implementation: http://www.unc.edu/∼yunmli/DISSCO. Contact: yunli@med.unc.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25810429
DISSCO: direct imputation of summary statistics allowing covariates.
Xu, Zheng; Duan, Qing; Yan, Song; Chen, Wei; Li, Mingyao; Lange, Ethan; Li, Yun
2015-08-01
Imputation of individual level genotypes at untyped markers using an external reference panel of genotyped or sequenced individuals has become standard practice in genetic association studies. Direct imputation of summary statistics can also be valuable, for example in meta-analyses where individual level genotype data are not available. Two methods (DIST and ImpG-Summary/LD), that assume a multivariate Gaussian distribution for the association summary statistics, have been proposed for imputing association summary statistics. However, both methods assume that the correlations between association summary statistics are the same as the correlations between the corresponding genotypes. This assumption can be violated in the presence of confounding covariates. We analytically show that in the absence of covariates, correlation among association summary statistics is indeed the same as that among the corresponding genotypes, thus serving as a theoretical justification for the recently proposed methods. We continue to prove that in the presence of covariates, correlation among association summary statistics becomes the partial correlation of the corresponding genotypes controlling for covariates. We therefore develop direct imputation of summary statistics allowing covariates (DISSCO). We consider two real-life scenarios where the correlation and partial correlation likely make practical difference: (i) association studies in admixed populations; (ii) association studies in presence of other confounding covariate(s). Application of DISSCO to real datasets under both scenarios shows at least comparable, if not better, performance compared with existing correlation-based methods, particularly for lower frequency variants. For example, DISSCO can reduce the absolute deviation from the truth by 3.9-15.2% for variants with minor allele frequency <5%. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
The geographic mosaic of Ecuadorian Y-chromosome ancestry.
Toscanini, U; Gaviria, A; Pardo-Seco, J; Gómez-Carballa, A; Moscoso, F; Vela, M; Cobos, S; Lupero, A; Zambrano, A K; Martinón-Torres, F; Carabajo-Marcillo, A; Yunga-León, R; Ugalde-Noritz, N; Ordoñez-Ugalde, A; Salas, A
2018-03-01
Ecuadorians originated from a complex mixture of Native American indigenous people with Europeans and Africans. We analyzed Y-chromosome STRs (Y-STRs) in a sample of 415 Ecuadorians (145 using the AmpFlSTR ® Yfiler™ system [Life Technologies, USA] and 270 using the PowerPlex ® Y23 system [Promega Corp., USA]; hereafter Yfiler and PPY23, respectively) representing three main ecological continental regions of the country, namely Amazon rainforest, Andes, and Pacific coast. Diversity values are high in the three regions, and the PPY23 exhibits higher discrimination power than the Yfiler set. While summary statistics, AMOVA, and R ST distances show low to moderate levels of population stratification, inferred ancestry derived from Y-STRs reveal clear patterns of geographic variation. The major ancestry in Ecuadorian males is European (61%), followed by an important Native American component (34%); whereas the African ancestry (5%) is mainly concentrated in the Northwest corner of the country. We conclude that classical procedures for measuring population stratification do not have the desirable sensitivity. Statistical inference of ancestry from Y-STRS is a satisfactory alternative for revealing patterns of spatial variation that would pass unnoticed when using popular statistical summary indices. Copyright © 2017 Elsevier B.V. All rights reserved.
Chen, Wenan; McDonnell, Shannon K; Thibodeau, Stephen N; Tillmans, Lori S; Schaid, Daniel J
2016-11-01
Functional annotations have been shown to improve both the discovery power and fine-mapping accuracy in genome-wide association studies. However, the optimal strategy to incorporate the large number of existing annotations is still not clear. In this study, we propose a Bayesian framework to incorporate functional annotations in a systematic manner. We compute the maximum a posteriori solution and use cross validation to find the optimal penalty parameters. By extending our previous fine-mapping method CAVIARBF into this framework, we require only summary statistics as input. We also derived an exact calculation of Bayes factors using summary statistics for quantitative traits, which is necessary when a large proportion of trait variance is explained by the variants of interest, such as in fine mapping expression quantitative trait loci (eQTL). We compared the proposed method with PAINTOR using different strategies to combine annotations. Simulation results show that the proposed method achieves the best accuracy in identifying causal variants among the different strategies and methods compared. We also find that for annotations with moderate effects from a large annotation pool, screening annotations individually and then combining the top annotations can produce overly optimistic results. We applied these methods on two real data sets: a meta-analysis result of lipid traits and a cis-eQTL study of normal prostate tissues. For the eQTL data, incorporating annotations significantly increased the number of potential causal variants with high probabilities. Copyright © 2016 by the Genetics Society of America.
Weir, Christopher J; Butcher, Isabella; Assi, Valentina; Lewis, Stephanie C; Murray, Gordon D; Langhorne, Peter; Brady, Marian C
2018-03-07
Rigorous, informative meta-analyses rely on availability of appropriate summary statistics or individual participant data. For continuous outcomes, especially those with naturally skewed distributions, summary information on the mean or variability often goes unreported. While full reporting of original trial data is the ideal, we sought to identify methods for handling unreported mean or variability summary statistics in meta-analysis. We undertook two systematic literature reviews to identify methodological approaches used to deal with missing mean or variability summary statistics. Five electronic databases were searched, in addition to the Cochrane Colloquium abstract books and the Cochrane Statistics Methods Group mailing list archive. We also conducted cited reference searching and emailed topic experts to identify recent methodological developments. Details recorded included the description of the method, the information required to implement the method, any underlying assumptions and whether the method could be readily applied in standard statistical software. We provided a summary description of the methods identified, illustrating selected methods in example meta-analysis scenarios. For missing standard deviations (SDs), following screening of 503 articles, fifteen methods were identified in addition to those reported in a previous review. These included Bayesian hierarchical modelling at the meta-analysis level; summary statistic level imputation based on observed SD values from other trials in the meta-analysis; a practical approximation based on the range; and algebraic estimation of the SD based on other summary statistics. Following screening of 1124 articles for methods estimating the mean, one approximate Bayesian computation approach and three papers based on alternative summary statistics were identified. Illustrative meta-analyses showed that when replacing a missing SD the approximation using the range minimised loss of precision and generally performed better than omitting trials. When estimating missing means, a formula using the median, lower quartile and upper quartile performed best in preserving the precision of the meta-analysis findings, although in some scenarios, omitting trials gave superior results. Methods based on summary statistics (minimum, maximum, lower quartile, upper quartile, median) reported in the literature facilitate more comprehensive inclusion of randomised controlled trials with missing mean or variability summary statistics within meta-analyses.
Yang, Yi; Tokita, Midori; Ishiguchi, Akira
2018-01-01
A number of studies revealed that our visual system can extract different types of summary statistics, such as the mean and variance, from sets of items. Although the extraction of such summary statistics has been studied well in isolation, the relationship between these statistics remains unclear. In this study, we explored this issue using an individual differences approach. Observers viewed illustrations of strawberries and lollypops varying in size or orientation and performed four tasks in a within-subject design, namely mean and variance discrimination tasks with size and orientation domains. We found that the performances in the mean and variance discrimination tasks were not correlated with each other and demonstrated that extractions of the mean and variance are mediated by different representation mechanisms. In addition, we tested the relationship between performances in size and orientation domains for each summary statistic (i.e. mean and variance) and examined whether each summary statistic has distinct processes across perceptual domains. The results illustrated that statistical summary representations of size and orientation may share a common mechanism for representing the mean and possibly for representing variance. Introspections for each observer performing the tasks were also examined and discussed.
Ruehland, Warren R; O'Donoghue, Fergal J; Pierce, Robert J; Thornton, Andrew T; Singh, Parmjit; Copland, Janet M; Stevens, Bronwyn; Rochford, Peter D
2011-01-01
To examine the impact of using American Academy of Sleep Medicine (AASM) recommended EEG derivations (F4/M1, C4/M1, O2/M1) vs. a single derivation (C4/M1) in polysomnography (PSG) on the measurement of sleep and cortical arousals, including inter- and intra-observer variability. Prospective, non-blinded, randomized comparison. Three Australian tertiary-care hospital clinical sleep laboratories. 30 PSGs from consecutive patients investigated for obstructive sleep apnea (OSA) during December 2007 and January 2008. N/A. To examine the impact of EEG derivations on PSG summary statistics, 3 scorers from different Australian clinical sleep laboratories each scored separate sets of 10 PSGs twice, once using 3 EEG derivations and once using 1 EEG derivation. To examine the impact on inter- and intra-scorer reliability, all 3 scorers scored a subset of 10 PSGs 4 times, twice using each method. All PSGs were de-identified and scored in random order according to the 2007 AASM Manual for the Scoring of Sleep and Associated Events. Using 3 referential EEG derivations during PSG, as recommended in the AASM manual, instead of a single central EEG derivation, as originally suggested by Rechtschaffen and Kales (1968), resulted in a mean ± SE decrease in N1 sleep of 9.6 ± 3.9 min (P = 0.018) and an increase in N3 sleep of 10.6 ± 2.8 min (P = 0.001). No significant differences were observed for any other sleep or arousal scoring summary statistics; nor were any differences observed in inter-scorer or intra-scorer reliability for scoring sleep or cortical arousals. This study provides information for those changing practice to comply with the 2007 AASM recommendations for EEG placement in PSG, for those using portable devices that are unable to comply with the recommendations due to limited channel options, and for the development of future standards for PSG scoring and recording. As the use of multiple EEG derivations only led to small changes in the distribution of derived sleep stages and no significant differences in scoring reliability, this study calls into question the need to use multiple EEG derivations in clinical PSG as suggested in the AASM manual.
PASSALI, D.; CARUSO, G.; ARIGLIANO, L.C.; PASSALI, F.M.; BELLUSSI, L.
2012-01-01
SUMMARY Obstructive sleep apnoea syndrome (OSAS) results from upper airway collapse during sleep. It represents an increasingly recognized pathology associated with many diseases. Herein, we describe a database for patients with OSAS. This has different goals: to facilitate good uniformity in clinical assessment, to allow the use of the application even by non-ENT specialists, to evaluate the results of medical and/or surgical treatments and to enable a statistical meta-analysis derived from the data collected in many OSAS medical centres. PMID:23093815
DOE Office of Scientific and Technical Information (OSTI.GOV)
NONE
1995-03-01
This volume presents the following appendices: ceramic test specimen drawings and schematics, mixed-mode and biaxial stress fracture of structural ceramics for advanced vehicular heat engines (U. Utah), mode I/mode II fracture toughness and tension/torsion fracture strength of NT154 Si nitride (Brown U.), summary of strength test results and fractography, fractography photographs, derivations of statistical models, Weibull strength plots for fast fracture test specimens, and size functions.
Exploring Marine Corps Officer Quality: An Analysis of Promotion to Lieutenant Colonel
2017-03-01
44 G. DESCRIPTIVE STATISTICS ................................................................44 1. Dependent...Variable Summary Statistics ...................................44 2. Performance...87 4. Further Research .........................................................................88 APPENDIX A. SUMMARY STATISTICS OF FITREP AND
Yang, Yi; Tokita, Midori; Ishiguchi, Akira
2018-01-01
A number of studies revealed that our visual system can extract different types of summary statistics, such as the mean and variance, from sets of items. Although the extraction of such summary statistics has been studied well in isolation, the relationship between these statistics remains unclear. In this study, we explored this issue using an individual differences approach. Observers viewed illustrations of strawberries and lollypops varying in size or orientation and performed four tasks in a within-subject design, namely mean and variance discrimination tasks with size and orientation domains. We found that the performances in the mean and variance discrimination tasks were not correlated with each other and demonstrated that extractions of the mean and variance are mediated by different representation mechanisms. In addition, we tested the relationship between performances in size and orientation domains for each summary statistic (i.e. mean and variance) and examined whether each summary statistic has distinct processes across perceptual domains. The results illustrated that statistical summary representations of size and orientation may share a common mechanism for representing the mean and possibly for representing variance. Introspections for each observer performing the tasks were also examined and discussed. PMID:29399318
2015-12-01
WAIVERS ..............................................................................................49 APPENDIX C. DESCRIPTIVE STATISTICS ... Statistics of Dependent Variables. .............................................23 Table 6. Summary Statistics of Academics Variables...24 Table 7. Summary Statistics of Application Variables ............................................25 Table 8
DOT National Transportation Integrated Search
2001-01-01
Airport Activity Statistics of Certificated Air Carriers: Summary Tables presents summary data for : all scheduled and nonscheduled service by large certificated U.S. air carriersincluding the volume : of passenger, freight, and mail enplanements,...
Network-Level Analysis of Cortical Thickness of the Epileptic Brain
Raj, A; Mueller, S.G; Young, K; Laxer, K.D.; Weiner, M
2010-01-01
Temporal lobe epilepsy (TLE) characterized by an epileptogenic focus in the medial temporal lobe is the most common form of focal epilepsy. However, the seizures are not confined to the temporal lobe but can spread to other, anatomically connected brain regions where they can cause similar structural abnormalities as observed in the focus. The aim of this study was to derive whole brain networks from volumetric data and obtain network-centric measures which can capture cortical thinning characteristic for TLE and can be used for classifying a given MRI into TLE or normal, and to obtain additional summary statistics which relate to the extent and spread of the disease. T1 weighted whole brain images were acquired on a 4T magnet in 13 patients with TLE with mesial temporal lobe sclerosis (TLE-MTS), 14 patients with TLE with normal MRI (TLE-no) and 30 controls. Mean cortical thickness and curvature measurements were obtained using the Freesurfer software. These values were used to derive a graph, or network, for each subject. The nodes of the graph are brain regions, and edges represent disease progression paths. We show how to obtain summary statistics like mean, median and variance defined for these networks and to perform exploratory analyses like correlation and classification. Our results indicate that the proposed network approach can improve accuracy of classifying subjects into 2 groups (control and TLE), from 78% for non-network classifiers to 93% using the proposed approach. We also obtain network “peakiness” values using statistical measures like entropy and complexity - this appears to be a good characterizer of the disease, and may have utility in surgical planning. PMID:20553893
Epidemiological data on US coal miners' pneumoconiosis, 1960 to 1988.
Attfield, M D; Castellan, R M
1992-07-01
Statistics on prevalence of pneumoconiosis among working underground coal miners based on epidemiologic data collected between 1960 and 1988 are presented. The main intent was to examine the time-related trend in prevalence, particularly after 1969, when substantially lower dust levels were mandated by federal act. Data from studies undertaken between 1960 and 1968 were collected and compared. Information for the period 1969 to 1988 was extracted from a large ongoing national epidemiologic study. Tenure-specific prevalence rates and summary statistics derived from the latter data for four consecutive time intervals within the 19-year period were calculated and compared. The results indicate a reduction in pneumoconiosis over time. The trend is similar to that seen in a large radiologic surveillance program of underground miners operated concurrently. Although such factors as x-ray reader variation, changes in x-ray standards, and worker self-selection for examination may have influenced the findings to some extent, adjusted summary rates reveal a reduction in prevalence concurrent with reductions in coal mine dust levels mandated by federal act in 1969.
Replicability of time-varying connectivity patterns in large resting state fMRI samples.
Abrol, Anees; Damaraju, Eswar; Miller, Robyn L; Stephen, Julia M; Claus, Eric D; Mayer, Andrew R; Calhoun, Vince D
2017-12-01
The past few years have seen an emergence of approaches that leverage temporal changes in whole-brain patterns of functional connectivity (the chronnectome). In this chronnectome study, we investigate the replicability of the human brain's inter-regional coupling dynamics during rest by evaluating two different dynamic functional network connectivity (dFNC) analysis frameworks using 7 500 functional magnetic resonance imaging (fMRI) datasets. To quantify the extent to which the emergent functional connectivity (FC) patterns are reproducible, we characterize the temporal dynamics by deriving several summary measures across multiple large, independent age-matched samples. Reproducibility was demonstrated through the existence of basic connectivity patterns (FC states) amidst an ensemble of inter-regional connections. Furthermore, application of the methods to conservatively configured (statistically stationary, linear and Gaussian) surrogate datasets revealed that some of the studied state summary measures were indeed statistically significant and also suggested that this class of null model did not explain the fMRI data fully. This extensive testing of reproducibility of similarity statistics also suggests that the estimated FC states are robust against variation in data quality, analysis, grouping, and decomposition methods. We conclude that future investigations probing the functional and neurophysiological relevance of time-varying connectivity assume critical importance. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Replicability of time-varying connectivity patterns in large resting state fMRI samples
Abrol, Anees; Damaraju, Eswar; Miller, Robyn L.; Stephen, Julia M.; Claus, Eric D.; Mayer, Andrew R.; Calhoun, Vince D.
2018-01-01
The past few years have seen an emergence of approaches that leverage temporal changes in whole-brain patterns of functional connectivity (the chronnectome). In this chronnectome study, we investigate the replicability of the human brain’s inter-regional coupling dynamics during rest by evaluating two different dynamic functional network connectivity (dFNC) analysis frameworks using 7 500 functional magnetic resonance imaging (fMRI) datasets. To quantify the extent to which the emergent functional connectivity (FC) patterns are reproducible, we characterize the temporal dynamics by deriving several summary measures across multiple large, independent age-matched samples. Reproducibility was demonstrated through the existence of basic connectivity patterns (FC states) amidst an ensemble of inter-regional connections. Furthermore, application of the methods to conservatively configured (statistically stationary, linear and Gaussian) surrogate datasets revealed that some of the studied state summary measures were indeed statistically significant and also suggested that this class of null model did not explain the fMRI data fully. This extensive testing of reproducibility of similarity statistics also suggests that the estimated FC states are robust against variation in data quality, analysis, grouping, and decomposition methods. We conclude that future investigations probing the functional and neurophysiological relevance of time-varying connectivity assume critical importance. PMID:28916181
“Plateau”-related summary statistics are uninformative for comparing working memory models
van den Berg, Ronald; Ma, Wei Ji
2014-01-01
Performance on visual working memory tasks decreases as more items need to be remembered. Over the past decade, a debate has unfolded between proponents of slot models and slotless models of this phenomenon. Zhang and Luck (2008) and Anderson, Vogel, and Awh (2011) noticed that as more items need to be remembered, “memory noise” seems to first increase and then reach a “stable plateau.” They argued that three summary statistics characterizing this plateau are consistent with slot models, but not with slotless models. Here, we assess the validity of their methods. We generated synthetic data both from a leading slot model and from a recent slotless model and quantified model evidence using log Bayes factors. We found that the summary statistics provided, at most, 0.15% of the expected model evidence in the raw data. In a model recovery analysis, a total of more than a million trials were required to achieve 99% correct recovery when models were compared on the basis of summary statistics, whereas fewer than 1,000 trials were sufficient when raw data were used. At realistic numbers of trials, plateau-related summary statistics are completely unreliable for model comparison. Applying the same analyses to subject data from Anderson et al. (2011), we found that the evidence in the summary statistics was, at most, 0.12% of the evidence in the raw data and far too weak to warrant any conclusions. These findings call into question claims about working memory that are based on summary statistics. PMID:24719235
Statistical summaries of selected Iowa streamflow data through September 2013
Eash, David A.; O'Shea, Padraic S.; Weber, Jared R.; Nguyen, Kevin T.; Montgomery, Nicholas L.; Simonson, Adrian J.
2016-01-04
Statistical summaries of streamflow data collected at 184 streamgages in Iowa are presented in this report. All streamgages included for analysis have at least 10 years of continuous record collected before or through September 2013. This report is an update to two previously published reports that presented statistical summaries of selected Iowa streamflow data through September 1988 and September 1996. The statistical summaries include (1) monthly and annual flow durations, (2) annual exceedance probabilities of instantaneous peak discharges (flood frequencies), (3) annual exceedance probabilities of high discharges, and (4) annual nonexceedance probabilities of low discharges and seasonal low discharges. Also presented for each streamgage are graphs of the annual mean discharges, mean annual mean discharges, 50-percent annual flow-duration discharges (median flows), harmonic mean flows, mean daily mean discharges, and flow-duration curves. Two sets of statistical summaries are presented for each streamgage, which include (1) long-term statistics for the entire period of streamflow record and (2) recent-term statistics for or during the 30-year period of record from 1984 to 2013. The recent-term statistics are only calculated for streamgages with streamflow records pre-dating the 1984 water year and with at least 10 years of record during 1984–2013. The streamflow statistics in this report are not adjusted for the effects of water use; although some of this water is used consumptively, most of it is returned to the streams.
Multiple Phenotype Association Tests Using Summary Statistics in Genome-Wide Association Studies
Liu, Zhonghua; Lin, Xihong
2017-01-01
Summary We study in this paper jointly testing the associations of a genetic variant with correlated multiple phenotypes using the summary statistics of individual phenotype analysis from Genome-Wide Association Studies (GWASs). We estimated the between-phenotype correlation matrix using the summary statistics of individual phenotype GWAS analyses, and developed genetic association tests for multiple phenotypes by accounting for between-phenotype correlation without the need to access individual-level data. Since genetic variants often affect multiple phenotypes differently across the genome and the between-phenotype correlation can be arbitrary, we proposed robust and powerful multiple phenotype testing procedures by jointly testing a common mean and a variance component in linear mixed models for summary statistics. We computed the p-values of the proposed tests analytically. This computational advantage makes our methods practically appealing in large-scale GWASs. We performed simulation studies to show that the proposed tests maintained correct type I error rates, and to compare their powers in various settings with the existing methods. We applied the proposed tests to a GWAS Global Lipids Genetics Consortium summary statistics data set and identified additional genetic variants that were missed by the original single-trait analysis. PMID:28653391
19 CFR 103.31 - Information on vessel manifests and summary statistical reports.
Code of Federal Regulations, 2011 CFR
2011-04-01
... statistical reports. 103.31 Section 103.31 Customs Duties U.S. CUSTOMS AND BORDER PROTECTION, DEPARTMENT OF... Restricted Access § 103.31 Information on vessel manifests and summary statistical reports. (a) Disclosure to... statistical reports of imports and exports and to copy therefrom for publication information and data subject...
19 CFR 103.31 - Information on vessel manifests and summary statistical reports.
Code of Federal Regulations, 2010 CFR
2010-04-01
... statistical reports. 103.31 Section 103.31 Customs Duties U.S. CUSTOMS AND BORDER PROTECTION, DEPARTMENT OF... Restricted Access § 103.31 Information on vessel manifests and summary statistical reports. (a) Disclosure to... statistical reports of imports and exports and to copy therefrom for publication information and data subject...
Fast and accurate imputation of summary statistics enhances evidence of functional enrichment
Pasaniuc, Bogdan; Zaitlen, Noah; Shi, Huwenbo; Bhatia, Gaurav; Gusev, Alexander; Pickrell, Joseph; Hirschhorn, Joel; Strachan, David P.; Patterson, Nick; Price, Alkes L.
2014-01-01
Motivation: Imputation using external reference panels (e.g. 1000 Genomes) is a widely used approach for increasing power in genome-wide association studies and meta-analysis. Existing hidden Markov models (HMM)-based imputation approaches require individual-level genotypes. Here, we develop a new method for Gaussian imputation from summary association statistics, a type of data that is becoming widely available. Results: In simulations using 1000 Genomes (1000G) data, this method recovers 84% (54%) of the effective sample size for common (>5%) and low-frequency (1–5%) variants [increasing to 87% (60%) when summary linkage disequilibrium information is available from target samples] versus the gold standard of 89% (67%) for HMM-based imputation, which cannot be applied to summary statistics. Our approach accounts for the limited sample size of the reference panel, a crucial step to eliminate false-positive associations, and it is computationally very fast. As an empirical demonstration, we apply our method to seven case–control phenotypes from the Wellcome Trust Case Control Consortium (WTCCC) data and a study of height in the British 1958 birth cohort (1958BC). Gaussian imputation from summary statistics recovers 95% (105%) of the effective sample size (as quantified by the ratio of χ2 association statistics) compared with HMM-based imputation from individual-level genotypes at the 227 (176) published single nucleotide polymorphisms (SNPs) in the WTCCC (1958BC height) data. In addition, for publicly available summary statistics from large meta-analyses of four lipid traits, we publicly release imputed summary statistics at 1000G SNPs, which could not have been obtained using previously published methods, and demonstrate their accuracy by masking subsets of the data. We show that 1000G imputation using our approach increases the magnitude and statistical evidence of enrichment at genic versus non-genic loci for these traits, as compared with an analysis without 1000G imputation. Thus, imputation of summary statistics will be a valuable tool in future functional enrichment analyses. Availability and implementation: Publicly available software package available at http://bogdan.bioinformatics.ucla.edu/software/. Contact: bpasaniuc@mednet.ucla.edu or aprice@hsph.harvard.edu Supplementary information: Supplementary materials are available at Bioinformatics online. PMID:24990607
Ruehland, Warren R.; O'Donoghue, Fergal J.; Pierce, Robert J.; Thornton, Andrew T.; Singh, Parmjit; Copland, Janet M.; Stevens, Bronwyn; Rochford, Peter D.
2011-01-01
Study Objective: To examine the impact of using American Academy of Sleep Medicine (AASM) recommended EEG derivations (F4/M1, C4/M1, O2/M1) vs. a single derivation (C4/M1) in polysomnography (PSG) on the measurement of sleep and cortical arousals, including inter- and intra-observer variability. Design: Prospective, non-blinded, randomized comparison. Setting: Three Australian tertiary-care hospital clinical sleep laboratories. Patients or Participants: 30 PSGs from consecutive patients investigated for obstructive sleep apnea (OSA) during December 2007 and January 2008. Interventions: N/A Measurements and Results: To examine the impact of EEG derivations on PSG summary statistics, 3 scorers from different Australian clinical sleep laboratories each scored separate sets of 10 PSGs twice, once using 3 EEG derivations and once using 1 EEG derivation. To examine the impact on inter- and intra-scorer reliability, all 3 scorers scored a subset of 10 PSGs 4 times, twice using each method. All PSGs were de-identified and scored in random order according to the 2007 AASM Manual for the Scoring of Sleep and Associated Events. Using 3 referential EEG derivations during PSG, as recommended in the AASM manual, instead of a single central EEG derivation, as originally suggested by Rechtschaffen and Kales (1968), resulted in a mean ± SE decrease in N1 sleep of 9.6 ± 3.9 min (P = 0.018) and an increase in N3 sleep of 10.6 ± 2.8 min (P = 0.001). No significant differences were observed for any other sleep or arousal scoring summary statistics; nor were any differences observed in inter-scorer or intra-scorer reliability for scoring sleep or cortical arousals. Conclusion: This study provides information for those changing practice to comply with the 2007 AASM recommendations for EEG placement in PSG, for those using portable devices that are unable to comply with the recommendations due to limited channel options, and for the development of future standards for PSG scoring and recording. As the use of multiple EEG derivations only led to small changes in the distribution of derived sleep stages and no significant differences in scoring reliability, this study calls into question the need to use multiple EEG derivations in clinical PSG as suggested in the AASM manual. Citation: Ruehland WR; O'Donoghue FJ; Pierce RJ; Thornton AT; Singh P; Copland JM; Stevens B; Rochford PD. The 2007 AASM recommendations for EEG electrode placement in polysomnography: impact on sleep and cortical arousal scoring. SLEEP 2011;34(1):73-81. PMID:21203376
"Plateau"-related summary statistics are uninformative for comparing working memory models.
van den Berg, Ronald; Ma, Wei Ji
2014-10-01
Performance on visual working memory tasks decreases as more items need to be remembered. Over the past decade, a debate has unfolded between proponents of slot models and slotless models of this phenomenon (Ma, Husain, Bays (Nature Neuroscience 17, 347-356, 2014). Zhang and Luck (Nature 453, (7192), 233-235, 2008) and Anderson, Vogel, and Awh (Attention, Perception, Psychophys 74, (5), 891-910, 2011) noticed that as more items need to be remembered, "memory noise" seems to first increase and then reach a "stable plateau." They argued that three summary statistics characterizing this plateau are consistent with slot models, but not with slotless models. Here, we assess the validity of their methods. We generated synthetic data both from a leading slot model and from a recent slotless model and quantified model evidence using log Bayes factors. We found that the summary statistics provided at most 0.15 % of the expected model evidence in the raw data. In a model recovery analysis, a total of more than a million trials were required to achieve 99 % correct recovery when models were compared on the basis of summary statistics, whereas fewer than 1,000 trials were sufficient when raw data were used. Therefore, at realistic numbers of trials, plateau-related summary statistics are highly unreliable for model comparison. Applying the same analyses to subject data from Anderson et al. (Attention, Perception, Psychophys 74, (5), 891-910, 2011), we found that the evidence in the summary statistics was at most 0.12 % of the evidence in the raw data and far too weak to warrant any conclusions. The evidence in the raw data, in fact, strongly favored the slotless model. These findings call into question claims about working memory that are based on summary statistics.
Lin, Kao; Li, Haipeng; Schlötterer, Christian; Futschik, Andreas
2011-01-01
Summary statistics are widely used in population genetics, but they suffer from the drawback that no simple sufficient summary statistic exists, which captures all information required to distinguish different evolutionary hypotheses. Here, we apply boosting, a recent statistical method that combines simple classification rules to maximize their joint predictive performance. We show that our implementation of boosting has a high power to detect selective sweeps. Demographic events, such as bottlenecks, do not result in a large excess of false positives. A comparison to other neutrality tests shows that our boosting implementation performs well compared to other neutrality tests. Furthermore, we evaluated the relative contribution of different summary statistics to the identification of selection and found that for recent sweeps integrated haplotype homozygosity is very informative whereas older sweeps are better detected by Tajima's π. Overall, Watterson's θ was found to contribute the most information for distinguishing between bottlenecks and selection. PMID:21041556
Certification Can Count: The Case of Aircraft Mechanics. Issues in Labor Statistics. Summary 02-03.
ERIC Educational Resources Information Center
Bureau of Labor Statistics, Washington, DC.
This document is a summary of aerospace industry technician statistics gathered by the Occupational Employment Statistics Survey for the year 2000 by the Department of Labor, Bureau of Labor Statistics. The data includes the following: (1) a comparison of wages earned by Federal Aviation Administration (FAA) certified and non-FAA certified…
Wilburn, D.R.; Stanley, K.A.
2013-01-01
This summary of international mineral exploration activities for 2012 draws upon information from industry sources, published literature and U.S. Geological Survey (USGS) specialists. The summary provides data on exploration budgets by region and mineral commodity, identifies significant mineral discoveries and areas of mineral exploration, discusses government programs affecting the mineral exploration industry and presents analyses of exploration activities performed by the mineral industry. Three sources of information are reported and analyzed in this annual review of international exploration for 2012: 1) budgetary statistics expressed in U.S. nominal dollars provided by SNL Metals Economics Group (MEG) of Halifax, Nova Scotia; 2) regional and site-specific exploration activities that took place in 2012 as compiled by the USGS and 3) regional events including economic, social and political conditions that affected exploration activities, which were derived from published sources and unpublished discussions with USGS and industry specialists.
ERIC Educational Resources Information Center
Williams, Immanuel James; Williams, Kelley Kim
2016-01-01
Understanding summary statistics and graphical techniques are building blocks to comprehending concepts beyond basic statistics. It's known that motivated students perform better in school. Using examples that students find engaging allows them to understand the concepts at a deeper level.
Hanford Works monthly report, October 1952
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1952-11-20
this document presents a summary of work and progress at the Hanford Engineer works for October 1952. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summaries work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Works monthly report, February 1953
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1953-03-18
This document presents a summary of work and progress at the Hanford Engineer Works for February 1953. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summaries work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Service departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Works monthly report, August 1952
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1952-09-24
This document presents a summary of work and progress at the Hanford Engineer Works for August 1952. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department` section summaries work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical,Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Real Estatemore » and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Works monthly report, September 1952
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1952-10-20
This document presents a summary of work and progress at the Hanford Engineer Works for September 1952. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summaries work for the Technical Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Pope, Larry M.; Diaz, A.M.
1982-01-01
Quality-of-water data, collected October 21-23, 1980, and a statistical summary are presented for 42 coal-mined strip pits in Crawford and Cherokee Counties, Southeastern Kansas. The statistical summary includes minimum and maximum observed values , mean, and standard deviation. Simple linear regression equations relating specific conductance, dissolved solids, and acidity to concentrations of dissolved solids, sulfate, calcium, and magnesium, potassium, aluminum, and iron are also presented. (USGS)
Inferring time derivatives including cell growth rates using Gaussian processes
NASA Astrophysics Data System (ADS)
Swain, Peter S.; Stevenson, Keiran; Leary, Allen; Montano-Gutierrez, Luis F.; Clark, Ivan B. N.; Vogel, Jackie; Pilizota, Teuta
2016-12-01
Often the time derivative of a measured variable is of as much interest as the variable itself. For a growing population of biological cells, for example, the population's growth rate is typically more important than its size. Here we introduce a non-parametric method to infer first and second time derivatives as a function of time from time-series data. Our approach is based on Gaussian processes and applies to a wide range of data. In tests, the method is at least as accurate as others, but has several advantages: it estimates errors both in the inference and in any summary statistics, such as lag times, and allows interpolation with the corresponding error estimation. As illustrations, we infer growth rates of microbial cells, the rate of assembly of an amyloid fibril and both the speed and acceleration of two separating spindle pole bodies. Our algorithm should thus be broadly applicable.
Code of Federal Regulations, 2012 CFR
2012-10-01
... Secretary of Transportation PROCEDURES FOR TRANSPORTATION WORKPLACE DRUG AND ALCOHOL TESTING PROGRAMS Drug Testing Laboratories § 40.111 When and how must a laboratory disclose statistical summaries and other... a report indicating that not enough testing was conducted to warrant a summary. You may transmit the...
Code of Federal Regulations, 2014 CFR
2014-10-01
... Secretary of Transportation PROCEDURES FOR TRANSPORTATION WORKPLACE DRUG AND ALCOHOL TESTING PROGRAMS Drug Testing Laboratories § 40.111 When and how must a laboratory disclose statistical summaries and other... a report indicating that not enough testing was conducted to warrant a summary. You may transmit the...
Code of Federal Regulations, 2011 CFR
2011-10-01
... Secretary of Transportation PROCEDURES FOR TRANSPORTATION WORKPLACE DRUG AND ALCOHOL TESTING PROGRAMS Drug Testing Laboratories § 40.111 When and how must a laboratory disclose statistical summaries and other... a report indicating that not enough testing was conducted to warrant a summary. You may transmit the...
Code of Federal Regulations, 2013 CFR
2013-10-01
... Secretary of Transportation PROCEDURES FOR TRANSPORTATION WORKPLACE DRUG AND ALCOHOL TESTING PROGRAMS Drug Testing Laboratories § 40.111 When and how must a laboratory disclose statistical summaries and other... a report indicating that not enough testing was conducted to warrant a summary. You may transmit the...
2012 statistical summaries : FTA grant assistance programs.
DOT National Transportation Integrated Search
2013-12-01
The 2012 Statistical Summaries provides information about the Federal Transit Administrations (FTA) major financial aid programs for : Federal Fiscal Year (FY) 2012. The report covers the following programs: Urbanized Area Formula, Non-urbanized A...
2011 statistical summaries : FTA grant assistance programs.
DOT National Transportation Integrated Search
2013-05-01
The 2011 Statistical Summaries provides information about the Federal Transit Administrations (FTA) major financial aid programs for Federal Fiscal Year (FY) 2011. The report covers the following programs: Urbanized Area Formula, Non-urbanized Are...
2010 statistical summaries : FTA grant assistance programs.
DOT National Transportation Integrated Search
2013-07-01
The 2010 Statistical Summaries provides information about the Federal Transit Administrations (FTA) major financial aid programs for Federal Fiscal Year (FY) 2010. The report covers the following programs: Urbanized Area Formula, Non-urbanized Are...
American Recovery and Reinvestment Act (ARRA) statistical summaries.
DOT National Transportation Integrated Search
2012-05-01
The American Recovery and Reinvestment Act (ARRA) Statistical Summaries provide information about the Federal Transit Administrations (FTA) financial investment programs funded through ARRA.This report covers the Urbanized Area Formula Program and...
Fast and accurate imputation of summary statistics enhances evidence of functional enrichment.
Pasaniuc, Bogdan; Zaitlen, Noah; Shi, Huwenbo; Bhatia, Gaurav; Gusev, Alexander; Pickrell, Joseph; Hirschhorn, Joel; Strachan, David P; Patterson, Nick; Price, Alkes L
2014-10-15
Imputation using external reference panels (e.g. 1000 Genomes) is a widely used approach for increasing power in genome-wide association studies and meta-analysis. Existing hidden Markov models (HMM)-based imputation approaches require individual-level genotypes. Here, we develop a new method for Gaussian imputation from summary association statistics, a type of data that is becoming widely available. In simulations using 1000 Genomes (1000G) data, this method recovers 84% (54%) of the effective sample size for common (>5%) and low-frequency (1-5%) variants [increasing to 87% (60%) when summary linkage disequilibrium information is available from target samples] versus the gold standard of 89% (67%) for HMM-based imputation, which cannot be applied to summary statistics. Our approach accounts for the limited sample size of the reference panel, a crucial step to eliminate false-positive associations, and it is computationally very fast. As an empirical demonstration, we apply our method to seven case-control phenotypes from the Wellcome Trust Case Control Consortium (WTCCC) data and a study of height in the British 1958 birth cohort (1958BC). Gaussian imputation from summary statistics recovers 95% (105%) of the effective sample size (as quantified by the ratio of [Formula: see text] association statistics) compared with HMM-based imputation from individual-level genotypes at the 227 (176) published single nucleotide polymorphisms (SNPs) in the WTCCC (1958BC height) data. In addition, for publicly available summary statistics from large meta-analyses of four lipid traits, we publicly release imputed summary statistics at 1000G SNPs, which could not have been obtained using previously published methods, and demonstrate their accuracy by masking subsets of the data. We show that 1000G imputation using our approach increases the magnitude and statistical evidence of enrichment at genic versus non-genic loci for these traits, as compared with an analysis without 1000G imputation. Thus, imputation of summary statistics will be a valuable tool in future functional enrichment analyses. Publicly available software package available at http://bogdan.bioinformatics.ucla.edu/software/. bpasaniuc@mednet.ucla.edu or aprice@hsph.harvard.edu Supplementary materials are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Hanford Atomic Products Operation monthly report, March 1954
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1954-04-23
This document presents a summary of work and progress at the Hanford Engineer Works for March 1954. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summaries work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Service departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, June 1954
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1954-07-26
This document presents a summary of work and progress at the Hanford Engineer Works for June 1954. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summaries work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, May 1954
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1954-06-22
This document presents a summary of work and progress at the Hanford Engineer Works for May 1954. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summaries work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Science, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, October 1953
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1953-11-20
This document presents a summary of work and progress at the Hanford Engineer Works for October 1953. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summaries work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services. Employee and Public Relations, and Community Realmore » Estate and Service departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, May 1953
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
This document presents a summary of work and progress at the Hanford Engineer Works for May 1953. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summaries work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, July 1953
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1953-08-20
This document presents a summary of work and progress at the Hanford Engineer Works for July 1953. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summaries work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report for September 1954
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1954-10-25
This document presents a summary of work and progress at the Hanford Engineer Works for September 1954. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summaries work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, June 1953
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1953-07-22
This document presents a summary of work and progress at the Hanford Engineer Works for June 1953. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summaries work the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Real Estatemore » and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, December 1953
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1954-01-22
This document presents a summary of work and progress at the Hanford Engineer Works for December 1953. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summaries work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Neandertal admixture in Eurasia confirmed by maximum-likelihood analysis of three genomes.
Lohse, Konrad; Frantz, Laurent A F
2014-04-01
Although there has been much interest in estimating histories of divergence and admixture from genomic data, it has proved difficult to distinguish recent admixture from long-term structure in the ancestral population. Thus, recent genome-wide analyses based on summary statistics have sparked controversy about the possibility of interbreeding between Neandertals and modern humans in Eurasia. Here we derive the probability of full mutational configurations in nonrecombining sequence blocks under both admixture and ancestral structure scenarios. Dividing the genome into short blocks gives an efficient way to compute maximum-likelihood estimates of parameters. We apply this likelihood scheme to triplets of human and Neandertal genomes and compare the relative support for a model of admixture from Neandertals into Eurasian populations after their expansion out of Africa against a history of persistent structure in their common ancestral population in Africa. Our analysis allows us to conclusively reject a model of ancestral structure in Africa and instead reveals strong support for Neandertal admixture in Eurasia at a higher rate (3.4-7.3%) than suggested previously. Using analysis and simulations we show that our inference is more powerful than previous summary statistics and robust to realistic levels of recombination.
Neandertal Admixture in Eurasia Confirmed by Maximum-Likelihood Analysis of Three Genomes
Lohse, Konrad; Frantz, Laurent A. F.
2014-01-01
Although there has been much interest in estimating histories of divergence and admixture from genomic data, it has proved difficult to distinguish recent admixture from long-term structure in the ancestral population. Thus, recent genome-wide analyses based on summary statistics have sparked controversy about the possibility of interbreeding between Neandertals and modern humans in Eurasia. Here we derive the probability of full mutational configurations in nonrecombining sequence blocks under both admixture and ancestral structure scenarios. Dividing the genome into short blocks gives an efficient way to compute maximum-likelihood estimates of parameters. We apply this likelihood scheme to triplets of human and Neandertal genomes and compare the relative support for a model of admixture from Neandertals into Eurasian populations after their expansion out of Africa against a history of persistent structure in their common ancestral population in Africa. Our analysis allows us to conclusively reject a model of ancestral structure in Africa and instead reveals strong support for Neandertal admixture in Eurasia at a higher rate (3.4−7.3%) than suggested previously. Using analysis and simulations we show that our inference is more powerful than previous summary statistics and robust to realistic levels of recombination. PMID:24532731
Utturkar, Sagar M.; Klingeman, Dawn Marie; Land, Miriam L.; ...
2014-06-14
Our motivation with this work was to assess the potential of different types of sequence data combined with de novo and hybrid assembly approaches to improve existing draft genome sequences. Our results show Illumina, 454 and PacBio sequencing technologies were used to generate de novo and hybrid genome assemblies for four different bacteria, which were assessed for quality using summary statistics (e.g. number of contigs, N50) and in silico evaluation tools. Differences in predictions of multiple copies of rDNA operons for each respective bacterium were evaluated by PCR and Sanger sequencing, and then the validated results were applied as anmore » additional criterion to rank assemblies. In general, assemblies using longer PacBio reads were better able to resolve repetitive regions. In this study, the combination of Illumina and PacBio sequence data assembled through the ALLPATHS-LG algorithm gave the best summary statistics and most accurate rDNA operon number predictions. This study will aid others looking to improve existing draft genome assemblies. As to availability and implementation–all assembly tools except CLC Genomics Workbench are freely available under GNU General Public License.« less
Facts about Newspapers '86: A Statistical Summary of the Newspaper Business.
ERIC Educational Resources Information Center
American Newspaper Publishers Association, Washington, DC.
Attesting to the continuing economic strength and institutional vitality of the newspaper business in 1985, this booklet presents a statistical summary of the industry in the United States and Canada. The statistics cover a wide range of topics, including (1) number of daily newspapers, (2) daily newspaper circulation, (3) daily newspapers by…
Facts about Newspapers '85: A Statistical Summary of the Newspaper Business.
ERIC Educational Resources Information Center
American Newspaper Publishers Association, Washington, DC.
A statistical summary of the newspaper industry for 1984 and previous years is presented in this brochure. Focusing primarily on the United States newspaper industry, the brochure also contains some information on Canadian newspapers. The brochure presents statistics in the following categories: (1) number of daily newspapers, (2) daily newspaper…
32 CFR 865.122 - Summary of statistics for Discharge Review Board.
Code of Federal Regulations, 2010 CFR
2010-07-01
... 32 National Defense 6 2010-07-01 2010-07-01 false Summary of statistics for Discharge Review Board. 865.122 Section 865.122 National Defense Department of Defense (Continued) DEPARTMENT OF THE AIR FORCE... statistics for Discharge Review Board. The Air Force Discharge Review Board shall prepare and provide to the...
Multiple phenotype association tests using summary statistics in genome-wide association studies.
Liu, Zhonghua; Lin, Xihong
2018-03-01
We study in this article jointly testing the associations of a genetic variant with correlated multiple phenotypes using the summary statistics of individual phenotype analysis from Genome-Wide Association Studies (GWASs). We estimated the between-phenotype correlation matrix using the summary statistics of individual phenotype GWAS analyses, and developed genetic association tests for multiple phenotypes by accounting for between-phenotype correlation without the need to access individual-level data. Since genetic variants often affect multiple phenotypes differently across the genome and the between-phenotype correlation can be arbitrary, we proposed robust and powerful multiple phenotype testing procedures by jointly testing a common mean and a variance component in linear mixed models for summary statistics. We computed the p-values of the proposed tests analytically. This computational advantage makes our methods practically appealing in large-scale GWASs. We performed simulation studies to show that the proposed tests maintained correct type I error rates, and to compare their powers in various settings with the existing methods. We applied the proposed tests to a GWAS Global Lipids Genetics Consortium summary statistics data set and identified additional genetic variants that were missed by the original single-trait analysis. © 2017, The International Biometric Society.
Driscoll, Daniel G.; Zogorski, John S.
1990-01-01
The report presents a summary of basin characteristics affecting streamflow, a history of the U.S. Geological Survey 's stream-gaging program, and a compilation of discharge records and statistical summaries for selected sites within the Rapid Creek basin. It is the first in a series which will investigate surface-water/groundwater relations along Rapid Creek. The summary of basin characteristics includes descriptions of the geology and hydrogeology, physiography and climate, land use and vegetation, reservoirs, and water use within the basin. A recounting of the U.S. Geological Survey 's stream-gaging program and a tabulation of historic stream-gaging stations within the basin are furnished. A compilation of monthly and annual mean discharge values for nine currently operated, long-term, continuous-record, streamflow-gaging stations on Rapid Creek is presented. The statistical summary for each site includes summary statistics on monthly and annual mean values, correlation matrix for monthly values, serial correlation for 1 year lag for monthly values, percentile rankings for monthly and annual mean values, low and high value tables, duration curves, and peak-discharge tables. Records of monthend contents for two reservoirs within the basin also are presented. (USGS)
Perception of ensemble statistics requires attention.
Jackson-Nielsen, Molly; Cohen, Michael A; Pitts, Michael A
2017-02-01
To overcome inherent limitations in perceptual bandwidth, many aspects of the visual world are represented as summary statistics (e.g., average size, orientation, or density of objects). Here, we investigated the relationship between summary (ensemble) statistics and visual attention. Recently, it was claimed that one ensemble statistic in particular, color diversity, can be perceived without focal attention. However, a broader debate exists over the attentional requirements of conscious perception, and it is possible that some form of attention is necessary for ensemble perception. To test this idea, we employed a modified inattentional blindness paradigm and found that multiple types of summary statistics (color and size) often go unnoticed without attention. In addition, we found attentional costs in dual-task situations, further implicating a role for attention in statistical perception. Overall, we conclude that while visual ensembles may be processed efficiently, some amount of attention is necessary for conscious perception of ensemble statistics. Copyright © 2016 Elsevier Inc. All rights reserved.
1998 statistical summaries : Federal Transit Administration : grant assistance programs
DOT National Transportation Integrated Search
1999-03-01
The 1998 Statistical Summaries provides information about the Federal Transit Administration's (FTA) major financial aid programs for Federal Fiscal Year (FY) 1998. The report covers the following programs: Urbanized Area Formula, Non-urbanized Area ...
NASA Technical Reports Server (NTRS)
Xu, Kuan-Man
2006-01-01
A new method is proposed to compare statistical differences between summary histograms, which are the histograms summed over a large ensemble of individual histograms. It consists of choosing a distance statistic for measuring the difference between summary histograms and using a bootstrap procedure to calculate the statistical significance level. Bootstrapping is an approach to statistical inference that makes few assumptions about the underlying probability distribution that describes the data. Three distance statistics are compared in this study. They are the Euclidean distance, the Jeffries-Matusita distance and the Kuiper distance. The data used in testing the bootstrap method are satellite measurements of cloud systems called cloud objects. Each cloud object is defined as a contiguous region/patch composed of individual footprints or fields of view. A histogram of measured values over footprints is generated for each parameter of each cloud object and then summary histograms are accumulated over all individual histograms in a given cloud-object size category. The results of statistical hypothesis tests using all three distances as test statistics are generally similar, indicating the validity of the proposed method. The Euclidean distance is determined to be most suitable after comparing the statistical tests of several parameters with distinct probability distributions among three cloud-object size categories. Impacts on the statistical significance levels resulting from differences in the total lengths of satellite footprint data between two size categories are also discussed.
Hanford Works monthly report, December 1952
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1953-01-23
This document presents a summary of work and progress at the Hanford Engineer Works for December 1952. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kollias, Pavlos
2017-08-08
This is a multi-institutional, collaborative project using observations and modeling to study the evolution (e.g. formation and growth) of hydrometeors in continental convective clouds. Our contribution was in data analysis for the generation of high-value cloud and precipitation products and derive cloud statistics for model validation. There are two areas in data analysis that we contributed: i) the development of novel, state-of-the-art dual-wavelength radar algorithms for the retrieval of cloud microphysical properties and ii) the evaluation of large domain, high-resolution models using comprehensive multi-sensor observations. Our research group developed statistical summaries from numerous sensors and developed retrievals of vertical airmore » motion in deep convection.« less
2016-06-01
14 Table 2. Summary of Statistics from GGSS Data ........................................ 35 Table 3. Summary of Statistics from...similar approach are unsurprisingly quite consistent in outcomes within statistical variance. The model is used to estimate the effects of exogenous...of German residents (~82 million), excluding diplomats, foreign military and homeless persons. (German Federal Office of Statistics , 2013, p. 475
Hutchinson, Marie; East, Leah; Stasa, Helen; Jackson, Debra
2014-01-01
Over recent decades, there has been considerable research and debate about essential features of advanced nursing practice and differences among various categories of advanced practice nurses. This study aimed to derive an integrative description of the defining characteristics of advanced practice nursing through a meta-summary of the existing literature. A three-phase approach involved (a) systematic review of the literature to identify the specific activities characterized as advanced practice nursing, (b) qualitative meta-summary of practice characteristics extracted from manuscripts meeting inclusion criteria; and (c) statistical analysis of domains across advanced practice categories and country in which the study was completed. A descriptive framework was distilled using qualitative and quantitative results. Fifty manuscripts met inclusion criteria and were retained for analysis. Seven domains of advanced nursing practice were identified: (a) autonomous or nurse-led extended clinical practice; (b) improving systems of care; (c) developing the practice of others; (d) developing/delivering educational programs/activities; (e) nursing research/scholarship; (f) leadership external to the organization; and (g) administering programs, budgets, and personnel. Domains were similar across categories of advanced nursing practice; the domain of developing/delivering educational programs/activities was more common in Australia than in the United States or United Kingdom. Similarity at the domain level was sufficient to suggest that advanced practice role categories are less distinct than often argued. There is merit in adopting a more integrated and consistent interpretation of advanced practice nursing.
Facts about Newspapers '87: A Statistical Summary of the Newspaper Business.
ERIC Educational Resources Information Center
American Newspaper Publishers Association, Washington, DC.
Attesting to the continuing economic strength and institutional vitality of the newspaper business in 1987, this booklet presents a statistical summary of the industry in the United States and Canada. The statistics cover a wide range of topics, including (1) number of daily newspapers; (2) daily newspaper circulation; (3) single copy sales price;…
2016 Service Academy Gender Relations Survey: Overview Report
2017-02-01
environment within the Academies. This Executive Summary will provide a summary of the methodology used and the top line results from the survey.1 Summary...discussion of the measurement constructs, a description of the survey methodology , and detailed presentation of the results. Each report section...are determined statistically significant at an alpha (α) level of .05.6 Survey Methodology Statistical Design OPA conducts cross-Service surveys that
Linking Arctic plant biodiversity measurements with landscape heterogeneity
NASA Astrophysics Data System (ADS)
Gerber, F.; Schaepman-Strub, G.; Furrer, R.
2016-12-01
Climate warming in the Arctic region triggers changes in the vegetation productivity and species composition of the tundra. To investigate these changes and their feedback to climate, we consider species richness and abundance data of the International Tundra EXperiment (ITEX). As this information is very sparse in time and space, we aim to upscale available records to climatically relevant scales with a remote sensing based characterization of the study sites. More precisely, we relate species richness and evenness derived from the ITEX data to summary statistics describing the landscape heterogeneity, which are derived from an elevation model (ASTER GDEM) and spectral satellite observations (LANDSAT 5 and 7). Preliminary results from the statistical analysis using generalized linear mixed models show that no remote sensing based landscape characterization does significantly explain species richness. Reasons could be a mismatch of the spatial scales, an inappropriate characterization of the test sites through the satellite measurements, incomparable plot measurements from the different test sites and/or too few plot measurements. We are looking forward to presenting our results and getting your inputs.
An Adaptive Association Test for Multiple Phenotypes with GWAS Summary Statistics.
Kim, Junghi; Bai, Yun; Pan, Wei
2015-12-01
We study the problem of testing for single marker-multiple phenotype associations based on genome-wide association study (GWAS) summary statistics without access to individual-level genotype and phenotype data. For most published GWASs, because obtaining summary data is substantially easier than accessing individual-level phenotype and genotype data, while often multiple correlated traits have been collected, the problem studied here has become increasingly important. We propose a powerful adaptive test and compare its performance with some existing tests. We illustrate its applications to analyses of a meta-analyzed GWAS dataset with three blood lipid traits and another with sex-stratified anthropometric traits, and further demonstrate its potential power gain over some existing methods through realistic simulation studies. We start from the situation with only one set of (possibly meta-analyzed) genome-wide summary statistics, then extend the method to meta-analysis of multiple sets of genome-wide summary statistics, each from one GWAS. We expect the proposed test to be useful in practice as more powerful than or complementary to existing methods. © 2015 WILEY PERIODICALS, INC.
Statistical summaries of selected Iowa streamflow data through September 2013.
DOT National Transportation Integrated Search
2015-01-01
Statistical summaries of streamflow data collected at : 184 streamgages in Iowa are presented in this report. All : streamgages included for analysis have at least 10 years of : continuous record collected before or through September : 2013. This rep...
Hanford Atomic Products Operation monthly report, January 1955
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1955-02-21
This document presents a summary of work and progress at the Hanford Engineer Works for January 1955. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical Design, and Project Sections. Costs for the various departments are presented in the Financial department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report for April 1955
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1955-05-23
This document presents a summary of work and progress at the Hanford Engineer Works for April 1955. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Monthly report Hanford Atomic Products Operation, July 1954
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1954-08-20
This document presents a summary of work and progress at the Hanford Engineer Works for July 1954. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services Departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, August 1956
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1956-09-28
This document presents a summary of work and progress at the Hanford Engineer Works for August 1956. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Sciences, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report for May 1956
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1956-06-21
This document presents a summary of work and progress at the Hanford Engineer Works for May, 1956. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, September 1955
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1955-10-27
This document presents a summary of work and progress at the Hanford Engineer Works for September 1955. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, March 1955
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1955-04-20
This document presents a summary of work and progress at the Hanford Engineer Works for March 1955. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, November 1955
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1955-12-30
This document presents a summary of work and progress at the Hanford Engineer Works for November 1955. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, August 1955
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1955-09-27
This document presents a summary of work and progress at the Hanford Engineer Works for August 1955. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Sciences, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report for December 1955
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1956-01-30
This document presents a summary of work and progress at the Hanford Engineer Works for December 1955. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, October 1955
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1955-11-30
This document presents a summary of work and progress at the Hanford Engineer works for October, 1955. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products for Operation monthly report, February 1955
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1955-03-18
This document presents a summary of work and progress at the Hanford Engineer Works for February 1955. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, May 1955
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1955-06-23
This document presents a summary of work and progress at the Hanford Engineer Works for May 1955. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, July 1955
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1955-08-26
This document presents a summary of work and progress at the Hanford Engineer Works for July 1955. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, October 1954
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1954-11-24
This document presents a summary of work and progress at the Hanford Engineer Works for October 1954. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, December 1954
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1955-01-25
This document presents a summary of work and progress at the Hanford Engineer Works for December 1954. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, August 1954
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1954-09-17
This document presents a summary of work and progress at the Hanford Engineer Works for August 1954. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department report plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities, and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Hanford Atomic Products Operation monthly report, August 1953
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1953-09-18
This document presents a summary of work and progress at the Hanford Engineer Works for August, 1953. The report is divided into sections by department. A plant wide general summary is included at the beginning of the report, after which the departmental summaries begin. The Manufacturing Department reports plant statistics, and summaries for the Metal Preparation, Reactor and Separation sections. The Engineering Department`s section summarizes work for the Technical, Design, and Project Sections. Costs for the various departments are presented in the Financial Department`s summary. The Medical, Radiological Sciences, Utilities and General Services, Employee and Public Relations, and Community Realmore » Estate and Services departments have sections presenting their monthly statistics, work, progress, and summaries.« less
Self-report of diabetes and claims-based identification of diabetes among Medicare beneficiaries.
Day, Hannah R; Parker, Jennifer D
2013-11-01
This report compares self-reported diabetes in the National Health Interview Survey (NHIS) with diabetes identified using the Medicare Chronic Condition (CC) Summary file. NHIS records have been linked with Medicare data from the Centers for Medicare & Medicaid Services. The CC Summary file, one of several linked files derived from Medicare claims data, contains indicators for chronic conditions based on an established algorithm. This analysis was limited to 2005 NHIS participants aged 65 and over whose records were linked to 2005 Medicare data. Linked NHIS participants had at least 1 month of fee-for-service Medicare coverage in 2005. Concordance between self-reported diabetes and the CC Summary indicator for diabetes is compared and described by demographics, socioeconomic status, health status indicators, and geographic characteristics. Of the Medicare beneficiaries in the 2005 NHIS, 20.0% self-reported diabetes and 27.8% had an indicator for diabetes in the CC Summary file. Of those who self-reported diabetes in NHIS, the percentage with a CC Summary indicator for diabetes was high (93.1%). Of those with a CC Summary indicator for diabetes, the percentage self-reporting diabetes was comparatively lower (67.0%). Statistically significant differences by subgroup existed in the percentage concordance between the two sources. Of those with self-reported diabetes, the percentage with a CC Summary indicator differed by sex and age. Of those with a CC Summary indicator for diabetes, the percentage with self-reported diabetes differed by age, self-rated health, number of self-reported conditions, and geographic location. Among Medicare beneficiaries who self-reported diabetes in NHIS, a high concordance was observed with identification of diabetes in the CC Summary file. However, among Medicare beneficiaries with an indicator for diabetes in the CC Summary file, concordance with self-reported diabetes in NHIS is comparatively lower. Differences exist by subgroup.
Statistical summary of commercial jet aircraft accidents : worldwide operations, 1959-2009
DOT National Transportation Integrated Search
2010-07-01
The accident statistics presented in this summary are confined to worldwide commercial jet airplanes that are heavier than 60,000 pounds maximum gross weight. Within that set of airplanes, there are two groups excluded: : 1) Airplanes manufactured in...
1997 Iowa crash summary : a summary of motor vehicle crash statistics on Iowa roadways
DOT National Transportation Integrated Search
1997-01-01
All information concerning Iowa : traffic crashes was taken from report forms provided by investigating officers and drivers involved in : crashes. : All statistics are gathered and calculated by the Iowa Department of Transportations Office of Dr...
1996 Iowa crash summary : a summary of motor vehicle crash statistics on Iowa roadways
DOT National Transportation Integrated Search
1996-01-01
All information concerning Iowa : traffic crashes was taken from report forms provided by investigating officers and drivers involved in : crashes. : All statistics are gathered and calculated by the Iowa Department of Transportations Office of Dr...
1998 Iowa crash summary : a summary of motor vehicle crash statistics on Iowa roadways
DOT National Transportation Integrated Search
1998-01-01
All information concerning Iowa : traffic crashes was taken from report forms provided by investigating officers and drivers involved in : crashes. : All statistics are gathered and calculated by the Iowa Department of Transportations Office of Dr...
Dissecting the genetics of complex traits using summary association statistics.
Pasaniuc, Bogdan; Price, Alkes L
2017-02-01
During the past decade, genome-wide association studies (GWAS) have been used to successfully identify tens of thousands of genetic variants associated with complex traits and diseases. These studies have produced extensive repositories of genetic variation and trait measurements across large numbers of individuals, providing tremendous opportunities for further analyses. However, privacy concerns and other logistical considerations often limit access to individual-level genetic data, motivating the development of methods that analyse summary association statistics. Here, we review recent progress on statistical methods that leverage summary association data to gain insights into the genetic basis of complex traits and diseases.
Dissecting the genetics of complex traits using summary association statistics
Pasaniuc, Bogdan; Price, Alkes L.
2017-01-01
During the past decade, genome-wide association studies (GWAS) have successfully identified tens of thousands of genetic variants associated with complex traits and diseases. These studies have produced extensive repositories of genetic variation and trait measurements across large numbers of individuals, providing tremendous opportunities for further analyses. However, privacy concerns and other logistical considerations often limit access to individual-level genetic data, motivating the development of methods that analyze summary association statistics. Here we review recent progress on statistical methods that leverage summary association data to gain insights into the genetic basis of complex traits and diseases. PMID:27840428
Lee, L.; Helsel, D.
2005-01-01
Trace contaminants in water, including metals and organics, often are measured at sufficiently low concentrations to be reported only as values below the instrument detection limit. Interpretation of these "less thans" is complicated when multiple detection limits occur. Statistical methods for multiply censored, or multiple-detection limit, datasets have been developed for medical and industrial statistics, and can be employed to estimate summary statistics or model the distributions of trace-level environmental data. We describe S-language-based software tools that perform robust linear regression on order statistics (ROS). The ROS method has been evaluated as one of the most reliable procedures for developing summary statistics of multiply censored data. It is applicable to any dataset that has 0 to 80% of its values censored. These tools are a part of a software library, or add-on package, for the R environment for statistical computing. This library can be used to generate ROS models and associated summary statistics, plot modeled distributions, and predict exceedance probabilities of water-quality standards. ?? 2005 Elsevier Ltd. All rights reserved.
Multi-trait analysis of genome-wide association summary statistics using MTAG.
Turley, Patrick; Walters, Raymond K; Maghzian, Omeed; Okbay, Aysu; Lee, James J; Fontana, Mark Alan; Nguyen-Viet, Tuan Anh; Wedow, Robbee; Zacher, Meghan; Furlotte, Nicholas A; Magnusson, Patrik; Oskarsson, Sven; Johannesson, Magnus; Visscher, Peter M; Laibson, David; Cesarini, David; Neale, Benjamin M; Benjamin, Daniel J
2018-02-01
We introduce multi-trait analysis of GWAS (MTAG), a method for joint analysis of summary statistics from genome-wide association studies (GWAS) of different traits, possibly from overlapping samples. We apply MTAG to summary statistics for depressive symptoms (N eff = 354,862), neuroticism (N = 168,105), and subjective well-being (N = 388,538). As compared to the 32, 9, and 13 genome-wide significant loci identified in the single-trait GWAS (most of which are themselves novel), MTAG increases the number of associated loci to 64, 37, and 49, respectively. Moreover, association statistics from MTAG yield more informative bioinformatics analyses and increase the variance explained by polygenic scores by approximately 25%, matching theoretical expectations.
1992-10-01
N=8) and Results of 44 Statistical Analyses for Impact Test Performed on Forefoot of Unworn Footwear A-2. Summary Statistics (N=8) and Results of...on Forefoot of Worn Footwear Vlll Tables (continued) Table Page B-2. Summary Statistics (N=4) and Results of 76 Statistical Analyses for Impact...used tests to assess heel and forefoot shock absorption, upper and sole durability, and flexibility (Cavanagh, 1978). Later, the number of tests was
U.S. Marine Corps Study of Establishing Time Criteria for Logistics Tasks
2004-09-30
STATISTICS FOR REQUESTS PER DAY FOR TWO BATTALIONS II-25 II-6 SUMMARY STATISTICS IN HOURS FOR RESOURCE REQUIREMENTS PER DAY FOR TWO BATTALIONS II-26 II-7...SUMMARY STATISTICS FOR INDIVIDUALS FOR RESOURCE REQUIREMENTS PER DAY FOR TWO BATTALIONS II-27 Study of Establishing Time Criteria for Logistics...developed and run to provide statistical information for analysis. In Task Four, the study team used Task Three findings to determine data requirements
Weighted Statistical Binning: Enabling Statistically Consistent Genome-Scale Phylogenetic Analyses
Bayzid, Md Shamsuzzoha; Mirarab, Siavash; Boussau, Bastien; Warnow, Tandy
2015-01-01
Because biological processes can result in different loci having different evolutionary histories, species tree estimation requires multiple loci from across multiple genomes. While many processes can result in discord between gene trees and species trees, incomplete lineage sorting (ILS), modeled by the multi-species coalescent, is considered to be a dominant cause for gene tree heterogeneity. Coalescent-based methods have been developed to estimate species trees, many of which operate by combining estimated gene trees, and so are called "summary methods". Because summary methods are generally fast (and much faster than more complicated coalescent-based methods that co-estimate gene trees and species trees), they have become very popular techniques for estimating species trees from multiple loci. However, recent studies have established that summary methods can have reduced accuracy in the presence of gene tree estimation error, and also that many biological datasets have substantial gene tree estimation error, so that summary methods may not be highly accurate in biologically realistic conditions. Mirarab et al. (Science 2014) presented the "statistical binning" technique to improve gene tree estimation in multi-locus analyses, and showed that it improved the accuracy of MP-EST, one of the most popular coalescent-based summary methods. Statistical binning, which uses a simple heuristic to evaluate "combinability" and then uses the larger sets of genes to re-calculate gene trees, has good empirical performance, but using statistical binning within a phylogenomic pipeline does not have the desirable property of being statistically consistent. We show that weighting the re-calculated gene trees by the bin sizes makes statistical binning statistically consistent under the multispecies coalescent, and maintains the good empirical performance. Thus, "weighted statistical binning" enables highly accurate genome-scale species tree estimation, and is also statistically consistent under the multi-species coalescent model. New data used in this study are available at DOI: http://dx.doi.org/10.6084/m9.figshare.1411146, and the software is available at https://github.com/smirarab/binning. PMID:26086579
Dai, Mingwei; Ming, Jingsi; Cai, Mingxuan; Liu, Jin; Yang, Can; Wan, Xiang; Xu, Zongben
2017-09-15
Results from genome-wide association studies (GWAS) suggest that a complex phenotype is often affected by many variants with small effects, known as 'polygenicity'. Tens of thousands of samples are often required to ensure statistical power of identifying these variants with small effects. However, it is often the case that a research group can only get approval for the access to individual-level genotype data with a limited sample size (e.g. a few hundreds or thousands). Meanwhile, summary statistics generated using single-variant-based analysis are becoming publicly available. The sample sizes associated with the summary statistics datasets are usually quite large. How to make the most efficient use of existing abundant data resources largely remains an open question. In this study, we propose a statistical approach, IGESS, to increasing statistical power of identifying risk variants and improving accuracy of risk prediction by i ntegrating individual level ge notype data and s ummary s tatistics. An efficient algorithm based on variational inference is developed to handle the genome-wide analysis. Through comprehensive simulation studies, we demonstrated the advantages of IGESS over the methods which take either individual-level data or summary statistics data as input. We applied IGESS to perform integrative analysis of Crohns Disease from WTCCC and summary statistics from other studies. IGESS was able to significantly increase the statistical power of identifying risk variants and improve the risk prediction accuracy from 63.2% ( ±0.4% ) to 69.4% ( ±0.1% ) using about 240 000 variants. The IGESS software is available at https://github.com/daviddaigithub/IGESS . zbxu@xjtu.edu.cn or xwan@comp.hkbu.edu.hk or eeyang@hkbu.edu.hk. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Summary Statistics, Educational Achievement Gaps and the Ecological Fallacy
ERIC Educational Resources Information Center
Connolly, Paul
2006-01-01
Summary statistics continue to play an important role in identifying and monitoring patterns and trends in educational inequalities between differing groups of pupils over time. However, this article argues that their uncritical use can also encourage the labelling of whole groups of pupils as "underachievers" or…
One of the main uses of biomarker measurements is to compare different populations to each other and to assess risk in comparison to established parameters. This is most often done using summary statistics such as central tendency, variance components, confidence intervals, excee...
Fully Bayesian tests of neutrality using genealogical summary statistics.
Drummond, Alexei J; Suchard, Marc A
2008-10-31
Many data summary statistics have been developed to detect departures from neutral expectations of evolutionary models. However questions about the neutrality of the evolution of genetic loci within natural populations remain difficult to assess. One critical cause of this difficulty is that most methods for testing neutrality make simplifying assumptions simultaneously about the mutational model and the population size model. Consequentially, rejecting the null hypothesis of neutrality under these methods could result from violations of either or both assumptions, making interpretation troublesome. Here we harness posterior predictive simulation to exploit summary statistics of both the data and model parameters to test the goodness-of-fit of standard models of evolution. We apply the method to test the selective neutrality of molecular evolution in non-recombining gene genealogies and we demonstrate the utility of our method on four real data sets, identifying significant departures of neutrality in human influenza A virus, even after controlling for variation in population size. Importantly, by employing a full model-based Bayesian analysis, our method separates the effects of demography from the effects of selection. The method also allows multiple summary statistics to be used in concert, thus potentially increasing sensitivity. Furthermore, our method remains useful in situations where analytical expectations and variances of summary statistics are not available. This aspect has great potential for the analysis of temporally spaced data, an expanding area previously ignored for limited availability of theory and methods.
Adapt-Mix: learning local genetic correlation structure improves summary statistics-based analyses
Park, Danny S.; Brown, Brielin; Eng, Celeste; Huntsman, Scott; Hu, Donglei; Torgerson, Dara G.; Burchard, Esteban G.; Zaitlen, Noah
2015-01-01
Motivation: Approaches to identifying new risk loci, training risk prediction models, imputing untyped variants and fine-mapping causal variants from summary statistics of genome-wide association studies are playing an increasingly important role in the human genetics community. Current summary statistics-based methods rely on global ‘best guess’ reference panels to model the genetic correlation structure of the dataset being studied. This approach, especially in admixed populations, has the potential to produce misleading results, ignores variation in local structure and is not feasible when appropriate reference panels are missing or small. Here, we develop a method, Adapt-Mix, that combines information across all available reference panels to produce estimates of local genetic correlation structure for summary statistics-based methods in arbitrary populations. Results: We applied Adapt-Mix to estimate the genetic correlation structure of both admixed and non-admixed individuals using simulated and real data. We evaluated our method by measuring the performance of two summary statistics-based methods: imputation and joint-testing. When using our method as opposed to the current standard of ‘best guess’ reference panels, we observed a 28% decrease in mean-squared error for imputation and a 73.7% decrease in mean-squared error for joint-testing. Availability and implementation: Our method is publicly available in a software package called ADAPT-Mix available at https://github.com/dpark27/adapt_mix. Contact: noah.zaitlen@ucsf.edu PMID:26072481
Bevans, Hugh E.; Diaz, Arthur M.
1980-01-01
Summaries of descriptive statistics are compiled for 14 data-collection sites located on streams draining areas that have been shaft mined and strip mined for coal in Cherokee and Crawford Counties in southeastern Kansas. These summaries include water-quality data collected from October 1976 through April 1979. Regression equations relating specific conductance and instantaneous streamflow to concentrations of bicarbonate, sulfate, chloride, fluoride, calcium, magnesium, sodium, potassium, silica, and dissolved solids are presented.
Statistical Summary of Missouri Higher Education, 1999-2000.
ERIC Educational Resources Information Center
Missouri State Coordinating Board for Higher Education, Jefferson City.
This report provides a statistical summary of higher education in Missouri for the 1999-2000 academic year. More than 74 tables provide data on: advanced placement enrollment in secondary schools, American College Testing program scores by institutional sector, high school rankings by institutional sector, the Missouri Coordinating Board for…
This statistical summary reports data from the Environmental Monitoring and Assessment Program (EMAP) Western Pilot (EMAP-W). EMAP-W was a sample survey (or probability survey, often simply called 'random') of streams and rivers in 12 states of the western U.S. (Arizona, Californ...
Analysis of Variance with Summary Statistics in Microsoft® Excel®
ERIC Educational Resources Information Center
Larson, David A.; Hsu, Ko-Cheng
2010-01-01
Students regularly are asked to solve Single Factor Analysis of Variance problems given only the sample summary statistics (number of observations per category, category means, and corresponding category standard deviations). Most undergraduate students today use Excel for data analysis of this type. However, Excel, like all other statistical…
Summary travel characteristics : Hawaii
DOT National Transportation Integrated Search
1997-10-01
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Massachusetts
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : New Jersey
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : New Hampshire
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : New York
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : New Mexico
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : South Dakota
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Virgina
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : South Carolina
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Florida
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Kansas
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : California
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Texas
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : North Carolina
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Illinois
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Montana
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Kentucky
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Wisconsin
DOT National Transportation Integrated Search
1997-10-01
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Maryland
DOT National Transportation Integrated Search
1997-09-19
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Nevada
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Iowa
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Washington
DOT National Transportation Integrated Search
1997-10-01
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Alabama
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Nebraska
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Oregon
DOT National Transportation Integrated Search
1997-10-01
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Maine
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Utah
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Michigan
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Missouri
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Colorado
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Alaska
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Oklahoma
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Indiana
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Idaho
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Tennessee
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : West Virginia
DOT National Transportation Integrated Search
1997-10-01
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Georgia
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Delaware
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Ohio
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Arizona
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : North Dakota
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Wyoming
DOT National Transportation Integrated Search
1997-01-01
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Minnesota
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Vermont
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Louisiana
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Mississippi
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Connecticut
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Rhode Island
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Pennsylvania
DOT National Transportation Integrated Search
1997-09-30
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
Summary travel characteristics : Arkansas
DOT National Transportation Integrated Search
1997-09-29
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
The capacity limitations of orientation summary statistics
Attarha, Mouna; Moore, Cathleen M.
2015-01-01
The simultaneous–sequential method was used to test the processing capacity of establishing mean orientation summaries. Four clusters of oriented Gabor patches were presented in the peripheral visual field. One of the clusters had a mean orientation that was tilted either left or right while the mean orientations of the other three clusters were roughly vertical. All four clusters were presented at the same time in the simultaneous condition whereas the clusters appeared in temporal subsets of two in the sequential condition. Performance was lower when the means of all four clusters had to be processed concurrently than when only two had to be processed in the same amount of time. The advantage for establishing fewer summaries at a given time indicates that the processing of mean orientation engages limited-capacity processes (Experiment 1). This limitation cannot be attributed to crowding, low target-distractor discriminability, or a limited-capacity comparison process (Experiments 2 and 3). In contrast to the limitations of establishing multiple summary representations, establishing a single summary representation unfolds without interference (Experiment 4). When interpreted in the context of recent work on the capacity of summary statistics, these findings encourage reevaluation of the view that early visual perception consists of summary statistic representations that unfold independently across multiple areas of the visual field. PMID:25810160
NWS Weather Fatality, Injury and Damage Statistics
government web resources and services. Natural Hazard Statistics Statistics U.S. Summaries Online The U.S. Natural Hazard Statistics provide statistical information on fatalities, injuries and
A novel approach for choosing summary statistics in approximate Bayesian computation.
Aeschbacher, Simon; Beaumont, Mark A; Futschik, Andreas
2012-11-01
The choice of summary statistics is a crucial step in approximate Bayesian computation (ABC). Since statistics are often not sufficient, this choice involves a trade-off between loss of information and reduction of dimensionality. The latter may increase the efficiency of ABC. Here, we propose an approach for choosing summary statistics based on boosting, a technique from the machine-learning literature. We consider different types of boosting and compare them to partial least-squares regression as an alternative. To mitigate the lack of sufficiency, we also propose an approach for choosing summary statistics locally, in the putative neighborhood of the true parameter value. We study a demographic model motivated by the reintroduction of Alpine ibex (Capra ibex) into the Swiss Alps. The parameters of interest are the mean and standard deviation across microsatellites of the scaled ancestral mutation rate (θ(anc) = 4N(e)u) and the proportion of males obtaining access to matings per breeding season (ω). By simulation, we assess the properties of the posterior distribution obtained with the various methods. According to our criteria, ABC with summary statistics chosen locally via boosting with the L(2)-loss performs best. Applying that method to the ibex data, we estimate θ(anc)≈ 1.288 and find that most of the variation across loci of the ancestral mutation rate u is between 7.7 × 10(-4) and 3.5 × 10(-3) per locus per generation. The proportion of males with access to matings is estimated as ω≈ 0.21, which is in good agreement with recent independent estimates.
NASA Astrophysics Data System (ADS)
Erfanifard, Y.; Rezayan, F.
2014-10-01
Vegetation heterogeneity biases second-order summary statistics, e.g., Ripley's K-function, applied for spatial pattern analysis in ecology. Second-order investigation based on Ripley's K-function and related statistics (i.e., L- and pair correlation function g) is widely used in ecology to develop hypothesis on underlying processes by characterizing spatial patterns of vegetation. The aim of this study was to demonstrate effects of underlying heterogeneity of wild pistachio (Pistacia atlantica Desf.) trees on the second-order summary statistics of point pattern analysis in a part of Zagros woodlands, Iran. The spatial distribution of 431 wild pistachio trees was accurately mapped in a 40 ha stand in the Wild Pistachio & Almond Research Site, Fars province, Iran. Three commonly used second-order summary statistics (i.e., K-, L-, and g-functions) were applied to analyse their spatial pattern. The two-sample Kolmogorov-Smirnov goodness-of-fit test showed that the observed pattern significantly followed an inhomogeneous Poisson process null model in the study region. The results also showed that heterogeneous pattern of wild pistachio trees biased the homogeneous form of K-, L-, and g-functions, demonstrating a stronger aggregation of the trees at the scales of 0-50 m than actually existed and an aggregation at scales of 150-200 m, while regularly distributed. Consequently, we showed that heterogeneity of point patterns may bias the results of homogeneous second-order summary statistics and we also suggested applying inhomogeneous summary statistics with related null models for spatial pattern analysis of heterogeneous vegetations.
A Novel Approach for Choosing Summary Statistics in Approximate Bayesian Computation
Aeschbacher, Simon; Beaumont, Mark A.; Futschik, Andreas
2012-01-01
The choice of summary statistics is a crucial step in approximate Bayesian computation (ABC). Since statistics are often not sufficient, this choice involves a trade-off between loss of information and reduction of dimensionality. The latter may increase the efficiency of ABC. Here, we propose an approach for choosing summary statistics based on boosting, a technique from the machine-learning literature. We consider different types of boosting and compare them to partial least-squares regression as an alternative. To mitigate the lack of sufficiency, we also propose an approach for choosing summary statistics locally, in the putative neighborhood of the true parameter value. We study a demographic model motivated by the reintroduction of Alpine ibex (Capra ibex) into the Swiss Alps. The parameters of interest are the mean and standard deviation across microsatellites of the scaled ancestral mutation rate (θanc = 4Neu) and the proportion of males obtaining access to matings per breeding season (ω). By simulation, we assess the properties of the posterior distribution obtained with the various methods. According to our criteria, ABC with summary statistics chosen locally via boosting with the L2-loss performs best. Applying that method to the ibex data, we estimate θ^anc≈1.288 and find that most of the variation across loci of the ancestral mutation rate u is between 7.7 × 10−4 and 3.5 × 10−3 per locus per generation. The proportion of males with access to matings is estimated as ω^≈0.21, which is in good agreement with recent independent estimates. PMID:22960215
1994 summary : public transportation systems in Washington state
DOT National Transportation Integrated Search
1995-08-01
The Washington State Department of Transportation (WSDOT) prepares the annual : transit statistical summary. The intent for this summary is to provide uniform : data to transit providers, the Legislative Transportation Committee, and local : and regi...
Summary travel characteristics : District of Columbia
DOT National Transportation Integrated Search
1997-10-01
The Summary Travel Characteristics publication series contains summary tables of travel statistics for census regions and divisions, States, and metropolitan areas. The tables in this report provide an overview of the findings of the American Travel ...
The Economy and Enlisted Retention in the Navy. Volume 2: Technical Appendixes
2014-06-01
Appendix C 13 Table 1. Selected summary statistics of Zone A sailors Statistic Men Women SRB 1.2 0.7 Months of sea duty in past 24 months 19.3 10.7...525,535 89,925 Appendix C 14 Table 2. Selected summary statistics of Zone B sailors Statistic Men Women SRB 1.0 0.5 Months of sea duty in...relationship holds for Zone A women as well as Zone B men and women as that found for Zone A men. Table 5. Changes in the unemployment and Treasury
A comparison of methods using optical coherence tomography to detect demineralized regions in teeth
Sowa, Michael G.; Popescu, Dan P.; Friesen, Jeri R.; Hewko, Mark D.; Choo-Smith, Lin-P’ing
2013-01-01
Optical coherence tomography (OCT) is a three- dimensional optical imaging technique that can be used to identify areas of early caries formation in dental enamel. The OCT signal at 850 nm back-reflected from sound enamel is attenuated stronger than the signal back-reflected from demineralized regions. To quantify this observation, the OCT signal as a function of depth into the enamel (also known as the A-scan intensity), the histogram of the A-scan intensities and three summary parameters derived from the A-scan are defined and their diagnostic potential compared. A total of 754 OCT A-scans were analyzed. The three summary parameters derived from the A-scans, the OCT attenuation coefficient as well as the mean and standard deviation of the lognormal fit to the histogram of the A-scan ensemble show statistically significant differences (p < 0.01) when comparing parameters from sound enamel and caries. Furthermore, these parameters only show a modest correlation. Based on the area under the curve (AUC) of the receiver operating characteristics (ROC) plot, the OCT attenuation coefficient shows higher discriminatory capacity (AUC=0.98) compared to the parameters derived from the lognormal fit to the histogram of the A-scan. However, direct analysis of the A-scans or the histogram of A-scan intensities using linear support vector machine classification shows diagnostic discrimination (AUC = 0.96) comparable to that achieved using the attenuation coefficient. These findings suggest that either direct analysis of the A-scan, its intensity histogram or the attenuation coefficient derived from the descending slope of the OCT A-scan have high capacity to discriminate between regions of caries and sound enamel. PMID:22052833
U.S. Virgin Islands 1983-84 School Statistical Summary.
ERIC Educational Resources Information Center
Romain, Louise
The 1984 edition of the United States Virgin Islands School Statistical Summary presents narratives, 48 tables, and 7 graphic illustrations of education in public and private elementary and secondary schools during the 1983-84 school year. Tables provide data on the U.S. Virgin Islands demography and economy, enrollment and average daily…
Characteristics of SUN Learners (First Five Offerings). Statistical Summary No. 4.
ERIC Educational Resources Information Center
Bryan, Donna; Forman, David C.
Based on a period of two and one-half years, during which the State University of Nebraska (SUN) has offered 15 multimedia courses to Nebraska learners, this report offers a statistical summary of student characteristics. The courses offered include: Accounting I, Accounting II, Adams Chronicles, American Economy, Anyone for Tennyson, Classic…
32 CFR 865.122 - Summary of statistics for Discharge Review Board.
Code of Federal Regulations, 2011 CFR
2011-07-01
... 32 National Defense 6 2011-07-01 2011-07-01 false Summary of statistics for Discharge Review Board. 865.122 Section 865.122 National Defense Department of Defense (Continued) DEPARTMENT OF THE AIR FORCE... Deputy Assistant Secretary of Defense (Military Personnel and Force Management) DASD(MP&FM), Office of...
32 CFR 865.122 - Summary of statistics for Discharge Review Board.
Code of Federal Regulations, 2013 CFR
2013-07-01
... 32 National Defense 6 2013-07-01 2013-07-01 false Summary of statistics for Discharge Review Board. 865.122 Section 865.122 National Defense Department of Defense (Continued) DEPARTMENT OF THE AIR FORCE... Deputy Assistant Secretary of Defense (Military Personnel and Force Management) DASD(MP&FM), Office of...
Simple Data Sets for Distinct Basic Summary Statistics
ERIC Educational Resources Information Center
Lesser, Lawrence M.
2011-01-01
It is important to avoid ambiguity with numbers because unfortunate choices of numbers can inadvertently make it possible for students to form misconceptions or make it difficult for teachers to tell if students obtained the right answer for the right reason. Therefore, it is important to make sure when introducing basic summary statistics that…
The Misdirection of Public Policy: Comparing and Combining Standardised Effect Sizes
ERIC Educational Resources Information Center
Simpson, Adrian
2017-01-01
Increased attention on "what works" in education has led to an emphasis on developing policy from evidence based on comparing and combining a particular statistical summary of intervention studies: the standardised effect size. It is assumed that this statistical summary provides an estimate of the educational impact of interventions and…
1996 summary : public transportation systems in Washington state
DOT National Transportation Integrated Search
1997-09-01
The Washington State Department of Transportation (WSDOT) prepares the annual transit statistical summary. The intent for this summary, required by Section 35.58.2796 RCW, is to provide uniform data to transit providers, the Legislative Transportatio...
Summary of Notifiable Infectious Diseases and Conditions - United States, 2015.
Adams, Deborah A; Thomas, Kimberly R; Jajosky, Ruth Ann; Foster, Loretta; Baroi, Gitangali; Sharp, Pearl; Onweh, Diana H; Schley, Alan W; Anderson, Willie J
2017-08-11
The Summary of Notifiable Infectious Diseases and Conditions - United States, 2015 (hereafter referred to as the summary) contains the official statistics, in tabular and graphical form, for the reported occurrence of nationally notifiable infectious diseases and conditions in the United States for 2015. Unless otherwise noted, data are final totals for 2015 reported as of June 30, 2016. These statistics are collected and compiled from reports sent by U.S. state and territories, New York City, and District of Columbia health departments to the National Notifiable Diseases Surveillance System (NNDSS), which is operated by CDC in collaboration with the Council of State and Territorial Epidemiologists (CSTE). This summary is available at https://www.cdc.gov/MMWR/MMWR_nd/index.html. This site also includes summary publications from previous years.
Summary of Notifiable Infectious Diseases and Conditions - United States, 2013.
Adams, Deborah; Fullerton, Kathleen; Jajosky, Ruth; Sharp, Pearl; Onweh, Diana; Schley, Alan; Anderson, Willie; Faulkner, Amanda; Kugeler, Kiersten
2015-10-23
The Summary of Notifiable Infectious Diseases and Condition-United States, 2013 (hereafter referred to as the summary) contains the official statistics, in tabular and graphic form, for the reported occurrence of nationally notifiable infectious diseases and conditions in the United States for 2013. Unless otherwise noted, data are final totals for 2013 reported as of June 30, 2014. These statistics are collected and compiled from reports sent by U.S. state and territory, New York City, and District of Columbia health departments to the National Notifiable Diseases Surveillance System (NNDSS), which is operated by CDC in collaboration with the Council of State and Territorial Epidemiologists (CSTE). This summary is available at http://www.cdc.gov/mmwr/mmwr_nd/index.html. This site also includes summary publications from previous years.
Summary of Notifiable Infectious Diseases and Conditions - United States, 2014.
Adams, Deborah A; Thomas, Kimberly R; Jajosky, Ruth Ann; Foster, Loretta; Sharp, Pearl; Onweh, Diana H; Schley, Alan W; Anderson, Willie J
2016-10-14
The Summary of Notifiable Infectious Diseases and Conditions-United States, 2014 (hereafter referred to as the summary) contains the official statistics, in tabular and graphic form, for the reported occurrence of nationally notifiable infectious diseases and conditions in the United States for 2014. Unless otherwise noted, data are final totals for 2014 reported as of June 30, 2015. These statistics are collected and compiled from reports sent by U.S. state and territory, New York City, and District of Columbia health departments to the National Notifiable Diseases Surveillance System (NNDSS), which is operated by CDC in collaboration with the Council of State and Territorial Epidemiologists (CSTE). This summary is available at http://www.cdc.gov/mmwr/mmwr_nd/index.html. This site also includes summary publications from previous years.
Using Electronic Data Interchange to Report Product Quality
1993-03-01
Numbers 0 31.1 S........................ . . . . ........... .... . .--- . ... N/U 140 SPS Sampling Parameters for Summary Statistics 0 1 N/U 150 REF...DTM Date/Time Reference 0 1 N/U 190 REF Reference Numbers 021 .................................. .......... .. ... NAU 200 STA Statistics 0 1 N/U 210...Measurements 0 1 N/U 120 DTM Date/Time Reference 0 >1 N/U 130 REF Reference Numbers 0 >1 :LOOIV f-SPS N/U 140 SPS Sampling Parameters for Summary Statistics 0 1
Understanding Statistics - Cancer Statistics
Annual reports of U.S. cancer statistics including new cases, deaths, trends, survival, prevalence, lifetime risk, and progress toward Healthy People targets, plus statistical summaries for a number of common cancer types.
Extending existing structural identifiability analysis methods to mixed-effects models.
Janzén, David L I; Jirstrand, Mats; Chappell, Michael J; Evans, Neil D
2018-01-01
The concept of structural identifiability for state-space models is expanded to cover mixed-effects state-space models. Two methods applicable for the analytical study of the structural identifiability of mixed-effects models are presented. The two methods are based on previously established techniques for non-mixed-effects models; namely the Taylor series expansion and the input-output form approach. By generating an exhaustive summary, and by assuming an infinite number of subjects, functions of random variables can be derived which in turn determine the distribution of the system's observation function(s). By considering the uniqueness of the analytical statistical moments of the derived functions of the random variables, the structural identifiability of the corresponding mixed-effects model can be determined. The two methods are applied to a set of examples of mixed-effects models to illustrate how they work in practice. Copyright © 2017 Elsevier Inc. All rights reserved.
USING LINKED MICROMAP PLOTS TO CHARACTERIZE OMERNIK ECOREGIONS
The paper introduces linked micromap (LM plots for presenting environmental summaries. The LM template includes parallel sequences of micromap, able, and statistical summary graphics panels with attention paid to perceptual grouping, sorting and linking of the summary components...
Transportation statistics annual report 1995
DOT National Transportation Integrated Search
1995-01-01
The summary of transportation statistics : programs and many of the tables and : graphs pioneered in last years Transportation : Statistics Annual Report have : been incorporated into the companion volume, : National Transportation Statistics. The...
Giambartolomei, Claudia; Vukcevic, Damjan; Schadt, Eric E; Franke, Lude; Hingorani, Aroon D; Wallace, Chris; Plagnol, Vincent
2014-05-01
Genetic association studies, in particular the genome-wide association study (GWAS) design, have provided a wealth of novel insights into the aetiology of a wide range of human diseases and traits, in particular cardiovascular diseases and lipid biomarkers. The next challenge consists of understanding the molecular basis of these associations. The integration of multiple association datasets, including gene expression datasets, can contribute to this goal. We have developed a novel statistical methodology to assess whether two association signals are consistent with a shared causal variant. An application is the integration of disease scans with expression quantitative trait locus (eQTL) studies, but any pair of GWAS datasets can be integrated in this framework. We demonstrate the value of the approach by re-analysing a gene expression dataset in 966 liver samples with a published meta-analysis of lipid traits including >100,000 individuals of European ancestry. Combining all lipid biomarkers, our re-analysis supported 26 out of 38 reported colocalisation results with eQTLs and identified 14 new colocalisation results, hence highlighting the value of a formal statistical test. In three cases of reported eQTL-lipid pairs (SYPL2, IFT172, TBKBP1) for which our analysis suggests that the eQTL pattern is not consistent with the lipid association, we identify alternative colocalisation results with SORT1, GCKR, and KPNB1, indicating that these genes are more likely to be causal in these genomic intervals. A key feature of the method is the ability to derive the output statistics from single SNP summary statistics, hence making it possible to perform systematic meta-analysis type comparisons across multiple GWAS datasets (implemented online at http://coloc.cs.ucl.ac.uk/coloc/). Our methodology provides information about candidate causal genes in associated intervals and has direct implications for the understanding of complex diseases as well as the design of drugs to target disease pathways.
Summary of Key Operating Statistics: Data Collected from the 2009 Annual Institutional Report
ERIC Educational Resources Information Center
Accrediting Council for Independent Colleges and Schools, 2010
2010-01-01
The Accrediting Council for Independent Colleges and Schools (ACICS) provides the Summary of Key Operating Statistics (KOS) as an annual review of the performance and key measurements of the more than 800 private post-secondary institutions we accredit. This edition of the KOS contains information based on the 2009 Annual Institutional Reports…
The non-equilibrium allele frequency spectrum in a Poisson random field framework.
Kaj, Ingemar; Mugal, Carina F
2016-10-01
In population genetic studies, the allele frequency spectrum (AFS) efficiently summarizes genome-wide polymorphism data and shapes a variety of allele frequency-based summary statistics. While existing theory typically features equilibrium conditions, emerging methodology requires an analytical understanding of the build-up of the allele frequencies over time. In this work, we use the framework of Poisson random fields to derive new representations of the non-equilibrium AFS for the case of a Wright-Fisher population model with selection. In our approach, the AFS is a scaling-limit of the expectation of a Poisson stochastic integral and the representation of the non-equilibrium AFS arises in terms of a fixation time probability distribution. The known duality between the Wright-Fisher diffusion process and a birth and death process generalizing Kingman's coalescent yields an additional representation. The results carry over to the setting of a random sample drawn from the population and provide the non-equilibrium behavior of sample statistics. Our findings are consistent with and extend a previous approach where the non-equilibrium AFS solves a partial differential forward equation with a non-traditional boundary condition. Moreover, we provide a bridge to previous coalescent-based work, and hence tie several frameworks together. Since frequency-based summary statistics are widely used in population genetics, for example, to identify candidate loci of adaptive evolution, to infer the demographic history of a population, or to improve our understanding of the underlying mechanics of speciation events, the presented results are potentially useful for a broad range of topics. Copyright © 2016 Elsevier Inc. All rights reserved.
The Use of Summaries in Studying Texts.
ERIC Educational Resources Information Center
Duchastel, Philippe C.
1983-01-01
Presents a scheme for comparing the text-learning outcomes derivable from study of either text or a summary of the text and considers some practical study strategies students might adopt when summaries are available and when they are not. The value of summaries in instructional situations is discussed. (MBR)
Statistical Compression for Climate Model Output
NASA Astrophysics Data System (ADS)
Hammerling, D.; Guinness, J.; Soh, Y. J.
2017-12-01
Numerical climate model simulations run at high spatial and temporal resolutions generate massive quantities of data. As our computing capabilities continue to increase, storing all of the data is not sustainable, and thus is it important to develop methods for representing the full datasets by smaller compressed versions. We propose a statistical compression and decompression algorithm based on storing a set of summary statistics as well as a statistical model describing the conditional distribution of the full dataset given the summary statistics. We decompress the data by computing conditional expectations and conditional simulations from the model given the summary statistics. Conditional expectations represent our best estimate of the original data but are subject to oversmoothing in space and time. Conditional simulations introduce realistic small-scale noise so that the decompressed fields are neither too smooth nor too rough compared with the original data. Considerable attention is paid to accurately modeling the original dataset-one year of daily mean temperature data-particularly with regard to the inherent spatial nonstationarity in global fields, and to determining the statistics to be stored, so that the variation in the original data can be closely captured, while allowing for fast decompression and conditional emulation on modest computers.
Veturi, Yogasudha; Ritchie, Marylyn D
2018-01-01
Transcriptome-wide association studies (TWAS) have recently been employed as an approach that can draw upon the advantages of genome-wide association studies (GWAS) and gene expression studies to identify genes associated with complex traits. Unlike standard GWAS, summary level data suffices for TWAS and offers improved statistical power. Two popular TWAS methods include either (a) imputing the cis genetic component of gene expression from smaller sized studies (using multi-SNP prediction or MP) into much larger effective sample sizes afforded by GWAS - TWAS-MP or (b) using summary-based Mendelian randomization - TWAS-SMR. Although these methods have been effective at detecting functional variants, it remains unclear how extensive variability in the genetic architecture of complex traits and diseases impacts TWAS results. Our goal was to investigate the different scenarios under which these methods yielded enough power to detect significant expression-trait associations. In this study, we conducted extensive simulations based on 6000 randomly chosen, unrelated Caucasian males from Geisinger's MyCode population to compare the power to detect cis expression-trait associations (within 500 kb of a gene) using the above-described approaches. To test TWAS across varying genetic backgrounds we simulated gene expression and phenotype using different quantitative trait loci per gene and cis-expression /trait heritability under genetic models that differentiate the effect of causality from that of pleiotropy. For each gene, on a training set ranging from 100 to 1000 individuals, we either (a) estimated regression coefficients with gene expression as the response using five different methods: LASSO, elastic net, Bayesian LASSO, Bayesian spike-slab, and Bayesian ridge regression or (b) performed eQTL analysis. We then sampled with replacement 50,000, 150,000, and 300,000 individuals respectively from the testing set of the remaining 5000 individuals and conducted GWAS on each set. Subsequently, we integrated the GWAS summary statistics derived from the testing set with the weights (or eQTLs) derived from the training set to identify expression-trait associations using (a) TWAS-MP (b) TWAS-SMR (c) eQTL-based GWAS, or (d) standalone GWAS. Finally, we examined the power to detect functionally relevant genes using the different approaches under the considered simulation scenarios. In general, we observed great similarities among TWAS-MP methods although the Bayesian methods resulted in improved power in comparison to LASSO and elastic net as the trait architecture grew more complex while training sample sizes and expression heritability remained small. Finally, we observed high power under causality but very low to moderate power under pleiotropy.
Asquith, William H.; Barbie, Dana L.
2014-01-01
Selected summary statistics (L-moments) and estimates of respective sampling variances were computed for the 35 streamgages lacking statistically significant trends. From the L-moments and estimated sampling variances, weighted means or regional values were computed for each L-moment. An example application is included demonstrating how the L-moments could be used to evaluate the magnitude and frequency of annual mean streamflow.
Kakourou, Alexia; Vach, Werner; Nicolardi, Simone; van der Burgt, Yuri; Mertens, Bart
2016-10-01
Mass spectrometry based clinical proteomics has emerged as a powerful tool for high-throughput protein profiling and biomarker discovery. Recent improvements in mass spectrometry technology have boosted the potential of proteomic studies in biomedical research. However, the complexity of the proteomic expression introduces new statistical challenges in summarizing and analyzing the acquired data. Statistical methods for optimally processing proteomic data are currently a growing field of research. In this paper we present simple, yet appropriate methods to preprocess, summarize and analyze high-throughput MALDI-FTICR mass spectrometry data, collected in a case-control fashion, while dealing with the statistical challenges that accompany such data. The known statistical properties of the isotopic distribution of the peptide molecules are used to preprocess the spectra and translate the proteomic expression into a condensed data set. Information on either the intensity level or the shape of the identified isotopic clusters is used to derive summary measures on which diagnostic rules for disease status allocation will be based. Results indicate that both the shape of the identified isotopic clusters and the overall intensity level carry information on the class outcome and can be used to predict the presence or absence of the disease.
Rolland, Jennifer M; Apostolou, Effie; Deckert, Kirsten; de Leon, Maria P; Douglass, Jo A; Glaspole, Ian N; Bailey, Michael; Stockley, Creina S; O'Hehir, Robyn E
2006-09-01
Recent Australian and international legislation requires labeling of wines made by using the potentially allergenic food proteins casein, milk, egg white, or isinglass (fish-derived) where "there is a detectable residual processing aid." We investigated whether wines fined using these proteins or non-grape-derived tannins (tree-nut derived) can provoke significant clinical allergic reactions (anaphylaxis) in patients with confirmed immunoglobulin E-mediated relevant food allergy. A double-blind, placebo-controlled trial was performed to determine whether allergic reactions followed consumption of Australian commercial wines fined using one or more of the legislation-targeted food proteins. In addition, allergenicity of a larger panel of these wines was evaluated by blood basophil activation. No anaphylaxis was induced by wine consumption. Three mild clinical reactions to protein-fined wine and two mild reactions to unfined wine occurred, but there was no statistically significant difference in reaction parameters between subject groups or between processing aids. No pattern of basophil activation correlated with wine type, processing aid, or subject group. Wines fined with egg white, isinglass, or non-grape-derived tannins present an extremely low risk of anaphylaxis to fish-, egg-, or peanut-allergic consumers. Although consumption of milk protein-fined wine did not induce anaphylaxis, there were insufficient subjects to determine statistically whether wines fined with milk proteins present a risk to the very rare milk-allergic consumers. In summary, the observed lack of anaphylaxis and basophil activation induced by wines made using the legislation-targeted food proteins according to good manufacturing practice suggests negligible residual food allergens in these wines.
77 FR 31302 - Advisory Committee on Agriculture Statistics
Federal Register 2010, 2011, 2012, 2013, 2014
2012-05-25
... DEPARTMENT OF AGRICULTURE National Agricultural Statistics Service Advisory Committee on Agriculture Statistics AGENCY: National Agricultural Statistics Service, USDA. ACTION: Notice of Renewal of the Charter for the Advisory Committee on Agriculture Statistics. SUMMARY: The U.S. Department of...
Across-cohort QC analyses of GWAS summary statistics from complex traits.
Chen, Guo-Bo; Lee, Sang Hong; Robinson, Matthew R; Trzaskowski, Maciej; Zhu, Zhi-Xiang; Winkler, Thomas W; Day, Felix R; Croteau-Chonka, Damien C; Wood, Andrew R; Locke, Adam E; Kutalik, Zoltán; Loos, Ruth J F; Frayling, Timothy M; Hirschhorn, Joel N; Yang, Jian; Wray, Naomi R; Visscher, Peter M
2016-01-01
Genome-wide association studies (GWASs) have been successful in discovering SNP trait associations for many quantitative traits and common diseases. Typically, the effect sizes of SNP alleles are very small and this requires large genome-wide association meta-analyses (GWAMAs) to maximize statistical power. A trend towards ever-larger GWAMA is likely to continue, yet dealing with summary statistics from hundreds of cohorts increases logistical and quality control problems, including unknown sample overlap, and these can lead to both false positive and false negative findings. In this study, we propose four metrics and visualization tools for GWAMA, using summary statistics from cohort-level GWASs. We propose methods to examine the concordance between demographic information, and summary statistics and methods to investigate sample overlap. (I) We use the population genetics F st statistic to verify the genetic origin of each cohort and their geographic location, and demonstrate using GWAMA data from the GIANT Consortium that geographic locations of cohorts can be recovered and outlier cohorts can be detected. (II) We conduct principal component analysis based on reported allele frequencies, and are able to recover the ancestral information for each cohort. (III) We propose a new statistic that uses the reported allelic effect sizes and their standard errors to identify significant sample overlap or heterogeneity between pairs of cohorts. (IV) To quantify unknown sample overlap across all pairs of cohorts, we propose a method that uses randomly generated genetic predictors that does not require the sharing of individual-level genotype data and does not breach individual privacy.
Across-cohort QC analyses of GWAS summary statistics from complex traits
Chen, Guo-Bo; Lee, Sang Hong; Robinson, Matthew R; Trzaskowski, Maciej; Zhu, Zhi-Xiang; Winkler, Thomas W; Day, Felix R; Croteau-Chonka, Damien C; Wood, Andrew R; Locke, Adam E; Kutalik, Zoltán; Loos, Ruth J F; Frayling, Timothy M; Hirschhorn, Joel N; Yang, Jian; Wray, Naomi R; Visscher, Peter M
2017-01-01
Genome-wide association studies (GWASs) have been successful in discovering SNP trait associations for many quantitative traits and common diseases. Typically, the effect sizes of SNP alleles are very small and this requires large genome-wide association meta-analyses (GWAMAs) to maximize statistical power. A trend towards ever-larger GWAMA is likely to continue, yet dealing with summary statistics from hundreds of cohorts increases logistical and quality control problems, including unknown sample overlap, and these can lead to both false positive and false negative findings. In this study, we propose four metrics and visualization tools for GWAMA, using summary statistics from cohort-level GWASs. We propose methods to examine the concordance between demographic information, and summary statistics and methods to investigate sample overlap. (I) We use the population genetics Fst statistic to verify the genetic origin of each cohort and their geographic location, and demonstrate using GWAMA data from the GIANT Consortium that geographic locations of cohorts can be recovered and outlier cohorts can be detected. (II) We conduct principal component analysis based on reported allele frequencies, and are able to recover the ancestral information for each cohort. (III) We propose a new statistic that uses the reported allelic effect sizes and their standard errors to identify significant sample overlap or heterogeneity between pairs of cohorts. (IV) To quantify unknown sample overlap across all pairs of cohorts, we propose a method that uses randomly generated genetic predictors that does not require the sharing of individual-level genotype data and does not breach individual privacy. PMID:27552965
DISTMIX: direct imputation of summary statistics for unmeasured SNPs from mixed ethnicity cohorts.
Lee, Donghyung; Bigdeli, T Bernard; Williamson, Vernell S; Vladimirov, Vladimir I; Riley, Brien P; Fanous, Ayman H; Bacanu, Silviu-Alin
2015-10-01
To increase the signal resolution for large-scale meta-analyses of genome-wide association studies, genotypes at unmeasured single nucleotide polymorphisms (SNPs) are commonly imputed using large multi-ethnic reference panels. However, the ever increasing size and ethnic diversity of both reference panels and cohorts makes genotype imputation computationally challenging for moderately sized computer clusters. Moreover, genotype imputation requires subject-level genetic data, which unlike summary statistics provided by virtually all studies, is not publicly available. While there are much less demanding methods which avoid the genotype imputation step by directly imputing SNP statistics, e.g. Directly Imputing summary STatistics (DIST) proposed by our group, their implicit assumptions make them applicable only to ethnically homogeneous cohorts. To decrease computational and access requirements for the analysis of cosmopolitan cohorts, we propose DISTMIX, which extends DIST capabilities to the analysis of mixed ethnicity cohorts. The method uses a relevant reference panel to directly impute unmeasured SNP statistics based only on statistics at measured SNPs and estimated/user-specified ethnic proportions. Simulations show that the proposed method adequately controls the Type I error rates. The 1000 Genomes panel imputation of summary statistics from the ethnically diverse Psychiatric Genetic Consortium Schizophrenia Phase 2 suggests that, when compared to genotype imputation methods, DISTMIX offers comparable imputation accuracy for only a fraction of computational resources. DISTMIX software, its reference population data, and usage examples are publicly available at http://code.google.com/p/distmix. dlee4@vcu.edu Supplementary Data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
ERIC Educational Resources Information Center
Bureau of the Census (DOC), Suitland, MD.
This report presents a summary of recent trends in school and college enrollment based on the October 1977 Current Population Survey (CPS) and earlier surveys. Enrollment statistics representing growth and decline at various educational levels are evaluated in written summaries. Comparative and distributive enrollment statistics of the population…
Major Railroad Accidents Involving Hazardous Materials Release, Composite Summaries 1969-1978
DOT National Transportation Integrated Search
1980-07-31
This report presents composite summaries describing 75 major railroad accidents in which hazardous materials were released. The selected accidents occurred during the years 1969-1978. The data contained in the individual summaries were derived from v...
Property Data Summaries for Advanced Materials
National Institute of Standards and Technology Data Gateway
SRD 150 NIST Property Data Summaries for Advanced Materials (Web, free access) Property Data Summaries are topical collections of property values derived from surveys of published data. Thermal, mechanical, structural, and chemical properties are included in the collections.
Exclusion probabilities and likelihood ratios with applications to kinship problems.
Slooten, Klaas-Jan; Egeland, Thore
2014-05-01
In forensic genetics, DNA profiles are compared in order to make inferences, paternity cases being a standard example. The statistical evidence can be summarized and reported in several ways. For example, in a paternity case, the likelihood ratio (LR) and the probability of not excluding a random man as father (RMNE) are two common summary statistics. There has been a long debate on the merits of the two statistics, also in the context of DNA mixture interpretation, and no general consensus has been reached. In this paper, we show that the RMNE is a certain weighted average of inverse likelihood ratios. This is true in any forensic context. We show that the likelihood ratio in favor of the correct hypothesis is, in expectation, bigger than the reciprocal of the RMNE probability. However, with the exception of pathological cases, it is also possible to obtain smaller likelihood ratios. We illustrate this result for paternity cases. Moreover, some theoretical properties of the likelihood ratio for a large class of general pairwise kinship cases, including expected value and variance, are derived. The practical implications of the findings are discussed and exemplified.
John, Majnu; Lencz, Todd; Malhotra, Anil K; Correll, Christoph U; Zhang, Jian-Ping
2018-06-01
Meta-analysis of genetic association studies is being increasingly used to assess phenotypic differences between genotype groups. When the underlying genetic model is assumed to be dominant or recessive, assessing the phenotype differences based on summary statistics, reported for individual studies in a meta-analysis, is a valid strategy. However, when the genetic model is additive, a similar strategy based on summary statistics will lead to biased results. This fact about the additive model is one of the things that we establish in this paper, using simulations. The main goal of this paper is to present an alternate strategy for the additive model based on simulating data for the individual studies. We show that the alternate strategy is far superior to the strategy based on summary statistics.
77 FR 2345 - Advisory Council on Transportation Statistics; Request for Nominations
Federal Register 2010, 2011, 2012, 2013, 2014
2012-01-17
... Transportation Statistics Advisory Council on Transportation Statistics; Request for Nominations AGENCY: Research and Innovative Technology Administration (RITA), Bureau of Transportation Statistics (BTS), DOT. ACTION: Request for Nominations to the Advisory Council on Transportation Statistics (ACTS). SUMMARY: The...
2016 Annual Disability Statistics Compendium
ERIC Educational Resources Information Center
Lauer, E. A.; Houtenville, A. J.
2017-01-01
The "Annual Disability Statistics Compendium" is a publication of statistics about people with disabilities and about the government programs which serve them. The "Compendium" is designed to serve as a summary of government statistics. The 2016 "Annual Disability Statistics Compendium" was substantially revised and…
Stone, M.A.J.; Mann, Larry J.; Kjelstrom, L.C.
1993-01-01
Statistical summaries and graphs of streamflow data were prepared for 13 gaging stations with 5 or more years of continuous record on and near the Idaho National Engineering Laboratory. Statistical summaries of streamflow data for the Big and Little Lost Rivers and Birch Creek were analyzed as a requisite for a comprehensive evaluation of the potential for flooding of facilities at the Idaho National Engineering Laboratory. The type of statistical analyses performed depended on the length of streamflow record for a gaging station. Streamflow statistics generated for stations with 5 to 9 years of record were: (1) magnitudes of monthly and annual flows; (2) duration of daily mean flows; and (3) maximum, median, and minimum daily mean flows. Streamflow statistics generated for stations with 10 or more years of record were: (1) magnitudes of monthly and annual flows; (2) magnitudes and frequencies of daily low, high, instantaneous peak (flood frequency), and annual mean flows; (3) duration of daily mean flows; (4) exceedance probabilities of annual low, high, instantaneous peak, and mean annual flows; (5) maximum, median, and minimum daily mean flows; and (6) annual mean and mean annual flows.
Generation Y and Blood Donation: The Impact of Altruistic Help in a Darwiportunistic Scenario
Scholz, Christian
2010-01-01
Summary This article focuses on the members of Generation Y and their willingness to offer voluntary (unpaid) blood donations. Using statistics from various sources, a three-stage model is developed to explain blood donation behaviour especially of this generation. It consists of i) developing altruism, ii) raising the willingness to donate blood, and iii) activating actual blood donation behaviour. Members of Generation Y live in a Darwinistic society. They also to some degree act opportunistically, but not in contradiction to altruism. For that reason, the article positions itself in the theoretical framework of Darwi-portunism and derives practical suggestions as well as implications for research. PMID:21048826
DOT National Transportation Integrated Search
2015-01-01
Statistical summaries of streamflow data collected at 184 streamgages in Iowa are presented in this report. All streamgages included for analysis have at least 10 years of continuous record collected before or through September 2013. This report is a...
Goss, Richard L.
1987-01-01
As part of the statistical summaries, trend tests were conducted. Several small uptrends were detected for total nitrogen, total organic nitrogen, total ammonia nitrogen, total nitrite nitrogen, total nitrate nitrogen, total organic plus ammonia nitrogen, total nitrite plus nitrate nitrogen, and total phosphorus. Small downtrends were detected for biochemical oxygen demand and dissolved magnesium.
2017 Annual Disability Statistics Compendium
ERIC Educational Resources Information Center
Lauer, E. A.; Houtenville, A. J.
2018-01-01
The "Annual Disability Statistics Compendium" and its compliment, the "Annual Disability Statistics Supplement," are publications of statistics about people with disabilities and about the government programs which serve them. The "Compendium" and "Supplement" are designed to serve as a summary of government…
The Performance and Retention of Female Navy Officers with a Military Spouse
2017-03-01
5 2. Female Officer Retention and Dual-Military Couples ...............7 3. Demographic Statistics ...23 III. DATA DESCRIPTION AND STATISTICS ...28 2. Independent Variables.................................................................31 C. SUMMARY STATISTICS
Jiang, Wei; Yu, Weichuan
2017-02-15
In genome-wide association studies (GWASs) of common diseases/traits, we often analyze multiple GWASs with the same phenotype together to discover associated genetic variants with higher power. Since it is difficult to access data with detailed individual measurements, summary-statistics-based meta-analysis methods have become popular to jointly analyze datasets from multiple GWASs. In this paper, we propose a novel summary-statistics-based joint analysis method based on controlling the joint local false discovery rate (Jlfdr). We prove that our method is the most powerful summary-statistics-based joint analysis method when controlling the false discovery rate at a certain level. In particular, the Jlfdr-based method achieves higher power than commonly used meta-analysis methods when analyzing heterogeneous datasets from multiple GWASs. Simulation experiments demonstrate the superior power of our method over meta-analysis methods. Also, our method discovers more associations than meta-analysis methods from empirical datasets of four phenotypes. The R-package is available at: http://bioinformatics.ust.hk/Jlfdr.html . eeyu@ust.hk. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Polygenic scores via penalized regression on summary statistics.
Mak, Timothy Shin Heng; Porsch, Robert Milan; Choi, Shing Wan; Zhou, Xueya; Sham, Pak Chung
2017-09-01
Polygenic scores (PGS) summarize the genetic contribution of a person's genotype to a disease or phenotype. They can be used to group participants into different risk categories for diseases, and are also used as covariates in epidemiological analyses. A number of possible ways of calculating PGS have been proposed, and recently there is much interest in methods that incorporate information available in published summary statistics. As there is no inherent information on linkage disequilibrium (LD) in summary statistics, a pertinent question is how we can use LD information available elsewhere to supplement such analyses. To answer this question, we propose a method for constructing PGS using summary statistics and a reference panel in a penalized regression framework, which we call lassosum. We also propose a general method for choosing the value of the tuning parameter in the absence of validation data. In our simulations, we showed that pseudovalidation often resulted in prediction accuracy that is comparable to using a dataset with validation phenotype and was clearly superior to the conservative option of setting the tuning parameter of lassosum to its lowest value. We also showed that lassosum achieved better prediction accuracy than simple clumping and P-value thresholding in almost all scenarios. It was also substantially faster and more accurate than the recently proposed LDpred. © 2017 WILEY PERIODICALS, INC.
2013-01-01
Background The advent of genome-wide association studies has led to many novel disease-SNP associations, opening the door to focused study on their biological underpinnings. Because of the importance of analyzing these associations, numerous statistical methods have been devoted to them. However, fewer methods have attempted to associate entire genes or genomic regions with outcomes, which is potentially more useful knowledge from a biological perspective and those methods currently implemented are often permutation-based. Results One property of some permutation-based tests is that their power varies as a function of whether significant markers are in regions of linkage disequilibrium (LD) or not, which we show from a theoretical perspective. We therefore develop two methods for quantifying the degree of association between a genomic region and outcome, both of whose power does not vary as a function of LD structure. One method uses dimension reduction to “filter” redundant information when significant LD exists in the region, while the other, called the summary-statistic test, controls for LD by scaling marker Z-statistics using knowledge of the correlation matrix of markers. An advantage of this latter test is that it does not require the original data, but only their Z-statistics from univariate regressions and an estimate of the correlation structure of markers, and we show how to modify the test to protect the type 1 error rate when the correlation structure of markers is misspecified. We apply these methods to sequence data of oral cleft and compare our results to previously proposed gene tests, in particular permutation-based ones. We evaluate the versatility of the modification of the summary-statistic test since the specification of correlation structure between markers can be inaccurate. Conclusion We find a significant association in the sequence data between the 8q24 region and oral cleft using our dimension reduction approach and a borderline significant association using the summary-statistic based approach. We also implement the summary-statistic test using Z-statistics from an already-published GWAS of Chronic Obstructive Pulmonary Disorder (COPD) and correlation structure obtained from HapMap. We experiment with the modification of this test because the correlation structure is assumed imperfectly known. PMID:24199751
Swanson, David M; Blacker, Deborah; Alchawa, Taofik; Ludwig, Kerstin U; Mangold, Elisabeth; Lange, Christoph
2013-11-07
The advent of genome-wide association studies has led to many novel disease-SNP associations, opening the door to focused study on their biological underpinnings. Because of the importance of analyzing these associations, numerous statistical methods have been devoted to them. However, fewer methods have attempted to associate entire genes or genomic regions with outcomes, which is potentially more useful knowledge from a biological perspective and those methods currently implemented are often permutation-based. One property of some permutation-based tests is that their power varies as a function of whether significant markers are in regions of linkage disequilibrium (LD) or not, which we show from a theoretical perspective. We therefore develop two methods for quantifying the degree of association between a genomic region and outcome, both of whose power does not vary as a function of LD structure. One method uses dimension reduction to "filter" redundant information when significant LD exists in the region, while the other, called the summary-statistic test, controls for LD by scaling marker Z-statistics using knowledge of the correlation matrix of markers. An advantage of this latter test is that it does not require the original data, but only their Z-statistics from univariate regressions and an estimate of the correlation structure of markers, and we show how to modify the test to protect the type 1 error rate when the correlation structure of markers is misspecified. We apply these methods to sequence data of oral cleft and compare our results to previously proposed gene tests, in particular permutation-based ones. We evaluate the versatility of the modification of the summary-statistic test since the specification of correlation structure between markers can be inaccurate. We find a significant association in the sequence data between the 8q24 region and oral cleft using our dimension reduction approach and a borderline significant association using the summary-statistic based approach. We also implement the summary-statistic test using Z-statistics from an already-published GWAS of Chronic Obstructive Pulmonary Disorder (COPD) and correlation structure obtained from HapMap. We experiment with the modification of this test because the correlation structure is assumed imperfectly known.
ParallABEL: an R library for generalized parallelization of genome-wide association studies.
Sangket, Unitsa; Mahasirimongkol, Surakameth; Chantratita, Wasun; Tandayya, Pichaya; Aulchenko, Yurii S
2010-04-29
Genome-Wide Association (GWA) analysis is a powerful method for identifying loci associated with complex traits and drug response. Parts of GWA analyses, especially those involving thousands of individuals and consuming hours to months, will benefit from parallel computation. It is arduous acquiring the necessary programming skills to correctly partition and distribute data, control and monitor tasks on clustered computers, and merge output files. Most components of GWA analysis can be divided into four groups based on the types of input data and statistical outputs. The first group contains statistics computed for a particular Single Nucleotide Polymorphism (SNP), or trait, such as SNP characterization statistics or association test statistics. The input data of this group includes the SNPs/traits. The second group concerns statistics characterizing an individual in a study, for example, the summary statistics of genotype quality for each sample. The input data of this group includes individuals. The third group consists of pair-wise statistics derived from analyses between each pair of individuals in the study, for example genome-wide identity-by-state or genomic kinship analyses. The input data of this group includes pairs of SNPs/traits. The final group concerns pair-wise statistics derived for pairs of SNPs, such as the linkage disequilibrium characterisation. The input data of this group includes pairs of individuals. We developed the ParallABEL library, which utilizes the Rmpi library, to parallelize these four types of computations. ParallABEL library is not only aimed at GenABEL, but may also be employed to parallelize various GWA packages in R. The data set from the North American Rheumatoid Arthritis Consortium (NARAC) includes 2,062 individuals with 545,080, SNPs' genotyping, was used to measure ParallABEL performance. Almost perfect speed-up was achieved for many types of analyses. For example, the computing time for the identity-by-state matrix was linearly reduced from approximately eight hours to one hour when ParallABEL employed eight processors. Executing genome-wide association analysis using the ParallABEL library on a computer cluster is an effective way to boost performance, and simplify the parallelization of GWA studies. ParallABEL is a user-friendly parallelization of GenABEL.
NASA Technical Reports Server (NTRS)
1983-01-01
The following subject areas are covered: summary of the NASA program goals and objectives; major mission performance; USSR spaceflights; summary comparisons of the USA and USSR space records; and selected technical, financial, and manpower data.
ICAP - An Interactive Cluster Analysis Procedure for analyzing remotely sensed data
NASA Technical Reports Server (NTRS)
Wharton, S. W.; Turner, B. J.
1981-01-01
An Interactive Cluster Analysis Procedure (ICAP) was developed to derive classifier training statistics from remotely sensed data. ICAP differs from conventional clustering algorithms by allowing the analyst to optimize the cluster configuration by inspection, rather than by manipulating process parameters. Control of the clustering process alternates between the algorithm, which creates new centroids and forms clusters, and the analyst, who can evaluate and elect to modify the cluster structure. Clusters can be deleted, or lumped together pairwise, or new centroids can be added. A summary of the cluster statistics can be requested to facilitate cluster manipulation. The principal advantage of this approach is that it allows prior information (when available) to be used directly in the analysis, since the analyst interacts with ICAP in a straightforward manner, using basic terms with which he is more likely to be familiar. Results from testing ICAP showed that an informed use of ICAP can improve classification, as compared to an existing cluster analysis procedure.
Buonaccorsi, G A; Rose, C J; O'Connor, J P B; Roberts, C; Watson, Y; Jackson, A; Jayson, G C; Parker, G J M
2010-01-01
Clinical trials of anti-angiogenic and vascular-disrupting agents often use biomarkers derived from DCE-MRI, typically reporting whole-tumor summary statistics and so overlooking spatial parameter variations caused by tissue heterogeneity. We present a data-driven segmentation method comprising tracer-kinetic model-driven registration for motion correction, conversion from MR signal intensity to contrast agent concentration for cross-visit normalization, iterative principal components analysis for imputation of missing data and dimensionality reduction, and statistical outlier detection using the minimum covariance determinant to obtain a robust Mahalanobis distance. After applying these techniques we cluster in the principal components space using k-means. We present results from a clinical trial of a VEGF inhibitor, using time-series data selected because of problems due to motion and outlier time series. We obtained spatially-contiguous clusters that map to regions with distinct microvascular characteristics. This methodology has the potential to uncover localized effects in trials using DCE-MRI-based biomarkers.
Antweiler, Ronald C.; Taylor, Howard E.
2008-01-01
The main classes of statistical treatment of below-detection limit (left-censored) environmental data for the determination of basic statistics that have been used in the literature are substitution methods, maximum likelihood, regression on order statistics (ROS), and nonparametric techniques. These treatments, along with using all instrument-generated data (even those below detection), were evaluated by examining data sets in which the true values of the censored data were known. It was found that for data sets with less than 70% censored data, the best technique overall for determination of summary statistics was the nonparametric Kaplan-Meier technique. ROS and the two substitution methods of assigning one-half the detection limit value to censored data or assigning a random number between zero and the detection limit to censored data were adequate alternatives. The use of these two substitution methods, however, requires a thorough understanding of how the laboratory censored the data. The technique of employing all instrument-generated data - including numbers below the detection limit - was found to be less adequate than the above techniques. At high degrees of censoring (greater than 70% censored data), no technique provided good estimates of summary statistics. Maximum likelihood techniques were found to be far inferior to all other treatments except substituting zero or the detection limit value to censored data.
Lambing, J.H.
1990-01-01
Water quality sampling was conducted at eight sites on the Clark Fork and selected tributaries from Galen to Missoula, from October 1988 through September 1989. This report presents tabulations and statistical summaries of the water quality data. Included are tabulations of streamflow, onsite water quality, and concentrations of trace elements and suspended sediment for periodic samples. Also included are tables and hydrographs of daily mean values for streamflow, suspended-sediment concentration, and suspended-sediment discharge at three mainstem stations and one tributary. Statistical summaries are presented for periodic water quality data collected from March 1985 through September 1989. Selected data are illustrated by graphs showing median concentrations of trace elements in water, relation of trace-element concentrations to suspended-sediment concentrations, and median concentrations of trace elements in suspended sediment. (USGS)
Lambing, John H.
1989-01-01
Water quality sampling was conducted at eight sites on the Clark Fork and selected tributaries from Galen to Missoula, Mont., from October 1987 through September 1988. This report presents tabulations and statistical summaries of the water quality data. Included in this report are tabulations of streamflow, onsite water quality, and concentrations of trace elements and suspended sediment for periodic samples. Also included are tables and hydrographs of daily mean values for streamflow, suspended-sediment concentration, and suspended-sediment discharge at three mainstream stations and one tributary. Statistical summaries are presented for periodic water quality data collected from March 1985 through September 1988. Selected data are illustrated by graphs showing median concentrations of trace elements in water, relation of trace element concentrations to suspended-sediment concentrations, and median concentrations of trace elements in suspended sediments. (USGS)
Computer program to perform cost and weight analysis of transport aircraft. Volume 1: Summary
NASA Technical Reports Server (NTRS)
1973-01-01
A digital computer program for evaluating the weight and costs of advanced transport designs was developed. The resultant program, intended for use at the preliminary design level, incorporates both batch mode and interactive graphics run capability. The basis of the weight and cost estimation method developed is a unique way of predicting the physical design of each detail part of a vehicle structure at a time when only configuration concept drawings are available. In addition, the technique relies on methods to predict the precise manufacturing processes and the associated material required to produce each detail part. Weight data are generated in four areas of the program. Overall vehicle system weights are derived on a statistical basis as part of the vehicle sizing process. Theoretical weights, actual weights, and the weight of the raw material to be purchased are derived as part of the structural synthesis and part definition processes based on the computed part geometry.
These summaries provide statistics for common cancer types. The statistics include incidence, mortality, survival, stage, prevalence, and lifetime risk. Links to additional resources are included. Updated annually.
NASA Technical Reports Server (NTRS)
1980-01-01
MATHPAC image-analysis library is collection of general-purpose mathematical and statistical routines and special-purpose data-analysis and pattern-recognition routines for image analysis. MATHPAC library consists of Linear Algebra, Optimization, Statistical-Summary, Densities and Distribution, Regression, and Statistical-Test packages.
micromap: A Package for Linked Micromaps
The R package micromap is used to create linked micromaps, which display statistical summaries associated with areal units, or polygons. Linked micromaps provide a means to simultaneously summarize and display both statistical and geographic distributions by linking statistical ...
The 2002 RPA Plot Summary database users manual
Patrick D. Miles; John S. Vissage; W. Brad Smith
2004-01-01
Describes the structure of the RPA 2002 Plot Summary database and provides information on generating estimates of forest statistics from these data. The RPA 2002 Plot Summary database provides a consistent framework for storing forest inventory data across all ownerships across the entire United States. The data represents the best available data as of October 2001....
Zhu, Xiaofeng; Feng, Tao; Tayo, Bamidele O; Liang, Jingjing; Young, J Hunter; Franceschini, Nora; Smith, Jennifer A; Yanek, Lisa R; Sun, Yan V; Edwards, Todd L; Chen, Wei; Nalls, Mike; Fox, Ervin; Sale, Michele; Bottinger, Erwin; Rotimi, Charles; Liu, Yongmei; McKnight, Barbara; Liu, Kiang; Arnett, Donna K; Chakravati, Aravinda; Cooper, Richard S; Redline, Susan
2015-01-08
Genome-wide association studies (GWASs) have identified many genetic variants underlying complex traits. Many detected genetic loci harbor variants that associate with multiple-even distinct-traits. Most current analysis approaches focus on single traits, even though the final results from multiple traits are evaluated together. Such approaches miss the opportunity to systemically integrate the phenome-wide data available for genetic association analysis. In this study, we propose a general approach that can integrate association evidence from summary statistics of multiple traits, either correlated, independent, continuous, or binary traits, which might come from the same or different studies. We allow for trait heterogeneity effects. Population structure and cryptic relatedness can also be controlled. Our simulations suggest that the proposed method has improved statistical power over single-trait analysis in most of the cases we studied. We applied our method to the Continental Origins and Genetic Epidemiology Network (COGENT) African ancestry samples for three blood pressure traits and identified four loci (CHIC2, HOXA-EVX1, IGFBP1/IGFBP3, and CDH17; p < 5.0 × 10(-8)) associated with hypertension-related traits that were missed by a single-trait analysis in the original report. Six additional loci with suggestive association evidence (p < 5.0 × 10(-7)) were also observed, including CACNA1D and WNT3. Our study strongly suggests that analyzing multiple phenotypes can improve statistical power and that such analysis can be executed with the summary statistics from GWASs. Our method also provides a way to study a cross phenotype (CP) association by using summary statistics from GWASs of multiple phenotypes. Copyright © 2015 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Hazing DEOCS 4.1 Construct Validity Summary
2017-08-01
Hazing DEOCS 4.1 Construct Validity Summary DEFENSE EQUAL OPPORTUNITY MANAGEMENT INSTITUTE DIRECTORATE OF...the analysis. Tables 4 – 6 provide additional information regarding the descriptive statistics and reliability of the Hazing items. Table 7 provides
Teaching the Meaning of Statistical Techniques with Microcomputer Simulation.
ERIC Educational Resources Information Center
Lee, Motoko Y.; And Others
Students in an introductory statistics course are often preoccupied with learning the computational routines of specific summary statistics and thereby fail to develop an understanding of the meaning of those statistics or their conceptual basis. To help students develop a better understanding of the meaning of three frequently used statistics,…
Statistical Abstract of the United States: 2012. 131st Edition
ERIC Educational Resources Information Center
US Census Bureau, 2011
2011-01-01
"The Statistical Abstract of the United States," published from 1878 to 2012, is the authoritative and comprehensive summary of statistics on the social, political, and economic organization of the United States. It is designed to serve as a convenient volume for statistical reference, and as a guide to other statistical publications and…
Energy conservation indicators. 1982 annual report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Belzer, D.B.
A series of Energy Conservation Indicators were developed for the Department of Energy to assist in the evaluation of current and proposed conservation strategies. As descriptive statistics that signify current conditions and trends related to efficiency of energy use, indicators provide a way of measuring, monitoring, or inferring actual responses by consumers in markets for energy services. Related sets of indicators are presented in some 40 one-page indicator summaries. Indicators are shown graphically, followed by several paragraphs that explain their derivation and highlight key findings. Indicators are classified according to broad end-use sectors: Aggregate (economy), Residential, Commercial, Industrial, Transportation andmore » Electric Utilities. In most cases annual time series information is presented covering the period 1960 through 1981.« less
Relationships Between Photospheric Flows and Solar Flares
NASA Astrophysics Data System (ADS)
Welsch, B. T.; Li, Y.
2013-12-01
Fourier Local Correlation Tracking (FLCT) has been applied to the entire database of 96-minute cadence line-of-sight (LOS) magnetograms from the SOHO/MDI mission, to derive photospheric transverse velocities (u_x,u_y). In a previous study, we applied FLCT to a few dozen active regions (ARs), and found that the "proxy Poynting flux" (PPF) --- the product u B^2, where u is the FLCT flow speed and B is the LOS field divided by the cosine of viewing angle, integrated over each AR --- was statistically related to flare activity. We will present preliminary results of our investigation of the relationship between PPF and flare activity from NOAA's GOES catalog for several hundred ARs identified in NOAA's daily Solar Region Summaries.
Surface topography of the Greenland Ice Sheet from satellite radar altimetry
NASA Technical Reports Server (NTRS)
Bindschadler, Robert A.; Zwally, H. Jay; Major, Judith A.; Brenner, Anita C.
1989-01-01
Surface elevation maps of the southern half of the Greenland subcontinent are produced from radar altimeter data acquired by the Seasat satellite. A summary of the processing procedure and examples of return waveform data are given. The elevation data are used to generate a regular grid which is then computer contoured to provide an elevation contour map. Ancillary maps show the statistical quality of the elevation data and various characteristics of the surface. The elevation map is used to define ice flow directions and delineate the major drainage basins. Regular maps of the Jakobshavns Glacier drainage basin and the ice divide in the vicinity of Crete Station are presented. Altimeter derived elevations are compared with elevations measured both by satellite geoceivers and optical surveying.
Michigan Library Statistical Report. 1997 Edition.
ERIC Educational Resources Information Center
Michigan Library, Lansing.
This 1997 edition focuses on statistical data supplied by Michigan public libraries, public library cooperatives, and those public libraries which serve as regional or subregional outlets for blind and physically handicapped patrons. Statistics on academic libraries are also presented in this edition, and summary statistics for prior fiscal years…
Statistical methods to detect novel genetic variants using publicly available GWAS summary data.
Guo, Bin; Wu, Baolin
2018-03-01
We propose statistical methods to detect novel genetic variants using only genome-wide association studies (GWAS) summary data without access to raw genotype and phenotype data. With more and more summary data being posted for public access in the post GWAS era, the proposed methods are practically very useful to identify additional interesting genetic variants and shed lights on the underlying disease mechanism. We illustrate the utility of our proposed methods with application to GWAS meta-analysis results of fasting glucose from the international MAGIC consortium. We found several novel genome-wide significant loci that are worth further study. Copyright © 2018 Elsevier Ltd. All rights reserved.
Thomas, Kimberly; Jajosky, Ruth; Coates, Ralph J; Calvert, Geoffrey M; Dewey-Mattia, Daniel; Raymond, Jaime; Singh, Simple D
2017-08-11
The Summary of Notifiable Noninfectious Conditions and Disease Outbreaks: Surveillance Data Published Between April 1, 2016 and January 31, 2017 - United States, herein referred to as the Summary (Noninfectious), contains official statistics for nationally notifiable noninfectious conditions and disease outbreaks. This Summary (Noninfectious) is being published in the same volume of MMWR as the annual Summary of Notifiable Infectious Diseases and Conditions (1). Data on notifiable noninfectious conditions and disease outbreaks from prior years have been published previously (2,3).
Song, Rui; Kosorok, Michael R.; Cai, Jianwen
2009-01-01
Summary Recurrent events data are frequently encountered in clinical trials. This article develops robust covariate-adjusted log-rank statistics applied to recurrent events data with arbitrary numbers of events under independent censoring and the corresponding sample size formula. The proposed log-rank tests are robust with respect to different data-generating processes and are adjusted for predictive covariates. It reduces to the Kong and Slud (1997, Biometrika 84, 847–862) setting in the case of a single event. The sample size formula is derived based on the asymptotic normality of the covariate-adjusted log-rank statistics under certain local alternatives and a working model for baseline covariates in the recurrent event data context. When the effect size is small and the baseline covariates do not contain significant information about event times, it reduces to the same form as that of Schoenfeld (1983, Biometrics 39, 499–503) for cases of a single event or independent event times within a subject. We carry out simulations to study the control of type I error and the comparison of powers between several methods in finite samples. The proposed sample size formula is illustrated using data from an rhDNase study. PMID:18162107
Two-Bin Kanban: Ordering Impact at Navy Medical Center San Diego
2016-06-01
book cost $2.94 on December 12, 2012. This same book cost $3.76 on June 30, 2015. These two costs were averaged to $3.35 in both the pretest (2013...with summary statistics based on those observations (Kabacoff, 2011, p. 112). Replacing the groups of observations with summary statistics allows the...ABSTRACT (maximum 200 words) One of the most important aspects of hospital administration is the medical consumable inventory process. The Navy Bureau of
NASA Technical Reports Server (NTRS)
Nastrom, G. D.; Jasperson, W. H.
1983-01-01
Temperature data obtained by the Global Atmospheric Sampling Program (GASP) during the period March 1975 to July 1979 are compiled to form flight summaries of static air temperature and a geographic temperature climatology. The flight summaries include the height and location of the coldest observed temperature and the mean flight level, temperature and the standard deviation of temperature for each flight as well as for flight segments. These summaries are ordered by route and month. The temperature climatology was computed for all statistically independent temperture data for each flight. The grid used consists of 5 deg latitude, 30 deg longitude and 2000 feet vertical resolution from FL270 to FL430 for each month of the year. The number of statistically independent observations, their mean, standard deviation and the empirical 98, 50, 16, 2 and .3 probability percentiles are presented.
Federal Register 2010, 2011, 2012, 2013, 2014
2011-12-30
... for OMB Review; Comment Request; Mass Layoff Statistics Program ACTION: Notice. SUMMARY: The... request (ICR) titled, ``Mass Layoff Statistics Program,'' to the Office of Management and Budget (OMB) for... Statistics (BLS). Title of Collection: Mass Layoff Statistics Program. OMB Control Number: 1220-0090...
Streamflow statistics for selected streams in North Dakota, Minnesota, Manitoba, and Saskatchewan
Williams-Sether, Tara
2012-01-01
Statistical summaries of streamflow data for the periods of record through water year 2009 for selected active and discontinued U.S. Geological Survey streamflow-gaging stations in North Dakota, Minnesota, Manitoba, and Saskatchewan were compiled. The summaries for each streamflow-gaging station include a brief station description, a graph of the annual peak and annual mean discharge for the period of record, statistics of monthly and annual mean discharges, monthly and annual flow durations, probability of occurrence of annual high discharges, annual peak discharge and corresponding gage height for the period of record, and monthly and annual mean discharges for the period of record.
Statistical summaries of water-quality data for two coal areas of Jackson County, Colorado
Kuhn, Gerhard
1982-01-01
Statistical summaries of water-quality data are compiled for eight streams in two separate coal areas of Jackson County, Colo. The quality-of-water data were collected from October 1976 to September 1980. For inorganic constituents, the maximum, minimum, and mean concentrations, as well as other statistics are presented; for minor elements, only the maximum, minimum, and mean values are included. Least-squares equations (regressions) are also given relating specific conductance of the streams to the concentration of the major ions. The observed range of specific conductance was 85 to 1,150 micromhos per centimeter for the eight sites. (USGS)
Analysis of Professional and Pre-Accession Characteristics and Junior Naval Officer Performance
2018-03-01
REVIEW .............................................5 A. NAVY PERFORMANCE EVALUATION SYSTEM ............................5 B. PROFESSIONAL...17 A. DATA DESCRIPTION ...........................................................................17 B. SUMMARY...STATISTICS ......................................................................24 C. DESCRIPTIVE STATISTICS
Zhang, Han; Wheeler, William; Hyland, Paula L; Yang, Yifan; Shi, Jianxin; Chatterjee, Nilanjan; Yu, Kai
2016-06-01
Meta-analysis of multiple genome-wide association studies (GWAS) has become an effective approach for detecting single nucleotide polymorphism (SNP) associations with complex traits. However, it is difficult to integrate the readily accessible SNP-level summary statistics from a meta-analysis into more powerful multi-marker testing procedures, which generally require individual-level genetic data. We developed a general procedure called Summary based Adaptive Rank Truncated Product (sARTP) for conducting gene and pathway meta-analysis that uses only SNP-level summary statistics in combination with genotype correlation estimated from a panel of individual-level genetic data. We demonstrated the validity and power advantage of sARTP through empirical and simulated data. We conducted a comprehensive pathway-based meta-analysis with sARTP on type 2 diabetes (T2D) by integrating SNP-level summary statistics from two large studies consisting of 19,809 T2D cases and 111,181 controls with European ancestry. Among 4,713 candidate pathways from which genes in neighborhoods of 170 GWAS established T2D loci were excluded, we detected 43 T2D globally significant pathways (with Bonferroni corrected p-values < 0.05), which included the insulin signaling pathway and T2D pathway defined by KEGG, as well as the pathways defined according to specific gene expression patterns on pancreatic adenocarcinoma, hepatocellular carcinoma, and bladder carcinoma. Using summary data from 8 eastern Asian T2D GWAS with 6,952 cases and 11,865 controls, we showed 7 out of the 43 pathways identified in European populations remained to be significant in eastern Asians at the false discovery rate of 0.1. We created an R package and a web-based tool for sARTP with the capability to analyze pathways with thousands of genes and tens of thousands of SNPs.
Zhang, Han; Wheeler, William; Hyland, Paula L.; Yang, Yifan; Shi, Jianxin; Chatterjee, Nilanjan; Yu, Kai
2016-01-01
Meta-analysis of multiple genome-wide association studies (GWAS) has become an effective approach for detecting single nucleotide polymorphism (SNP) associations with complex traits. However, it is difficult to integrate the readily accessible SNP-level summary statistics from a meta-analysis into more powerful multi-marker testing procedures, which generally require individual-level genetic data. We developed a general procedure called Summary based Adaptive Rank Truncated Product (sARTP) for conducting gene and pathway meta-analysis that uses only SNP-level summary statistics in combination with genotype correlation estimated from a panel of individual-level genetic data. We demonstrated the validity and power advantage of sARTP through empirical and simulated data. We conducted a comprehensive pathway-based meta-analysis with sARTP on type 2 diabetes (T2D) by integrating SNP-level summary statistics from two large studies consisting of 19,809 T2D cases and 111,181 controls with European ancestry. Among 4,713 candidate pathways from which genes in neighborhoods of 170 GWAS established T2D loci were excluded, we detected 43 T2D globally significant pathways (with Bonferroni corrected p-values < 0.05), which included the insulin signaling pathway and T2D pathway defined by KEGG, as well as the pathways defined according to specific gene expression patterns on pancreatic adenocarcinoma, hepatocellular carcinoma, and bladder carcinoma. Using summary data from 8 eastern Asian T2D GWAS with 6,952 cases and 11,865 controls, we showed 7 out of the 43 pathways identified in European populations remained to be significant in eastern Asians at the false discovery rate of 0.1. We created an R package and a web-based tool for sARTP with the capability to analyze pathways with thousands of genes and tens of thousands of SNPs. PMID:27362418
2000 Iowa crash facts : a summary of motor vehicle crash statistics on Iowa roadways
DOT National Transportation Integrated Search
2000-01-01
All statistics are gathered and calculated by the Iowa Department of Transportations Office of Driver Services. National statistics : are obtained from Traffic Safety Facts 2000 published by the U.S. Department of Transportations National...
2004 Missouri state highway system : traffic accident statistics
DOT National Transportation Integrated Search
2005-06-27
The following summary of traffic accidents represents only those accidents that have occurred on the State Highway : System of Missouri in 2004. The information contained in this publication is a summary of the accident reports : provided to the Miss...
1999 Missouri state highway system : traffic accident statistics
DOT National Transportation Integrated Search
2001-01-17
The following summary of traffic accidents represents only those accidents that have occurred on the State : Highway System of Missouri in 1999. The information contained in this publication is a summary of the accident : reports provided to the Miss...
2001 Missouri state highway system : traffic accident statistics
DOT National Transportation Integrated Search
2002-09-30
The following summary of traffic accidents represents only those accidents that have occurred on the State : Highway System of Missouri in 2001. The information contained in this publication is a summary of the accident : reports provided to the Miss...
2000 Missouri state highway system : traffic accident statistics
DOT National Transportation Integrated Search
2001-10-25
The following summary of traffic accidents represents only those accidents that have occurred on the State : Highway System of Missouri in 2000. The information contained in this publication is a summary of the accident : reports provided to the Miss...
2002 Missouri state highway system : traffic accident statistics
DOT National Transportation Integrated Search
2003-07-02
The following summary of traffic accidents represents only those accidents that have occurred on the State Highway : System of Missouri in 2002. The information contained in this publication is a summary of the accident reports : provided to the Miss...
2005 Missouri state highway system : traffic accident statistics
DOT National Transportation Integrated Search
2006-08-31
The following summary of traffic accidents represents only those accidents that have occurred on the State Highway : System of Missouri in 2005. The information contained in this publication is a summary of the accident reports : provided to the Miss...
2003 Missouri state highway system : traffic accident statistics
DOT National Transportation Integrated Search
2004-08-12
The following summary of traffic accidents represents only those accidents that have occurred on the State Highway : System of Missouri in 2003. The information contained in this publication is a summary of the accident reports : provided to the Miss...
Wildlife strikes to Canadian aircraft : 2008 summary report
DOT National Transportation Integrated Search
2008-01-01
This report provides a summary of Canadian wildlife strike statistics for 2008. It is : intended for the use of all stakeholders involved with Airport Bird and Mammal Control : Programs. Included in this group are pilots, airfield staff, airline main...
Lu, Qiongshi; Li, Boyang; Ou, Derek; Erlendsdottir, Margret; Powles, Ryan L; Jiang, Tony; Hu, Yiming; Chang, David; Jin, Chentian; Dai, Wei; He, Qidu; Liu, Zefeng; Mukherjee, Shubhabrata; Crane, Paul K; Zhao, Hongyu
2017-12-07
Despite the success of large-scale genome-wide association studies (GWASs) on complex traits, our understanding of their genetic architecture is far from complete. Jointly modeling multiple traits' genetic profiles has provided insights into the shared genetic basis of many complex traits. However, large-scale inference sets a high bar for both statistical power and biological interpretability. Here we introduce a principled framework to estimate annotation-stratified genetic covariance between traits using GWAS summary statistics. Through theoretical and numerical analyses, we demonstrate that our method provides accurate covariance estimates, thereby enabling researchers to dissect both the shared and distinct genetic architecture across traits to better understand their etiologies. Among 50 complex traits with publicly accessible GWAS summary statistics (N total ≈ 4.5 million), we identified more than 170 pairs with statistically significant genetic covariance. In particular, we found strong genetic covariance between late-onset Alzheimer disease (LOAD) and amyotrophic lateral sclerosis (ALS), two major neurodegenerative diseases, in single-nucleotide polymorphisms (SNPs) with high minor allele frequencies and in SNPs located in the predicted functional genome. Joint analysis of LOAD, ALS, and other traits highlights LOAD's correlation with cognitive traits and hints at an autoimmune component for ALS. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Sarwate, Anand D; Plis, Sergey M; Turner, Jessica A; Arbabshirani, Mohammad R; Calhoun, Vince D
2014-01-01
The growth of data sharing initiatives for neuroimaging and genomics represents an exciting opportunity to confront the "small N" problem that plagues contemporary neuroimaging studies while further understanding the role genetic markers play in the function of the brain. When it is possible, open data sharing provides the most benefits. However, some data cannot be shared at all due to privacy concerns and/or risk of re-identification. Sharing other data sets is hampered by the proliferation of complex data use agreements (DUAs) which preclude truly automated data mining. These DUAs arise because of concerns about the privacy and confidentiality for subjects; though many do permit direct access to data, they often require a cumbersome approval process that can take months. An alternative approach is to only share data derivatives such as statistical summaries-the challenges here are to reformulate computational methods to quantify the privacy risks associated with sharing the results of those computations. For example, a derived map of gray matter is often as identifiable as a fingerprint. Thus alternative approaches to accessing data are needed. This paper reviews the relevant literature on differential privacy, a framework for measuring and tracking privacy loss in these settings, and demonstrates the feasibility of using this framework to calculate statistics on data distributed at many sites while still providing privacy.
DOT National Transportation Integrated Search
2005-01-01
2005 SELECTED STATISTICS provides a summary of recent : transportation-related data collected and reported by the Kansas : Department of Transportation (KDOT). : Information regarding the following modes of transportation in the : State of Kansas -- ...
More Cancer Types - SEER Cancer Stat Facts
Cancer Statistical Fact Sheets are summaries of common cancer types developed to provide an overview of frequently-requested cancer statistics including incidence, mortality, survival, stage, prevalence, and lifetime risk.
Measuring older adults' sedentary time: reliability, validity, and responsiveness.
Gardiner, Paul A; Clark, Bronwyn K; Healy, Genevieve N; Eakin, Elizabeth G; Winkler, Elisabeth A H; Owen, Neville
2011-11-01
With evidence that prolonged sitting has deleterious health consequences, decreasing sedentary time is a potentially important preventive health target. High-quality measures, particularly for use with older adults, who are the most sedentary population group, are needed to evaluate the effect of sedentary behavior interventions. We examined the reliability, validity, and responsiveness to change of a self-report sedentary behavior questionnaire that assessed time spent in behaviors common among older adults: watching television, computer use, reading, socializing, transport and hobbies, and a summary measure (total sedentary time). In the context of a sedentary behavior intervention, nonworking older adults (n = 48, age = 73 ± 8 yr (mean ± SD)) completed the questionnaire on three occasions during a 2-wk period (7 d between administrations) and wore an accelerometer (ActiGraph model GT1M) for two periods of 6 d. Test-retest reliability (for the individual items and the summary measure) and validity (self-reported total sedentary time compared with accelerometer-derived sedentary time) were assessed during the 1-wk preintervention period, using Spearman (ρ) correlations and 95% confidence intervals (CI). Responsiveness to change after the intervention was assessed using the responsiveness statistic (RS). Test-retest reliability was excellent for television viewing time (ρ (95% CI) = 0.78 (0.63-0.89)), computer use (ρ (95% CI) = 0.90 (0.83-0.94)), and reading (ρ (95% CI) = 0.77 (0.62-0.86)); acceptable for hobbies (ρ (95% CI) = 0.61 (0.39-0.76)); and poor for socializing and transport (ρ < 0.45). Total sedentary time had acceptable test-retest reliability (ρ (95% CI) = 0.52 (0.27-0.70)) and validity (ρ (95% CI) = 0.30 (0.02-0.54)). Self-report total sedentary time was similarly responsive to change (RS = 0.47) as accelerometer-derived sedentary time (RS = 0.39). The summary measure of total sedentary time has good repeatability and modest validity and is sufficiently responsive to change suggesting that it is suitable for use in interventions with older adults.
DOT National Transportation Integrated Search
1997-10-01
In order to provide waterborne commerce information as soon as possible, the Waterborne Commerce Statistics Center (WCSC) has prepared this summary document of estimated waterborne commerce statistics for calendar year 1996. The foreign import and ex...
DOT National Transportation Integrated Search
1999-07-30
In order to provide waterborne commerce information as soon as possible, the Waterborne Commerce Statistics Center(WCSC) has prepared this summary document of estimated waterborne commerce statistics for calendar year 1998. The foreign import and exp...
Financial statistics major US publicly owned electric utilities 1996
DOE Office of Scientific and Technical Information (OSTI.GOV)
NONE
1998-03-01
The 1996 edition of The Financial Statistics of Major US Publicly Owned Electric Utilities publication presents 5 years (1992 through 1996) of summary financial data and current year detailed financial data on the major publicly owned electric utilities. The objective of the publication is to provide Federal and State governments, industry, and the general public with current and historical data that can be used for policymaking and decision making purposes related to publicly owned electric utility issues. Generator and nongenerator summaries are presented in this publication. Five years of summary financial data are provided. Summaries of generators for fiscal yearsmore » ending June 30 and December 31, nongenerators for fiscal years ending June 30 and December 31, and summaries of all respondents are provided. The composite tables present aggregates of income statement and balance sheet data, as well as financial indicators. Composite tables also display electric operation and maintenance expenses, electric utility plant, number of consumers, sales of electricity, and operating revenue, and electric energy account data. 2 figs., 32 tabs.« less
Federal Register 2010, 2011, 2012, 2013, 2014
2011-12-01
... Statistics Relating to Competitive Need Limitations AGENCY: Office of the United States Trade Representative. ACTION: Notice. SUMMARY: This notice is to inform the public of the availability of import statistics for... System of Preferences (GSP) program. These import statistics identify some articles for which the 2011...
Statistical Report of Kentucky Public Libraries, Fiscal Year 1997-1998.
ERIC Educational Resources Information Center
Bank, Jay, Comp.
This report contains statistical information on Kentucky public libraries for fiscal year 1997-1998 taken from the Annual Report of Public Libraries. The report is separated into seven sections: summary of library statistics for the most recent year (1998) and comparisons with the three prior years; graphs showing statistical trends in library…
Ector, Hugo
2010-12-01
I still remember my first book on statistics: "Elementary statistics with applications in medicine and the biological sciences" by Frederick E. Croxton. For me, it has been the start of pursuing understanding statistics in daily life and in medical practice. It was the first volume in a long row of books. In his introduction, Croxton pretends that"nearly everyone involved in any aspect of medicine needs to have some knowledge of statistics". The reality is that for many clinicians, statistics are limited to a "P < 0.05 = ok". I do not blame my colleagues who omit the paragraph on statistical methods. They have never had the opportunity to learn concise and clear descriptions of the key features. I have experienced how some authors can describe difficult methods in a well understandable language. Others fail completely. As a teacher, I tell my students that life is impossible without a basic knowledge of statistics. This feeling has resulted in an annual seminar of 90 minutes. This tutorial is the summary of this seminar. It is a summary and a transcription of the best pages I have detected.
2007 Missouri state highway system : traffic accident statistics
DOT National Transportation Integrated Search
2008-08-25
The following summary of traffic crashes represents only those crashes that have occurred on the State Highway : System of Missouri in 2007. The information contained in this publication is a summary of the crash reports : provided to the Missouri De...
2009 Missouri state highway system : traffic accident statistics
DOT National Transportation Integrated Search
2010-08-03
The following summary of traffic crashes represents only those crashes that have occurred on the State Highway : System of Missouri in 2009. The information contained in this publication is a summary of the crash reports : provided to the Missouri De...
2008 Missouri state highway system : traffic accident statistics
DOT National Transportation Integrated Search
2009-08-13
The following summary of traffic crashes represents only those crashes that have occurred on the State Highway : System of Missouri in 2008. The information contained in this publication is a summary of the crash reports : provided to the Missouri De...
Federal Register 2010, 2011, 2012, 2013, 2014
2012-08-21
... data and summary statistics, addressing the program's current design and coverage; a summary of testing... identified in the OMB Inventory of Approved ICR Burdens. This decrease is due to (1) a mathematical error...
2006 Missouri state highway system : traffic accident statistics
DOT National Transportation Integrated Search
2007-07-18
The following summary of traffic crashes represents only those crashes that have occurred on the State Highway : System of Missouri in 2006. The information contained in this publication is a summary of the crash reports : provided to the Missouri De...
DOT National Transportation Integrated Search
1995-12-01
This publication, 1995 SELECTED STATISTICS, was designed to provide a summary of transportation-related data collected and reported by the Kansas Department of Transportation (KDOT). Information regarding the following modes of transportation in the ...
Lambing, J.H.
1988-01-01
Water quality sampling was conducted at seven sites on the Clark Fork and selected tributaries from Deer Lodge to Missoula, Montana, from July 1986 through September 1987. This report presents tabulations and statistical summaries of the water quality data. The data presented in this report supplement previous data collected from March 1985 through June 1986 for six of the seven sites. Included in this report are tabulations of instantaneous values of streamflow, onsite water quality, hardness, and concentrations of trace elements and suspended sediment for periodic samples. Also included are tables and hydrographs of daily mean values for streamflow, suspended-sediment concentration, and suspended-sediment discharge at three mainstream stations and one tributary. Statistical summaries are presented for periodic water quality data collected from March 1986 through September 1987. Selected data are illustrated by graphs showing median concentrations to suspended-sediment concentrations, and median concentrations of trace elements in suspended sediment. (USGS)
Pare, Guillaume; Mao, Shihong; Deng, Wei Q
2016-06-08
Despite considerable efforts, known genetic associations only explain a small fraction of predicted heritability. Regional associations combine information from multiple contiguous genetic variants and can improve variance explained at established association loci. However, regional associations are not easily amenable to estimation using summary association statistics because of sensitivity to linkage disequilibrium (LD). We now propose a novel method, LD Adjusted Regional Genetic Variance (LARGV), to estimate phenotypic variance explained by regional associations using summary statistics while accounting for LD. Our method is asymptotically equivalent to a multiple linear regression model when no interaction or haplotype effects are present. It has several applications, such as ranking of genetic regions according to variance explained or comparison of variance explained by two or more regions. Using height and BMI data from the Health Retirement Study (N = 7,776), we show that most genetic variance lies in a small proportion of the genome and that previously identified linkage peaks have higher than expected regional variance.
Pare, Guillaume; Mao, Shihong; Deng, Wei Q.
2016-01-01
Despite considerable efforts, known genetic associations only explain a small fraction of predicted heritability. Regional associations combine information from multiple contiguous genetic variants and can improve variance explained at established association loci. However, regional associations are not easily amenable to estimation using summary association statistics because of sensitivity to linkage disequilibrium (LD). We now propose a novel method, LD Adjusted Regional Genetic Variance (LARGV), to estimate phenotypic variance explained by regional associations using summary statistics while accounting for LD. Our method is asymptotically equivalent to a multiple linear regression model when no interaction or haplotype effects are present. It has several applications, such as ranking of genetic regions according to variance explained or comparison of variance explained by two or more regions. Using height and BMI data from the Health Retirement Study (N = 7,776), we show that most genetic variance lies in a small proportion of the genome and that previously identified linkage peaks have higher than expected regional variance. PMID:27273519
Education Statistics Quarterly, Spring 2001.
ERIC Educational Resources Information Center
Education Statistics Quarterly, 2001
2001-01-01
The "Education Statistics Quarterly" gives a comprehensive overview of work done across all parts of the National Center for Education Statistics (NCES). Each issue contains short publications, summaries, and descriptions that cover all NCES publications, data products and funding opportunities developed over a 3-month period. Each issue…
Statistical Techniques to Analyze Pesticide Data Program Food Residue Observations.
Szarka, Arpad Z; Hayworth, Carol G; Ramanarayanan, Tharacad S; Joseph, Robert S I
2018-06-26
The U.S. EPA conducts dietary-risk assessments to ensure that levels of pesticides on food in the U.S. food supply are safe. Often these assessments utilize conservative residue estimates, maximum residue levels (MRLs), and a high-end estimate derived from registrant-generated field-trial data sets. A more realistic estimate of consumers' pesticide exposure from food may be obtained by utilizing residues from food-monitoring programs, such as the Pesticide Data Program (PDP) of the U.S. Department of Agriculture. A substantial portion of food-residue concentrations in PDP monitoring programs are below the limits of detection (left-censored), which makes the comparison of regulatory-field-trial and PDP residue levels difficult. In this paper, we present a novel adaption of established statistical techniques, the Kaplan-Meier estimator (K-M), the robust regression on ordered statistic (ROS), and the maximum-likelihood estimator (MLE), to quantify the pesticide-residue concentrations in the presence of heavily censored data sets. The examined statistical approaches include the most commonly used parametric and nonparametric methods for handling left-censored data that have been used in the fields of medical and environmental sciences. This work presents a case study in which data of thiamethoxam residue on bell pepper generated from registrant field trials were compared with PDP-monitoring residue values. The results from the statistical techniques were evaluated and compared with commonly used simple substitution methods for the determination of summary statistics. It was found that the maximum-likelihood estimator (MLE) is the most appropriate statistical method to analyze this residue data set. Using the MLE technique, the data analyses showed that the median and mean PDP bell pepper residue levels were approximately 19 and 7 times lower, respectively, than the corresponding statistics of the field-trial residues.
Illinois crash facts and statistics, 2008
DOT National Transportation Integrated Search
2008-01-01
The 2008 Illinois Crash Facts & Statistics includes : data that illustrate Illinois safety accomplishments and : provides information about key events in the history of : traffic-safety related legislation. Summaries of safety : belt usage, ...
Federal Register 2010, 2011, 2012, 2013, 2014
2011-05-26
... Sentenced Population Movement--National Prisoner Statistics, Extension and Revision of Existing Collection...) Title of the Form/Collection: Summary of Sentenced Population Movement--National Prisoner Statistics (3...
Saiki, Jun; Holcombe, Alex O
2012-03-06
Sudden change of every object in a display is typically conspicuous. We find however that in the presence of a secondary task, with a display of moving dots, it can be difficult to detect a sudden change in color of all the dots. A field of 200 dots, half red and half green, half moving rightward and half moving leftward, gave the appearance of two surfaces. When all 200 dots simultaneously switched color between red and green, performance in detecting the switch was very poor. A key display characteristic was that the color proportions on each surface (summary statistics) were not affected by the color switch. When the color switch is accompanied by a change in these summary statistics, people perform well in detecting the switch, suggesting that the secondary task does not disrupt the availability of this statistical information. These findings suggest that when the change is missed, the old and new colors were represented, but the color-location pattern (binding of colors to locations) was not represented or not compared. Even after extended viewing, changes to the individual color-location pattern are not available, suggesting that the feeling of seeing these details is misleading.
Meng, Xiang-He; Shen, Hui; Chen, Xiang-Ding; Xiao, Hong-Mei; Deng, Hong-Wen
2018-03-01
Genome-wide association studies (GWAS) have successfully identified numerous genetic variants associated with diverse complex phenotypes and diseases, and provided tremendous opportunities for further analyses using summary association statistics. Recently, Pickrell et al. developed a robust method for causal inference using independent putative causal SNPs. However, this method may fail to infer the causal relationship between two phenotypes when only a limited number of independent putative causal SNPs identified. Here, we extended Pickrell's method to make it more applicable for the general situations. We extended the causal inference method by replacing the putative causal SNPs with the lead SNPs (the set of the most significant SNPs in each independent locus) and tested the performance of our extended method using both simulation and empirical data. Simulations suggested that when the same number of genetic variants is used, our extended method had similar distribution of test statistic under the null model as well as comparable power under the causal model compared with the original method by Pickrell et al. But in practice, our extended method would generally be more powerful because the number of independent lead SNPs was often larger than the number of independent putative causal SNPs. And including more SNPs, on the other hand, would not cause more false positives. By applying our extended method to summary statistics from GWAS for blood metabolites and femoral neck bone mineral density (FN-BMD), we successfully identified ten blood metabolites that may causally influence FN-BMD. We extended a causal inference method for inferring putative causal relationship between two phenotypes using summary statistics from GWAS, and identified a number of potential causal metabolites for FN-BMD, which may provide novel insights into the pathophysiological mechanisms underlying osteoporosis.
DOT National Transportation Integrated Search
1995-05-01
This report provides a summary on the state of the national mass transit industry by highlighting aggregate financial and operational characteristics and trend information for key statistics and performance indicators. These aggregate data represent ...
Illinois crash facts and statistics, 2007
DOT National Transportation Integrated Search
2007-01-01
The 2007 Illinois Crash Facts & Statistics includes : data that illustrate these accomplishments and also : provides information about key events in the history of : traffic safety-related legislation. Also included, are : summaries of motorcyc...
State transportation profile : summary
DOT National Transportation Integrated Search
2003-12-01
The Bureau of Transportation Statistics (BTS) presents a statistical : profile of transportation in the 50 states and the District of Columbia. : This document supplements a previously published series of individual : state profiles. Like the individ...
Energetic Electron Populations in the Magnetosphere During Geomagnetic Storms and Substorms
NASA Technical Reports Server (NTRS)
McKenzie, David L.; Anderson, Phillip C.
2002-01-01
This report summarizes the scientific work performed by the Aerospace Corporation under NASA Grant NAG5-10278, 'Energetic Electron Populations in the Magnetosphere during Geomagnetic Storms and Subsisting.' The period of performance for the Grant was March 1, 2001 to February 28, 2002. The following is a summary of the Statement of Work for this Grant. Use data from the PIXIE instrument on the Polar spacecraft from September 1998 onward to derive the statistical relationship between particle precipitation patterns and various geomagnetic activity indices. We are particularly interested in the occurrence of substorms during storm main phase and the efficacy of storms and substorms in injecting ring-current particles. We will compare stormtime simulations of the diffuse aurora using the models of Chen and Schulz with stormtime PIXIE measurements.
Analysis of the Tanana River Basin using LANDSAT data
NASA Technical Reports Server (NTRS)
Morrissey, L. A.; Ambrosia, V. G.; Carson-Henry, C.
1981-01-01
Digital image classification techniques were used to classify land cover/resource information in the Tanana River Basin of Alaska. Portions of four scenes of LANDSAT digital data were analyzed using computer systems at Ames Research Center in an unsupervised approach to derive cluster statistics. The spectral classes were identified using the IDIMS display and color infrared photography. Classification errors were corrected using stratification procedures. The classification scheme resulted in the following eleven categories; sedimented/shallow water, clear/deep water, coniferous forest, mixed forest, deciduous forest, shrub and grass, bog, alpine tundra, barrens, snow and ice, and cultural features. Color coded maps and acreage summaries of the major land cover categories were generated for selected USGS quadrangles (1:250,000) which lie within the drainage basin. The project was completed within six months.
Petkovic, Jennifer; Welch, Vivian; Jacob, Maria Helena; Yoganathan, Manosila; Ayala, Ana Patricia; Cunningham, Heather; Tugwell, Peter
2016-12-09
Systematic reviews are important for decision makers. They offer many potential benefits but are often written in technical language, are too long, and do not contain contextual details which make them hard to use for decision-making. There are many organizations that develop and disseminate derivative products, such as evidence summaries, from systematic reviews for different populations or subsets of decision makers. This systematic review aimed to (1) assess the effectiveness of evidence summaries on policymakers' use of the evidence and (2) identify the most effective summary components for increasing policymakers' use of the evidence. We present an overview of the available evidence on systematic review derivative products. We included studies of policymakers at all levels as well as health system managers. We included studies examining any type of "evidence summary," "policy brief," or other products derived from systematic reviews that presented evidence in a summarized form. The primary outcomes were the (1) use of systematic review summaries in decision-making (e.g., self-reported use of the evidence in policymaking and decision-making) and (2) policymakers' understanding, knowledge, and/or beliefs (e.g., changes in knowledge scores about the topic included in the summary). We also assessed perceived relevance, credibility, usefulness, understandability, and desirability (e.g., format) of the summaries. Our database search combined with our gray literature search yielded 10,113 references after removal of duplicates. From these, 54 were reviewed in full text, and we included six studies (reported in seven papers) as well as protocols from two ongoing studies. Two studies assessed the use of evidence summaries in decision-making and found little to no difference in effect. There was also little to no difference in effect for knowledge, understanding or beliefs (four studies), and perceived usefulness or usability (three studies). Summary of findings tables and graded entry summaries were perceived as slightly easier to understand compared to complete systematic reviews. Two studies assessed formatting changes and found that for summary of findings tables, certain elements, such as reporting study event rates and absolute differences, were preferred as well as avoiding the use of footnotes. Evidence summaries are likely easier to understand than complete systematic reviews. However, their ability to increase the use of systematic review evidence in policymaking is unclear. The protocol was published in the journal Systematic Reviews (2015;4:122).
ParallABEL: an R library for generalized parallelization of genome-wide association studies
2010-01-01
Background Genome-Wide Association (GWA) analysis is a powerful method for identifying loci associated with complex traits and drug response. Parts of GWA analyses, especially those involving thousands of individuals and consuming hours to months, will benefit from parallel computation. It is arduous acquiring the necessary programming skills to correctly partition and distribute data, control and monitor tasks on clustered computers, and merge output files. Results Most components of GWA analysis can be divided into four groups based on the types of input data and statistical outputs. The first group contains statistics computed for a particular Single Nucleotide Polymorphism (SNP), or trait, such as SNP characterization statistics or association test statistics. The input data of this group includes the SNPs/traits. The second group concerns statistics characterizing an individual in a study, for example, the summary statistics of genotype quality for each sample. The input data of this group includes individuals. The third group consists of pair-wise statistics derived from analyses between each pair of individuals in the study, for example genome-wide identity-by-state or genomic kinship analyses. The input data of this group includes pairs of SNPs/traits. The final group concerns pair-wise statistics derived for pairs of SNPs, such as the linkage disequilibrium characterisation. The input data of this group includes pairs of individuals. We developed the ParallABEL library, which utilizes the Rmpi library, to parallelize these four types of computations. ParallABEL library is not only aimed at GenABEL, but may also be employed to parallelize various GWA packages in R. The data set from the North American Rheumatoid Arthritis Consortium (NARAC) includes 2,062 individuals with 545,080, SNPs' genotyping, was used to measure ParallABEL performance. Almost perfect speed-up was achieved for many types of analyses. For example, the computing time for the identity-by-state matrix was linearly reduced from approximately eight hours to one hour when ParallABEL employed eight processors. Conclusions Executing genome-wide association analysis using the ParallABEL library on a computer cluster is an effective way to boost performance, and simplify the parallelization of GWA studies. ParallABEL is a user-friendly parallelization of GenABEL. PMID:20429914
Federal Register 2010, 2011, 2012, 2013, 2014
2011-01-27
... Operating Statistics for Large Certificated Air Carriers AGENCY: Research & Innovative Technology Administration (RITA), Bureau of Transportation Statistics (BTS), DOT. ACTION: Notice. SUMMARY: In compliance with the Paperwork Reduction Act of 1995, Public Law 104-13, the Bureau of Transportation Statistics...
Federal Register 2010, 2011, 2012, 2013, 2014
2011-12-29
... for OMB Review; Comment Request; Local Area Unemployment Statistics Program ACTION: Notice. SUMMARY... collection request (ICR) titled, ``Local Area Unemployment Statistics Program,'' to the Office of Management... of Collection: Local Area Unemployment Statistics Program. OMB Control Number: 1220-0017. Affected...
Summary of National Transportation Statistics (1973)
DOT National Transportation Integrated Search
1975-01-01
This report is a compendium of selected national-level transportation statistics. Included are cost, inventory, and performance data describing the passenger and cargo operations of the following moes: air carrier, genral aviation, automobile, bus, t...
Summary of National Transportation Statistics (1974)
DOT National Transportation Integrated Search
1976-06-01
This report is a compendium of selected national-level transportation statistics. Included are cost, inventory, and performance data describing the passenger and cargo operations of the following modes: air carrier, general aviation, automobile, bus,...
Summary of National Transportation Statistics (1972)
DOT National Transportation Integrated Search
1974-06-01
This report is a compendium of selected national-level transportation statistics. Included are cost, inventory, and performance data describing the passenger and cargo operations of the following modes: air carrier, general aviation, automobile, bus,...
Peer Review Documents Related to the Evaluation of ...
BMDS is one of the Agency's premier tools for estimating risk assessments, therefore the validity and reliability of its statistical models are of paramount importance. This page provides links to peer review and expert summaries of the BMDS application and its models as they were developed and eventually released documenting the rigorous review process taken to provide the best science tools available for statistical modeling. This page provides links to peer reviews and expert summaries of the BMDS applications and its models as they were developed and eventually released.
Dyess AFB, Texas. Revised Uniform Summary of Surface Weather Observations (RUSSWO). Parts A-F.
1988-01-01
Observations (RUSSWO); Dyess AFB TX; Texas; Abilene TX; Army Airfield Abilene TX; USTX722665. 19 Abstract: A six-part statistical data summary of...ELAT. AND S TANDARD Di-V I AtIONjS PEEESNTCVIS [’j ,T INCLUDE INCOMPLETE MONTHS. FOUR OR MORE MONTHS ARE NEEDED TO ADMILTE THE SE STATISTIC S AND...TA L NLMMYt (,F OPSIRW8IONS: 93" 6LOfAL CLPUATOLOGV FRANC " PERCENTAGE FPEiUtICY OF OCCURRENCE OF SURFACE WIND DIRECTION VERSUS WIND SPEED LiSAF7 I
Federal Register 2010, 2011, 2012, 2013, 2014
2010-06-21
... DEPARTMENT OF AGRICULTURE National Agricultural Statistics Service Notice of Intent To Request... Statistics Service, USDA. ACTION: Notice and request for comments. SUMMARY: In accordance with the Paperwork Reduction Act of 1995, this notice announces the intention of the National Agricultural Statistics Service...
Federal Register 2010, 2011, 2012, 2013, 2014
2013-10-25
... DEPARTMENT OF AGRICULTURE National Agricultural Statistics Service Notice of Intent To Request... Statistics Service, USDA. ACTION: Notice and request for comments. SUMMARY: In accordance with the Paperwork Reduction Act of 1995, this notice announces the intention of the National Agricultural Statistics Service...
Federal Register 2010, 2011, 2012, 2013, 2014
2012-03-19
... Transportation Statistics [Docket: RITA 2008-0002 BTS Paperwork Reduction Notice] Agency Information Collection... of Transportation Statistics (BTS), DOT. ACTION: Notice. SUMMARY: In compliance with the Paperwork Reduction Act of 1995, Public Law 104-13, the Bureau of Transportation Statistics invites the general public...
Federal Register 2010, 2011, 2012, 2013, 2014
2011-07-27
... for OMB Review; Comment Request; Report on Current Employment Statistics ACTION: Notice. SUMMARY: The Department of Labor (DOL) is submitting the revised Bureau of Labor Statistics (BLS) sponsored information collection request (ICR) titled, ``Report on Current Employment Statistics,'' to the Office of Management and...
Federal Register 2010, 2011, 2012, 2013, 2014
2012-06-20
... DEPARTMENT OF AGRICULTURE National Agricultural Statistics Service Notice of Intent To Request... Statistics Service, USDA. ACTION: Notice and request for comments. SUMMARY: In accordance with the Paperwork Reduction Act of 1995, this notice announces the intention of the National Agricultural Statistics Service...
Federal Register 2010, 2011, 2012, 2013, 2014
2013-02-14
... DEPARTMENT OF AGRICULTURE National Agricultural Statistics Service Notice of Intent To Request... Statistics Service, USDA. ACTION: Notice and request for comments. SUMMARY: In accordance with the Paperwork Reduction Act of 1995 this notice announces the intention of the National Agricultural Statistics Service...
Federal Register 2010, 2011, 2012, 2013, 2014
2013-02-14
... DEPARTMENT OF AGRICULTURE National Agricultural Statistics Service Notice of Intent to Request... Statistics Service, USDA. ACTION: Notice and request for comments. SUMMARY: In accordance with the Paperwork Reduction Act of 1995, this notice announces the intention of the National Agricultural Statistics Service...
Federal Register 2010, 2011, 2012, 2013, 2014
2011-10-18
... DEPARTMENT OF AGRICULTURE National Agricultural Statistics Service Notice of Intent to Request... Statistics Service, USDA. ACTION: Notice and request for comments. SUMMARY: In accordance with the Paperwork Reduction Act of 1995, this notice announces the intention of the National Agricultural Statistics Service...
Federal Register 2010, 2011, 2012, 2013, 2014
2012-02-06
... DEPARTMENT OF AGRICULTURE National Agricultural Statistics Service Notice of Intent To Request... Statistics Service. ACTION: Notice and request for comments. SUMMARY: In accordance with the Paperwork Reduction Act of 1995, this notice announces the intention of the National Agricultural Statistics Service...
Federal Register 2010, 2011, 2012, 2013, 2014
2013-10-30
... DEPARTMENT OF AGRICULTURE National Agricultural Statistics Service Notice of Intent To Request... Statistics Service, USDA. ACTION: Notice and request for comments. SUMMARY: In accordance with the Paperwork Reduction Act of 1995, this notice announces the intention of the National Agricultural Statistics Service...
Federal Register 2010, 2011, 2012, 2013, 2014
2013-02-14
... DEPARTMENT OF AGRICULTURE National Agricultural Statistics Service Notice of Intent To Request... Statistics Service, USDA. ACTION: Notice and request for comments. SUMMARY: In accordance with the Paperwork Reduction Act of 1995, this notice announces the intention of the National Agricultural Statistics Service...
78 FR 26611 - Notice of Intent To Seek Approval To Conduct an Information Collection
Federal Register 2010, 2011, 2012, 2013, 2014
2013-05-07
... Statistics Service Notice of Intent To Seek Approval To Conduct an Information Collection AGENCY: National Agricultural Statistics Service, USDA. ACTION: Notice and request for comments. SUMMARY: In accordance with the Paperwork Reduction Act of 1995, this notice announces the intention of the National Agricultural Statistics...
Federal Register 2010, 2011, 2012, 2013, 2014
2012-11-16
... DEPARTMENT OF AGRICULTURE National Agricultural Statistics Service Notice of Intent To Request... Statistics Service, USDA. ACTION: Notice and request for comments. SUMMARY: In accordance with the Paperwork Reduction Act of 1995, this notice announces the intention the National Agricultural Statistics Service...
Yelland, LN; Gajewski, BJ; Colombo, J; Gibson, RA; Makrides, M; Carlson, SE
2016-01-01
SUMMARY The DHA to Optimize Mother Infant Outcome (DOMInO) and Kansas DHA Outcomes Study (KUDOS) were randomized controlled trials that supplemented mothers with 800 and 600 mg DHA/day, respectively, or a placebo during pregnancy. DOMInO was conducted in Australia and KUDOS in the United States. Both trials found an unanticipated and statistically significant reduction in early preterm birth (ePTB; i.e., birth before 34 weeks gestation). However, in each trial, the number of ePTBs were small. We used a novel Bayesian approach and an arbitrary sample of 120,000 pregnancies to estimate statistically derived low, moderate or high risk for ePTB, and to test for differences between the DHA and placebo groups. In both trials, the model predicted DHA would significantly reduce the expected proportion of deliveries in the high risk group under the trial conditions of the parent studies. From these proportions we estimated the number of ePTB that could be prevented. PMID:27637340
The causal meaning of Fisher’s average effect
LEE, JAMES J.; CHOW, CARSON C.
2013-01-01
Summary In order to formulate the Fundamental Theorem of Natural Selection, Fisher defined the average excess and average effect of a gene substitution. Finding these notions to be somewhat opaque, some authors have recommended reformulating Fisher’s ideas in terms of covariance and regression, which are classical concepts of statistics. We argue that Fisher intended his two averages to express a distinction between correlation and causation. On this view, the average effect is a specific weighted average of the actual phenotypic changes that result from physically changing the allelic states of homologous genes. We show that the statistical and causal conceptions of the average effect, perceived as inconsistent by Falconer, can be reconciled if certain relationships between the genotype frequencies and non-additive residuals are conserved. There are certain theory-internal considerations favouring Fisher’s original formulation in terms of causality; for example, the frequency-weighted mean of the average effects equaling zero at each locus becomes a derivable consequence rather than an arbitrary constraint. More broadly, Fisher’s distinction between correlation and causation is of critical importance to gene-trait mapping studies and the foundations of evolutionary biology. PMID:23938113
Cichonska, Anna; Rousu, Juho; Marttinen, Pekka; Kangas, Antti J; Soininen, Pasi; Lehtimäki, Terho; Raitakari, Olli T; Järvelin, Marjo-Riitta; Salomaa, Veikko; Ala-Korpela, Mika; Ripatti, Samuli; Pirinen, Matti
2016-07-01
A dominant approach to genetic association studies is to perform univariate tests between genotype-phenotype pairs. However, analyzing related traits together increases statistical power, and certain complex associations become detectable only when several variants are tested jointly. Currently, modest sample sizes of individual cohorts, and restricted availability of individual-level genotype-phenotype data across the cohorts limit conducting multivariate tests. We introduce metaCCA, a computational framework for summary statistics-based analysis of a single or multiple studies that allows multivariate representation of both genotype and phenotype. It extends the statistical technique of canonical correlation analysis to the setting where original individual-level records are not available, and employs a covariance shrinkage algorithm to achieve robustness.Multivariate meta-analysis of two Finnish studies of nuclear magnetic resonance metabolomics by metaCCA, using standard univariate output from the program SNPTEST, shows an excellent agreement with the pooled individual-level analysis of original data. Motivated by strong multivariate signals in the lipid genes tested, we envision that multivariate association testing using metaCCA has a great potential to provide novel insights from already published summary statistics from high-throughput phenotyping technologies. Code is available at https://github.com/aalto-ics-kepaco anna.cichonska@helsinki.fi or matti.pirinen@helsinki.fi Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Cichonska, Anna; Rousu, Juho; Marttinen, Pekka; Kangas, Antti J.; Soininen, Pasi; Lehtimäki, Terho; Raitakari, Olli T.; Järvelin, Marjo-Riitta; Salomaa, Veikko; Ala-Korpela, Mika; Ripatti, Samuli; Pirinen, Matti
2016-01-01
Motivation: A dominant approach to genetic association studies is to perform univariate tests between genotype-phenotype pairs. However, analyzing related traits together increases statistical power, and certain complex associations become detectable only when several variants are tested jointly. Currently, modest sample sizes of individual cohorts, and restricted availability of individual-level genotype-phenotype data across the cohorts limit conducting multivariate tests. Results: We introduce metaCCA, a computational framework for summary statistics-based analysis of a single or multiple studies that allows multivariate representation of both genotype and phenotype. It extends the statistical technique of canonical correlation analysis to the setting where original individual-level records are not available, and employs a covariance shrinkage algorithm to achieve robustness. Multivariate meta-analysis of two Finnish studies of nuclear magnetic resonance metabolomics by metaCCA, using standard univariate output from the program SNPTEST, shows an excellent agreement with the pooled individual-level analysis of original data. Motivated by strong multivariate signals in the lipid genes tested, we envision that multivariate association testing using metaCCA has a great potential to provide novel insights from already published summary statistics from high-throughput phenotyping technologies. Availability and implementation: Code is available at https://github.com/aalto-ics-kepaco Contacts: anna.cichonska@helsinki.fi or matti.pirinen@helsinki.fi Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153689
Statistical analysis of aerosol species, trace gasses, and meteorology in Chicago.
Binaku, Katrina; O'Brien, Timothy; Schmeling, Martina; Fosco, Tinamarie
2013-09-01
Both canonical correlation analysis (CCA) and principal component analysis (PCA) were applied to atmospheric aerosol and trace gas concentrations and meteorological data collected in Chicago during the summer months of 2002, 2003, and 2004. Concentrations of ammonium, calcium, nitrate, sulfate, and oxalate particulate matter, as well as, meteorological parameters temperature, wind speed, wind direction, and humidity were subjected to CCA and PCA. Ozone and nitrogen oxide mixing ratios were also included in the data set. The purpose of statistical analysis was to determine the extent of existing linear relationship(s), or lack thereof, between meteorological parameters and pollutant concentrations in addition to reducing dimensionality of the original data to determine sources of pollutants. In CCA, the first three canonical variate pairs derived were statistically significant at the 0.05 level. Canonical correlation between the first canonical variate pair was 0.821, while correlations of the second and third canonical variate pairs were 0.562 and 0.461, respectively. The first canonical variate pair indicated that increasing temperatures resulted in high ozone mixing ratios, while the second canonical variate pair showed wind speed and humidity's influence on local ammonium concentrations. No new information was uncovered in the third variate pair. Canonical loadings were also interpreted for information regarding relationships between data sets. Four principal components (PCs), expressing 77.0 % of original data variance, were derived in PCA. Interpretation of PCs suggested significant production and/or transport of secondary aerosols in the region (PC1). Furthermore, photochemical production of ozone and wind speed's influence on pollutants were expressed (PC2) along with overall measure of local meteorology (PC3). In summary, CCA and PCA results combined were successful in uncovering linear relationships between meteorology and air pollutants in Chicago and aided in determining possible pollutant sources.
Petkovic, Jennifer; Welch, Vivian; Tugwell, Peter
2015-09-28
Systematic reviews are important for decision-makers. They offer many potential benefits but are often written in technical language, are too long, and do not contain contextual details which makes them hard to use for decision-making. There are many organizations that develop and disseminate derivative products, such as evidence summaries, from systematic reviews for different populations or subsets of decision-makers. This systematic review will assess the effectiveness of systematic review summaries on increasing policymakers' use of systematic review evidence and to identify the components or features of these summaries that are most effective. We will include studies of policy-makers at all levels as well as health-system managers. We will include studies examining any type of "evidence summary," "policy brief," or other products derived from systematic reviews that present evidence in a summarized form. The primary outcomes are the following: (1) use of systematic review summaries decision-making (e.g., self-reported use of the evidence in policy-making, decision-making) and (2) policy-maker understanding, knowledge, and/or beliefs (e.g., changes in knowledge scores about the topic included in the summary). We will conduct a systematic review of randomized controlled trials (RCTs), non-randomized controlled trials (NRCTs), controlled before-after studies (CBA), and interrupted time series (ITS) studies. The results of this review will inform the development of future systematic review summaries to ensure that systematic review evidence is accessible to and used by policy-makers making health-related decisions.
National Transportation Statistics (Annual Report, 1975)
DOT National Transportation Integrated Search
1977-01-01
This report is a summary of selected national transportation statistics from a wide variety of government and private sources. Included are cost, inventory, and performance data describing the passenger and cargo operations of the following modes: ai...
National Transportation Statistics (Annual Report, 1976)
DOT National Transportation Integrated Search
1978-01-01
This report is a summary of selected national transportation statistics from a wide variety of government and private sources. Included are cost, inventory, and performance data describing the passenger and cargo operations of the following modes: ai...
National Transportation Statistics (Annual Report, 1980)
DOT National Transportation Integrated Search
1980-01-01
This report is a summary of selected national transportation statistics from a wide variety of government and private sources. Included are cost, inventory, and performance data describing the passenger and cargo operations of the following modes: ai...
National Transportation Statistics (Annual Report, 1983)
DOT National Transportation Integrated Search
1983-01-01
This report is a summary of selected national transportation statistics from a wide variety of government and private resources. Included are cost, inventory, and performance data describing the passenger and cargo operations of the following modes: ...
National Transportation Statistics (Annual Report, 1981)
DOT National Transportation Integrated Search
1981-01-01
This report is a summary of selected national transportation statistics from a wide variety of government and private sources. Included are cost, inventory, and performance data describing the passenger and cargo operations of the following modes: ai...
National Transportation Statistics (Annual Report, 1982)
DOT National Transportation Integrated Search
1982-11-01
This report is a summary of selected national transportation statistics from a wide variety of government and private sources. Included are cost, inventory, and performance data describing the passenger and cargo operations of the following modes: ai...
National Transportation Statistics (Annual Report, 1984)
DOT National Transportation Integrated Search
1984-08-01
This report is a summary of selected national transportation statistics from a wide variety of government and private sources. Included are cost, inventory, and performance data describing the passenger and cargo operations of the following modes: ai...
University of Alaska 1984 Statistical Summary.
ERIC Educational Resources Information Center
Spargo, Frank R.; Gaylord, Thomas A.
Designed to inform decisions about the University of Alaska's (UA's) budget, direction, scope, and academic thrusts, this report provides statewide, unit, and campus data for the two- and four-year colleges in the university system. First, a systemwide summary offers information on finances, enrollments, student loan program participation,…
Review of literature and practices for incident management programs : technical report.
DOT National Transportation Integrated Search
2016-06-01
The project team examined project evaluations, best practice summaries, and synthesis documents, and derived a summary of key elements of programs to speed the time to find and clear stalled vehicles and crashes from freeway shoulders and main lanes....
40 CFR Appendix IV to Part 264 - Cochran's Approximation to the Behrens-Fisher Students' t-test
Code of Federal Regulations, 2011 CFR
2011-07-01
... summary measures to calculate a t-statistic (t*) and a comparison t-statistic (tc). The t* value is compared to the tc value and a conclusion reached as to whether there has been a statistically significant... made in collecting the background data. The t-statistic (tc), against which t* will be compared...
40 CFR Appendix IV to Part 264 - Cochran's Approximation to the Behrens-Fisher Students' t-test
Code of Federal Regulations, 2010 CFR
2010-07-01
... summary measures to calculate a t-statistic (t*) and a comparison t-statistic (tc). The t* value is compared to the tc value and a conclusion reached as to whether there has been a statistically significant... made in collecting the background data. The t-statistic (tc), against which t* will be compared...
Federal Register 2010, 2011, 2012, 2013, 2014
2011-03-02
... Statistical Area 630 in the Gulf of Alaska AGENCY: National Marine Fisheries Service (NMFS), National Oceanic.... SUMMARY: NMFS is opening directed fishing for pollock in Statistical Area 630 of the Gulf of Alaska (GOA... catch (TAC) of pollock in Statistical Area 630 of the GOA. DATES: Effective 1200 hrs, Alaska local time...
Federal Register 2010, 2011, 2012, 2013, 2014
2011-02-28
... Statistical Area 610 in the Gulf of Alaska AGENCY: National Marine Fisheries Service (NMFS), National Oceanic.... SUMMARY: NMFS is opening directed fishing for pollock in Statistical Area 610 of the Gulf of Alaska (GOA... pollock in Statistical Area 610 of the GOA. DATES: Effective 1200 hrs, Alaska local time (A.l.t...
NASA Astrophysics Data System (ADS)
Hincks, Ian; Granade, Christopher; Cory, David G.
2018-01-01
The analysis of photon count data from the standard nitrogen vacancy (NV) measurement process is treated as a statistical inference problem. This has applications toward gaining better and more rigorous error bars for tasks such as parameter estimation (e.g. magnetometry), tomography, and randomized benchmarking. We start by providing a summary of the standard phenomenological model of the NV optical process in terms of Lindblad jump operators. This model is used to derive random variables describing emitted photons during measurement, to which finite visibility, dark counts, and imperfect state preparation are added. NV spin-state measurement is then stated as an abstract statistical inference problem consisting of an underlying biased coin obstructed by three Poisson rates. Relevant frequentist and Bayesian estimators are provided, discussed, and quantitatively compared. We show numerically that the risk of the maximum likelihood estimator is well approximated by the Cramér-Rao bound, for which we provide a simple formula. Of the estimators, we in particular promote the Bayes estimator, owing to its slightly better risk performance, and straightforward error propagation into more complex experiments. This is illustrated on experimental data, where quantum Hamiltonian learning is performed and cross-validated in a fully Bayesian setting, and compared to a more traditional weighted least squares fit.
NASA Technical Reports Server (NTRS)
Campbell, John P; Mckinney, Marion O
1952-01-01
A summary of methods for making dynamic lateral stability and response calculations and for estimating the aerodynamic stability derivatives required for use in these calculations is presented. The processes of performing calculations of the time histories of lateral motions, of the period and damping of these motions, and of the lateral stability boundaries are presented as a series of simple straightforward steps. Existing methods for estimating the stability derivatives are summarized and, in some cases, simple new empirical formulas are presented. Detailed estimation methods are presented for low-subsonic-speed conditions but only a brief discussion and a list of references are given for transonic and supersonic speed conditions.
The perceptual processing capacity of summary statistics between and within feature dimensions
Attarha, Mouna; Moore, Cathleen M.
2015-01-01
The simultaneous–sequential method was used to test the processing capacity of statistical summary representations both within and between feature dimensions. Sixteen gratings varied with respect to their size and orientation. In Experiment 1, the gratings were equally divided into four separate smaller sets, one of which with a mean size that was larger or smaller than the other three sets, and one of which with a mean orientation that was tilted more leftward or rightward. The task was to report the mean size and orientation of the oddball sets. This therefore required four summary representations for size and another four for orientation. The sets were presented at the same time in the simultaneous condition or across two temporal frames in the sequential condition. Experiment 1 showed evidence of a sequential advantage, suggesting that the system may be limited with respect to establishing multiple within-feature summaries. Experiment 2 eliminates the possibility that some aspect of the task, other than averaging, was contributing to this observed limitation. In Experiment 3, the same 16 gratings appeared as one large superset, and therefore the task only required one summary representation for size and another one for orientation. Equal simultaneous–sequential performance indicated that between-feature summaries are capacity free. These findings challenge the view that within-feature summaries drive a global sense of visual continuity across areas of the peripheral visual field, and suggest a shift in focus to seeking an understanding of how between-feature summaries in one area of the environment control behavior. PMID:26360153
Angelini, Sabrina; Bermejo, Justo Lorenzo; Ravegnini, Gloria; Sammarini, Giulia; Hrelia, Patrizia
The lymphocyte cytokinesis-block micronucleus (CBMN) assay is applied in many different in vivo biomonitoring studies of human exposure to genotoxic chemicals. Among extensively chemicals investigated, we identified petroleum and its derivatives, in particular benzene and the most common mixture of benzene, toluene, and xylene. Although conflicting results have been reported on the effects of benzene exposure, the number of positive findings in independent studies suggests that occupational exposure to benzene causes DNA damage in peripheral blood lymphocytes. To assess current evidence on this hypothesis, we conducted a meta-analysis. Our aim was to evaluate the effect of benzene exposure on genetic damage, quantified using the CBMN assay on individuals occupationally exposed to petroleum and its derivatives. Statistical analyses were conducted using the rmeta package from the free Software Environment for Statistical Computing R. Combined study results indicated that benzene exposure is associated with an increased level of genetic damage in peripheral blood lymphocytes, as reflected by an increased MN frequency. The summary mean difference in MN frequency between exposed and unexposed individuals was 1.64 (95% CI: 0.80-2.47). Overall, this finding points to MN frequency as a sensitive biomarker which could be used to evaluate genetic damage induced by occupational - industrial or environmental - exposure to benzene. This review also identified some important knowledge gaps as well as the need of large, well-designed studies. In particular, it is fundamental to accurately characterize the investigated population, including dietary habits and genetic variability which could modulate MN frequency in both exposed individuals and unexposed controls. In conclusion, according to present findings the use of the CBMN assay in biomonitoring studies could provide objective evidence to guide prioritization of preventive interventions in subjects occupationally exposed to petroleum derivatives, and in particular benzene. Copyright © 2016 Elsevier B.V. All rights reserved.
National Transportation Statistics (Annual Report, 1985)
DOT National Transportation Integrated Search
1985-06-01
This report is a summary of selected national transportation statistics from a wide variety of government and private sources. Featured in the report are cost, inventory, and performance data describing the passenger and cargo operations of the follo...
National Transportation Statistics (Annual Report, 1986)
DOT National Transportation Integrated Search
1986-07-01
This report is a summary of selected national transportation statistics from a wide variety of government and private sources. Featured in the report are cost, inventory, and performance data describing the passenger and cargo operations of the follo...
The Georgia Health Education Study: A Summary Report.
ERIC Educational Resources Information Center
Georgia Univ., Athens. Dept. of Health and Safety.
This summary review of the Georgia Health Education Study is a statistical presentation of scores achieved by over four thousand freshman college students in the university system of Georgia to questions on health knowledge. Data compiled from the administration of the Fast-Tyson Health Knowledge Test (1975 revision) indicates that subject…
Distinct encoding of risk and value in economic choice between multiple risky options☆
Wright, Nicholas D.; Symmonds, Mkael; Dolan, Raymond J.
2013-01-01
Neural encoding of value-based stimuli is suggested to involve representations of summary statistics, including risk and expected value (EV). A more complex, but ecologically more common, context is when multiple risky options are evaluated together. However, it is unknown whether encoding related to option evaluation in these situations involves similar principles. Here we employed fMRI during a task that parametrically manipulated EV and risk in two simultaneously presented lotteries, both of which contained either gains or losses. We found representations of EV in medial prefrontal cortex and anterior insula, an encoding that was dependent on which option was chosen (i.e. chosen and unchosen EV) and whether the choice was over gains or losses. Parietal activity reflected whether the riskier or surer option was selected, whilst activity in a network of regions that also included parietal cortex reflected both combined risk and difference in risk for the two options. Our findings provide support for the idea that summary statistics underpin a representation of value-based stimuli, and further that these summary statistics undergo distinct forms of encoding. PMID:23684860
Zelt, R.B.; Jordan, P.R.
1993-01-01
Among the first activities undertaken in each National Water-Quality Assessment (NAWQA) program study-unit investigation are compilation, screening, and statistical summary of available data concerning recent, general water-quality conditions in the study unit. This report (1) identifies which of the existing water-quality data are suitable for characterizing general conditions in a nationally consistent manner and (2) describes, to the extent possible, recent, general water-quality conditions in the Central Nebraska Basins. The study unit con- sists of the area drained by the Platte River between the confluence of the North Platte and South Platte Rivers near North Platte downstream to its confluence with the Missouri River south of Omaha. The report includes (1) a description of the sources and characteristics of water-quality data that are available, (2) a description of the approach used for screening data to identify a subset of the data suitable for summary and comparisons, (3) a presen- tation of the results of statistical and graphical summaries of recent, general water-quality con- ditions, and (4) comparisons of recent, general water-quality conditions to established national water-quality criteria, where applicable. Stream- and lake-water data are summarized for selected sampling sites, and data are summarized by major subunits of the study unit (the Sandhills, Loess Hills, Glaciated Area, and Platte Valley subunits) for streambed-sediment, fish-tissue, aquatic- ecological, and ground-water data. The summaries focus on the central tendencies and typical variation in the data and use nonparametric statistics such as frequencies and percentile values.
Benner, Christian; Havulinna, Aki S; Järvelin, Marjo-Riitta; Salomaa, Veikko; Ripatti, Samuli; Pirinen, Matti
2017-10-05
During the past few years, various novel statistical methods have been developed for fine-mapping with the use of summary statistics from genome-wide association studies (GWASs). Although these approaches require information about the linkage disequilibrium (LD) between variants, there has not been a comprehensive evaluation of how estimation of the LD structure from reference genotype panels performs in comparison with that from the original individual-level GWAS data. Using population genotype data from Finland and the UK Biobank, we show here that a reference panel of 1,000 individuals from the target population is adequate for a GWAS cohort of up to 10,000 individuals, whereas smaller panels, such as those from the 1000 Genomes Project, should be avoided. We also show, both theoretically and empirically, that the size of the reference panel needs to scale with the GWAS sample size; this has important consequences for the application of these methods in ongoing GWAS meta-analyses and large biobank studies. We conclude by providing software tools and by recommending practices for sharing LD information to more efficiently exploit summary statistics in genetics research. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Statistical Literacy: Data Tell a Story
ERIC Educational Resources Information Center
Sole, Marla A.
2016-01-01
Every day, students collect, organize, and analyze data to make decisions. In this data-driven world, people need to assess how much trust they can place in summary statistics. The results of every survey and the safety of every drug that undergoes a clinical trial depend on the correct application of appropriate statistics. Recognizing the…
A statistical approach to instrument calibration
Robert R. Ziemer; David Strauss
1978-01-01
Summary - It has been found that two instruments will yield different numerical values when used to measure identical points. A statistical approach is presented that can be used to approximate the error associated with the calibration of instruments. Included are standard statistical tests that can be used to determine if a number of successive calibrations of the...
Miami-Dade County Public Schools Statistical Abstract 2006-2007
ERIC Educational Resources Information Center
Research Services, Miami-Dade County Public Schools, 2007
2007-01-01
The purpose of this document is to present, in summary fashion, statistical information on the status of public education in Miami-Dade County. Information is provided in the areas of organization, educational programs and services, achievement, and other outcomes of schooling. Also included are multi-year statistics on student population,…
Miami-Dade County Public Schools Statistical Abstract 2005-2006
ERIC Educational Resources Information Center
Research Services, Miami-Dade County Public Schools, 2006
2006-01-01
The purpose of this document is to present, in summary fashion, statistical information on the status of public education in Miami-Dade County. Information is provided in the areas of organization, educational programs and services, achievement, and other outcomes of schooling. Also included are multi-year statistics on student population,…
Miami-Dade County Public Schools Statistical Abstract 2004-2005
ERIC Educational Resources Information Center
Research Services, Miami-Dade County Public Schools, 2005
2005-01-01
The purpose of this document is to present, in summary fashion, statistical information on the status of public education in Miami-Dade County. Information is provided in the areas of organization, educational programs and services, achievement, and other outcomes of schooling. Also included are multi-year statistics on student population,…
Miami-Dade County Public Schools Statistical Abstract 2007-2008
ERIC Educational Resources Information Center
Research Services, Miami-Dade County Public Schools, 2008
2008-01-01
The purpose of this document is to present, in summary fashion, statistical information on the status of public education in Miami-Dade County. Information is provided in the areas of organization, educational programs and services, achievement, and other outcomes of schooling. Also included are multi-year statistics on student population,…
Using R in Introductory Statistics Courses with the pmg Graphical User Interface
ERIC Educational Resources Information Center
Verzani, John
2008-01-01
The pmg add-on package for the open source statistics software R is described. This package provides a simple to use graphical user interface (GUI) that allows introductory statistics students, without advanced computing skills, to quickly create the graphical and numeric summaries expected of them. (Contains 9 figures.)
Improving Statistics Education through Simulations: The Case of the Sampling Distribution.
ERIC Educational Resources Information Center
Earley, Mark A.
This paper presents a summary of action research investigating statistics students' understandings of the sampling distribution of the mean. With four sections of an introductory Statistics in Education course (n=98 students), a computer simulation activity (R. delMas, J. Garfield, and B. Chance, 1999) was implemented and evaluated to show…
Omnibus Risk Assessment via Accelerated Failure Time Kernel Machine Modeling
Sinnott, Jennifer A.; Cai, Tianxi
2013-01-01
Summary Integrating genomic information with traditional clinical risk factors to improve the prediction of disease outcomes could profoundly change the practice of medicine. However, the large number of potential markers and possible complexity of the relationship between markers and disease make it difficult to construct accurate risk prediction models. Standard approaches for identifying important markers often rely on marginal associations or linearity assumptions and may not capture non-linear or interactive effects. In recent years, much work has been done to group genes into pathways and networks. Integrating such biological knowledge into statistical learning could potentially improve model interpretability and reliability. One effective approach is to employ a kernel machine (KM) framework, which can capture nonlinear effects if nonlinear kernels are used (Scholkopf and Smola, 2002; Liu et al., 2007, 2008). For survival outcomes, KM regression modeling and testing procedures have been derived under a proportional hazards (PH) assumption (Li and Luan, 2003; Cai et al., 2011). In this paper, we derive testing and prediction methods for KM regression under the accelerated failure time model, a useful alternative to the PH model. We approximate the null distribution of our test statistic using resampling procedures. When multiple kernels are of potential interest, it may be unclear in advance which kernel to use for testing and estimation. We propose a robust Omnibus Test that combines information across kernels, and an approach for selecting the best kernel for estimation. The methods are illustrated with an application in breast cancer. PMID:24328713
Federal Register 2010, 2011, 2012, 2013, 2014
2012-03-23
... Statistical Area 630 in the Gulf of Alaska AGENCY: National Marine Fisheries Service (NMFS), National Oceanic.... SUMMARY: NMFS is opening directed fishing for pollock in Statistical Area 630 of the Gulf of Alaska (GOA... catch of pollock in Statistical Area 630 of the GOA. DATES: Effective 1200 hrs, Alaska local time (A.l.t...
Federal Register 2010, 2011, 2012, 2013, 2014
2013-03-25
... Statistical Area 630 in the Gulf of Alaska AGENCY: National Marine Fisheries Service (NMFS), National Oceanic.... SUMMARY: NMFS is opening directed fishing for pollock in Statistical Area 630 of the Gulf of Alaska (GOA... Statistical Area 630 of the GOA. DATES: Effective 1200 hrs, Alaska local time (A.l.t.), March 22, 2013...
Federal Register 2010, 2011, 2012, 2013, 2014
2011-03-10
... Statistical Area 630 in the Gulf of Alaska AGENCY: National Marine Fisheries Service (NMFS), National Oceanic.... SUMMARY: NMFS is opening directed fishing for pollock in Statistical Area 630 of the Gulf of Alaska (GOA... pollock in Statistical Area 630 of the GOA. DATES: Effective 1200 hrs, Alaska local time (A.l.t.), March 7...
Energy Statistics : A Supplement to the Summary of National Transportation Statistics
DOT National Transportation Integrated Search
1973-09-01
This annual report is a compendium of selected time-series data describing the transportation, production, processing, and consumption of energy. The report is divided into three main sections. The first, entitled Energy Transport, contains such item...
Statistical and operational summaries
NASA Technical Reports Server (NTRS)
Disalvo, J.
1972-01-01
Statistical progress indicator forms are presented on the financial management of the research allocations. Promotional activities, conference participants, and services are tabulated. The staffing and activity levels are also discussed, as well as the fee schedule revision and the standard interest profile offerings.
PERFORMANCE OF TRICKLING FILTER PLANTS: RELIABILITY, STABILITY, VARIABILITY
Effluent quality variability from trickling filters was examined in this study by statistically analyzing daily effluent BOD5 and suspended solids data from 11 treatment plants. Summary statistics (mean, standard deviation, etc.) were examined to determine the general characteris...
Energy Statistics : A Supplement to the Summary of Transportation Statistics
DOT National Transportation Integrated Search
1974-08-01
This annual report is a compendium of selected time-series data describing the transportation, production, processing, and consumption of energy. The report is divided into three main sections. The first, entitled Energy Transport, contains such item...
Giesinger, Johannes M; Kieffer, Jacobien M; Fayers, Peter M; Groenvold, Mogens; Petersen, Morten Aa; Scott, Neil W; Sprangers, Mirjam A G; Velikova, Galina; Aaronson, Neil K
2016-01-01
To further evaluate the higher order measurement structure of the European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Questionnaire Core 30 (QLQ-C30), with the aim of generating a summary score. Using pretreatment QLQ-C30 data (N = 3,282), we conducted confirmatory factor analyses to test seven previously evaluated higher order models. We compared the summary score(s) derived from the best performing higher order model with the original QLQ-C30 scale scores, using tumor stage, performance status, and change over time (N = 244) as grouping variables. Although all models showed acceptable fit, we continued in the interest of parsimony with known-groups validity and responsiveness analyses using a summary score derived from the single higher order factor model. The validity and responsiveness of this QLQ-C30 summary score was equal to, and in many cases superior to the original, underlying QLQ-C30 scale scores. Our results provide empirical support for a measurement model for the QLQ-C30 yielding a single summary score. The availability of this summary score can avoid problems with potential type I errors that arise because of multiple testing when making comparisons based on the 15 outcomes generated by this questionnaire and may reduce sample size requirements for health-related quality of life studies using the QLQ-C30 questionnaire when an overall summary score is a relevant primary outcome. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
A summary of transition probabilities for atomic absorption lines formed in low-density clouds
NASA Technical Reports Server (NTRS)
Morton, D. C.; Smith, W. H.
1973-01-01
A table of wavelengths, statistical weights, and excitation energies is given for 944 atomic spectral lines in 221 multiplets whose lower energy levels lie below 0.275 eV. Oscillator strengths were adopted for 635 lines in 155 multiplets from the available experimental and theoretical determinations. Radiation damping constants also were derived for most of these lines. This table contains the lines most likely to be observed in absorption in interstellar clouds, circumstellar shells, and the clouds in the direction of quasars where neither the particle density nor the radiation density is high enough to populate the higher levels. All ions of all elements from hydrogen to zinc are included which have resonance lines longward of 912 A, although a number of weaker lines of neutrals and first ions have been omitted.
Geodetic positioning using a global positioning system of satellites
NASA Technical Reports Server (NTRS)
Fell, P. J.
1980-01-01
Geodetic positioning using range, integrated Doppler, and interferometric observations from a constellation of twenty-four Global Positioning System satellites is analyzed. A summary of the proposals for geodetic positioning and baseline determination is given which includes a description of measurement techniques and comments on rank deficiency and error sources. An analysis of variance comparison of range, Doppler, and interferometric time delay to determine their relative geometric strength for baseline determination is included. An analytic examination to the effect of a priori constraints on positioning using simultaneous observations from two stations is presented. Dynamic point positioning and baseline determination using range and Doppler is examined in detail. Models for the error sources influencing dynamic positioning are developed. Included is a discussion of atomic clock stability, and range and Doppler observation error statistics based on random correlated atomic clock error are derived.
An Introductory Summary of Various Effect Size Choices.
ERIC Educational Resources Information Center
Cromwell, Susan
This paper provides a tutorial summary of some of the many effect size choices so that members of the Southwest Educational Research Association would be better able to follow the recommendations of the American Psychological Association (APA) publication manual, the APA Task Force on Statistical Inference, and the publication requirements of some…
Certificates Awarded by Oregon's Degree Granting Colleges and Universities, 1993-94.
ERIC Educational Resources Information Center
Oregon State Dept. of Education, Salem. Office of Educational Policy and Planning.
This document presents statistical data in summary form on the certificates awarded by institutions of higher education in Oregon. These data were obtained from a completions survey, part of the national Integrated Postsecondary Education Data System (IPEDS). Summary tables are arranged by institution and by program area, followed by tables…
Degrees Awarded by Oregon's Degree-Granting Colleges and Universities, 1993-94.
ERIC Educational Resources Information Center
Oregon State Dept. of Education, Salem. Office of Educational Policy and Planning.
This document presents statistical data in summary form on the associate, Bachelor's, Master's, doctoral, and first professional degrees awarded by institutions of higher education in Oregon. These data were obtained from a completions survey which is part of the national Integrated Postsecondary Education Data System (IPEDS). Summary tables are…
STATISTICAL SUMMARY: EMAP-ESTUARIES LOUISIANIAN PROVINCE - 1993
This statistical summmary of the ecological condition of the estuarine resources is based on the results of the 1993 Louisianian Province Demonstration Project. The population of estuarine resources with the Louisianian Province consists of all estuarine areas located along the c...
1999 Iowa crash facts : a summary of motor vehicle crash statistics on Iowa roadways
DOT National Transportation Integrated Search
1999-01-01
All information concerning Iowa traffic crashes was taken from report forms : provided by investigating officers and drivers involved in crashes. : All statistics are gathered and calculated by the Iowa Department of Transportations Office of Driv...
Inventory of Electric Utility Power Plants in the United States
2002-01-01
Final issue of this report. Provides detailed statistics on existing generating units operated by electric utilities as of December 31, 2000, and certain summary statistics about new generators planned for operation by electric utilities during the next 5 years.
75 FR 57440 - Performance Review Board Membership
Federal Register 2010, 2011, 2012, 2013, 2014
2010-09-21
... DEPARTMENT OF COMMERCE Performance Review Board Membership AGENCY: Economics and Statistics Administration, Commerce. ACTION: Notice. SUMMARY: Below is a listing of individuals who are eligible to serve on the Performance Review Board in accordance with the Economics and Statistics Administration's Senior...
76 FR 57712 - Performance Review Board Membership
Federal Register 2010, 2011, 2012, 2013, 2014
2011-09-16
... DEPARTMENT OF COMMERCE Performance Review Board Membership AGENCY: Economics and Statistics Administration, Commerce. ACTION: Notice. SUMMARY: Below is a listing of individuals who are eligible to serve on the Performance Review Board (PRB) in accordance with the Economics and Statistics Administration's...
Federal Register 2010, 2011, 2012, 2013, 2014
2010-04-23
... DEPARTMENT OF COMMERCE National Oceanic and Atmospheric Administration Proposed Information Collection; Comment Request; Marine Recreational Fisheries Statistics Survey AGENCY: National Oceanic and Atmospheric Administration (NOAA), Commerce. ACTION: Notice. SUMMARY: The Department of Commerce, as part of...
The Higher Education System in Israel: Statistical Abstract and Analysis.
ERIC Educational Resources Information Center
Herskovic, Shlomo
This edition of a statistical abstract published every few years on the higher education system in Israel presents the most recent data available through 1990-91. The data were gathered through the cooperation of the Central Bureau of Statistics and institutions of higher education. Chapter 1 presents a summary of principal findings covering the…
2016-02-02
23 Descriptive Statistics for Enlisted Service Applicants and Accessions...33 Summary Statistics for Applicants and Accessions for Enlisted Service ..................................... 36 Applicants and...utilization among Soldiers screened using TAPAS. Section 2 of this report includes the descriptive statistics AMSARA compiles and publishes
2017-03-01
53 ix LIST OF TABLES Table 1. Descriptive Statistics for Control Variables by... Statistics for Control Variables by Gender (Random Subsample with Complete Survey) ............................................................30 Table...empirical analysis. Chapter IV describes the summary statistics and results. Finally, Chapter V offers concluding thoughts, study limitations, and
19 CFR 191.73 - Export summary procedure.
Code of Federal Regulations, 2014 CFR
2014-04-01
... drawback, as well as for drawback involving the substitution of finished petroleum derivatives (19 U.S.C. 1313(a), (b), (c), (j), or (p)). It is intended to improve administrative efficiency. (b) Format of... Chronological Summary of Exports to the appropriate documentary evidence of exportation (for example, Bill of...
19 CFR 191.73 - Export summary procedure.
Code of Federal Regulations, 2011 CFR
2011-04-01
... drawback, as well as for drawback involving the substitution of finished petroleum derivatives (19 U.S.C. 1313(a), (b), (c), (j), or (p)). It is intended to improve administrative efficiency. (b) Format of... Chronological Summary of Exports to the appropriate documentary evidence of exportation (for example, Bill of...
19 CFR 191.73 - Export summary procedure.
Code of Federal Regulations, 2013 CFR
2013-04-01
... drawback, as well as for drawback involving the substitution of finished petroleum derivatives (19 U.S.C. 1313(a), (b), (c), (j), or (p)). It is intended to improve administrative efficiency. (b) Format of... Chronological Summary of Exports to the appropriate documentary evidence of exportation (for example, Bill of...
19 CFR 191.73 - Export summary procedure.
Code of Federal Regulations, 2012 CFR
2012-04-01
... drawback, as well as for drawback involving the substitution of finished petroleum derivatives (19 U.S.C. 1313(a), (b), (c), (j), or (p)). It is intended to improve administrative efficiency. (b) Format of... Chronological Summary of Exports to the appropriate documentary evidence of exportation (for example, Bill of...
2011 statistical abstract of the United States
Krisanda, Joseph M.
2011-01-01
The Statistical Abstract of the United States, published since 1878, is the authoritative and comprehensive summary of statistics on the social, political, and economic organization of the United States.Use the Abstract as a convenient volume for statistical reference, and as a guide to sources of more information both in print and on the Web.Sources of data include the Census Bureau, Bureau of Labor Statistics, Bureau of Economic Analysis, and many other Federal agencies and private organizations.
Ewing Sarcoma Treatment (PDQ®)—Health Professional Version
Ewing sarcoma is derived from a primordial bone marrow–derived mesenchymal stem cell. Get comprehensive information about the presentation, genomics, diagnostic evaluation, prognosis, and treatment of newly diagnosed and recurrent Ewing sarcoma in this summary for clinicians.
Zheng, Jie; Rodriguez, Santiago; Laurin, Charles; Baird, Denis; Trela-Larsen, Lea; Erzurumluoglu, Mesut A; Zheng, Yi; White, Jon; Giambartolomei, Claudia; Zabaneh, Delilah; Morris, Richard; Kumari, Meena; Casas, Juan P; Hingorani, Aroon D; Evans, David M; Gaunt, Tom R; Day, Ian N M
2017-01-01
Fine mapping is a widely used approach for identifying the causal variant(s) at disease-associated loci. Standard methods (e.g. multiple regression) require individual level genotypes. Recent fine mapping methods using summary-level data require the pairwise correlation coefficients ([Formula: see text]) of the variants. However, haplotypes rather than pairwise [Formula: see text], are the true biological representation of linkage disequilibrium (LD) among multiple loci. In this article, we present an empirical iterative method, HAPlotype Regional Association analysis Program (HAPRAP), that enables fine mapping using summary statistics and haplotype information from an individual-level reference panel. Simulations with individual-level genotypes show that the results of HAPRAP and multiple regression are highly consistent. In simulation with summary-level data, we demonstrate that HAPRAP is less sensitive to poor LD estimates. In a parametric simulation using Genetic Investigation of ANthropometric Traits height data, HAPRAP performs well with a small training sample size (N < 2000) while other methods become suboptimal. Moreover, HAPRAP's performance is not affected substantially by single nucleotide polymorphisms (SNPs) with low minor allele frequencies. We applied the method to existing quantitative trait and binary outcome meta-analyses (human height, QTc interval and gallbladder disease); all previous reported association signals were replicated and two additional variants were independently associated with human height. Due to the growing availability of summary level data, the value of HAPRAP is likely to increase markedly for future analyses (e.g. functional prediction and identification of instruments for Mendelian randomization). The HAPRAP package and documentation are available at http://apps.biocompute.org.uk/haprap/ CONTACT: : jie.zheng@bristol.ac.uk or tom.gaunt@bristol.ac.ukSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
78 FR 29162 - Submission for OMB Review; Comment Request
Federal Register 2010, 2011, 2012, 2013, 2014
2013-05-17
... list and trade a new derivative securities product without submitting a proposed rule change pursuant... derivative securities products traded on the SROs, Rule 19b-4(e) requires an SRO to file a summary form, Form 19b- 4(e), to notify the Commission when the SRO begins trading a new derivative securities product...
Energy Statistics : A Supplement to the Summary of National Transportation Statistics
DOT National Transportation Integrated Search
1976-08-01
This report is a compendium of selected time-series data describing the transportation, production, processing, and consumption of energy. It contains such items as the revenues and expenses of oil pipeline companies, number and capacities of U.S. ta...
Energy Statistics : A Supplement to the Summary of National Transportation Statistics
DOT National Transportation Integrated Search
1975-08-01
This report is a compendium of selected time-series data describing the transportation, production, processing, and consumption of energy. It discusses such items as the revenues and expenses of oil pipeline companies, number and capacities of U.S. t...
Riley, Richard D; Ahmed, Ikhlaaq; Debray, Thomas P A; Willis, Brian H; Noordzij, J Pieter; Higgins, Julian P T; Deeks, Jonathan J
2015-06-15
Following a meta-analysis of test accuracy studies, the translation of summary results into clinical practice is potentially problematic. The sensitivity, specificity and positive (PPV) and negative (NPV) predictive values of a test may differ substantially from the average meta-analysis findings, because of heterogeneity. Clinicians thus need more guidance: given the meta-analysis, is a test likely to be useful in new populations, and if so, how should test results inform the probability of existing disease (for a diagnostic test) or future adverse outcome (for a prognostic test)? We propose ways to address this. Firstly, following a meta-analysis, we suggest deriving prediction intervals and probability statements about the potential accuracy of a test in a new population. Secondly, we suggest strategies on how clinicians should derive post-test probabilities (PPV and NPV) in a new population based on existing meta-analysis results and propose a cross-validation approach for examining and comparing their calibration performance. Application is made to two clinical examples. In the first example, the joint probability that both sensitivity and specificity will be >80% in a new population is just 0.19, because of a low sensitivity. However, the summary PPV of 0.97 is high and calibrates well in new populations, with a probability of 0.78 that the true PPV will be at least 0.95. In the second example, post-test probabilities calibrate better when tailored to the prevalence in the new population, with cross-validation revealing a probability of 0.97 that the observed NPV will be within 10% of the predicted NPV. © 2015 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd.
Gene- and pathway-based association tests for multiple traits with GWAS summary statistics.
Kwak, Il-Youp; Pan, Wei
2017-01-01
To identify novel genetic variants associated with complex traits and to shed new insights on underlying biology, in addition to the most popular single SNP-single trait association analysis, it would be useful to explore multiple correlated (intermediate) traits at the gene- or pathway-level by mining existing single GWAS or meta-analyzed GWAS data. For this purpose, we present an adaptive gene-based test and a pathway-based test for association analysis of multiple traits with GWAS summary statistics. The proposed tests are adaptive at both the SNP- and trait-levels; that is, they account for possibly varying association patterns (e.g. signal sparsity levels) across SNPs and traits, thus maintaining high power across a wide range of situations. Furthermore, the proposed methods are general: they can be applied to mixed types of traits, and to Z-statistics or P-values as summary statistics obtained from either a single GWAS or a meta-analysis of multiple GWAS. Our numerical studies with simulated and real data demonstrated the promising performance of the proposed methods. The methods are implemented in R package aSPU, freely and publicly available at: https://cran.r-project.org/web/packages/aSPU/ CONTACT: weip@biostat.umn.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Petroleum supply monthly, June 1999, with data for April 1999
DOE Office of Scientific and Technical Information (OSTI.GOV)
NONE
Data presented in the Petroleum Supply Monthly (PSM) describe the supply and disposition of petroleum products in the US and major US geographic regions. The data series describe production, imports and exports, inter-Petroleum Administration for Defense (PAD) District movements, and inventories by the primary suppliers of petroleum products in the US (50 States and the District of Columbia). The reporting universe includes those petroleum sectors in primary supply. Included are: petroleum refiners, motor gasoline blenders, operators of natural gas processing plants and fractionators, inter-PAD transporters, importers, and major inventory holders of petroleum products and crude oil. When aggregated, the datamore » reported by these sectors approximately represent the consumption of petroleum products in the US. Data presented in the PSM are divided into two sections: Summary Statistics and Detailed Statistics. The tables and figures in the Summary Statistics section of the PSM present a time series of selected petroleum data on a US level. The Detailed Statistics tables of the PSM present statistics for the most current month available as well as year-to-date. 16 figs., 66 tabs.« less
Petroleum supply monthly, February 1999, with data for December 1998
DOE Office of Scientific and Technical Information (OSTI.GOV)
NONE
Data presented in the Petroleum Supply Monthly (PSM) describes the supply and disposition of petroleum products in the United States and major US geographic regions. The data series describe production, imports and exports, inter-Petroleum Administration for Defense (PAD) District movements, and inventories by the primary suppliers of petroleum products in the United States (50 States and the District of Columbia). The reporting universe includes those petroleum sectors in primary supply. Included are: petroleum refiners, motor gasoline blenders, operators of natural gas processing plants and fractionators, inter-PAD transporters, importers, and major inventory holders of petroleum products and crude oil. When aggregated,more » the data reported by these sectors approximately represent the consumption of petroleum products in the United States. Data presented in the PSM are divided into two sections: Summary Statistics and Detailed Statistics. The tables and figures in the Summary Statistics section of the PSM present a time series of selected petroleum data on a US level. The Detailed Statistics tables of the PSM present statistics for the most current month available as well as year-to-date. 16 figs., 66 tabs.« less
Streamflow characteristics of streams in southeastern Afghanistan
Vining, Kevin C.
2010-01-01
Statistical summaries of streamflow data for all historical streamgaging stations that have available data in the southeastern Afghanistan provinces of Ghazni, Khost, Logar, Paktya, and Wardak, and a portion of Kabul Province are presented in this report. The summaries for each streamgaging station include a station desciption, table of statistics of monthly and annual mean discharges, table of monthly and annual flow duration, table of probability of occurrence of annual high discharges, table of probability of occurrence of annual low discharges, table of annual peak discharge and corresponding gage height for the period of record, and table of monthly and annual mean discharges for the period of record.
Streamflow characteristics at streamgages in northern Afghanistan and selected locations
Olson, Scott A.; Williams-Sether, Tara
2010-01-01
Statistical summaries of streamflow data for 79 historical streamgages in Northern Afghanistan and other selected historical streamgages are presented in this report. The summaries for each streamgage include (1) station description, (2) graph of the annual mean discharge for the period of record, (3) statistics of monthly and annual mean discharges, (4) monthly and annual flow duration, (5) probability of occurrence of annual high discharges, (6) probability of occurrence of annual low discharges, (7) probability of occurrence of seasonal low discharges, (8) annual peak discharges for the period of record, and (9) monthly and annual mean discharges for the period of record.
Connectedness DEOCS 4.1 Construct Validity Summary
2017-08-01
Connectedness DEOCS 4.1 Construct Validity Summary DEFENSE EQUAL OPPORTUNITY MANAGEMENT INSTITUTE DIRECTORATE OF...appropriate statistical method to analyze these data. This EFA yielded a single factor solution. Refer to Table 6 for more information . Table 6...top management team perceptions of CEO charisma. Academy of Management Journal, 49(1), 161-174. Assessment to Solutions. (2016). Retrieved from https
Hierarchical models and bayesian analysis of bird survey information
John R. Sauer; William A. Link; J. Andrew Royle
2005-01-01
Summary of bird survey information is a critical component of conservation activities, but often our summaries rely on statistical methods that do not accommodate the limitations of the information. Prioritization of species requires ranking and analysis of species by magnitude of population trend, but often magnitude of trend is a misleading measure of actual decline...
The National Evaluation of School Nutrition Programs. Final Report - Executive Summary.
ERIC Educational Resources Information Center
Radzikowski, Jack
This is a summary of the final report of a study (begun in 1979) of the National School Lunch, School Breakfast, and Special Milk Programs. The major objectives of the evaluation were to (1) identify existing information on the school nutrition programs; (2) identify determinants of participation in the programs and develop statistical models for…
Doctorate Recipients from United States Universities. Summary Report, 1984.
ERIC Educational Resources Information Center
Coyle, Susan L.; Syverson, Peter D.
A statistical and narrative summary of the results of the 1983-1984 Survey of Earned Doctorates is presented. Basic information, such as sex, field, institution, and year of Ph.D., is presented for all of the 31,253 doctorate recipients; complete questionnaire data are included for the 29,713 Ph.D. recipients who responded to the questionnaire,…
2011 statistical abstract of the United States
Krisanda, Joseph M.
2011-01-01
The Statistical Abstract of the United States, published since 1878, is the authoritative and comprehensive summary of statistics on the social, political, and economic organization of the United States.
Use the Abstract as a convenient volume for statistical reference, and as a guide to sources of more information both in print and on the Web.
Sources of data include the Census Bureau, Bureau of Labor Statistics, Bureau of Economic Analysis, and many other Federal agencies and private organizations.
Statistics of Private High Schools and Academies, 1919-20. Bulletin, 1922, No. 9
ERIC Educational Resources Information Center
Bonner, H. R.
1922-01-01
The included tables present the statistics of 2,093 private high schools and academies in the continental United States and of 4 such schools in Hawaii and Puerto Rico. Throughout the summary tables the totals for the United States do not include the statistics of these 4 schools in the outlying possessions. No reports from private high schools…
Revised Perturbation Statistics for the Global Scale Atmospheric Model
NASA Technical Reports Server (NTRS)
Justus, C. G.; Woodrum, A.
1975-01-01
Magnitudes and scales of atmospheric perturbations about the monthly mean for the thermodynamic variables and wind components are presented by month at various latitudes. These perturbation statistics are a revision of the random perturbation data required for the global scale atmospheric model program and are from meteorological rocket network statistical summaries in the 22 to 65 km height range and NASA grenade and pitot tube data summaries in the region up to 90 km. The observed perturbations in the thermodynamic variables were adjusted to make them consistent with constraints required by the perfect gas law and the hydrostatic equation. Vertical scales were evaluated by Buell's depth of pressure system equation and from vertical structure function analysis. Tables of magnitudes and vertical scales are presented for each month at latitude 10, 30, 50, 70, and 90 degrees.
Combinatorial interpretation of Haldane-Wu fractional exclusion statistics.
Aringazin, A K; Mazhitov, M I
2002-08-01
Assuming that the maximal allowed number of identical particles in a state is an integer parameter, q, we derive the statistical weight and analyze the associated equation that defines the statistical distribution. The derived distribution covers Fermi-Dirac and Bose-Einstein ones in the particular cases q=1 and q--> infinity (n(i)/q-->1), respectively. We show that the derived statistical weight provides a natural combinatorial interpretation of Haldane-Wu fractional exclusion statistics, and present exact solutions of the distribution equation.
Crawford, John R; Garthwaite, Paul H; Denham, Annie K; Chelune, Gordon J
2012-12-01
Regression equations have many useful roles in psychological assessment. Moreover, there is a large reservoir of published data that could be used to build regression equations; these equations could then be employed to test a wide variety of hypotheses concerning the functioning of individual cases. This resource is currently underused because (a) not all psychologists are aware that regression equations can be built not only from raw data but also using only basic summary data for a sample, and (b) the computations involved are tedious and prone to error. In an attempt to overcome these barriers, Crawford and Garthwaite (2007) provided methods to build and apply simple linear regression models using summary statistics as data. In the present study, we extend this work to set out the steps required to build multiple regression models from sample summary statistics and the further steps required to compute the associated statistics for drawing inferences concerning an individual case. We also develop, describe, and make available a computer program that implements these methods. Although there are caveats associated with the use of the methods, these need to be balanced against pragmatic considerations and against the alternative of either entirely ignoring a pertinent data set or using it informally to provide a clinical "guesstimate." Upgraded versions of earlier programs for regression in the single case are also provided; these add the point and interval estimates of effect size developed in the present article.
Data Recovery from SCATHA Satellite
NASA Technical Reports Server (NTRS)
Fennell, J. F.; Boyd, G. M.; Redding, M. T.; McNab, M. C.
1997-01-01
This document gives a brief description of the SCATHA (P78-2) satellite and consolidates into one location information relevant to the generation of the SCATHA Summary Data parameters for the European Space Agency (ESA), under ESTEC Contract No. 11006/94/NL/CC, and the National Aeronautics and Space Administration (NASA), under Grant No. NAGW-414 1. Included are descriptions of the instruments from which the Summary Data parameters are generated, their derivation, and archival. Any questions pertaining to the Summary Data parameters should be directed to Dr. Joseph Fennell.
Nakagawa, Shinichi; Johnson, Paul C D; Schielzeth, Holger
2017-09-01
The coefficient of determination R 2 quantifies the proportion of variance explained by a statistical model and is an important summary statistic of biological interest. However, estimating R 2 for generalized linear mixed models (GLMMs) remains challenging. We have previously introduced a version of R 2 that we called [Formula: see text] for Poisson and binomial GLMMs, but not for other distributional families. Similarly, we earlier discussed how to estimate intra-class correlation coefficients (ICCs) using Poisson and binomial GLMMs. In this paper, we generalize our methods to all other non-Gaussian distributions, in particular to negative binomial and gamma distributions that are commonly used for modelling biological data. While expanding our approach, we highlight two useful concepts for biologists, Jensen's inequality and the delta method, both of which help us in understanding the properties of GLMMs. Jensen's inequality has important implications for biologically meaningful interpretation of GLMMs, whereas the delta method allows a general derivation of variance associated with non-Gaussian distributions. We also discuss some special considerations for binomial GLMMs with binary or proportion data. We illustrate the implementation of our extension by worked examples from the field of ecology and evolution in the R environment. However, our method can be used across disciplines and regardless of statistical environments. © 2017 The Author(s).
NASA Technical Reports Server (NTRS)
Wharton, S. W.
1980-01-01
An Interactive Cluster Analysis Procedure (ICAP) was developed to derive classifier training statistics from remotely sensed data. The algorithm interfaces the rapid numerical processing capacity of a computer with the human ability to integrate qualitative information. Control of the clustering process alternates between the algorithm, which creates new centroids and forms clusters and the analyst, who evaluate and elect to modify the cluster structure. Clusters can be deleted or lumped pairwise, or new centroids can be added. A summary of the cluster statistics can be requested to facilitate cluster manipulation. The ICAP was implemented in APL (A Programming Language), an interactive computer language. The flexibility of the algorithm was evaluated using data from different LANDSAT scenes to simulate two situations: one in which the analyst is assumed to have no prior knowledge about the data and wishes to have the clusters formed more or less automatically; and the other in which the analyst is assumed to have some knowledge about the data structure and wishes to use that information to closely supervise the clustering process. For comparison, an existing clustering method was also applied to the two data sets.
Evaluation of methods for managing censored results when calculating the geometric mean.
Mikkonen, Hannah G; Clarke, Bradley O; Dasika, Raghava; Wallis, Christian J; Reichman, Suzie M
2018-01-01
Currently, there are conflicting views on the best statistical methods for managing censored environmental data. The method commonly applied by environmental science researchers and professionals is to substitute half the limit of reporting for derivation of summary statistics. This approach has been criticised by some researchers, raising questions around the interpretation of historical scientific data. This study evaluated four complete soil datasets, at three levels of simulated censorship, to test the accuracy of a range of censored data management methods for calculation of the geometric mean. The methods assessed included removal of censored results, substitution of a fixed value (near zero, half the limit of reporting and the limit of reporting), substitution by nearest neighbour imputation, maximum likelihood estimation, regression on order substitution and Kaplan-Meier/survival analysis. This is the first time such a comprehensive range of censored data management methods have been applied to assess the accuracy of calculation of the geometric mean. The results of this study show that, for describing the geometric mean, the simple method of substitution of half the limit of reporting is comparable or more accurate than alternative censored data management methods, including nearest neighbour imputation methods. Copyright © 2017 Elsevier Ltd. All rights reserved.
SIG-VISA: Signal-based Vertically Integrated Seismic Monitoring
NASA Astrophysics Data System (ADS)
Moore, D.; Mayeda, K. M.; Myers, S. C.; Russell, S.
2013-12-01
Traditional seismic monitoring systems rely on discrete detections produced by station processing software; however, while such detections may constitute a useful summary of station activity, they discard large amounts of information present in the original recorded signal. We present SIG-VISA (Signal-based Vertically Integrated Seismic Analysis), a system for seismic monitoring through Bayesian inference on seismic signals. By directly modeling the recorded signal, our approach incorporates additional information unavailable to detection-based methods, enabling higher sensitivity and more accurate localization using techniques such as waveform matching. SIG-VISA's Bayesian forward model of seismic signal envelopes includes physically-derived models of travel times and source characteristics as well as Gaussian process (kriging) statistical models of signal properties that combine interpolation of historical data with extrapolation of learned physical trends. Applying Bayesian inference, we evaluate the model on earthquakes as well as the 2009 DPRK test event, demonstrating a waveform matching effect as part of the probabilistic inference, along with results on event localization and sensitivity. In particular, we demonstrate increased sensitivity from signal-based modeling, in which the SIGVISA signal model finds statistical evidence for arrivals even at stations for which the IMS station processing failed to register any detection.
Visualization of Spatio-Temporal Relations in Movement Event Using Multi-View
NASA Astrophysics Data System (ADS)
Zheng, K.; Gu, D.; Fang, F.; Wang, Y.; Liu, H.; Zhao, W.; Zhang, M.; Li, Q.
2017-09-01
Spatio-temporal relations among movement events extracted from temporally varying trajectory data can provide useful information about the evolution of individual or collective movers, as well as their interactions with their spatial and temporal contexts. However, the pure statistical tools commonly used by analysts pose many difficulties, due to the large number of attributes embedded in multi-scale and multi-semantic trajectory data. The need for models that operate at multiple scales to search for relations at different locations within time and space, as well as intuitively interpret what these relations mean, also presents challenges. Since analysts do not know where or when these relevant spatio-temporal relations might emerge, these models must compute statistical summaries of multiple attributes at different granularities. In this paper, we propose a multi-view approach to visualize the spatio-temporal relations among movement events. We describe a method for visualizing movement events and spatio-temporal relations that uses multiple displays. A visual interface is presented, and the user can interactively select or filter spatial and temporal extents to guide the knowledge discovery process. We also demonstrate how this approach can help analysts to derive and explain the spatio-temporal relations of movement events from taxi trajectory data.
Statistical energy analysis computer program, user's guide
NASA Technical Reports Server (NTRS)
Trudell, R. W.; Yano, L. I.
1981-01-01
A high frequency random vibration analysis, (statistical energy analysis (SEA) method) is examined. The SEA method accomplishes high frequency prediction of arbitrary structural configurations. A general SEA computer program is described. A summary of SEA theory, example problems of SEA program application, and complete program listing are presented.
Education Statistics Quarterly, Summer 2002.
ERIC Educational Resources Information Center
Dillow, Sally, Ed.
2002-01-01
This publication provides a comprehensive overview of work done across all parts of the National Center for Education Statistics (NCES). Each issue contains short publications, summaries, and descriptions that cover all NCES publications, data products, and funding opportunities developed over a 3-month period. Each issue also contains a message…
Education Statistics Quarterly, Spring 2002.
ERIC Educational Resources Information Center
Dillow, Sally, Ed.
2002-01-01
This publication provides a comprehensive overview of work done across all parts of the National Center for Education Statistics (NCES). Each issue contains short publications, summaries, and descriptions that cover all NCES publications, data products, and funding opportunities developed over a 3-month period. Each issue also contains a message…
Computational Approaches to Chemical Hazard Assessment
Luechtefeld, Thomas; Hartung, Thomas
2018-01-01
Summary Computational prediction of toxicity has reached new heights as a result of decades of growth in the magnitude and diversity of biological data. Public packages for statistics and machine learning make model creation faster. New theory in machine learning and cheminformatics enables integration of chemical structure, toxicogenomics, simulated and physical data in the prediction of chemical health hazards, and other toxicological information. Our earlier publications have characterized a toxicological dataset of unprecedented scale resulting from the European REACH legislation (Registration Evaluation Authorisation and Restriction of Chemicals). These publications dove into potential use cases for regulatory data and some models for exploiting this data. This article analyzes the options for the identification and categorization of chemicals, moves on to the derivation of descriptive features for chemicals, discusses different kinds of targets modeled in computational toxicology, and ends with a high-level perspective of the algorithms used to create computational toxicology models. PMID:29101769
Probability distribution of extreme share returns in Malaysia
NASA Astrophysics Data System (ADS)
Zin, Wan Zawiah Wan; Safari, Muhammad Aslam Mohd; Jaaman, Saiful Hafizah; Yie, Wendy Ling Shin
2014-09-01
The objective of this study is to investigate the suitable probability distribution to model the extreme share returns in Malaysia. To achieve this, weekly and monthly maximum daily share returns are derived from share prices data obtained from Bursa Malaysia over the period of 2000 to 2012. The study starts with summary statistics of the data which will provide a clue on the likely candidates for the best fitting distribution. Next, the suitability of six extreme value distributions, namely the Gumbel, Generalized Extreme Value (GEV), Generalized Logistic (GLO) and Generalized Pareto (GPA), the Lognormal (GNO) and the Pearson (PE3) distributions are evaluated. The method of L-moments is used in parameter estimation. Based on several goodness of fit tests and L-moment diagram test, the Generalized Pareto distribution and the Pearson distribution are found to be the best fitted distribution to represent the weekly and monthly maximum share returns in Malaysia stock market during the studied period, respectively.
Stream-temperature characteristics in Georgia
Dyar, T.R.; Alhadeff, S. Jack
1997-01-01
Stream-temperature measurements for 198 periodic and 22 daily record stations were analyzed using a harmonic curve-fitting procedure. Statistics of data from 78 selected stations were used to compute a statewide stream-temperature harmonic equation, derived using latitude, drainage area, and altitude for natural streams having drainage areas greater than about 40 square miles. Based on the 1955-84 reference period, the equation may be used to compute long-term natural harmonic stream-temperature coefficients to within an on average of about 0.4? C. Basin-by-basin summaries of observed long-term stream-temperature characteristics are included for selected stations and river reaches, particularly along Georgia's mainstem streams. Changes in the stream- temperature regimen caused by the effects of development, principally impoundments and thermal power plants, are shown by comparing harmonic curves and coefficients from the estimated natural values to the observed modified-condition values.
Automatic Summarization as a Combinatorial Optimization Problem
NASA Astrophysics Data System (ADS)
Hirao, Tsutomu; Suzuki, Jun; Isozaki, Hideki
We derived the oracle summary with the highest ROUGE score that can be achieved by integrating sentence extraction with sentence compression from the reference abstract. The analysis results of the oracle revealed that summarization systems have to assign an appropriate compression rate for each sentence in the document. In accordance with this observation, this paper proposes a summarization method as a combinatorial optimization: selecting the set of sentences that maximize the sum of the sentence scores from the pool which consists of the sentences with various compression rates, subject to length constrains. The score of the sentence is defined by its compression rate, content words and positional information. The parameters for the compression rates and positional information are optimized by minimizing the loss between score of oracles and that of candidates. The results obtained from TSC-2 corpus showed that our method outperformed the previous systems with statistical significance.
Two Paradoxes in Linear Regression Analysis
FENG, Ge; PENG, Jing; TU, Dongke; ZHENG, Julia Z.; FENG, Changyong
2016-01-01
Summary Regression is one of the favorite tools in applied statistics. However, misuse and misinterpretation of results from regression analysis are common in biomedical research. In this paper we use statistical theory and simulation studies to clarify some paradoxes around this popular statistical method. In particular, we show that a widely used model selection procedure employed in many publications in top medical journals is wrong. Formal procedures based on solid statistical theory should be used in model selection. PMID:28638214
Statistics for NAEG: past efforts, new results, and future plans
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gilbert, R.O.; Simpson, J.C.; Kinnison, R.R.
A brief review of Nevada Applied Ecology Group (NAEG) objectives is followed by a summary of past statistical analyses conducted by Pacific Northwest Laboratory for the NAEG. Estimates of spatial pattern of radionuclides and other statistical analyses at NS's 201, 219 and 221 are reviewed as background for new analyses presented in this paper. Suggested NAEG activities and statistical analyses needed for the projected termination date of NAEG studies in March 1986 are given.
Survey of statistical techniques used in validation studies of air pollution prediction models
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bornstein, R D; Anderson, S F
1979-03-01
Statistical techniques used by meteorologists to validate predictions made by air pollution models are surveyed. Techniques are divided into the following three groups: graphical, tabular, and summary statistics. Some of the practical problems associated with verification are also discussed. Characteristics desired in any validation program are listed and a suggested combination of techniques that possesses many of these characteristics is presented.
Compendium of Methods for Applying Measured Data to Vibration and Acoustic Problems
1985-10-01
statistical energy analysis , finite element models, transfer function...Procedures for the Modal Analysis Method .............................................. 8-22 8.4 Summary of the Procedures for the Statistical Energy Analysis Method... statistical energy analysis . 8-1 • o + . . i... "_+,A" L + "+..• •+A ’! i, + +.+ +• o.+ -ore -+. • -..- , .%..% ". • 2 -".-2- ;.-.’, . o . It is helpful
In vivo Comet assay--statistical analysis and power calculations of mice testicular cells.
Hansen, Merete Kjær; Sharma, Anoop Kumar; Dybdahl, Marianne; Boberg, Julie; Kulahci, Murat
2014-11-01
The in vivo Comet assay is a sensitive method for evaluating DNA damage. A recurrent concern is how to analyze the data appropriately and efficiently. A popular approach is to summarize the raw data into a summary statistic prior to the statistical analysis. However, consensus on which summary statistic to use has yet to be reached. Another important consideration concerns the assessment of proper sample sizes in the design of Comet assay studies. This study aims to identify a statistic suitably summarizing the % tail DNA of mice testicular samples in Comet assay studies. A second aim is to provide curves for this statistic outlining the number of animals and gels to use. The current study was based on 11 compounds administered via oral gavage in three doses to male mice: CAS no. 110-26-9, CAS no. 512-56-1, CAS no. 111873-33-7, CAS no. 79-94-7, CAS no. 115-96-8, CAS no. 598-55-0, CAS no. 636-97-5, CAS no. 85-28-9, CAS no. 13674-87-8, CAS no. 43100-38-5 and CAS no. 60965-26-6. Testicular cells were examined using the alkaline version of the Comet assay and the DNA damage was quantified as % tail DNA using a fully automatic scoring system. From the raw data 23 summary statistics were examined. A linear mixed-effects model was fitted to the summarized data and the estimated variance components were used to generate power curves as a function of sample size. The statistic that most appropriately summarized the within-sample distributions was the median of the log-transformed data, as it most consistently conformed to the assumptions of the statistical model. Power curves for 1.5-, 2-, and 2.5-fold changes of the highest dose group compared to the control group when 50 and 100 cells were scored per gel are provided to aid in the design of future Comet assay studies on testicular cells. Copyright © 2014 Elsevier B.V. All rights reserved.
Cloud Compute for Global Climate Station Summaries
NASA Astrophysics Data System (ADS)
Baldwin, R.; May, B.; Cogbill, P.
2017-12-01
Global Climate Station Summaries are simple indicators of observational normals which include climatic data summarizations and frequency distributions. These typically are statistical analyses of station data over 5-, 10-, 20-, 30-year or longer time periods. The summaries are computed from the global surface hourly dataset. This dataset totaling over 500 gigabytes is comprised of 40 different types of weather observations with 20,000 stations worldwide. NCEI and the U.S. Navy developed these value added products in the form of hourly summaries from many of these observations. Enabling this compute functionality in the cloud is the focus of the project. An overview of approach and challenges associated with application transition to the cloud will be presented.
Synthesizing Risk from Summary Evidence Across Multiple Risk Factors.
Shrier, Ian; Colditz, Graham A; Steele, Russell J
2018-07-01
Although meta-analyses provide summary effect estimates that help advise patient care, patients often want to compare their overall health to the general population. The Harvard Cancer Risk Index was published in 2004 and uses risk ratio estimates and prevalence estimates from original studies across many risk factors to provide an answer to this question. However, the published version of the formula only uses dichotomous risk factors and its derivation was not provided. The objective of this brief report was to provide the derivation of a more general form of the equation that allows the incorporation of risk factors with three or more levels.
Statistical Supplement to the Annual Report, Fiscal Year 1987.
ERIC Educational Resources Information Center
Texas Coll. and Univ. System, Austin. Coordinating Board.
This report offers statistical data for fiscal year 1987 on student enrollments, faculty, semester credit hours, physical facilities appropriations, and state loan and grant programs for Texas institutions of higher education. The following enrollment data are presented: 5-year (1982-86) summaries of headcount for public senior colleges and…
76 FR 28730 - Notice of Intent To Suspend the Agricultural Labor Survey and Farm Labor Reports
Federal Register 2010, 2011, 2012, 2013, 2014
2011-05-18
... DEPARTMENT OF AGRICULTURE National Agricultural Statistics Service Notice of Intent To Suspend the Agricultural Labor Survey and Farm Labor Reports AGENCY: National Agricultural Statistics Service, USDA. ACTION: Notice of suspension of data collection and publication. SUMMARY: This notice announces the intention of...
Education Statistics Quarterly, Fall 2002.
ERIC Educational Resources Information Center
Dillow, Sally, Ed.
2003-01-01
This publication provides a comprehensive overview of work done across all parts of the National Center for Education Statistics (NCES). Each issue contains short publications, summaries, and descriptions that cover all NCES publications and data products released in a 3-month period. Each issue also contains a message from the NCES on a timely…
Timber resource statistics for western Oregon, 1997.
David L. Azuma; Larry F. Bednar; Bruce A. Hiserote; Charles F. Veneklase
2004-01-01
This report is a summary of timber resource statistics for western Oregon, which includes Benton, Clackamas, Clatsop, Columbia, Coos, Curry, Douglas, Hood River, Jackson, Josephine, Lane, Lincoln, Linn, Marion, Multnomah, Polk, Tillamook, Washington, and Yamhill Counties. Data were collected as part of a statewide multiresource inventory. The inventory sampled all...
Education Statistics Quarterly, Fall 2001.
ERIC Educational Resources Information Center
Dillow, Sally, Ed.
2001-01-01
The publication gives a comprehensive overview of work done across all parts of the National Center for Education Statistics (NCES). Each issue contains short publications, summaries, and descriptions that cover all NCES publications, data products, and funding opportunities developed over a 3-month period. Each issue also contains a message from…
Education Statistics Quarterly. Volume 5, Issue 1.
ERIC Educational Resources Information Center
Dillow, Sally, Ed.
2003-01-01
This publication provides a comprehensive overview of work done across all parts of the National Center for Education Statistics (NCES). Each issue contains short publications, summaries, and descriptions that cover all NCES publications, data product, and funding opportunities developed over a 3-month period. Each issue also contains a message…
Education Statistics Quarterly, Winter 2001.
ERIC Educational Resources Information Center
Dillow, Sally, Ed.
2002-01-01
This publication provides a comprehensive overview of work done across all parts of the National Center for Education Statistics (NCES). Each issue contains short publications, summaries, and descriptions that cover all NCES publications and data products released in a 3-month period. Each issue also contains a message from the NCES on a timely…
Early estimate of motor vehicle traffic fatalities in 2009 : a brief statistical summary
DOT National Transportation Integrated Search
2010-03-01
statistical projection of traffic fatalities in 2009 shows that an estimated 33,963 people died in motor vehicle traffic crashes. This represents a decline of about 8.9 percent as compared to the 37,261 fatalities that occurred in 2008, as shown in T...
This technical report presents a summary of indoor air studies that measured background concentrations of VOCs in the indoor air of thousands of North American residences and an evaluation and compilation of their reported statistical information.
Timber resource statistics for eastern Oregon, 1999.
David L. Azuma; Paul A. Dunham; Bruce A. Hiserote; Charles F. Veneklase
2004-01-01
This report is a summary of timber resource statistics for eastern Oregon, which includes Baker, Crook, Deschutes, Gilliam, Grant, Harney, Jefferson, Klamath, Lake, Malheur, Morrow, Sherman, Umatilla, Union, Wallowa, Wasco, and Wheeler Counties. Data were collected as part of a statewide multiresource inventory. The inventory sampled all private and public lands except...
Derivative Free Optimization of Complex Systems with the Use of Statistical Machine Learning Models
2015-09-12
AFRL-AFOSR-VA-TR-2015-0278 DERIVATIVE FREE OPTIMIZATION OF COMPLEX SYSTEMS WITH THE USE OF STATISTICAL MACHINE LEARNING MODELS Katya Scheinberg...COMPLEX SYSTEMS WITH THE USE OF STATISTICAL MACHINE LEARNING MODELS 5a. CONTRACT NUMBER 5b. GRANT NUMBER FA9550-11-1-0239 5c. PROGRAM ELEMENT...developed, which has been the focus of our research. 15. SUBJECT TERMS optimization, Derivative-Free Optimization, Statistical Machine Learning 16. SECURITY
Misuse of statistical methods: critical assessment of articles in BMJ from January to March 1976.
Gore, S M; Jones, I G; Rytter, E C
1977-01-01
Sixty-two reports that appeared as Papers and Originals (excluding short reports) in 13 consecutive issues of the British Medical journal included statistical analysis. Thirty-two had statistical errors of one kind or another; in 18 fairly serious faults were discovered. The summaries of five reports made some claim that was unsupportable on re-examination of the data. Medical investigators should consult with people who have a real understanding of statistical methods throughout their projects. PMID:832023
Ensor, Joie; Riley, Richard D.
2016-01-01
Meta‐analysis using individual participant data (IPD) obtains and synthesises the raw, participant‐level data from a set of relevant studies. The IPD approach is becoming an increasingly popular tool as an alternative to traditional aggregate data meta‐analysis, especially as it avoids reliance on published results and provides an opportunity to investigate individual‐level interactions, such as treatment‐effect modifiers. There are two statistical approaches for conducting an IPD meta‐analysis: one‐stage and two‐stage. The one‐stage approach analyses the IPD from all studies simultaneously, for example, in a hierarchical regression model with random effects. The two‐stage approach derives aggregate data (such as effect estimates) in each study separately and then combines these in a traditional meta‐analysis model. There have been numerous comparisons of the one‐stage and two‐stage approaches via theoretical consideration, simulation and empirical examples, yet there remains confusion regarding when each approach should be adopted, and indeed why they may differ. In this tutorial paper, we outline the key statistical methods for one‐stage and two‐stage IPD meta‐analyses, and provide 10 key reasons why they may produce different summary results. We explain that most differences arise because of different modelling assumptions, rather than the choice of one‐stage or two‐stage itself. We illustrate the concepts with recently published IPD meta‐analyses, summarise key statistical software and provide recommendations for future IPD meta‐analyses. © 2016 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. PMID:27747915
Improved score statistics for meta-analysis in single-variant and gene-level association studies.
Yang, Jingjing; Chen, Sai; Abecasis, Gonçalo
2018-06-01
Meta-analysis is now an essential tool for genetic association studies, allowing them to combine large studies and greatly accelerating the pace of genetic discovery. Although the standard meta-analysis methods perform equivalently as the more cumbersome joint analysis under ideal settings, they result in substantial power loss under unbalanced settings with various case-control ratios. Here, we investigate the power loss problem by the standard meta-analysis methods for unbalanced studies, and further propose novel meta-analysis methods performing equivalently to the joint analysis under both balanced and unbalanced settings. We derive improved meta-score-statistics that can accurately approximate the joint-score-statistics with combined individual-level data, for both linear and logistic regression models, with and without covariates. In addition, we propose a novel approach to adjust for population stratification by correcting for known population structures through minor allele frequencies. In the simulated gene-level association studies under unbalanced settings, our method recovered up to 85% power loss caused by the standard methods. We further showed the power gain of our methods in gene-level tests with 26 unbalanced studies of age-related macular degeneration . In addition, we took the meta-analysis of three unbalanced studies of type 2 diabetes as an example to discuss the challenges of meta-analyzing multi-ethnic samples. In summary, our improved meta-score-statistics with corrections for population stratification can be used to construct both single-variant and gene-level association studies, providing a useful framework for ensuring well-powered, convenient, cross-study analyses. © 2018 WILEY PERIODICALS, INC.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1978-10-03
This report is a six-part statistical summary of surface weather observations for Torrejon AB, Madrid Spain. It contains the following parts: (A) Weather Conditions; Atmospheric Phenomena; (B) Precipitation, Snowfall and Snow Depth (daily amounts and extreme values); (C) Surface winds; (D) Ceiling Versus Visibility; Sky Cover; (E) Psychrometric Summaries (daily maximum and minimum temperatures, extreme maximum and minimum temperatures, psychrometric summary of wet-bulb temperature depression versus dry-bulb temperature, means and standard deviations of dry-bulb, wet-bulb and dew-point temperatures and relative humidity); and (F) Pressure Summary (means, standard, deviations, and observation counts of station pressure and sea-level pressure). Data in thismore » report are presented in tabular form, in most cases in percentage frequency of occurrence or cumulative percentage frequency of occurrence tables.« less
ERIC Educational Resources Information Center
Foster, Emily M.
1942-01-01
The U.S. Office of Education is required by law to collect statistics to show the condition and progress of education. Statistics can be made available, on a national scale, to the extent that school administrators, principals, and college officials cooperate on a voluntary basis with the Office of Education in making the facts available. This…
ERIC Educational Resources Information Center
Pleis, J. R.; Ward, B. W.; Lucas, J. W.
2010-01-01
Objectives: This report presents health statistics from the 2009 National Health Interview Survey (NHIS) for the civilian noninstitutionalized adult population, classified by sex, age, race and ethnicity, education, family income, poverty status, health insurance coverage, marital status, and place and region of residence. Estimates are presented…
Comments on statistical issues in numerical modeling for underground nuclear test monitoring
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nicholson, W.L.; Anderson, K.K.
1993-11-01
The Symposium concluded with prepared summaries by four experts in the involved disciplines. These experts made no mention of statistics and/or the statistical content of issues. The first author contributed an extemporaneous statement at the Symposium because there are important issues associated with conducting and evaluating numerical modeling that are familiar to statisticians and often treated successfully by them. This note expands upon these extemporaneous remarks.
Powerful Statistical Inference for Nested Data Using Sufficient Summary Statistics
Dowding, Irene; Haufe, Stefan
2018-01-01
Hierarchically-organized data arise naturally in many psychology and neuroscience studies. As the standard assumption of independent and identically distributed samples does not hold for such data, two important problems are to accurately estimate group-level effect sizes, and to obtain powerful statistical tests against group-level null hypotheses. A common approach is to summarize subject-level data by a single quantity per subject, which is often the mean or the difference between class means, and treat these as samples in a group-level t-test. This “naive” approach is, however, suboptimal in terms of statistical power, as it ignores information about the intra-subject variance. To address this issue, we review several approaches to deal with nested data, with a focus on methods that are easy to implement. With what we call the sufficient-summary-statistic approach, we highlight a computationally efficient technique that can improve statistical power by taking into account within-subject variances, and we provide step-by-step instructions on how to apply this approach to a number of frequently-used measures of effect size. The properties of the reviewed approaches and the potential benefits over a group-level t-test are quantitatively assessed on simulated data and demonstrated on EEG data from a simulated-driving experiment. PMID:29615885
Balas, Benjamin
2016-11-01
Peripheral visual perception is characterized by reduced information about appearance due to constraints on how image structure is represented. Visual crowding is a consequence of excessive integration in the visual periphery. Basic phenomenology of visual crowding and other tasks have been successfully accounted for by a summary-statistic model of pooling, suggesting that texture-like processing is useful for how information is reduced in peripheral vision. I attempt to extend the scope of this model by examining a property of peripheral vision: reduced perceived numerosity in the periphery. I demonstrate that a summary-statistic model of peripheral appearance accounts for reduced numerosity in peripherally viewed arrays of randomly placed dots, but does not account for observed effects of dot clustering within such arrays. The model thus offers a limited account of how numerosity is perceived in the visual periphery. I also demonstrate that the model predicts that numerosity estimation is sensitive to element shape, which represents a novel prediction regarding the phenomenology of peripheral numerosity perception. Finally, I discuss ways to extend the model to a broader range of behavior and the potential for using the model to make further predictions about how number is perceived in untested scenarios in peripheral vision.
GWAMA: software for genome-wide association meta-analysis.
Mägi, Reedik; Morris, Andrew P
2010-05-28
Despite the recent success of genome-wide association studies in identifying novel loci contributing effects to complex human traits, such as type 2 diabetes and obesity, much of the genetic component of variation in these phenotypes remains unexplained. One way to improving power to detect further novel loci is through meta-analysis of studies from the same population, increasing the sample size over any individual study. Although statistical software analysis packages incorporate routines for meta-analysis, they are ill equipped to meet the challenges of the scale and complexity of data generated in genome-wide association studies. We have developed flexible, open-source software for the meta-analysis of genome-wide association studies. The software incorporates a variety of error trapping facilities, and provides a range of meta-analysis summary statistics. The software is distributed with scripts that allow simple formatting of files containing the results of each association study and generate graphical summaries of genome-wide meta-analysis results. The GWAMA (Genome-Wide Association Meta-Analysis) software has been developed to perform meta-analysis of summary statistics generated from genome-wide association studies of dichotomous phenotypes or quantitative traits. Software with source files, documentation and example data files are freely available online at http://www.well.ox.ac.uk/GWAMA.
A summary of modulus of elasticity and knot size surveys for laminating grades of lumber
R. W. Wolfe; R. C. Moody
1981-01-01
A summary of modulus of elasticity (MOE) and knot data is presented for grades of lumber commonly used to manufacture glued-laminated (glulam) timber by the laminating Industry. Tabulated values represent 30 different studies covering a time span of over 16 years. Statistical estimates of average and near-maximum knot sizes as well as mean and coefficient of variation...
1982-11-01
N* I. Khednipin.CeldegN. CG-D-Y4-82 4. Id ed I.ld j. D* NIUVin 1982U /LIAC WH ,. Peb- olm..,, Cede 9. Pwmln,, OWe, .omee . en Adoe., 10 . Wei Uat Me... 10 PRE-SURVEY PLANNING ..a-C ..hin.at.......................... 11 Overviewon of S ..... ........... 1......... ... .16 R eWaypoint Defintion...TDSS Statistics Summary.. ....................... 29 9 Example of Range-Range Waypoint Calculation.................30 10 Summary of Range-Range Waypoint
ERIC Educational Resources Information Center
Kellermann, Arthur L.; Fuqua-Whitley, Dawna S.; Rivara, Frederick P.
This summary explaining the results of evaluations of programs to prevent youth violence is an attempt to fill the gap in information about what works and what does not. An effort is made to place the problem of youth violence in perspective, using information largely taken from Bureau of Justice statistics. The existing programs are divided into…
Computational methods in the pricing and risk management of modern financial derivatives
NASA Astrophysics Data System (ADS)
Deutsch, Hans-Peter
1999-09-01
In the last 20 years modern finance has developed into a complex mathematically challenging field. Very complicated risks exist in financial markets which need very advanced methods to measure and/or model them. The financial instruments invented by the market participants to trade these risk, the so called derivatives are usually even more complicated than the risks themselves and also sometimes generate new riks. Topics like random walks, stochastic differential equations, martingale measures, time series analysis, implied correlations, etc. are of common use in the field. This is why more and more people with a science background, such as physicists, mathematicians, or computer scientists, are entering the field of finance. The measurement and management of all theses risks is the key to the continuing success of banks. This talk gives insight into today's common methods of modern market risk management such as variance-covariance, historical simulation, Monte Carlo, “Greek” ratios, etc., including the statistical concepts on which they are based. Derivatives are at the same time the main reason for and the most effective means of conducting risk management. As such, they stand at the beginning and end of risk management. The valuation of derivatives and structured financial instruments is therefore the prerequisite, the condition sine qua non, for all risk management. This talk introduces some of the important valuation methods used in modern derivatives pricing such as present value, Black-Scholes, binomial trees, Monte Carlo, etc. In summary this talk highlights an area outside physics where there is a lot of interesting work to do, especially for physicists. Or as one of our consultants said: The fascinating thing about this job is that Arthur Andersen hired me not ALTHOUGH I am a physicist but BECAUSE I am a physicist.
Adult Illiterates and Adult Literacy Programs: A Summary of Descriptive Data.
ERIC Educational Resources Information Center
McGrail, Janet
A portrait of illiterates and literacy programs in the United States in the 1980s is derived from this summary of the most up-to-date, valid information that could be obtained from a literature review. The first section on adult illiterates identifies data sources, numbers of illiterates, and characteristics of the five main groups (the elderly,…
Dicarboxylic esters: Useful tools for the biocatalyzed synthesis of hybrid compounds and polymers
Bassanini, Ivan; Hult, Karl
2015-01-01
Summary Dicarboxylic acids and their derivatives (esters and anhydrides) have been used as acylating agents in lipase-catalyzed reactions in organic solvents. The synthetic outcomes have been dimeric or hybrid derivatives of bioactive natural compounds as well as functionalized polyesters. PMID:26664578
NASA Technical Reports Server (NTRS)
Westwater, E. R.; Snider, J. B.; Falls, M. J.; Fionda, E.
1990-01-01
Two seasons of thermal emission measurements, running from December 1987 through February 1988 and from June through August 1988 of thermal emission measurements, taken by a multi-channel, ground-based microwave radiometer, are used to derive single-station zenith attenuation statistics at 20.6 and 31.65 GHz. For the summer period, statistics are also derived for 52.85 GHz. In addition, data from two dual-channel radiometers, separated from Denver by baseline distances of 49 and 168 km, are used to derive two-station attenuation diversity statistics at 20.6 and 31.65 GHz. The multi-channel radiometer is operated at Denver, Colorado; the dual-channel devices are operated at Platteville and Flagler, Colorado. The diversity statistics are presented by cumulative distributions of maximum and minimum attenuation.
A comprehensive review of arsenic levels in the semiconductor manufacturing industry.
Park, Donguk; Yang, Haengsun; Jeong, Jeeyeon; Ha, Kwonchul; Choi, Sangjun; Kim, Chinyon; Yoon, Chungsik; Park, Dooyong; Paek, Domyung
2010-11-01
This paper presents a summary of arsenic level statistics from air and wipe samples taken from studies conducted in fabrication operations. The main objectives of this study were not only to describe arsenic measurement data but also, through a literature review, to categorize fabrication workers in accordance with observed arsenic levels. All airborne arsenic measurements reported were included in the summary statistics for analysis of the measurement data. The arithmetic mean was estimated assuming a lognormal distribution from the geometric mean and the geometric standard deviation or the range. In addition, weighted arithmetic means (WAMs) were calculated based on the number of measurements reported for each mean. Analysis of variance (ANOVA) was employed to compare arsenic levels classified according to several categories such as the year, sampling type, location sampled, operation type, and cleaning technique. Nine papers were found reporting airborne arsenic measurement data from maintenance workers or maintenance areas in semiconductor chip-making plants. A total of 40 statistical summaries from seven articles were identified that represented a total of 423 airborne arsenic measurements. Arsenic exposure levels taken during normal operating activities in implantation operations (WAM = 1.6 μg m⁻³, no. of samples = 77, no. of statistical summaries = 2) were found to be lower than exposure levels of engineers who were involved in maintenance works (7.7 μg m⁻³, no. of samples = 181, no. of statistical summaries = 19). The highest level (WAM = 218.6 μg m⁻³) was associated with various maintenance works performed inside an ion implantation chamber. ANOVA revealed no significant differences in the WAM arsenic levels among the categorizations based on operation and sampling characteristics. Arsenic levels (56.4 μg m⁻³) recorded during maintenance works performed in dry conditions were found to be much higher than those from maintenance works in wet conditions (0.6 μg m⁻³). Arsenic levels from wipe samples in process areas after maintenance activities ranged from non-detectable to 146 μg cm⁻², indicating the potential for dispersion into the air and hence inhalation. We conclude that workers who are regularly or occasionally involved in maintenance work have higher potential for occupational exposure than other employees who are in charge of routine production work. In addition, fabrication workers can be classified into two groups based on the reviewed arsenic exposure levels: operators with potential for low levels of exposure and maintenance engineers with high levels of exposure. These classifications could be used as a basis for a qualitative ordinal ranking of exposure in an epidemiological study.
Elhadad, N.; Claassen, J.; Perotte, R.; Goldstein, A.; Hripcsak, G.
2018-01-01
We study the question of how to represent or summarize raw laboratory data taken from an electronic health record (EHR) using parametric model selection to reduce or cope with biases induced through clinical care. It has been previously demonstrated that the health care process (Hripcsak and Albers, 2012, 2013), as defined by measurement context (Hripcsak and Albers, 2013; Albers et al., 2012) and measurement patterns (Albers and Hripcsak, 2010, 2012), can influence how EHR data are distributed statistically (Kohane and Weber, 2013; Pivovarov et al., 2014). We construct an algorithm, PopKLD, which is based on information criterion model selection (Burnham and Anderson, 2002; Claeskens and Hjort, 2008), is intended to reduce and cope with health care process biases and to produce an intuitively understandable continuous summary. The PopKLD algorithm can be automated and is designed to be applicable in high-throughput settings; for example, the output of the PopKLD algorithm can be used as input for phenotyping algorithms. Moreover, we develop the PopKLD-CAT algorithm that transforms the continuous PopKLD summary into a categorical summary useful for applications that require categorical data such as topic modeling. We evaluate our methodology in two ways. First, we apply the method to laboratory data collected in two different health care contexts, primary versus intensive care. We show that the PopKLD preserves known physiologic features in the data that are lost when summarizing the data using more common laboratory data summaries such as mean and standard deviation. Second, for three disease-laboratory measurement pairs, we perform a phenotyping task: we use the PopKLD and PopKLD-CAT algorithms to define high and low values of the laboratory variable that are used for defining a disease state. We then compare the relationship between the PopKLD-CAT summary disease predictions and the same predictions using empirically estimated mean and standard deviation to a gold standard generated by clinical review of patient records. We find that the PopKLD laboratory data summary is substantially better at predicting disease state. The PopKLD or PopKLD-CAT algorithms are not meant to be used as phenotyping algorithms, but we use the phenotyping task to show what information can be gained when using a more informative laboratory data summary. In the process of evaluation our method we show that the different clinical contexts and laboratory measurements necessitate different statistical summaries. Similarly, leveraging the principle of maximum entropy we argue that while some laboratory data only have sufficient information to estimate a mean and standard deviation, other laboratory data captured in an EHR contain substantially more information than can be captured in higher-parameter models. PMID:29369797
Albers, D J; Elhadad, N; Claassen, J; Perotte, R; Goldstein, A; Hripcsak, G
2018-02-01
We study the question of how to represent or summarize raw laboratory data taken from an electronic health record (EHR) using parametric model selection to reduce or cope with biases induced through clinical care. It has been previously demonstrated that the health care process (Hripcsak and Albers, 2012, 2013), as defined by measurement context (Hripcsak and Albers, 2013; Albers et al., 2012) and measurement patterns (Albers and Hripcsak, 2010, 2012), can influence how EHR data are distributed statistically (Kohane and Weber, 2013; Pivovarov et al., 2014). We construct an algorithm, PopKLD, which is based on information criterion model selection (Burnham and Anderson, 2002; Claeskens and Hjort, 2008), is intended to reduce and cope with health care process biases and to produce an intuitively understandable continuous summary. The PopKLD algorithm can be automated and is designed to be applicable in high-throughput settings; for example, the output of the PopKLD algorithm can be used as input for phenotyping algorithms. Moreover, we develop the PopKLD-CAT algorithm that transforms the continuous PopKLD summary into a categorical summary useful for applications that require categorical data such as topic modeling. We evaluate our methodology in two ways. First, we apply the method to laboratory data collected in two different health care contexts, primary versus intensive care. We show that the PopKLD preserves known physiologic features in the data that are lost when summarizing the data using more common laboratory data summaries such as mean and standard deviation. Second, for three disease-laboratory measurement pairs, we perform a phenotyping task: we use the PopKLD and PopKLD-CAT algorithms to define high and low values of the laboratory variable that are used for defining a disease state. We then compare the relationship between the PopKLD-CAT summary disease predictions and the same predictions using empirically estimated mean and standard deviation to a gold standard generated by clinical review of patient records. We find that the PopKLD laboratory data summary is substantially better at predicting disease state. The PopKLD or PopKLD-CAT algorithms are not meant to be used as phenotyping algorithms, but we use the phenotyping task to show what information can be gained when using a more informative laboratory data summary. In the process of evaluation our method we show that the different clinical contexts and laboratory measurements necessitate different statistical summaries. Similarly, leveraging the principle of maximum entropy we argue that while some laboratory data only have sufficient information to estimate a mean and standard deviation, other laboratory data captured in an EHR contain substantially more information than can be captured in higher-parameter models. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Factors That Attenuate the Correlation Coefficient and Its Analogs.
ERIC Educational Resources Information Center
Dolenz, Beverly
The correlation coefficient is an integral part of many other statistical techniques (analysis of variance, t-tests, etc.), since all analytic methods are actually correlational (G. V. Glass and K. D. Hopkins, 1984). The correlation coefficient is a statistical summary that represents the degree and direction of relationship between two variables.…
An Analysis of Research Trends in Dissertations and Theses Studying Blended Learning
ERIC Educational Resources Information Center
Drysdale, Jeffery S.; Graham, Charles R.; Spring, Kristian J.; Halverson, Lisa R.
2013-01-01
This article analyzes the research of 205 doctoral dissertations and masters' theses in the domain of blended learning. A summary of trends regarding the growth and context of blended learning research is presented. Methodological trends are described in terms of qualitative, inferential statistics, descriptive statistics, and combined approaches…
A Data Analysis of Naval Air Systems Command Funding Documents
2017-06-01
Directorate for Information Operations and Reports, 1215 Jefferson Davis Highway, Suite 1204, Arlington, VA 22202-4302, and to the Office of Management ...Business & Financial Managers 15. NUMBER OF PAGES 75 16. PRICE CODE 17. SECURITY CLASSIFICATION OF REPORT Unclassified 18. SECURITY...Summary Statistics for Regressions with a Statistically Significant Relationship
In Search of the Most Likely Value
ERIC Educational Resources Information Center
Letkowski, Jerzy
2014-01-01
Descripting Statistics provides methodology and tools for user-friendly presentation of random data. Among the summary measures that describe focal tendencies in random data, the mode is given the least amount of attention and it is frequently misinterpreted in many introductory textbooks on statistics. The purpose of the paper is to provide a…