SPSS macros to compare any two fitted values from a regression model.
Weaver, Bruce; Dubois, Sacha
2012-12-01
In regression models with first-order terms only, the coefficient for a given variable is typically interpreted as the change in the fitted value of Y for a one-unit increase in that variable, with all other variables held constant. Therefore, each regression coefficient represents the difference between two fitted values of Y. But the coefficients represent only a fraction of the possible fitted value comparisons that might be of interest to researchers. For many fitted value comparisons that are not captured by any of the regression coefficients, common statistical software packages do not provide the standard errors needed to compute confidence intervals or carry out statistical tests-particularly in more complex models that include interactions, polynomial terms, or regression splines. We describe two SPSS macros that implement a matrix algebra method for comparing any two fitted values from a regression model. The !OLScomp and !MLEcomp macros are for use with models fitted via ordinary least squares and maximum likelihood estimation, respectively. The output from the macros includes the standard error of the difference between the two fitted values, a 95% confidence interval for the difference, and a corresponding statistical test with its p-value.
Modified Regression Correlation Coefficient for Poisson Regression Model
NASA Astrophysics Data System (ADS)
Kaengthong, Nattacha; Domthong, Uthumporn
2017-09-01
This study gives attention to indicators in predictive power of the Generalized Linear Model (GLM) which are widely used; however, often having some restrictions. We are interested in regression correlation coefficient for a Poisson regression model. This is a measure of predictive power, and defined by the relationship between the dependent variable (Y) and the expected value of the dependent variable given the independent variables [E(Y|X)] for the Poisson regression model. The dependent variable is distributed as Poisson. The purpose of this research was modifying regression correlation coefficient for Poisson regression model. We also compare the proposed modified regression correlation coefficient with the traditional regression correlation coefficient in the case of two or more independent variables, and having multicollinearity in independent variables. The result shows that the proposed regression correlation coefficient is better than the traditional regression correlation coefficient based on Bias and the Root Mean Square Error (RMSE).
Shrinkage regression-based methods for microarray missing value imputation.
Wang, Hsiuying; Chiu, Chia-Chun; Wu, Yi-Ching; Wu, Wei-Sheng
2013-01-01
Missing values commonly occur in the microarray data, which usually contain more than 5% missing values with up to 90% of genes affected. Inaccurate missing value estimation results in reducing the power of downstream microarray data analyses. Many types of methods have been developed to estimate missing values. Among them, the regression-based methods are very popular and have been shown to perform better than the other types of methods in many testing microarray datasets. To further improve the performances of the regression-based methods, we propose shrinkage regression-based methods. Our methods take the advantage of the correlation structure in the microarray data and select similar genes for the target gene by Pearson correlation coefficients. Besides, our methods incorporate the least squares principle, utilize a shrinkage estimation approach to adjust the coefficients of the regression model, and then use the new coefficients to estimate missing values. Simulation results show that the proposed methods provide more accurate missing value estimation in six testing microarray datasets than the existing regression-based methods do. Imputation of missing values is a very important aspect of microarray data analyses because most of the downstream analyses require a complete dataset. Therefore, exploring accurate and efficient methods for estimating missing values has become an essential issue. Since our proposed shrinkage regression-based methods can provide accurate missing value estimation, they are competitive alternatives to the existing regression-based methods.
Biases and Standard Errors of Standardized Regression Coefficients
ERIC Educational Resources Information Center
Yuan, Ke-Hai; Chan, Wai
2011-01-01
The paper obtains consistent standard errors (SE) and biases of order O(1/n) for the sample standardized regression coefficients with both random and given predictors. Analytical results indicate that the formulas for SEs given in popular text books are consistent only when the population value of the regression coefficient is zero. The sample…
Cruz, Antonio M; Barr, Cameron; Puñales-Pozo, Elsa
2008-01-01
This research's main goals were to build a predictor for a turnaround time (TAT) indicator for estimating its values and use a numerical clustering technique for finding possible causes of undesirable TAT values. The following stages were used: domain understanding, data characterisation and sample reduction and insight characterisation. Building the TAT indicator multiple linear regression predictor and clustering techniques were used for improving corrective maintenance task efficiency in a clinical engineering department (CED). The indicator being studied was turnaround time (TAT). Multiple linear regression was used for building a predictive TAT value model. The variables contributing to such model were clinical engineering department response time (CE(rt), 0.415 positive coefficient), stock service response time (Stock(rt), 0.734 positive coefficient), priority level (0.21 positive coefficient) and service time (0.06 positive coefficient). The regression process showed heavy reliance on Stock(rt), CE(rt) and priority, in that order. Clustering techniques revealed the main causes of high TAT values. This examination has provided a means for analysing current technical service quality and effectiveness. In doing so, it has demonstrated a process for identifying areas and methods of improvement and a model against which to analyse these methods' effectiveness.
Kitagawa, Yasuhisa; Teramoto, Tamio; Daida, Hiroyuki
2012-01-01
We evaluated the impact of adherence to preferable behavior on serum lipid control assessed by a self-reported questionnaire in high-risk patients taking pravastatin for primary prevention of coronary artery disease. High-risk patients taking pravastatin were followed for 2 years. Questionnaire surveys comprising 21 questions, including 18 questions concerning awareness of health, and current status of diet, exercise, and drug therapy, were conducted at baseline and after 1 year. Potential domains were established by factor analysis from the results of questionnaires, and adherence scores were calculated in each domain. The relationship between adherence scores and lipid values during the 1-year treatment period was analyzed by each domain using multiple regression analysis. A total of 5,792 patients taking pravastatin were included in the analysis. Multiple regression analysis showed a significant correlation in terms of "Intake of high fat/cholesterol/sugar foods" (regression coefficient -0.58, p=0.0105) and "Adherence to instructions for drug therapy" (regression coefficient -6.61, p<0.0001). Low-density lipoprotein cholesterol (LDL-C) values were significantly lower in patients who had an increase in the adherence score in the "Awareness of health" domain compared with those with a decreased score. There was a significant correlation between high-density lipoprotein (HDL-C) values and "Awareness of health" (regression coefficient 0.26; p= 0.0037), "Preferable dietary behaviors" (regression coefficient 0.75; p<0.0001), and "Exercise" (regression coefficient 0.73; p= 0.0002). Similar relations were seen with triglycerides. In patients who have a high awareness of their health, a positive attitude toward lipid-lowering treatment including diet, exercise, and high adherence to drug therapy, is related with favorable overall lipid control even in patients under treatment with pravastatin.
2014-01-01
Background Support vector regression (SVR) and Gaussian process regression (GPR) were used for the analysis of electroanalytical experimental data to estimate diffusion coefficients. Results For simulated cyclic voltammograms based on the EC, Eqr, and EqrC mechanisms these regression algorithms in combination with nonlinear kernel/covariance functions yielded diffusion coefficients with higher accuracy as compared to the standard approach of calculating diffusion coefficients relying on the Nicholson-Shain equation. The level of accuracy achieved by SVR and GPR is virtually independent of the rate constants governing the respective reaction steps. Further, the reduction of high-dimensional voltammetric signals by manual selection of typical voltammetric peak features decreased the performance of both regression algorithms compared to a reduction by downsampling or principal component analysis. After training on simulated data sets, diffusion coefficients were estimated by the regression algorithms for experimental data comprising voltammetric signals for three organometallic complexes. Conclusions Estimated diffusion coefficients closely matched the values determined by the parameter fitting method, but reduced the required computational time considerably for one of the reaction mechanisms. The automated processing of voltammograms according to the regression algorithms yields better results than the conventional analysis of peak-related data. PMID:24987463
Biostatistics Series Module 6: Correlation and Linear Regression.
Hazra, Avijit; Gogtay, Nithya
2016-01-01
Correlation and linear regression are the most commonly used techniques for quantifying the association between two numeric variables. Correlation quantifies the strength of the linear relationship between paired variables, expressing this as a correlation coefficient. If both variables x and y are normally distributed, we calculate Pearson's correlation coefficient ( r ). If normality assumption is not met for one or both variables in a correlation analysis, a rank correlation coefficient, such as Spearman's rho (ρ) may be calculated. A hypothesis test of correlation tests whether the linear relationship between the two variables holds in the underlying population, in which case it returns a P < 0.05. A 95% confidence interval of the correlation coefficient can also be calculated for an idea of the correlation in the population. The value r 2 denotes the proportion of the variability of the dependent variable y that can be attributed to its linear relation with the independent variable x and is called the coefficient of determination. Linear regression is a technique that attempts to link two correlated variables x and y in the form of a mathematical equation ( y = a + bx ), such that given the value of one variable the other may be predicted. In general, the method of least squares is applied to obtain the equation of the regression line. Correlation and linear regression analysis are based on certain assumptions pertaining to the data sets. If these assumptions are not met, misleading conclusions may be drawn. The first assumption is that of linear relationship between the two variables. A scatter plot is essential before embarking on any correlation-regression analysis to show that this is indeed the case. Outliers or clustering within data sets can distort the correlation coefficient value. Finally, it is vital to remember that though strong correlation can be a pointer toward causation, the two are not synonymous.
Biostatistics Series Module 6: Correlation and Linear Regression
Hazra, Avijit; Gogtay, Nithya
2016-01-01
Correlation and linear regression are the most commonly used techniques for quantifying the association between two numeric variables. Correlation quantifies the strength of the linear relationship between paired variables, expressing this as a correlation coefficient. If both variables x and y are normally distributed, we calculate Pearson's correlation coefficient (r). If normality assumption is not met for one or both variables in a correlation analysis, a rank correlation coefficient, such as Spearman's rho (ρ) may be calculated. A hypothesis test of correlation tests whether the linear relationship between the two variables holds in the underlying population, in which case it returns a P < 0.05. A 95% confidence interval of the correlation coefficient can also be calculated for an idea of the correlation in the population. The value r2 denotes the proportion of the variability of the dependent variable y that can be attributed to its linear relation with the independent variable x and is called the coefficient of determination. Linear regression is a technique that attempts to link two correlated variables x and y in the form of a mathematical equation (y = a + bx), such that given the value of one variable the other may be predicted. In general, the method of least squares is applied to obtain the equation of the regression line. Correlation and linear regression analysis are based on certain assumptions pertaining to the data sets. If these assumptions are not met, misleading conclusions may be drawn. The first assumption is that of linear relationship between the two variables. A scatter plot is essential before embarking on any correlation-regression analysis to show that this is indeed the case. Outliers or clustering within data sets can distort the correlation coefficient value. Finally, it is vital to remember that though strong correlation can be a pointer toward causation, the two are not synonymous. PMID:27904175
The microcomputer scientific software series 2: general linear model--regression.
Harold M. Rauscher
1983-01-01
The general linear model regression (GLMR) program provides the microcomputer user with a sophisticated regression analysis capability. The output provides a regression ANOVA table, estimators of the regression model coefficients, their confidence intervals, confidence intervals around the predicted Y-values, residuals for plotting, a check for multicollinearity, a...
Zhao, Yang; Zhang, Xue Qing; Bian, Xiao Dong
2018-01-01
To investigate the early supplementary processes of fishre sources in the Bohai Sea, the geographically weighted regression (GWR) was introduced to the habitat suitability index (HSI) model. The Bohai Sea larval Japanese Halfbeak HSI GWR model was established with four environmental variables, including sea surface temperature (SST), sea surface salinity (SSS), water depth (DEP), and chlorophyll a concentration (Chl a). Results of the simulation showed that the four variables had different performances in August 2015. SST and Chl a were global variables, and had little impacts on HSI, with the regression coefficients of -0.027 and 0.006, respectively. SSS and DEP were local variables, and had larger impacts on HSI, while the average values of absolute values of their regression coefficients were 0.075 and 0.129, respectively. In the central Bohai Sea, SSS showed a negative correlation with HSI, and the most negative correlation coefficient was -0.3. In contrast, SSS was correlated positively but weakly with HSI in the three bays of Bohai Sea, and the largest correlation coefficient was 0.1. In particular, DEP and HSI were negatively correlated in the entire Bohai Sea, while they were more negatively correlated in the three bays of Bohai than in the central Bohai Sea, and the most negative correlation coefficient was -0.16 in the three bays. The Poisson regression coefficient of the HSI GWR model was 0.705, consistent with field measurements. Therefore, it could provide a new method for the research on fish habitats in the future.
NASA Astrophysics Data System (ADS)
Mansouri, Edris; Feizi, Faranak; Jafari Rad, Alireza; Arian, Mehran
2018-03-01
This paper uses multivariate regression to create a mathematical model for iron skarn exploration in the Sarvian area, central Iran, using multivariate regression for mineral prospectivity mapping (MPM). The main target of this paper is to apply multivariate regression analysis (as an MPM method) to map iron outcrops in the northeastern part of the study area in order to discover new iron deposits in other parts of the study area. Two types of multivariate regression models using two linear equations were employed to discover new mineral deposits. This method is one of the reliable methods for processing satellite images. ASTER satellite images (14 bands) were used as unique independent variables (UIVs), and iron outcrops were mapped as dependent variables for MPM. According to the results of the probability value (p value), coefficient of determination value (R2) and adjusted determination coefficient (Radj2), the second regression model (which consistent of multiple UIVs) fitted better than other models. The accuracy of the model was confirmed by iron outcrops map and geological observation. Based on field observation, iron mineralization occurs at the contact of limestone and intrusive rocks (skarn type).
Nakatsuka, Haruo; Chiba, Keiko; Watanabe, Takao; Sawatari, Hideyuki; Seki, Takako
2016-11-01
Iodine intake by adults in farming districts in Northeastern Japan was evaluated by two methods: (1) government-approved food composition tables based calculation and (2) instrumental measurement. The correlation between these two values and a regression model for the calibration of calculated values was presented. Iodine intake was calculated, using the values in the Japan Standard Tables of Food Composition (FCT), through the analysis of duplicate samples of complete 24-h food consumption for 90 adult subjects. In cases where the value for iodine content was not available in the FCT, it was assumed to be zero for that food item (calculated values). Iodine content was also measured by ICP-MS (measured values). Calculated and measured values rendered geometric means (GM) of 336 and 279 μg/day, respectively. There was no statistically significant (p > 0.05) difference between calculated and measured values. The correlation coefficient was 0.646 (p < 0.05). With this high correlation coefficient, a simple regression line can be applied to estimate measured value from calculated value. A survey of the literature suggests that the values in this study were similar to values that have been reported to date for Japan, and higher than those for other countries in Asia. Iodine intake of Japanese adults was 336 μg/day (GM, calculated) and 279 μg/day (GM, measured). Both values correlated so well, with a correlation coefficient of 0.646, that a regression model (Y = 130.8 + 1.9479X, where X and Y are measured and calculated values, respectively) could be used to calibrate calculated values.
Beyond Multiple Regression: Using Commonality Analysis to Better Understand R[superscript 2] Results
ERIC Educational Resources Information Center
Warne, Russell T.
2011-01-01
Multiple regression is one of the most common statistical methods used in quantitative educational research. Despite the versatility and easy interpretability of multiple regression, it has some shortcomings in the detection of suppressor variables and for somewhat arbitrarily assigning values to the structure coefficients of correlated…
Detection of Cutting Tool Wear using Statistical Analysis and Regression Model
NASA Astrophysics Data System (ADS)
Ghani, Jaharah A.; Rizal, Muhammad; Nuawi, Mohd Zaki; Haron, Che Hassan Che; Ramli, Rizauddin
2010-10-01
This study presents a new method for detecting the cutting tool wear based on the measured cutting force signals. A statistical-based method called Integrated Kurtosis-based Algorithm for Z-Filter technique, called I-kaz was used for developing a regression model and 3D graphic presentation of I-kaz 3D coefficient during machining process. The machining tests were carried out using a CNC turning machine Colchester Master Tornado T4 in dry cutting condition. A Kistler 9255B dynamometer was used to measure the cutting force signals, which were transmitted, analyzed, and displayed in the DasyLab software. Various force signals from machining operation were analyzed, and each has its own I-kaz 3D coefficient. This coefficient was examined and its relationship with flank wear lands (VB) was determined. A regression model was developed due to this relationship, and results of the regression model shows that the I-kaz 3D coefficient value decreases as tool wear increases. The result then is used for real time tool wear monitoring.
Innovating patient care delivery: DSRIP's interrupted time series analysis paradigm.
Shenoy, Amrita G; Begley, Charles E; Revere, Lee; Linder, Stephen H; Daiger, Stephen P
2017-12-08
Adoption of Medicaid Section 1115 waiver is one of the many ways of innovating healthcare delivery system. The Delivery System Reform Incentive Payment (DSRIP) pool, one of the two funding pools of the waiver has four categories viz. infrastructure development, program innovation and redesign, quality improvement reporting and lastly, bringing about population health improvement. A metric of the fourth category, preventable hospitalization (PH) rate was analyzed in the context of eight conditions for two time periods, pre-reporting years (2010-2012) and post-reporting years (2013-2015) for two hospital cohorts, DSRIP participating and non-participating hospitals. The study explains how DSRIP impacted Preventable Hospitalization (PH) rates of eight conditions for both hospital cohorts within two time periods. Eight PH rates were regressed as the dependent variable with time, intervention and post-DSRIP Intervention as independent variables. PH rates of eight conditions were then consolidated into one rate for regressing with the above independent variables to evaluate overall impact of DSRIP. An interrupted time series regression was performed after accounting for auto-correlation, stationarity and seasonality in the dataset. In the individual regression model, PH rates showed statistically significant coefficients for seven out of eight conditions in DSRIP participating hospitals. In the combined regression model, the coefficient of the PH rate showed a statistically significant decrease with negative p-values for regression coefficients in DSRIP participating hospitals compared to positive/increased p-values for regression coefficients in DSRIP non-participating hospitals. Several macro- and micro-level factors may have likely contributed DSRIP hospitals outperforming DSRIP non-participating hospitals. Healthcare organization/provider collaboration, support from healthcare professionals, DSRIP's design, state reimbursement and coordination in care delivery methods may have led to likely success of DSRIP. IV, a retrospective cohort study based on longitudinal data. Copyright © 2017 Elsevier Inc. All rights reserved.
Prediction of kinase-inhibitor binding affinity using energetic parameters
Usha, Singaravelu; Selvaraj, Samuel
2016-01-01
The combination of physicochemical properties and energetic parameters derived from protein-ligand complexes play a vital role in determining the biological activity of a molecule. In the present work, protein-ligand interaction energy along with logP values was used to predict the experimental log (IC50) values of 25 different kinase-inhibitors using multiple regressions which gave a correlation coefficient of 0.93. The regression equation obtained was tested on 93 kinase-inhibitor complexes and an average deviation of 0.92 from the experimental log IC50 values was shown. The same set of descriptors was used to predict binding affinities for a test set of five individual kinase families, with correlation values > 0.9. We show that the protein-ligand interaction energies and partition coefficient values form the major deterministic factors for binding affinity of the ligand for its receptor. PMID:28149052
Sabetghadam, Samaneh; Ahmadi-Givi, Farhang
2014-01-01
Light extinction, which is the extent of attenuation of light signal for every distance traveled by light in the absence of special weather conditions (e.g., fog and rain), can be expressed as the sum of scattering and absorption effects of aerosols. In this paper, diurnal and seasonal variations of the extinction coefficient are investigated for the urban areas of Tehran from 2007 to 2009. Cases of visibility impairment that were concurrent with reports of fog, mist, precipitation, or relative humidity above 90% are filtered. The mean value and standard deviation of daily extinction are 0.49 and 0.39 km(-1), respectively. The average is much higher than that in many other large cities in the world, indicating the rather poor air quality over Tehran. The extinction coefficient shows obvious diurnal variations in each season, with a peak in the morning that is more pronounced in the wintertime. Also, there is a very slight increasing trend in the annual variations of atmospheric extinction coefficient, which suggests that air quality has regressed since 2007. The horizontal extinction coefficient decreased from January to July in each year and then increased between July and December, with the maximum value in the winter. Diurnal variation of extinction is often associated with small values for low relative humidity (RH), but increases significantly at higher RH. Annual correlation analysis shows that there is a positive correlation between the extinction coefficient and RH, CO, PM10, SO2, and NO2 concentration, while negative correlation exists between the extinction and T, WS, and O3, implying their unfavorable impact on extinction variation. The extinction budget was derived from multiple regression equations using the regression coefficients. On average, 44% of the extinction is from suspended particles, 3% is from air molecules, about 5% is from NO2 absorption, 0.35% is from RH, and approximately 48% is unaccounted for, which may represent errors in the data as well as contribution of other atmospheric constituents omitted from the analysis. Stronger regression equation is achieved in the summer, meaning that the extinction is more predictable in this season using pollutant concentrations.
Yoneoka, Daisuke; Henmi, Masayuki
2017-11-30
Recently, the number of clinical prediction models sharing the same regression task has increased in the medical literature. However, evidence synthesis methodologies that use the results of these regression models have not been sufficiently studied, particularly in meta-analysis settings where only regression coefficients are available. One of the difficulties lies in the differences between the categorization schemes of continuous covariates across different studies. In general, categorization methods using cutoff values are study specific across available models, even if they focus on the same covariates of interest. Differences in the categorization of covariates could lead to serious bias in the estimated regression coefficients and thus in subsequent syntheses. To tackle this issue, we developed synthesis methods for linear regression models with different categorization schemes of covariates. A 2-step approach to aggregate the regression coefficient estimates is proposed. The first step is to estimate the joint distribution of covariates by introducing a latent sampling distribution, which uses one set of individual participant data to estimate the marginal distribution of covariates with categorization. The second step is to use a nonlinear mixed-effects model with correction terms for the bias due to categorization to estimate the overall regression coefficients. Especially in terms of precision, numerical simulations show that our approach outperforms conventional methods, which only use studies with common covariates or ignore the differences between categorization schemes. The method developed in this study is also applied to a series of WHO epidemiologic studies on white blood cell counts. Copyright © 2017 John Wiley & Sons, Ltd.
NASA Astrophysics Data System (ADS)
Rock, N. M. S.; Duffy, T. R.
REGRES allows a range of regression equations to be calculated for paired sets of data values in which both variables are subject to error (i.e. neither is the "independent" variable). Nonparametric regressions, based on medians of all possible pairwise slopes and intercepts, are treated in detail. Estimated slopes and intercepts are output, along with confidence limits, Spearman and Kendall rank correlation coefficients. Outliers can be rejected with user-determined stringency. Parametric regressions can be calculated for any value of λ (the ratio of the variances of the random errors for y and x)—including: (1) major axis ( λ = 1); (2) reduced major axis ( λ = variance of y/variance of x); (3) Y on Xλ = infinity; or (4) X on Y ( λ = 0) solutions. Pearson linear correlation coefficients also are output. REGRES provides an alternative to conventional isochron assessment techniques where bivariate normal errors cannot be assumed, or weighting methods are inappropriate.
Improvement of Storm Forecasts Using Gridded Bayesian Linear Regression for Northeast United States
NASA Astrophysics Data System (ADS)
Yang, J.; Astitha, M.; Schwartz, C. S.
2017-12-01
Bayesian linear regression (BLR) is a post-processing technique in which regression coefficients are derived and used to correct raw forecasts based on pairs of observation-model values. This study presents the development and application of a gridded Bayesian linear regression (GBLR) as a new post-processing technique to improve numerical weather prediction (NWP) of rain and wind storm forecasts over northeast United States. Ten controlled variables produced from ten ensemble members of the National Center for Atmospheric Research (NCAR) real-time prediction system are used for a GBLR model. In the GBLR framework, leave-one-storm-out cross-validation is utilized to study the performances of the post-processing technique in a database composed of 92 storms. To estimate the regression coefficients of the GBLR, optimization procedures that minimize the systematic and random error of predicted atmospheric variables (wind speed, precipitation, etc.) are implemented for the modeled-observed pairs of training storms. The regression coefficients calculated for meteorological stations of the National Weather Service are interpolated back to the model domain. An analysis of forecast improvements based on error reductions during the storms will demonstrate the value of GBLR approach. This presentation will also illustrate how the variances are optimized for the training partition in GBLR and discuss the verification strategy for grid points where no observations are available. The new post-processing technique is successful in improving wind speed and precipitation storm forecasts using past event-based data and has the potential to be implemented in real-time.
Prediction of anthropometric foot characteristics in children.
Morrison, Stewart C; Durward, Brian R; Watt, Gordon F; Donaldson, Malcolm D C
2009-01-01
The establishment of growth reference values is needed in pediatric practice where pathologic conditions can have a detrimental effect on the growth and development of the pediatric foot. This study aims to use multiple regression to evaluate the effects of multiple predictor variables (height, age, body mass, and gender) on anthropometric characteristics of the peripubescent foot. Two hundred children aged 9 to 12 years were recruited, and three anthropometric measurements of the pediatric foot were recorded (foot length, forefoot width, and navicular height). Multiple regression analysis was conducted, and coefficients for gender, height, and body mass all had significant relationships for the prediction of forefoot width and foot length (P < or = .05, r > or = 0.7). The coefficients for gender and body mass were not significant for the prediction of navicular height (P > or = .05), whereas height was (P < or = .05). Normative growth reference values and prognostic regression equations are presented for the peripubescent foot.
The solar wind effect on cosmic rays and solar activity
NASA Technical Reports Server (NTRS)
Fujimoto, K.; Kojima, H.; Murakami, K.
1985-01-01
The relation of cosmic ray intensity to solar wind velocity is investigated, using neutron monitor data from Kiel and Deep River. The analysis shows that the regression coefficient of the average intensity for a time interval to the corresponding average velocity is negative and that the absolute effect increases monotonously with the interval of averaging, tau, that is, from -0.5% per 100km/s for tau = 1 day to -1.1% per 100km/s for tau = 27 days. For tau 27 days the coefficient becomes almost constant independently of the value of tau. The analysis also shows that this tau-dependence of the regression coefficiently is varying with the solar activity.
Linear regression metamodeling as a tool to summarize and present simulation model results.
Jalal, Hawre; Dowd, Bryan; Sainfort, François; Kuntz, Karen M
2013-10-01
Modelers lack a tool to systematically and clearly present complex model results, including those from sensitivity analyses. The objective was to propose linear regression metamodeling as a tool to increase transparency of decision analytic models and better communicate their results. We used a simplified cancer cure model to demonstrate our approach. The model computed the lifetime cost and benefit of 3 treatment options for cancer patients. We simulated 10,000 cohorts in a probabilistic sensitivity analysis (PSA) and regressed the model outcomes on the standardized input parameter values in a set of regression analyses. We used the regression coefficients to describe measures of sensitivity analyses, including threshold and parameter sensitivity analyses. We also compared the results of the PSA to deterministic full-factorial and one-factor-at-a-time designs. The regression intercept represented the estimated base-case outcome, and the other coefficients described the relative parameter uncertainty in the model. We defined simple relationships that compute the average and incremental net benefit of each intervention. Metamodeling produced outputs similar to traditional deterministic 1-way or 2-way sensitivity analyses but was more reliable since it used all parameter values. Linear regression metamodeling is a simple, yet powerful, tool that can assist modelers in communicating model characteristics and sensitivity analyses.
Examination of the Relation between the Values of Adolescents and Virtual Sensitiveness
ERIC Educational Resources Information Center
Yilmaz, Hasan
2013-01-01
The aim of this study is to examine the relation between the values adolescents have and virtual sensitiveness. The study is carried out on 447 adolescents, 160 of whom are female, 287 males. The Humanistic Values Scale and Virtual Sensitiveness scale were used. Pearson Product Moment Coefficient and multiple regression analysis techniques were…
Analysis of oscillatory motion of a light airplane at high values of lift coefficient
NASA Technical Reports Server (NTRS)
Batterson, J. G.
1983-01-01
A modified stepwise regression is applied to flight data from a light research air-plane operating at high angles at attack. The well-known phenomenon referred to as buckling or porpoising is analyzed and modeled using both power series and spline expansions of the aerodynamic force and moment coefficients associated with the longitudinal equations of motion.
Testing a single regression coefficient in high dimensional linear models
Zhong, Ping-Shou; Li, Runze; Wang, Hansheng; Tsai, Chih-Ling
2017-01-01
In linear regression models with high dimensional data, the classical z-test (or t-test) for testing the significance of each single regression coefficient is no longer applicable. This is mainly because the number of covariates exceeds the sample size. In this paper, we propose a simple and novel alternative by introducing the Correlated Predictors Screening (CPS) method to control for predictors that are highly correlated with the target covariate. Accordingly, the classical ordinary least squares approach can be employed to estimate the regression coefficient associated with the target covariate. In addition, we demonstrate that the resulting estimator is consistent and asymptotically normal even if the random errors are heteroscedastic. This enables us to apply the z-test to assess the significance of each covariate. Based on the p-value obtained from testing the significance of each covariate, we further conduct multiple hypothesis testing by controlling the false discovery rate at the nominal level. Then, we show that the multiple hypothesis testing achieves consistent model selection. Simulation studies and empirical examples are presented to illustrate the finite sample performance and the usefulness of the proposed method, respectively. PMID:28663668
Testing a single regression coefficient in high dimensional linear models.
Lan, Wei; Zhong, Ping-Shou; Li, Runze; Wang, Hansheng; Tsai, Chih-Ling
2016-11-01
In linear regression models with high dimensional data, the classical z -test (or t -test) for testing the significance of each single regression coefficient is no longer applicable. This is mainly because the number of covariates exceeds the sample size. In this paper, we propose a simple and novel alternative by introducing the Correlated Predictors Screening (CPS) method to control for predictors that are highly correlated with the target covariate. Accordingly, the classical ordinary least squares approach can be employed to estimate the regression coefficient associated with the target covariate. In addition, we demonstrate that the resulting estimator is consistent and asymptotically normal even if the random errors are heteroscedastic. This enables us to apply the z -test to assess the significance of each covariate. Based on the p -value obtained from testing the significance of each covariate, we further conduct multiple hypothesis testing by controlling the false discovery rate at the nominal level. Then, we show that the multiple hypothesis testing achieves consistent model selection. Simulation studies and empirical examples are presented to illustrate the finite sample performance and the usefulness of the proposed method, respectively.
Vereecken, H; Vanderborght, J; Kasteel, R; Spiteller, M; Schäffer, A; Close, M
2011-01-01
In this study, we analyzed sorption parameters for pesticides that were derived from batch and column or batch and field experiments. The batch experiments analyzed in this study were run with the same pesticide and soil as in the column and field experiments. We analyzed the relationship between the pore water velocity of the column and field experiments, solute residence times, and sorption parameters, such as the organic carbon normalized distribution coefficient ( ) and the mass exchange coefficient in kinetic models, as well as the predictability of sorption parameters from basic soil properties. The batch/column analysis included 38 studies with a total of 139 observations. The batch/field analysis included five studies, resulting in a dataset of 24 observations. For the batch/column data, power law relationships between pore water velocity, residence time, and sorption constants were derived. The unexplained variability in these equations was reduced, taking into account the saturation status and the packing status (disturbed-undisturbed) of the soil sample. A new regression equation was derived that allows estimating the values derived from column experiments using organic matter and bulk density with an value of 0.56. Regression analysis of the batch/column data showed that the relationship between batch- and column-derived values depends on the saturation status and packing of the soil column. Analysis of the batch/field data showed that as the batch-derived value becomes larger, field-derived values tend to be lower than the corresponding batch-derived values, and vice versa. The present dataset also showed that the variability in the ratio of batch- to column-derived value increases with increasing pore water velocity, with a maximum value approaching 3.5. American Society of Agronomy, Crop Science Society of America, and Soil Science Society of America.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dierauf, Timothy; Kurtz, Sarah; Riley, Evan
This paper provides a recommended method for evaluating the AC capacity of a photovoltaic (PV) generating station. It also presents companion guidance on setting the facilitys capacity guarantee value. This is a principles-based approach that incorporates plant fundamental design parameters such as loss factors, module coefficients, and inverter constraints. This method has been used to prove contract guarantees for over 700 MW of installed projects. The method is transparent, and the results are deterministic. In contrast, current industry practices incorporate statistical regression where the empirical coefficients may only characterize the collected data. Though these methods may work well when extrapolationmore » is not required, there are other situations where the empirical coefficients may not adequately model actual performance.This proposed Fundamentals Approach method provides consistent results even where regression methods start to lose fidelity.« less
[From clinical judgment to linear regression model.
Palacios-Cruz, Lino; Pérez, Marcela; Rivas-Ruiz, Rodolfo; Talavera, Juan O
2013-01-01
When we think about mathematical models, such as linear regression model, we think that these terms are only used by those engaged in research, a notion that is far from the truth. Legendre described the first mathematical model in 1805, and Galton introduced the formal term in 1886. Linear regression is one of the most commonly used regression models in clinical practice. It is useful to predict or show the relationship between two or more variables as long as the dependent variable is quantitative and has normal distribution. Stated in another way, the regression is used to predict a measure based on the knowledge of at least one other variable. Linear regression has as it's first objective to determine the slope or inclination of the regression line: Y = a + bx, where "a" is the intercept or regression constant and it is equivalent to "Y" value when "X" equals 0 and "b" (also called slope) indicates the increase or decrease that occurs when the variable "x" increases or decreases in one unit. In the regression line, "b" is called regression coefficient. The coefficient of determination (R 2 ) indicates the importance of independent variables in the outcome.
Measuring Productivity of Depot-Level Aircraft Maintenance in the Air Force Logistics Command.
1985-09-01
of Figures...... . . . . . . . . . . . . vi List of Tables . . . . . . . . . ............ vii Abstract . . . ...................... viii I...59 6. DEA Efficiency Values (Third DEA Model) . .... 62 7. DMU 5 Input Efficiencies ................ 64 vi F "-’ List of Tables Table Page I. DEA...Regression Results for 20 Months . . . ..... 68 V. Regression Results for 7 Quarters . . ..... 70 VI . Coefficients of Correlation (Using Quarterly Data
Investigating bias in squared regression structure coefficients
Nimon, Kim F.; Zientek, Linda R.; Thompson, Bruce
2015-01-01
The importance of structure coefficients and analogs of regression weights for analysis within the general linear model (GLM) has been well-documented. The purpose of this study was to investigate bias in squared structure coefficients in the context of multiple regression and to determine if a formula that had been shown to correct for bias in squared Pearson correlation coefficients and coefficients of determination could be used to correct for bias in squared regression structure coefficients. Using data from a Monte Carlo simulation, this study found that squared regression structure coefficients corrected with Pratt's formula produced less biased estimates and might be more accurate and stable estimates of population squared regression structure coefficients than estimates with no such corrections. While our findings are in line with prior literature that identified multicollinearity as a predictor of bias in squared regression structure coefficients but not coefficients of determination, the findings from this study are unique in that the level of predictive power, number of predictors, and sample size were also observed to contribute bias in squared regression structure coefficients. PMID:26217273
A critical re-evaluation of the regression model specification in the US D1 EQ-5D value function
2012-01-01
Background The EQ-5D is a generic health-related quality of life instrument (five dimensions with three levels, 243 health states), used extensively in cost-utility/cost-effectiveness analyses. EQ-5D health states are assigned values on a scale anchored in perfect health (1) and death (0). The dominant procedure for defining values for EQ-5D health states involves regression modeling. These regression models have typically included a constant term, interpreted as the utility loss associated with any movement away from perfect health. The authors of the United States EQ-5D valuation study replaced this constant with a variable, D1, which corresponds to the number of impaired dimensions beyond the first. The aim of this study was to illustrate how the use of the D1 variable in place of a constant is problematic. Methods We compared the original D1 regression model with a mathematically equivalent model with a constant term. Comparisons included implications for the magnitude and statistical significance of the coefficients, multicollinearity (variance inflation factors, or VIFs), number of calculation steps needed to determine tariff values, and consequences for tariff interpretation. Results Using the D1 variable in place of a constant shifted all dummy variable coefficients away from zero by the value of the constant, greatly increased the multicollinearity of the model (maximum VIF of 113.2 vs. 21.2), and increased the mean number of calculation steps required to determine health state values. Discussion Using the D1 variable in place of a constant constitutes an unnecessary complication of the model, obscures the fact that at least two of the main effect dummy variables are statistically nonsignificant, and complicates and biases interpretation of the tariff algorithm. PMID:22244261
A critical re-evaluation of the regression model specification in the US D1 EQ-5D value function.
Rand-Hendriksen, Kim; Augestad, Liv A; Dahl, Fredrik A
2012-01-13
The EQ-5D is a generic health-related quality of life instrument (five dimensions with three levels, 243 health states), used extensively in cost-utility/cost-effectiveness analyses. EQ-5D health states are assigned values on a scale anchored in perfect health (1) and death (0).The dominant procedure for defining values for EQ-5D health states involves regression modeling. These regression models have typically included a constant term, interpreted as the utility loss associated with any movement away from perfect health. The authors of the United States EQ-5D valuation study replaced this constant with a variable, D1, which corresponds to the number of impaired dimensions beyond the first. The aim of this study was to illustrate how the use of the D1 variable in place of a constant is problematic. We compared the original D1 regression model with a mathematically equivalent model with a constant term. Comparisons included implications for the magnitude and statistical significance of the coefficients, multicollinearity (variance inflation factors, or VIFs), number of calculation steps needed to determine tariff values, and consequences for tariff interpretation. Using the D1 variable in place of a constant shifted all dummy variable coefficients away from zero by the value of the constant, greatly increased the multicollinearity of the model (maximum VIF of 113.2 vs. 21.2), and increased the mean number of calculation steps required to determine health state values. Using the D1 variable in place of a constant constitutes an unnecessary complication of the model, obscures the fact that at least two of the main effect dummy variables are statistically nonsignificant, and complicates and biases interpretation of the tariff algorithm.
Expression profiles of loneliness-associated genes for survival prediction in cancer patients.
You, Liang-Fu; Yeh, Jia-Rong; Su, Mu-Chun
2014-01-01
Influence of loneliness on human survival has been established epidemiologically, but genomic research remains undeveloped. We identified 34 loneliness-associated genes which were statistically significant for high- lonely and low-lonely individuals. With the univariate Cox proportional hazards regression model, we obtained corresponding regression coefficients for loneliness-associated genes fo individual cancer patients. Furthermore, risk scores could be generated with the combination of gene expression level multiplied by corresponding regression coefficients of loneliness-associated genes. We verified that high-risk score cancer patients had shorter mean survival time than their low-risk score counterparts. Then we validated the loneliness-associated gene signature in three independent brain cancer cohorts with Kaplan-Meier survival curves (n=77, 85 and 191), significantly separable by log-rank test with hazard ratios (HR) >1 and p-values <0.0001 (HR=2.94, 3.82, and 1.78). Moreover, we validated the loneliness-associated gene signature in bone cancer (HR=5.10, p-value=4.69e-3), lung cancer (HR=2.86, p-value=4.71e-5), ovarian cancer (HR=1.97, p-value=3.11e-5), and leukemia (HR=2.06, p-value=1.79e-4) cohorts. The last lymphoma cohort proved to have an HR=3.50, p-value=1.15e-7. Loneliness- associated genes had good survival prediction for cancer patients, especially bone cancer patients. Our study provided the first indication that expression of loneliness-associated genes are related to survival time of cancer patients.
Statistical procedures for evaluating daily and monthly hydrologic model predictions
Coffey, M.E.; Workman, S.R.; Taraba, J.L.; Fogle, A.W.
2004-01-01
The overall study objective was to evaluate the applicability of different qualitative and quantitative methods for comparing daily and monthly SWAT computer model hydrologic streamflow predictions to observed data, and to recommend statistical methods for use in future model evaluations. Statistical methods were tested using daily streamflows and monthly equivalent runoff depths. The statistical techniques included linear regression, Nash-Sutcliffe efficiency, nonparametric tests, t-test, objective functions, autocorrelation, and cross-correlation. None of the methods specifically applied to the non-normal distribution and dependence between data points for the daily predicted and observed data. Of the tested methods, median objective functions, sign test, autocorrelation, and cross-correlation were most applicable for the daily data. The robust coefficient of determination (CD*) and robust modeling efficiency (EF*) objective functions were the preferred methods for daily model results due to the ease of comparing these values with a fixed ideal reference value of one. Predicted and observed monthly totals were more normally distributed, and there was less dependence between individual monthly totals than was observed for the corresponding predicted and observed daily values. More statistical methods were available for comparing SWAT model-predicted and observed monthly totals. The 1995 monthly SWAT model predictions and observed data had a regression Rr2 of 0.70, a Nash-Sutcliffe efficiency of 0.41, and the t-test failed to reject the equal data means hypothesis. The Nash-Sutcliffe coefficient and the R r2 coefficient were the preferred methods for monthly results due to the ability to compare these coefficients to a set ideal value of one.
AMINI, Payam; AHMADINIA, Hasan; POOROLAJAL, Jalal; MOQADDASI AMIRI, Mohammad
2016-01-01
Background: We aimed to assess the high-risk group for suicide using different classification methods includinglogistic regression (LR), decision tree (DT), artificial neural network (ANN), and support vector machine (SVM). Methods: We used the dataset of a study conducted to predict risk factors of completed suicide in Hamadan Province, the west of Iran, in 2010. To evaluate the high-risk groups for suicide, LR, SVM, DT and ANN were performed. The applied methods were compared using sensitivity, specificity, positive predicted value, negative predicted value, accuracy and the area under curve. Cochran-Q test was implied to check differences in proportion among methods. To assess the association between the observed and predicted values, Ø coefficient, contingency coefficient, and Kendall tau-b were calculated. Results: Gender, age, and job were the most important risk factors for fatal suicide attempts in common for four methods. SVM method showed the highest accuracy 0.68 and 0.67 for training and testing sample, respectively. However, this method resulted in the highest specificity (0.67 for training and 0.68 for testing sample) and the highest sensitivity for training sample (0.85), but the lowest sensitivity for the testing sample (0.53). Cochran-Q test resulted in differences between proportions in different methods (P<0.001). The association of SVM predictions and observed values, Ø coefficient, contingency coefficient, and Kendall tau-b were 0.239, 0.232 and 0.239, respectively. Conclusion: SVM had the best performance to classify fatal suicide attempts comparing to DT, LR and ANN. PMID:27957463
Morikawa, Go; Suzuka, Chihiro; Shoji, Atsushi; Shibusawa, Yoichi; Yanagida, Akio
2016-01-05
A high-throughput method for determining the octanol/water partition coefficient (P(o/w)) of a large variety of compounds exhibiting a wide range in hydrophobicity was established. The method combines a simple shake-flask method with a novel two-phase solvent system comprising an acetonitrile-phosphate buffer (0.1 M, pH 7.4)-1-octanol (25:25:4, v/v/v; AN system). The AN system partition coefficients (K(AN)) of 51 standard compounds for which log P(o/w) (at pH 7.4; log D) values had been reported were determined by single two-phase partitioning in test tubes, followed by measurement of the solute concentration in both phases using an automatic flow injection-ultraviolet detection system. The log K(AN) values were closely related to reported log D values, and the relationship could be expressed by the following linear regression equation: log D=2.8630 log K(AN) -0.1497(n=51). The relationship reveals that log D values (+8 to -8) for a large variety of highly hydrophobic and/or hydrophilic compounds can be estimated indirectly from the narrow range of log K(AN) values (+3 to -3) determined using the present method. Furthermore, log K(AN) values for highly polar compounds for which no log D values have been reported, such as amino acids, peptides, proteins, nucleosides, and nucleotides, can be estimated using the present method. The wide-ranging log D values (+5.9 to -7.5) of these molecules were estimated for the first time from their log K(AN) values and the above regression equation. Copyright © 2015 Elsevier B.V. All rights reserved.
ERIC Educational Resources Information Center
Pecorella, Patricia A.; Bowers, David G.
Multiple regression in a double cross-validated design was used to predict two performance measures (total variable expense and absence rate) by multi-month period in five industrial firms. The regressions do cross-validate, and produce multiple coefficients which display both concurrent and predictive effects, peaking 18 months to two years…
Simple and multiple linear regression: sample size considerations.
Hanley, James A
2016-11-01
The suggested "two subjects per variable" (2SPV) rule of thumb in the Austin and Steyerberg article is a chance to bring out some long-established and quite intuitive sample size considerations for both simple and multiple linear regression. This article distinguishes two of the major uses of regression models that imply very different sample size considerations, neither served well by the 2SPV rule. The first is etiological research, which contrasts mean Y levels at differing "exposure" (X) values and thus tends to focus on a single regression coefficient, possibly adjusted for confounders. The second research genre guides clinical practice. It addresses Y levels for individuals with different covariate patterns or "profiles." It focuses on the profile-specific (mean) Y levels themselves, estimating them via linear compounds of regression coefficients and covariates. By drawing on long-established closed-form variance formulae that lie beneath the standard errors in multiple regression, and by rearranging them for heuristic purposes, one arrives at quite intuitive sample size considerations for both research genres. Copyright © 2016 Elsevier Inc. All rights reserved.
Boligon, A A; Baldi, F; Mercadante, M E Z; Lobo, R B; Pereira, R J; Albuquerque, L G
2011-06-28
We quantified the potential increase in accuracy of expected breeding value for weights of Nelore cattle, from birth to mature age, using multi-trait and random regression models on Legendre polynomials and B-spline functions. A total of 87,712 weight records from 8144 females were used, recorded every three months from birth to mature age from the Nelore Brazil Program. For random regression analyses, all female weight records from birth to eight years of age (data set I) were considered. From this general data set, a subset was created (data set II), which included only nine weight records: at birth, weaning, 365 and 550 days of age, and 2, 3, 4, 5, and 6 years of age. Data set II was analyzed using random regression and multi-trait models. The model of analysis included the contemporary group as fixed effects and age of dam as a linear and quadratic covariable. In the random regression analyses, average growth trends were modeled using a cubic regression on orthogonal polynomials of age. Residual variances were modeled by a step function with five classes. Legendre polynomials of fourth and sixth order were utilized to model the direct genetic and animal permanent environmental effects, respectively, while third-order Legendre polynomials were considered for maternal genetic and maternal permanent environmental effects. Quadratic polynomials were applied to model all random effects in random regression models on B-spline functions. Direct genetic and animal permanent environmental effects were modeled using three segments or five coefficients, and genetic maternal and maternal permanent environmental effects were modeled with one segment or three coefficients in the random regression models on B-spline functions. For both data sets (I and II), animals ranked differently according to expected breeding value obtained by random regression or multi-trait models. With random regression models, the highest gains in accuracy were obtained at ages with a low number of weight records. The results indicate that random regression models provide more accurate expected breeding values than the traditionally finite multi-trait models. Thus, higher genetic responses are expected for beef cattle growth traits by replacing a multi-trait model with random regression models for genetic evaluation. B-spline functions could be applied as an alternative to Legendre polynomials to model covariance functions for weights from birth to mature age.
Optimization of Regression Models of Experimental Data Using Confirmation Points
NASA Technical Reports Server (NTRS)
Ulbrich, N.
2010-01-01
A new search metric is discussed that may be used to better assess the predictive capability of different math term combinations during the optimization of a regression model of experimental data. The new search metric can be determined for each tested math term combination if the given experimental data set is split into two subsets. The first subset consists of data points that are only used to determine the coefficients of the regression model. The second subset consists of confirmation points that are exclusively used to test the regression model. The new search metric value is assigned after comparing two values that describe the quality of the fit of each subset. The first value is the standard deviation of the PRESS residuals of the data points. The second value is the standard deviation of the response residuals of the confirmation points. The greater of the two values is used as the new search metric value. This choice guarantees that both standard deviations are always less or equal to the value that is used during the optimization. Experimental data from the calibration of a wind tunnel strain-gage balance is used to illustrate the application of the new search metric. The new search metric ultimately generates an optimized regression model that was already tested at regression model independent confirmation points before it is ever used to predict an unknown response from a set of regressors.
Mulhern, Brendan; Shah, Koonal; Janssen, Mathieu F Bas; Longworth, Louise; Ibbotson, Rachel
2016-01-01
Health states defined by multiattribute instruments such as the EuroQol five-dimensional questionnaire with five response levels (EQ-5D-5L) can be valued using time trade-off (TTO) or discrete choice experiment (DCE) methods. A key feature of the tasks is the order in which the health state dimensions are presented. Respondents may use various heuristics to complete the tasks, and therefore the order of the dimensions may impact on the importance assigned to particular states. To assess the impact of different EQ-5D-5L dimension orders on health state values. Preferences for EQ-5D-5L health states were elicited from a broadly representative sample of members of the UK general public. Respondents valued EQ-5D-5L health states using TTO and DCE methods across one of three dimension orderings via face-to-face computer-assisted personal interviews. Differences in mean values and the size of the health dimension coefficients across the arms were compared using difference testing and regression analyses. Descriptive analysis suggested some differences between the mean TTO health state values across the different dimension orderings, but these were not systematic. Regression analysis suggested that the magnitude of the dimension coefficients differs across the different dimension orderings (for both TTO and DCE), but there was no clear pattern. There is some evidence that the order in which the dimensions are presented impacts on the coefficients, which may impact on the health state values provided. The order of dimensions is a key consideration in the design of health state valuation studies. Copyright © 2016. Published by Elsevier Inc.
THE DISTRIBUTION OF COOK’S D STATISTIC
Muller, Keith E.; Mok, Mario Chen
2013-01-01
Cook (1977) proposed a diagnostic to quantify the impact of deleting an observation on the estimated regression coefficients of a General Linear Univariate Model (GLUM). Simulations of models with Gaussian response and predictors demonstrate that his suggestion of comparing the diagnostic to the median of the F for overall regression captures an erratically varying proportion of the values. We describe the exact distribution of Cook’s statistic for a GLUM with Gaussian predictors and response. We also present computational forms, simple approximations, and asymptotic results. A simulation supports the accuracy of the results. The methods allow accurate evaluation of a single value or the maximum value from a regression analysis. The approximations work well for a single value, but less well for the maximum. In contrast, the cut-point suggested by Cook provides widely varying tail probabilities. As with all diagnostics, the data analyst must use scientific judgment in deciding how to treat highlighted observations. PMID:24363487
Standards for Standardized Logistic Regression Coefficients
ERIC Educational Resources Information Center
Menard, Scott
2011-01-01
Standardized coefficients in logistic regression analysis have the same utility as standardized coefficients in linear regression analysis. Although there has been no consensus on the best way to construct standardized logistic regression coefficients, there is now sufficient evidence to suggest a single best approach to the construction of a…
Carreon, Leah Y; Bratcher, Kelly R; Das, Nandita; Nienhuis, Jacob B; Glassman, Steven D
2014-09-01
The Neck Disability Index (NDI) and numeric rating scales (0 to 10) for neck pain and arm pain are widely used cervical spine disease-specific measures. Recent studies have shown that there is a strong relationship between the SF-6D and the NDI such that using a simple linear regression allows for the estimation of an SF-6D value from the NDI alone. Due to ease of administration and scoring, the EQ-5D is increasingly being used as a measure of utility in the clinical setting. The purpose of this study is to determine if the EQ-5D values can be estimated from commonly available cervical spine disease-specific health-related quality of life measures, much like the SF-6D. The EQ-5D, NDI, neck pain score, and arm pain score were prospectively collected in 3732 patients who presented to the authors' clinic with degenerative cervical spine disorders. Correlation coefficients for paired observations from multiple time points between the NDI, neck pain and arm pain scores, and EQ-5D were determined. Regression models were built to estimate the EQ-5D values from the NDI, neck pain, and arm pain scores. The mean age of the 3732 patients was 53.3 ± 12.2 years, and 43% were male. Correlations between the EQ-5D and the NDI, neck pain score, and arm pain score were statistically significant (p < 0.0001), with correlation coefficients of -0.77, -0.62, and -0.50, respectively. The regression equation 0.98947 + (-0.00705 × NDI) + (-0.00875 × arm pain score) + (-0.00877 × neck pain score) to predict EQ-5D had an R-square of 0.62 and a root mean square error (RMSE) of 0.146. The model using NDI alone had an R-square of 0.59 and a RMSE of 0.150. The model using the individual NDI items had an R-square of 0.46 and an RMSE of 0.172. The correlation coefficient between the observed and estimated EQ-5D scores was 0.79. There was no statistically significant difference between the actual EQ-5D score (0.603 ± 0.235) and the estimated EQ-5D score (0.603 ± 0.185) using the NDI, neck pain score, and arm pain score regression model. However, rounding off the coefficients to fewer than 5 decimal places produced less accurate results. The regression model estimating the EQ-5D from the NDI, neck pain score, and arm pain score accounted for 60% of the variability of the EQ-5D with a relatively large RMSE. This regression model may not be sufficient to accurately or reliably estimate actual EQ-5D values.
Practical Session: Simple Linear Regression
NASA Astrophysics Data System (ADS)
Clausel, M.; Grégoire, G.
2014-12-01
Two exercises are proposed to illustrate the simple linear regression. The first one is based on the famous Galton's data set on heredity. We use the lm R command and get coefficients estimates, standard error of the error, R2, residuals …In the second example, devoted to data related to the vapor tension of mercury, we fit a simple linear regression, predict values, and anticipate on multiple linear regression. This pratical session is an excerpt from practical exercises proposed by A. Dalalyan at EPNC (see Exercises 1 and 2 of http://certis.enpc.fr/~dalalyan/Download/TP_ENPC_4.pdf).
Identification of transplanting stage of rice using Sentinel-1 data
NASA Astrophysics Data System (ADS)
Hongo, C.; Tosa, T.; Tamura, E.; Sigit, G.; Barus, B.
2017-12-01
As the adaptation of climate change, the Government of Indonesia has launched agricultural insurance program for damage of rice by drought, flood and pest and disease. For assessment of the damage ratio and calculation of indemnity, extraction of paddy field and identification of transplanting stage are key issues. In this research, we conducted identification of rice transplanting stage in dry season of 2015, using data from Sentinel-1, for paddy in Cianjur, West Java, Indonesia. As the first step, time series order of backscattering coefficient was analyzed about paddy, forest, villages and fish farming ponds with use of Sentinel-1 data acquired on April 1, April 13, April 25, May 7, May 19, June 24, July 18 and August 11. The result shows that the backscattering coefficient of paddy substantially decreased from data on May 7 and reached minimum value and then after increased toward June. A paddy area showing this change was almost the same area where rice was at harvesting stage and we did field investigation work from August 11 to 13. Considering a growth period of rice in our research site was about 110 days, so the result supported the fact that transplantation of rice was done around May 7. On the other hand, backscattering coefficient of forest, villages and fish farming ponds was constant and showed clear difference from the coefficient of paddy. As the next step, minimum and maximum value of backscattering coefficient were extracted from the data of May 7, May 19 and June 24, respectively. Then increase amount was calculated by deducting the minimum value from the maximum. Finally, using the minimum value of backscattering coefficient and the increased amount, a classification of image was made to identify transplanting stage through maximum likelihood method, decision tree method and threshold setting method (regression analysis by 3σ-rule). As the result, the maximum likelihood method made the most accurate distinguishment about transplanting stage while the decision tree method showed tendency to underestimate a paddy area already planted. As to the threshold setting method (regression analysis by 3σ-rule), its distinguishment accuracy was better than those of other methods about a paddy area adjacent to forest and villages of which backscattering coefficient was influenced by other sources' coefficients.
NASA Astrophysics Data System (ADS)
Ari, I. R. D.; Hasyim, A. W.; Pratama, B. A.; Helmy, M.; Sheilla, M. N.
2017-06-01
Poverty is a problem that requires attention from the government especially in developing countries such as Indonesia. This Research takes Place at Kasembon District because it has 53,19% family below poverty line in the region. The purpose of this research is to measure poverty based on 3 poverty indicators published by World Bank and 1 multidimensional poverty index. Furthermore, this research invesitigas the relationship between poverty with social and infrastructure in Kasembon District. This study using social network analysis, hot spots analysis, and regression analysis with ordinary least squares. From the poverty indicators known that Pondokagung Village has the highest poverty rate compared to another region. Results from regression model indicate that social and infrastructure affecting poverty in Kasembon District. Social parameter that affecting poverty is density. Infrastructure parameter that affecting poverty is length of paved road. Coefficient value of density is the largest in the model. Therefore it can be concluded that social factors can give more opportunity to reduce poverty rates in Kasembon District. In the local model of paved road coefficient, it is known that the coefficient for each village has not much different value from the global model.
Freitas, Alex A; Limbu, Kriti; Ghafourian, Taravat
2015-01-01
Volume of distribution is an important pharmacokinetic property that indicates the extent of a drug's distribution in the body tissues. This paper addresses the problem of how to estimate the apparent volume of distribution at steady state (Vss) of chemical compounds in the human body using decision tree-based regression methods from the area of data mining (or machine learning). Hence, the pros and cons of several different types of decision tree-based regression methods have been discussed. The regression methods predict Vss using, as predictive features, both the compounds' molecular descriptors and the compounds' tissue:plasma partition coefficients (Kt:p) - often used in physiologically-based pharmacokinetics. Therefore, this work has assessed whether the data mining-based prediction of Vss can be made more accurate by using as input not only the compounds' molecular descriptors but also (a subset of) their predicted Kt:p values. Comparison of the models that used only molecular descriptors, in particular, the Bagging decision tree (mean fold error of 2.33), with those employing predicted Kt:p values in addition to the molecular descriptors, such as the Bagging decision tree using adipose Kt:p (mean fold error of 2.29), indicated that the use of predicted Kt:p values as descriptors may be beneficial for accurate prediction of Vss using decision trees if prior feature selection is applied. Decision tree based models presented in this work have an accuracy that is reasonable and similar to the accuracy of reported Vss inter-species extrapolations in the literature. The estimation of Vss for new compounds in drug discovery will benefit from methods that are able to integrate large and varied sources of data and flexible non-linear data mining methods such as decision trees, which can produce interpretable models. Graphical AbstractDecision trees for the prediction of tissue partition coefficient and volume of distribution of drugs.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brink, Carsten, E-mail: carsten.brink@rsyd.dk; Laboratory of Radiation Physics, Odense University Hospital; Bernchou, Uffe
2014-07-15
Purpose: Large interindividual variations in volume regression of non-small cell lung cancer (NSCLC) are observable on standard cone beam computed tomography (CBCT) during fractionated radiation therapy. Here, a method for automated assessment of tumor volume regression is presented and its potential use in response adapted personalized radiation therapy is evaluated empirically. Methods and Materials: Automated deformable registration with calculation of the Jacobian determinant was applied to serial CBCT scans in a series of 99 patients with NSCLC. Tumor volume at the end of treatment was estimated on the basis of the first one third and two thirds of the scans.more » The concordance between estimated and actual relative volume at the end of radiation therapy was quantified by Pearson's correlation coefficient. On the basis of the estimated relative volume, the patients were stratified into 2 groups having volume regressions below or above the population median value. Kaplan-Meier plots of locoregional disease-free rate and overall survival in the 2 groups were used to evaluate the predictive value of tumor regression during treatment. Cox proportional hazards model was used to adjust for other clinical characteristics. Results: Automatic measurement of the tumor regression from standard CBCT images was feasible. Pearson's correlation coefficient between manual and automatic measurement was 0.86 in a sample of 9 patients. Most patients experienced tumor volume regression, and this could be quantified early into the treatment course. Interestingly, patients with pronounced volume regression had worse locoregional tumor control and overall survival. This was significant on patient with non-adenocarcinoma histology. Conclusions: Evaluation of routinely acquired CBCT images during radiation therapy provides biological information on the specific tumor. This could potentially form the basis for personalized response adaptive therapy.« less
Ranade, A K; Pandey, M; Datta, D
2013-01-01
A study was conducted to evaluate the absorbed rate coefficient of (238)U, (232)Th, (40)K and (137)Cs present in soil. A total of 31 soil samples and the corresponding terrestrial dose rates at 1 m from different locations were taken around the Anushaktinagar region, where the litho-logy is dominated by red soil. A linear regression model was developed for the estimation of these factors. The estimated coefficients (nGy h(-1) Bq(-1) kg(-1)) were 0.454, 0.586, 0.035 and 0.392, respectively. The factors calculated were in good agreement with the literature values.
NASA Astrophysics Data System (ADS)
He, Anhua; Singh, Ramesh P.; Sun, Zhaohua; Ye, Qing; Zhao, Gang
2016-07-01
The earth tide, atmospheric pressure, precipitation and earthquake fluctuations, especially earthquake greatly impacts water well levels, thus anomalous co-seismic changes in ground water levels have been observed. In this paper, we have used four different models, simple linear regression (SLR), multiple linear regression (MLR), principal component analysis (PCA) and partial least squares (PLS) to compute the atmospheric pressure and earth tidal effects on water level. Furthermore, we have used the Akaike information criterion (AIC) to study the performance of various models. Based on the lowest AIC and sum of squares for error values, the best estimate of the effects of atmospheric pressure and earth tide on water level is found using the MLR model. However, MLR model does not provide multicollinearity between inputs, as a result the atmospheric pressure and earth tidal response coefficients fail to reflect the mechanisms associated with the groundwater level fluctuations. On the premise of solving serious multicollinearity of inputs, PLS model shows the minimum AIC value. The atmospheric pressure and earth tidal response coefficients show close response with the observation using PLS model. The atmospheric pressure and the earth tidal response coefficients are found to be sensitive to the stress-strain state using the observed data for the period 1 April-8 June 2008 of Chuan 03# well. The transient enhancement of porosity of rock mass around Chuan 03# well associated with the Wenchuan earthquake (Mw = 7.9 of 12 May 2008) that has taken its original pre-seismic level after 13 days indicates that the co-seismic sharp rise of water well could be induced by static stress change, rather than development of new fractures.
NASA Astrophysics Data System (ADS)
Darmayasa, J. B.; Wahyudin; Mulyana, T.
2018-01-01
Ethnomathematics may be the connecting bridge between culture and technology and arts. Therefore, the exploration of mathematics values that intersects with cultural anthropology should be significantly conducted. One case containing such issue is the construction of Traditional House of Saka Roras in Bali. Thus, this research aimed to explore the mathematic concept adopted in the construction of such traditional Bale (house) located in Songan Village, Kintamani, Bali. Specifically, this research also aimed to investigate the selection of linear regression coefficient for the saka (pillar) in the Bale. This research applied Embedded Mix-Method Design. Meanwhile, the data collection was conducted by interview, observation and measurement of pillars of 32 Bale Saka Roras. The result of this research revealed that the connection between the width and height of pillars was stated in the formula Y = 26,3 + 18,2X, where X acted as stimulus variable. The coefficient value amounted to 18.2 showed that most preceding architects in Songan Village were more likely to use 19 as the coefficient towards the pillar width than the other coefficients such as 17, 20 and 21 as mentioned in book/palm-leaf manuscript entitled Kosala-Kosali. The last but not least, the researchers also figured out that the pillar width depended on the length of the house-owner candidate’s index finger.
Aziz, Shamsul Akmar Ab; Nuawi, Mohd Zaki; Nor, Mohd Jailani Mohd
2015-01-01
The objective of this study was to present a new method for determination of hand-arm vibration (HAV) in Malaysian Army (MA) three-tonne truck steering wheels based on changes in vehicle speed using regression model and the statistical analysis method known as Integrated Kurtosis-Based Algorithm for Z-Notch Filter Technique Vibro (I-kaz Vibro). The test was conducted for two different road conditions, tarmac and dirt roads. HAV exposure was measured using a Brüel & Kjær Type 3649 vibration analyzer, which is capable of recording HAV exposures from steering wheels. The data was analyzed using I-kaz Vibro to determine the HAV values in relation to varying speeds of a truck and to determine the degree of data scattering for HAV data signals. Based on the results obtained, HAV experienced by drivers can be determined using the daily vibration exposure A(8), I-kaz Vibro coefficient (Ƶ(v)(∞)), and the I-kaz Vibro display. The I-kaz Vibro displays also showed greater scatterings, indicating that the values of Ƶ(v)(∞) and A(8) were increasing. Prediction of HAV exposure was done using the developed regression model and graphical representations of Ƶ(v)(∞). The results of the regression model showed that Ƶ(v)(∞) increased when the vehicle speed and HAV exposure increased. For model validation, predicted and measured noise exposures were compared, and high coefficient of correlation (R(2)) values were obtained, indicating that good agreement was obtained between them. By using the developed regression model, we can easily predict HAV exposure from steering wheels for HAV exposure monitoring.
Determination of suitable drying curve model for bread moisture loss during baking
NASA Astrophysics Data System (ADS)
Soleimani Pour-Damanab, A. R.; Jafary, A.; Rafiee, S.
2013-03-01
This study presents mathematical modelling of bread moisture loss or drying during baking in a conventional bread baking process. In order to estimate and select the appropriate moisture loss curve equation, 11 different models, semi-theoretical and empirical, were applied to the experimental data and compared according to their correlation coefficients, chi-squared test and root mean square error which were predicted by nonlinear regression analysis. Consequently, of all the drying models, a Page model was selected as the best one, according to the correlation coefficients, chi-squared test, and root mean square error values and its simplicity. Mean absolute estimation error of the proposed model by linear regression analysis for natural and forced convection modes was 2.43, 4.74%, respectively.
The Study of Rain Specific Attenuation for the Prediction of Satellite Propagation in Malaysia
NASA Astrophysics Data System (ADS)
Mandeep, J. S.; Ng, Y. Y.; Abdullah, H.; Abdullah, M.
2010-06-01
Specific attenuation is the fundamental quantity in the calculation of rain attenuation for terrestrial path and slant paths representing as rain attenuation per unit distance (dB/km). Specific attenuation is an important element in developing the predicted rain attenuation model. This paper deals with the empirical determination of the power law coefficients which allow calculating the specific attenuation in dB/km from the knowledge of the rain rate in mm/h. The main purpose of the paper is to obtain the coefficients of k and α of power law relationship between specific attenuation. Three years (from 1st January 2006 until 31st December 2008) rain gauge and beacon data taken from USM, Nibong Tebal have been used to do the empirical procedure analysis of rain specific attenuation. The data presented are semi-empirical in nature. A year-to-year variation of the coefficients has been indicated and the empirical measured data was compared with ITU-R provided regression coefficient. The result indicated that the USM empirical measured data was significantly vary from ITU-R predicted value. Hence, ITU-R recommendation for regression coefficients of rain specific attenuation is not suitable for predicting rain attenuation at Malaysia.
Ridge: a computer program for calculating ridge regression estimates
Donald E. Hilt; Donald W. Seegrist
1977-01-01
Least-squares coefficients for multiple-regression models may be unstable when the independent variables are highly correlated. Ridge regression is a biased estimation procedure that produces stable estimates of the coefficients. Ridge regression is discussed, and a computer program for calculating the ridge coefficients is presented.
Revisiting crash spatial heterogeneity: A Bayesian spatially varying coefficients approach.
Xu, Pengpeng; Huang, Helai; Dong, Ni; Wong, S C
2017-01-01
This study was performed to investigate the spatially varying relationships between crash frequency and related risk factors. A Bayesian spatially varying coefficients model was elaborately introduced as a methodological alternative to simultaneously account for the unstructured and spatially structured heterogeneity of the regression coefficients in predicting crash frequencies. The proposed method was appealing in that the parameters were modeled via a conditional autoregressive prior distribution, which involved a single set of random effects and a spatial correlation parameter with extreme values corresponding to pure unstructured or pure spatially correlated random effects. A case study using a three-year crash dataset from the Hillsborough County, Florida, was conducted to illustrate the proposed model. Empirical analysis confirmed the presence of both unstructured and spatially correlated variations in the effects of contributory factors on severe crash occurrences. The findings also suggested that ignoring spatially structured heterogeneity may result in biased parameter estimates and incorrect inferences, while assuming the regression coefficients to be spatially clustered only is probably subject to the issue of over-smoothness. Copyright © 2016 Elsevier Ltd. All rights reserved.
Effects of integration time on in-water radiometric profiles.
D'Alimonte, Davide; Zibordi, Giuseppe; Kajiyama, Tamito
2018-03-05
This work investigates the effects of integration time on in-water downward irradiance E d , upward irradiance E u and upwelling radiance L u profile data acquired with free-fall hyperspectral systems. Analyzed quantities are the subsurface value and the diffuse attenuation coefficient derived by applying linear and non-linear regression schemes. Case studies include oligotrophic waters (Case-1), as well as waters dominated by Colored Dissolved Organic Matter (CDOM) and Non-Algal Particles (NAP). Assuming a 24-bit digitization, measurements resulting from the accumulation of photons over integration times varying between 8 and 2048ms are evaluated at depths corresponding to: 1) the beginning of each integration interval (Fst); 2) the end of each integration interval (Lst); 3) the averages of Fst and Lst values (Avg); and finally 4) the values weighted accounting for the diffuse attenuation coefficient of water (Wgt). Statistical figures show that the effects of integration time can bias results well above 5% as a function of the depth definition. Results indicate the validity of the Wgt depth definition and the fair applicability of the Avg one. Instead, both the Fst and Lst depths should not be adopted since they may introduce pronounced biases in E u and L u regression products for highly absorbing waters. Finally, the study reconfirms the relevance of combining multiple radiometric casts into a single profile to increase precision of regression products.
NASA Astrophysics Data System (ADS)
Gasperini, Paolo; Lolli, Barbara
2014-01-01
The argument proposed by Wason et al. that the conversion of magnitudes from a scale (e.g. Ms or mb) to another (e.g. Mw), using the coefficients computed by the general orthogonal regression method (Fuller) is biased if the observed values of the predictor (independent) variable are used in the equation as well as the methodology they suggest to estimate the supposedly true values of the predictor variable are wrong for a number of theoretical and empirical reasons. Hence, we advise against the use of such methodology for magnitude conversions.
Prediction of dimethyl disulfide levels from biosolids using statistical modeling.
Gabriel, Steven A; Vilalai, Sirapong; Arispe, Susanna; Kim, Hyunook; McConnell, Laura L; Torrents, Alba; Peot, Christopher; Ramirez, Mark
2005-01-01
Two statistical models were used to predict the concentration of dimethyl disulfide (DMDS) released from biosolids produced by an advanced wastewater treatment plant (WWTP) located in Washington, DC, USA. The plant concentrates sludge from primary sedimentation basins in gravity thickeners (GT) and sludge from secondary sedimentation basins in dissolved air flotation (DAF) thickeners. The thickened sludge is pumped into blending tanks and then fed into centrifuges for dewatering. The dewatered sludge is then conditioned with lime before trucking out from the plant. DMDS, along with other volatile sulfur and nitrogen-containing chemicals, is known to contribute to biosolids odors. These models identified oxidation/reduction potential (ORP) values of a GT and DAF, the amount of sludge dewatered by centrifuges, and the blend ratio between GT thickened sludge and DAF thickened sludge in blending tanks as control variables. The accuracy of the developed regression models was evaluated by checking the adjusted R2 of the regression as well as the signs of coefficients associated with each variable. In general, both models explained observed DMDS levels in sludge headspace samples. The adjusted R2 value of the regression models 1 and 2 were 0.79 and 0.77, respectively. Coefficients for each regression model also had the correct sign. Using the developed models, plant operators can adjust the controllable variables to proactively decrease this odorant. Therefore, these models are a useful tool in biosolids management at WWTPs.
Huang, Hairong; Xu, Zanzan; Shao, Xianhong; Wismeijer, Daniel; Sun, Ping; Wang, Jingxiao
2017-01-01
Objectives This study identified potential general influencing factors for a mathematical prediction of implant stability quotient (ISQ) values in clinical practice. Methods We collected the ISQ values of 557 implants from 2 different brands (SICace and Osstem) placed by 2 surgeons in 336 patients. Surgeon 1 placed 329 SICace implants, and surgeon 2 placed 113 SICace implants and 115 Osstem implants. ISQ measurements were taken at T1 (immediately after implant placement) and T2 (before dental restoration). A multivariate linear regression model was used to analyze the influence of the following 11 candidate factors for stability prediction: sex, age, maxillary/mandibular location, bone type, immediate/delayed implantation, bone grafting, insertion torque, I-stage or II-stage healing pattern, implant diameter, implant length and T1-T2 time interval. Results The need for bone grafting as a predictor significantly influenced ISQ values in all three groups at T1 (weight coefficients ranging from -4 to -5). In contrast, implant diameter consistently influenced the ISQ values in all three groups at T2 (weight coefficients ranging from 3.4 to 4.2). Other factors, such as sex, age, I/II-stage implantation and bone type, did not significantly influence ISQ values at T2, and implant length did not significantly influence ISQ values at T1 or T2. Conclusions These findings provide a rational basis for mathematical models to quantitatively predict the ISQ values of implants in clinical practice. PMID:29084260
Determination of drying kinetics and convective heat transfer coefficients of ginger slices
NASA Astrophysics Data System (ADS)
Akpinar, Ebru Kavak; Toraman, Seda
2016-10-01
In the present work, the effects of some parametric values on convective heat transfer coefficients and the thin layer drying process of ginger slices were investigated. Drying was done in the laboratory by using cyclone type convective dryer. The drying air temperature was varied as 40, 50, 60 and 70 °C and the air velocity is 0.8, 1.5 and 3 m/s. All drying experiments had only falling rate period. The drying data were fitted to the twelve mathematical models and performance of these models was investigated by comparing the determination of coefficient ( R 2), reduced Chi-square ( χ 2) and root mean square error between the observed and predicted moisture ratios. The effective moisture diffusivity and activation energy were calculated using an infinite series solution of Fick's diffusion equation. The average effective moisture diffusivity values and activation energy values varied from 2.807 × 10-10 to 6.977 × 10-10 m2/s and 19.313-22.722 kJ/mol over the drying air temperature and velocity range, respectively. Experimental data was used to evaluate the values of constants in Nusselt number expression by using linear regression analysis and consequently, convective heat transfer coefficients were determined in forced convection mode. Convective heat transfer coefficient of ginger slices showed changes in ranges 0.33-2.11 W/m2 °C.
Determination of sedimentation coefficients for small peptides.
Schuck, P; MacPhee, C E; Howlett, G J
1998-01-01
Direct fitting of sedimentation velocity data with numerical solutions of the Lamm equations has been exploited to obtain sedimentation coefficients for single solutes under conditions where solvent and solution plateaus are either not available or are transient. The calculated evolution was initialized with the first experimental scan and nonlinear regression was employed to obtain best-fit values for the sedimentation and diffusion coefficients. General properties of the Lamm equations as data analysis tools were examined. This method was applied to study a set of small peptides containing amphipathic heptad repeats with the general structure Ac-YS-(AKEAAKE)nGAR-NH2, n = 2, 3, or 4. Sedimentation velocity analysis indicated single sedimenting species with sedimentation coefficients (s(20,w) values) of 0.37, 0.45, and 0.52 S, respectively, in good agreement with sedimentation coefficients predicted by hydrodynamic theory. The described approach can be applied to synthetic boundary and conventional loading experiments, and can be extended to analyze sedimentation data for both large and small macromolecules in order to define shape, heterogeneity, and state of association. PMID:9449347
Influence of soil pH on the sorption of ionizable chemicals: modeling advances.
Franco, Antonio; Fu, Wenjing; Trapp, Stefan
2009-03-01
The soil-water distribution coefficient of ionizable chemicals (K(d)) depends on the soil acidity, mainly because the pH governs speciation. Using pH-specific K(d) values normalized to organic carbon (K(OC)) from the literature, a method was developed to estimate the K(OC) of monovalent organic acids and bases. The regression considers pH-dependent speciation and species-specific partition coefficients, calculated from the dissociation constant (pK(a)) and the octanol-water partition coefficient of the neutral molecule (log P(n)). Probably because of the lower pH near the organic colloid-water interface, the optimal pH to model dissociation was lower than the bulk soil pH. The knowledge of the soil pH allows calculation of the fractions of neutral and ionic molecules in the system, thus improving the existing regression for acids. The same approach was not successful with bases, for which the impact of pH on the total sorption is contrasting. In fact, the shortcomings of the model assumptions affect the predictive power for acids and for bases differently. We evaluated accuracy and limitations of the regressions for their use in the environmental fate assessment of ionizable chemicals.
Modelling the co-evolution of indirect genetic effects and inherited variability.
Marjanovic, Jovana; Mulder, Han A; Rönnegård, Lars; Bijma, Piter
2018-03-28
When individuals interact, their phenotypes may be affected not only by their own genes but also by genes in their social partners. This phenomenon is known as Indirect Genetic Effects (IGEs). In aquaculture species and some plants, however, competition not only affects trait levels of individuals, but also inflates variability of trait values among individuals. In the field of quantitative genetics, the variability of trait values has been studied as a quantitative trait in itself, and is often referred to as inherited variability. Such studies, however, consider only the genetic effect of the focal individual on trait variability and do not make a connection to competition. Although the observed phenotypic relationship between competition and variability suggests an underlying genetic relationship, the current quantitative genetic models of IGE and inherited variability do not allow for such a relationship. The lack of quantitative genetic models that connect IGEs to inherited variability limits our understanding of the potential of variability to respond to selection, both in nature and agriculture. Models of trait levels, for example, show that IGEs may considerably change heritable variation in trait values. Currently, we lack the tools to investigate whether this result extends to variability of trait values. Here we present a model that integrates IGEs and inherited variability. In this model, the target phenotype, say growth rate, is a function of the genetic and environmental effects of the focal individual and of the difference in trait value between the social partner and the focal individual, multiplied by a regression coefficient. The regression coefficient is a genetic trait, which is a measure of cooperation; a negative value indicates competition, a positive value cooperation, and an increasing value due to selection indicates the evolution of cooperation. In contrast to the existing quantitative genetic models, our model allows for co-evolution of IGEs and variability, as the regression coefficient can respond to selection. Our simulations show that the model results in increased variability of body weight with increasing competition. When competition decreases, i.e., cooperation evolves, variability becomes significantly smaller. Hence, our model facilitates quantitative genetic studies on the relationship between IGEs and inherited variability. Moreover, our findings suggest that we may have been overlooking an entire level of genetic variation in variability, the one due to IGEs.
Bowen, Stephen R; Chappell, Richard J; Bentzen, Søren M; Deveau, Michael A; Forrest, Lisa J; Jeraj, Robert
2012-01-01
Purpose To quantify associations between pre-radiotherapy and post-radiotherapy PET parameters via spatially resolved regression. Materials and methods Ten canine sinonasal cancer patients underwent PET/CT scans of [18F]FDG (FDGpre), [18F]FLT (FLTpre), and [61Cu]Cu-ATSM (Cu-ATSMpre). Following radiotherapy regimens of 50 Gy in 10 fractions, veterinary patients underwent FDG PET/CT scans at three months (FDGpost). Regression of standardized uptake values in baseline FDGpre, FLTpre and Cu-ATSMpre tumour voxels to those in FDGpost images was performed for linear, log-linear, generalized-linear and mixed-fit linear models. Goodness-of-fit in regression coefficients was assessed by R2. Hypothesis testing of coefficients over the patient population was performed. Results Multivariate linear model fits of FDGpre to FDGpost were significantly positive over the population (FDGpost~0.17 FDGpre, p=0.03), and classified slopes of RECIST non-responders and responders to be different (0.37 vs. 0.07, p=0.01). Generalized-linear model fits related FDGpre to FDGpost by a linear power law (FDGpost~FDGpre0.93, p<0.001). Univariate mixture model fits of FDGpre improved R2 from 0.17 to 0.52. Neither baseline FLT PET nor Cu-ATSM PET uptake contributed statistically significant multivariate regression coefficients. Conclusions Spatially resolved regression analysis indicates that pre-treatment FDG PET uptake is most strongly associated with three-month post-treatment FDG PET uptake in this patient population, though associations are histopathology-dependent. PMID:22682748
NASA Astrophysics Data System (ADS)
Gusriani, N.; Firdaniza
2018-03-01
The existence of outliers on multiple linear regression analysis causes the Gaussian assumption to be unfulfilled. If the Least Square method is forcedly used on these data, it will produce a model that cannot represent most data. For that, we need a robust regression method against outliers. This paper will compare the Minimum Covariance Determinant (MCD) method and the TELBS method on secondary data on the productivity of phytoplankton, which contains outliers. Based on the robust determinant coefficient value, MCD method produces a better model compared to TELBS method.
Modeling of adipose/blood partition coefficient for environmental chemicals.
Papadaki, K C; Karakitsios, S P; Sarigiannis, D A
2017-12-01
A Quantitative Structure Activity Relationship (QSAR) model was developed in order to predict the adipose/blood partition coefficient of environmental chemical compounds. The first step of QSAR modeling was the collection of inputs. Input data included the experimental values of adipose/blood partition coefficient and two sets of molecular descriptors for 67 organic chemical compounds; a) the descriptors from Linear Free Energy Relationship (LFER) and b) the PaDEL descriptors. The datasets were split to training and prediction set and were analysed using two statistical methods; Genetic Algorithm based Multiple Linear Regression (GA-MLR) and Artificial Neural Networks (ANN). The models with LFER and PaDEL descriptors, coupled with ANN, produced satisfying performance results. The fitting performance (R 2 ) of the models, using LFER and PaDEL descriptors, was 0.94 and 0.96, respectively. The Applicability Domain (AD) of the models was assessed and then the models were applied to a large number of chemical compounds with unknown values of adipose/blood partition coefficient. In conclusion, the proposed models were checked for fitting, validity and applicability. It was demonstrated that they are stable, reliable and capable to predict the values of adipose/blood partition coefficient of "data poor" chemical compounds that fall within the applicability domain. Copyright © 2017. Published by Elsevier Ltd.
Background stratified Poisson regression analysis of cohort data.
Richardson, David B; Langholz, Bryan
2012-03-01
Background stratified Poisson regression is an approach that has been used in the analysis of data derived from a variety of epidemiologically important studies of radiation-exposed populations, including uranium miners, nuclear industry workers, and atomic bomb survivors. We describe a novel approach to fit Poisson regression models that adjust for a set of covariates through background stratification while directly estimating the radiation-disease association of primary interest. The approach makes use of an expression for the Poisson likelihood that treats the coefficients for stratum-specific indicator variables as 'nuisance' variables and avoids the need to explicitly estimate the coefficients for these stratum-specific parameters. Log-linear models, as well as other general relative rate models, are accommodated. This approach is illustrated using data from the Life Span Study of Japanese atomic bomb survivors and data from a study of underground uranium miners. The point estimate and confidence interval obtained from this 'conditional' regression approach are identical to the values obtained using unconditional Poisson regression with model terms for each background stratum. Moreover, it is shown that the proposed approach allows estimation of background stratified Poisson regression models of non-standard form, such as models that parameterize latency effects, as well as regression models in which the number of strata is large, thereby overcoming the limitations of previously available statistical software for fitting background stratified Poisson regression models.
Structural features that predict real-value fluctuations of globular proteins
Jamroz, Michal; Kolinski, Andrzej; Kihara, Daisuke
2012-01-01
It is crucial to consider dynamics for understanding the biological function of proteins. We used a large number of molecular dynamics trajectories of non-homologous proteins as references and examined static structural features of proteins that are most relevant to fluctuations. We examined correlation of individual structural features with fluctuations and further investigated effective combinations of features for predicting the real-value of residue fluctuations using the support vector regression. It was found that some structural features have higher correlation than crystallographic B-factors with fluctuations observed in molecular dynamics trajectories. Moreover, support vector regression that uses combinations of static structural features showed accurate prediction of fluctuations with an average Pearson’s correlation coefficient of 0.669 and a root mean square error of 1.04 Å. This correlation coefficient is higher than the one observed for the prediction by the Gaussian network model. An advantage of the developed method over the Gaussian network models is that the former predicts the real-value of fluctuation. The results help improve our understanding of relationships between protein structure and fluctuation. Furthermore, the developed method provides a convienient practial way to predict fluctuations of proteins using easily computed static structural features of proteins. PMID:22328193
NASA Astrophysics Data System (ADS)
Ben Shabat, Yael; Shitzer, Avraham
2012-07-01
Facial heat exchange convection coefficients were estimated from experimental data in cold and windy ambient conditions applicable to wind chill calculations. Measured facial temperature datasets, that were made available to this study, originated from 3 separate studies involving 18 male and 6 female subjects. Most of these data were for a -10°C ambient environment and wind speeds in the range of 0.2 to 6 m s-1. Additional single experiments were for -5°C, 0°C and 10°C environments and wind speeds in the same range. Convection coefficients were estimated for all these conditions by means of a numerical facial heat exchange model, applying properties of biological tissues and a typical facial diameter of 0.18 m. Estimation was performed by adjusting the guessed convection coefficients in the computed facial temperatures, while comparing them to measured data, to obtain a satisfactory fit ( r 2 > 0.98, in most cases). In one of the studies, heat flux meters were additionally used. Convection coefficients derived from these meters closely approached the estimated values for only the male subjects. They differed significantly, by about 50%, when compared to the estimated female subjects' data. Regression analysis was performed for just the -10°C ambient temperature, and the range of experimental wind speeds, due to the limited availability of data for other ambient temperatures. The regressed equation was assumed in the form of the equation underlying the "new" wind chill chart. Regressed convection coefficients, which closely duplicated the measured data, were consistently higher than those calculated by this equation, except for one single case. The estimated and currently used convection coefficients are shown to diverge exponentially from each other, as wind speed increases. This finding casts considerable doubts on the validity of the convection coefficients that are used in the computation of the "new" wind chill chart and their applicability to humans in cold and windy environments.
Ben Shabat, Yael; Shitzer, Avraham
2012-07-01
Facial heat exchange convection coefficients were estimated from experimental data in cold and windy ambient conditions applicable to wind chill calculations. Measured facial temperature datasets, that were made available to this study, originated from 3 separate studies involving 18 male and 6 female subjects. Most of these data were for a -10°C ambient environment and wind speeds in the range of 0.2 to 6 m s(-1). Additional single experiments were for -5°C, 0°C and 10°C environments and wind speeds in the same range. Convection coefficients were estimated for all these conditions by means of a numerical facial heat exchange model, applying properties of biological tissues and a typical facial diameter of 0.18 m. Estimation was performed by adjusting the guessed convection coefficients in the computed facial temperatures, while comparing them to measured data, to obtain a satisfactory fit (r(2) > 0.98, in most cases). In one of the studies, heat flux meters were additionally used. Convection coefficients derived from these meters closely approached the estimated values for only the male subjects. They differed significantly, by about 50%, when compared to the estimated female subjects' data. Regression analysis was performed for just the -10°C ambient temperature, and the range of experimental wind speeds, due to the limited availability of data for other ambient temperatures. The regressed equation was assumed in the form of the equation underlying the "new" wind chill chart. Regressed convection coefficients, which closely duplicated the measured data, were consistently higher than those calculated by this equation, except for one single case. The estimated and currently used convection coefficients are shown to diverge exponentially from each other, as wind speed increases. This finding casts considerable doubts on the validity of the convection coefficients that are used in the computation of the "new" wind chill chart and their applicability to humans in cold and windy environments.
Comparison of Nimbus-7 SMMR and GOES-1 VISSR Atmospheric Liquid Water Content.
NASA Astrophysics Data System (ADS)
Lojou, Jean-Yves; Frouin, Robert; Bernard, René
1991-02-01
Vertically integrated atmospheric liquid water content derived from Nimbus-7 Scanning Multichannel Microwave Radiometer (SMMR) brightness temperatures and from GOES-1 Visible and Infrared Spin-Scan Radiometer (VISSR) radiances in the visible are compared over the Indian Ocean during MONEX (monsoon experiment). In the retrieval procedure, Wilheit and Chang' algorithm and Stephens' parameterization schemes are applied to the SMMR and VISSR data, respectively. The results indicate that in the 0-100 mg cm2 range of liquid water content considered, the correlation coefficient between the two types of estimates is 0.83 (0.81- 0.85 at the 99 percent confidence level). The Wilheit and Chang algorithm, however, yields values lower than those obtained with Stephens's schemes by 24.5 mg cm2 on the average, and occasionally the SMMR-based values are negative. Alternative algorithms are proposed for use with SMMR data, which eliminate the bias, augment the correlation coefficient, and reduce the rms difference. These algorithms include using the Witheit and Chang formula with modified coefficients (multilinear regression), the Wilheit and Chang formula with the same coefficients but different equivalent atmospheric temperatures for each channel (temperature bias adjustment), and a second-order polynomial in brightness temperatures at 18, 21, and 37 GHz (polynomial development). When applied to a dataset excluded from the regressionn dataset, the multilinear regression algorithm provides the best results, namely a 0.91 correlation coefficient, a 5.2 mg cm2 (residual) difference, and a 2.9 mg cm2 bias. Simply shifting the liquid water content predicted by the Wilheit and Chang algorithm does not yield as good comparison statistics, indicating that the occasional negative values are not due only to a bias. The more accurate SMMR-derived liquid water content allows one to better evaluate cloud transmittance in the solar spectrum, at least in the area and during the period analyzed. Combining this cloud transmittance with a clear sky model would provide ocean surface insulation estimates from SMMR data alone.
Regression analysis for solving diagnosis problem of children's health
NASA Astrophysics Data System (ADS)
Cherkashina, Yu A.; Gerget, O. M.
2016-04-01
The paper includes results of scientific researches. These researches are devoted to the application of statistical techniques, namely, regression analysis, to assess the health status of children in the neonatal period based on medical data (hemostatic parameters, parameters of blood tests, the gestational age, vascular-endothelial growth factor) measured at 3-5 days of children's life. In this paper a detailed description of the studied medical data is given. A binary logistic regression procedure is discussed in the paper. Basic results of the research are presented. A classification table of predicted values and factual observed values is shown, the overall percentage of correct recognition is determined. Regression equation coefficients are calculated, the general regression equation is written based on them. Based on the results of logistic regression, ROC analysis was performed, sensitivity and specificity of the model are calculated and ROC curves are constructed. These mathematical techniques allow carrying out diagnostics of health of children providing a high quality of recognition. The results make a significant contribution to the development of evidence-based medicine and have a high practical importance in the professional activity of the author.
Zhu, Yu; Xia, Jie-lai; Wang, Jing
2009-09-01
Application of the 'single auto regressive integrated moving average (ARIMA) model' and the 'ARIMA-generalized regression neural network (GRNN) combination model' in the research of the incidence of scarlet fever. Establish the auto regressive integrated moving average model based on the data of the monthly incidence on scarlet fever of one city, from 2000 to 2006. The fitting values of the ARIMA model was used as input of the GRNN, and the actual values were used as output of the GRNN. After training the GRNN, the effect of the single ARIMA model and the ARIMA-GRNN combination model was then compared. The mean error rate (MER) of the single ARIMA model and the ARIMA-GRNN combination model were 31.6%, 28.7% respectively and the determination coefficient (R(2)) of the two models were 0.801, 0.872 respectively. The fitting efficacy of the ARIMA-GRNN combination model was better than the single ARIMA, which had practical value in the research on time series data such as the incidence of scarlet fever.
2009-07-16
0.25 0.26 -0.85 1 SSR SSE R SSTO SSTO = = − 2 2 ˆ( ) : Regression sum of square, ˆwhere : mean value, : value from the fitted line ˆ...Error sum of square : Total sum of square i i i i SSR Y Y Y Y SSE Y Y SSTO SSE SSR = − = − = + ∑ ∑ Statistical analysis: Coefficient of correlation
Linear Regression between CIE-Lab Color Parameters and Organic Matter in Soils of Tea Plantations
NASA Astrophysics Data System (ADS)
Chen, Yonggen; Zhang, Min; Fan, Dongmei; Fan, Kai; Wang, Xiaochang
2018-02-01
To quantify the relationship between the soil organic matter and color parameters using the CIE-Lab system, 62 soil samples (0-10 cm, Ferralic Acrisols) from tea plantations were collected from southern China. After air-drying and sieving, numerical color information and reflectance spectra of soil samples were measured under laboratory conditions using an UltraScan VIS (HunterLab) spectrophotometer equipped with CIE-Lab color models. We found that soil total organic carbon (TOC) and nitrogen (TN) contents were negatively correlated with the L* value (lightness) ( r = -0.84 and -0.80, respectively), a* value (correlation coefficient r = -0.51 and -0.46, respectively) and b* value ( r = -0.76 and -0.70, respectively). There were also linear regressions between TOC and TN contents with the L* value and b* value. Results showed that color parameters from a spectrophotometer equipped with CIE-Lab color models can predict TOC contents well for soils in tea plantations. The linear regression model between color values and soil organic carbon contents showed it can be used as a rapid, cost-effective method to evaluate content of soil organic matter in Chinese tea plantations.
Relationship Between Earthquake b-Values and Crustal Stresses in a Young Orogenic Belt
NASA Astrophysics Data System (ADS)
Wu, Yih-Min; Chen, Sean Kuanhsiang; Huang, Ting-Chung; Huang, Hsin-Hua; Chao, Wei-An; Koulakov, Ivan
2018-02-01
It has been reported that earthquake b-values decrease linearly with the differential stresses in the continental crust and subduction zones. Here we report a regression-derived relation between earthquake b-values and crustal stresses using the Anderson fault parameter (Aϕ) in a young orogenic belt of Taiwan. This regression relation is well established by using a large and complete earthquake catalog for Taiwan. The data set consists of b-values and Aϕ values derived from relocated earthquakes and focal mechanisms, respectively. Our results show that b-values decrease linearly with the Aϕ values at crustal depths with a high correlation coefficient of -0.9. Thus, b-values could be used as stress indicators for orogenic belts. However, the state of stress is relatively well correlated with the surface geological setting with respect to earthquake b-values in Taiwan. Temporal variations in the b-value could constitute one of the main reasons for the spatial heterogeneity of b-values. We therefore suggest that b-values could be highly sensitive to temporal stress variations.
ERIC Educational Resources Information Center
Dolan, Conor V.; Wicherts, Jelte M.; Molenaar, Peter C. M.
2004-01-01
We consider the question of how variation in the number and reliability of indicators affects the power to reject the hypothesis that the regression coefficients are zero in latent linear regression analysis. We show that power remains constant as long as the coefficient of determination remains unchanged. Any increase in the number of indicators…
Assessing LULC changes over Chilika Lake watershed in Eastern India using Driving Force Analysis
NASA Astrophysics Data System (ADS)
Jadav, S.; Syed, T. H.
2017-12-01
Rapid population growth and industrial development has brought about significant changes in Land Use Land Cover (LULC) of many developing countries in the world. This study investigates LULC changes in the Chilika Lake watershed of Eastern India for the period of 1988 to 2016. The methodology involves pre-processing and classification of Landsat satellite images using support vector machine (SVM) supervised classification algorithm. Results reveal that `Cropland', `Emergent Vegetation' and `Settlement' has expanded over the study period by 284.61 km², 106.83 km² and 98.83 km² respectively. Contemporaneously, `Lake Area', `Vegetation' and `Scrub Land' have decreased by 121.62 km², 96.05 km² and 80.29 km² respectively. This study also analyzes five major driving force variables of socio-economic and climatological factors triggering LULC changes through a bivariate logistic regression model. The outcome gives credible relative operating characteristics (ROC) value of 0.76 that indicate goodness fit of logistic regression model. In addition, independent variables like distance to drainage network and average annual rainfall have negative regression coefficient values that represent decreased rate of dependent variable (changed LULC) whereas independent variables (population density, distance to road and distance to railway) have positive regression coefficient indicates increased rate of changed LULC . Results from this study will be crucial for planning and restoration of this vital lake water body that has major implications over the society and environment at large.
Raj, Retheep; Sivanandan, K S
2017-01-01
Estimation of elbow dynamics has been the object of numerous investigations. In this work a solution is proposed for estimating elbow movement velocity and elbow joint angle from Surface Electromyography (SEMG) signals. Here the Surface Electromyography signals are acquired from the biceps brachii muscle of human hand. Two time-domain parameters, Integrated EMG (IEMG) and Zero Crossing (ZC), are extracted from the Surface Electromyography signal. The relationship between the time domain parameters, IEMG and ZC with elbow angular displacement and elbow angular velocity during extension and flexion of the elbow are studied. A multiple input-multiple output model is derived for identifying the kinematics of elbow. A Nonlinear Auto Regressive with eXogenous inputs (NARX) structure based multiple layer perceptron neural network (MLPNN) model is proposed for the estimation of elbow joint angle and elbow angular velocity. The proposed NARX MLPNN model is trained using Levenberg-marquardt based algorithm. The proposed model is estimating the elbow joint angle and elbow movement angular velocity with appreciable accuracy. The model is validated using regression coefficient value (R). The average regression coefficient value (R) obtained for elbow angular displacement prediction is 0.9641 and for the elbow anglular velocity prediction is 0.9347. The Nonlinear Auto Regressive with eXogenous inputs (NARX) structure based multiple layer perceptron neural networks (MLPNN) model can be used for the estimation of angular displacement and movement angular velocity of the elbow with good accuracy.
Li, Dapeng; Yue, Jiawei; Jiang, Lu; Huang, Yonghui; Sun, Jifu; Wu, Yan
2017-04-22
BACKGROUND Degrading enzymes play an important role in the process of disc degeneration. The objective of this study was to investigate the correlation between the expression of high temperature requirement serine protease A1 (HtrA1) in the nucleus pulposus and the T2 value of the nucleus pulposus region in magnetic resonance imaging (MRI). MATERIAL AND METHODS Thirty-six patients who had undergone surgical excision of the nucleus pulposus were examined by MRI before surgery. Pfirrmann grading of the target intervertebral disc was performed according to the sagittal T2-weighted imaging, and the T2 value of the target nucleus pulposus was measured according to the median sagittal T2 mapping. The correlation between the Pfirrmann grade and the T2 value was analyzed. The expression of HtrA1 in the nucleus pulposus was analyzed by RT-PCR and Western blot. The correlation between the expression of HtrA1 and the T2 value was analyzed. RESULTS The T2 value of the nucleus pulposus region was 33.11-167.91 ms, with an average of 86.64±38.73 ms. According to Spearman correlation analysis, there was a rank correlation between T2 value and Pfirrmann grade (P<0.0001), and the correlation coefficient (rs)=-0.93617. There was a linear correlation between the mRNA level of HtrA1 and T2 value in nucleus pulposus tissues (a=3.88, b=-0.019, F=112.63, P<0.0001), normalized regression coefficient=-0.88. There was a linear correlation between the expression level of HtrA1 protein and the T2 value in the nucleus pulposus tissues (a=3.30, b=-0.016, F=93.15, P<0.0001) and normalized regression coefficient=-0.86. CONCLUSIONS The expression of HtrA1 was strongly related to the T2 value, suggesting that HtrA1 plays an important role in the pathological process of intervertebral disc degeneration.
MANCOVA for one way classification with homogeneity of regression coefficient vectors
NASA Astrophysics Data System (ADS)
Mokesh Rayalu, G.; Ravisankar, J.; Mythili, G. Y.
2017-11-01
The MANOVA and MANCOVA are the extensions of the univariate ANOVA and ANCOVA techniques to multidimensional or vector valued observations. The assumption of a Gaussian distribution has been replaced with the Multivariate Gaussian distribution for the vectors data and residual term variables in the statistical models of these techniques. The objective of MANCOVA is to determine if there are statistically reliable mean differences that can be demonstrated between groups later modifying the newly created variable. When randomization assignment of samples or subjects to groups is not possible, multivariate analysis of covariance (MANCOVA) provides statistical matching of groups by adjusting dependent variables as if all subjects scored the same on the covariates. In this research article, an extension has been made to the MANCOVA technique with more number of covariates and homogeneity of regression coefficient vectors is also tested.
Advanced Statistical Analyses to Reduce Inconsistency of Bond Strength Data.
Minamino, T; Mine, A; Shintani, A; Higashi, M; Kawaguchi-Uemura, A; Kabetani, T; Hagino, R; Imai, D; Tajiri, Y; Matsumoto, M; Yatani, H
2017-11-01
This study was designed to clarify the interrelationship of factors that affect the value of microtensile bond strength (µTBS), focusing on nondestructive testing by which information of the specimens can be stored and quantified. µTBS test specimens were prepared from 10 noncarious human molars. Six factors of µTBS test specimens were evaluated: presence of voids at the interface, X-ray absorption coefficient of resin, X-ray absorption coefficient of dentin, length of dentin part, size of adhesion area, and individual differences of teeth. All specimens were observed nondestructively by optical coherence tomography and micro-computed tomography before µTBS testing. After µTBS testing, the effect of these factors on µTBS data was analyzed by the general linear model, linear mixed effects regression model, and nonlinear regression model with 95% confidence intervals. By the general linear model, a significant difference in individual differences of teeth was observed ( P < 0.001). A significantly positive correlation was shown between µTBS and length of dentin part ( P < 0.001); however, there was no significant nonlinearity ( P = 0.157). Moreover, a significantly negative correlation was observed between µTBS and size of adhesion area ( P = 0.001), with significant nonlinearity ( P = 0.014). No correlation was observed between µTBS and X-ray absorption coefficient of resin ( P = 0.147), and there was no significant nonlinearity ( P = 0.089). Additionally, a significantly positive correlation was observed between µTBS and X-ray absorption coefficient of dentin ( P = 0.022), with significant nonlinearity ( P = 0.036). A significant difference was also observed between the presence and absence of voids by linear mixed effects regression analysis. Our results showed correlations between various parameters of tooth specimens and µTBS data. To evaluate the performance of the adhesive more precisely, the effect of tooth variability and a method to reduce variation in bond strength values should also be considered.
Validation of MODIS Aerosol Optical Depth Retrievals over a Tropical Urban Site, Pune, India
NASA Technical Reports Server (NTRS)
More, Sanjay; Kuman, P. Pradeep; Gupta, Pawan; Devara, P. C. S.; Aher, G. R.
2011-01-01
In the present paper, MODIS (Terra and Aqua; level 2, collection 5) derived aerosoloptical depths (AODs) are compared with the ground-based measurements obtained from AERONET (level 2.0) and Microtops - II sun-photometer over a tropical urban station, Pune (18 deg 32'N; 73 deg 49'E, 559 m amsl). This is the first ever systematic validation of the MODIS aerosol products over Pune. Analysis of the data indicates that the Terra and Aqua MODIS AOD retrievals at 550 nm have good correlations with the AERONET and Microtops - II sun-photometer AOD measurements. During winter the linear regression correlation coefficients for MODIS products against AERONET measurements are 0.79 for Terra and 0.62 for Aqua; however for premonsoon, the corresponding coefficients are 0.78 and 0.74. Similarly, the linear regression correlation coefficients for Microtops measurements against MODIS products are 0.72 and 0.93 for Terra and Aqua data respectively during winter and are 0.78 and 0.75 during pre-monsoon. On yearly basis in 2008-2009, correlation coefficients for MODIS products against AERONET measurements are 0.80 and 0.78 for Terra and Aqua respectively while the corresponding coefficients are 0.70 and 0.73 during 2009-2010. The regressed intercepts with MODIS vs. AERONET are 0.09 for Terra and 0.05 for Aqua during winter whereas their values are 0.04 and 0.07 during pre-monsoon. However, MODIS AODs are found to underestimate during winter and overestimate during pre-monsoon with respect to AERONET and Microtops measurements having slopes 0.63 (Terra) and 0.74 (Aqua) during winter and 0.97 (Terra) and 0.94 (Aqua) during pre-monsoon. Wavelength dependency of Single Scattering Albedo (SSA) shows presence of absorbing and scattering aerosol particles. For winter, SSA decreases with wavelength with the values 0.86 +/- 0.03 at 440 nm and 0.82 +/- 0.04 at 1020nm. In pre-monsoon, it increases with wavelength (SSA is 0.87 +/- 0.02 at 440nm; and 0.88 +/-0.04 at 1020 nm).
Bi, Qiu; Xiao, Zhibo; Lv, Fajin; Liu, Yao; Zou, Chunxia; Shen, Yiqing
2018-02-05
The objective of this study was to find clinical parameters and qualitative and quantitative magnetic resonance imaging (MRI) features for differentiating uterine sarcoma from atypical leiomyoma (ALM) preoperatively and to calculate predictive values for uterine sarcoma. Data from 60 patients with uterine sarcoma and 88 patients with ALM confirmed by surgery and pathology were collected. Clinical parameters, qualitative MRI features, diffusion-weighted imaging with apparent diffusion coefficient values, and quantitative parameters of dynamic contrast-enhanced MRI of these two tumor types were compared. Predictive values for uterine sarcoma were calculated using multivariable logistic regression. Patient clinical manifestations, tumor locations, margins, T2-weighted imaging signals, mean apparent diffusion coefficient values, minimum apparent diffusion coefficient values, and time-signal intensity curves of solid tumor components were obvious significant parameters for distinguishing between uterine sarcoma and ALM (all P <.001). Abnormal vaginal bleeding, tumors located mainly in the uterine cavity, ill-defined tumor margins, and mean apparent diffusion coefficient values of <1.272 × 10 -3 mm 2 /s were significant preoperative predictors of uterine sarcoma. When the overall scores of these four predictors were greater than or equal to 7 points, the sensitivity, the specificity, the accuracy, and the positive and negative predictive values were 88.9%, 99.9%, 95.7%, 97.0%, and 95.1%, respectively. The use of clinical parameters and multiparametric MRI as predictive factors was beneficial for diagnosing uterine sarcoma preoperatively. These findings could be helpful for guiding treatment decisions. Copyright © 2018 The Association of University Radiologists. Published by Elsevier Inc. All rights reserved.
Yamazaki, Takeshi; Takeda, Hisato; Hagiya, Koichi; Yamaguchi, Satoshi; Sasaki, Osamu
2018-03-13
Because lactation periods in dairy cows lengthen with increasing total milk production, it is important to predict individual productivities after 305 days in milk (DIM) to determine the optimal lactation period. We therefore examined whether the random regression (RR) coefficient from 306 to 450 DIM (M2) can be predicted from those during the first 305 DIM (M1) by using a random regression model. We analyzed test-day milk records from 85690 Holstein cows in their first lactations and 131727 cows in their later (second to fifth) lactations. Data in M1 and M2 were analyzed separately by using different single-trait RR animal models. We then performed a multiple regression analysis of the RR coefficients of M2 on those of M1 during the first and later lactations. The first-order Legendre polynomials were practical covariates of random regression for the milk yields of M2. All RR coefficients for the additive genetic (AG) effect and the intercept for the permanent environmental (PE) effect of M2 had moderate to strong correlations with the intercept for the AG effect of M1. The coefficients of determination for multiple regression of the combined intercepts for the AG and PE effects of M2 on the coefficients for the AG effect of M1 were moderate to high. The daily milk yields of M2 predicted by using the RR coefficients for the AG effect of M1 were highly correlated with those obtained by using the coefficients of M2. Milk production after 305 DIM can be predicted by using the RR coefficient estimates of the AG effect during the first 305 DIM.
[Culture and quality of life assessment in Chinese populations].
Xia, Ping; Li, Ning-Xiu; Liu, Chao-Jie; Lü, Yu-Bo; Zhang, Qiang; Ou, Ai-Hua
2010-07-01
To investigate the impact of cultural factors on quality of life (QOL) and to identify appropriate ways of dividing sub-populations for population norm-based quality of life assessment. The WHOQOL-BREF was used as a QOL instrument. Another questionnaire was developed to assess cultural values. A cross-sectional survey was undertaken in 1090 Guangzhou residents, which included 635 respondents from communities and 455 patients who visited outpatient departments of hospitals. Cronbach's a coefficients and item-domain correlation coefficients were calculated to test the reliability and validity of the WHOQOL-BREF, respectively. Student t test, ANOVA and stepwise multiple linear regression analysis were performed to identify the variables that might have an impact on the QOL. Two regression models with and without including cultural variables were constructed, and the extent of impact exerted by the cultural factors was assessed through a comparison of the change of adjusted R square values. A total of 1052 (96%) valid questionnaire were returned. The Cronbach's alpha coefficients of the WHOQOL-BREF ranged from 0.67 to 0.78. Age, education, occupation and family income were correlated with all of the domains of the WHOQOL-BREF. Chronic condition was correlated with physical, psychological, and social relationship domains of the WHOQOL-BREF. Gender was correlated with physical and psychological domains of the WHOQOL-BREF. The multiple regression analysis showed that social and demographic factors contributed to 6.3%, 13.6%, 10.4% and 8.7% of the predicted variances for the physical, psychological, social relationship, and environment domains, respectively. Social support, horizontal collectivism, vertical individualism, escape acceptance, fear of death, health value, supernatural belief had a significant impact on QOL. However, social support was the only one factor that had an impact on all of the four QOL domains. It is necessary to divide sub-cultural populations for population norm-based QOL assessment. Further research is needed to develop a practical approach to the sub-cultural population division.
Dennison, Jessica L; Stack, Jim; Beatty, Stephen; Nolan, John M
2013-11-01
This study compares in vivo measurements of macular pigment (MP) obtained using customized heterochromatic flicker photometry (cHFP; Macular Metrics Densitometer(™)), dual-wavelength fundus autofluorescence (Heidelberg Spectralis(®) HRA + OCT MultiColor) and single-wavelength fundus reflectance (Zeiss Visucam(®) 200). MP was measured in one eye of 62 subjects on each device. Data from 49 subjects (79%) was suitable for analysis. Agreement between the Densitometer and Spectralis was investigated at various eccentricities using a variety of quantitative and graphical methods, including: Pearson correlation coefficient to measure degree of scatter (precision), accuracy coefficient, concordance correlation coefficient (ccc), paired t-test, scatter and Bland-Altman plots. The relationship between max MP from the Visucam and central MP from the Spectralis and Densitometer was investigated using regression methods. Agreement was strong between the Densitometer and Spectralis at all central eccentricities (e.g. at 0.25° eccentricity: accuracy = 0.97, precision = 0.90, ccc = 0.87). Regression analysis showed a very weak relationship between the Visucam and Densitometer (e.g. Visucam max on Densitometer central MP: R(2) = 0.008, p = 0.843). Regression analysis also demonstrated a weak relationship between MP measured by the Spectralis and Visucam (e.g. Visucam max on Spectralis central MP: R(2) = 0.047, p = 0.348). MP values obtained using the Heidelberg Spectralis are comparable to MP values obtained using the Densitometer. In contrast, MP values obtained using the Zeiss Visucam are not comparable with either the Densitometer or the Spectralis MP measuring devices. Taking cHFP as the current standard to which other MP measuring devices should be compared, the Spectralis is suitable for use in a clinical and research setting, whereas the Visucam is not. Copyright © 2013 Elsevier Ltd. All rights reserved.
Interpreting Regression Results: beta Weights and Structure Coefficients are Both Important.
ERIC Educational Resources Information Center
Thompson, Bruce
Various realizations have led to less frequent use of the "OVA" methods (analysis of variance--ANOVA--among others) and to more frequent use of general linear model approaches such as regression. However, too few researchers understand all the various coefficients produced in regression. This paper explains these coefficients and their…
Soil sail content estimation in the yellow river delta with satellite hyperspectral data
Weng, Yongling; Gong, Peng; Zhu, Zhi-Liang
2008-01-01
Soil salinization is one of the most common land degradation processes and is a severe environmental hazard. The primary objective of this study is to investigate the potential of predicting salt content in soils with hyperspectral data acquired with EO-1 Hyperion. Both partial least-squares regression (PLSR) and conventional multiple linear regression (MLR), such as stepwise regression (SWR), were tested as the prediction model. PLSR is commonly used to overcome the problem caused by high-dimensional and correlated predictors. Chemical analysis of 95 samples collected from the top layer of soils in the Yellow River delta area shows that salt content was high on average, and the dominant chemicals in the saline soil were NaCl and MgCl2. Multivariate models were established between soil contents and hyperspectral data. Our results indicate that the PLSR technique with laboratory spectral data has a strong prediction capacity. Spectral bands at 1487-1527, 1971-1991, 2032-2092, and 2163-2355 nm possessed large absolute values of regression coefficients, with the largest coefficient at 2203 nm. We obtained a root mean squared error (RMSE) for calibration (with 61 samples) of RMSEC = 0.753 (R2 = 0.893) and a root mean squared error for validation (with 30 samples) of RMSEV = 0.574. The prediction model was applied on a pixel-by-pixel basis to a Hyperion reflectance image to yield a quantitative surface distribution map of soil salt content. The result was validated successfully from 38 sampling points. We obtained an RMSE estimate of 1.037 (R2 = 0.784) for the soil salt content map derived by the PLSR model. The salinity map derived from the SWR model shows that the predicted value is higher than the true value. These results demonstrate that the PLSR method is a more suitable technique than stepwise regression for quantitative estimation of soil salt content in a large area. ?? 2008 CASI.
NASA Astrophysics Data System (ADS)
Uca; Toriman, Ekhwan; Jaafar, Othman; Maru, Rosmini; Arfan, Amal; Saleh Ahmar, Ansari
2018-01-01
Prediction of suspended sediment discharge in a catchments area is very important because it can be used to evaluation the erosion hazard, management of its water resources, water quality, hydrology project management (dams, reservoirs, and irrigation) and to determine the extent of the damage that occurred in the catchments. Multiple Linear Regression analysis and artificial neural network can be used to predict the amount of daily suspended sediment discharge. Regression analysis using the least square method, whereas artificial neural networks using Radial Basis Function (RBF) and feedforward multilayer perceptron with three learning algorithms namely Levenberg-Marquardt (LM), Scaled Conjugate Descent (SCD) and Broyden-Fletcher-Goldfarb-Shanno Quasi-Newton (BFGS). The number neuron of hidden layer is three to sixteen, while in output layer only one neuron because only one output target. The mean absolute error (MAE), root mean square error (RMSE), coefficient of determination (R2 ) and coefficient of efficiency (CE) of the multiple linear regression (MLRg) value Model 2 (6 input variable independent) has the lowest the value of MAE and RMSE (0.0000002 and 13.6039) and highest R2 and CE (0.9971 and 0.9971). When compared between LM, SCG and RBF, the BFGS model structure 3-7-1 is the better and more accurate to prediction suspended sediment discharge in Jenderam catchment. The performance value in testing process, MAE and RMSE (13.5769 and 17.9011) is smallest, meanwhile R2 and CE (0.9999 and 0.9998) is the highest if it compared with the another BFGS Quasi-Newton model (6-3-1, 9-10-1 and 12-12-1). Based on the performance statistics value, MLRg, LM, SCG, BFGS and RBF suitable and accurately for prediction by modeling the non-linear complex behavior of suspended sediment responses to rainfall, water depth and discharge. The comparison between artificial neural network (ANN) and MLRg, the MLRg Model 2 accurately for to prediction suspended sediment discharge (kg/day) in Jenderan catchment area.
Return period adjustment for runoff coefficients based on analysis in undeveloped Texas watersheds
Dhakal, Nirajan; Fang, Xing; Asquith, William H.; Cleveland, Theodore G.; Thompson, David B.
2013-01-01
The rational method for peak discharge (Qp) estimation was introduced in the 1880s. The runoff coefficient (C) is a key parameter for the rational method that has an implicit meaning of rate proportionality, and the C has been declared a function of the annual return period by various researchers. Rate-based runoff coefficients as a function of the return period, C(T), were determined for 36 undeveloped watersheds in Texas using peak discharge frequency from previously published regional regression equations and rainfall intensity frequency for return periods T of 2, 5, 10, 25, 50, and 100 years. The C(T) values and return period adjustments C(T)/C(T=10 year) determined in this study are most applicable to undeveloped watersheds. The return period adjustments determined for the Texas watersheds in this study and those extracted from prior studies of non-Texas data exceed values from well-known literature such as design manuals and textbooks. Most importantly, the return period adjustments exceed values currently recognized in Texas Department of Transportation design guidance when T>10 years.
Osinga, Rik; Babst, Doris; Bodmer, Elvira S; Link, Bjoern C; Fritsche, Elmar; Hug, Urs
2017-12-01
This work assessed both subjective and objective postoperative parameters after breast reduction surgery and compared between patients and plastic surgeons. After an average postoperative observation period of 6.7 ± 2.7 (2 - 13) years, 159 out of 259 patients (61 %) were examined. The mean age at the time of surgery was 37 ± 14 (15 - 74) years. The postoperative anatomy of the breast and other anthropometric parameters were measured in cm with the patient in an upright position. The visual analogue scale (VAS) values for symmetry, size, shape, type of scar and overall satisfaction both from the patient's and from four plastic surgeons' perspectives were assessed and compared. Patients rated the postoperative result significantly better than surgeons. Good subjective ratings by patients for shape, symmetry and sensitivity correlated with high scores for overall assessment. Shape had the strongest influence on overall satisfaction (regression coefficient 0.357; p < 0.001), followed by symmetry (regression coefficient 0.239; p < 0.001) and sensitivity (regression coefficient 0.109; p = 0.040) of the breast. The better the subjective rating for symmetry by the patient, the smaller the measured difference of the jugulum-mamillary distance between left and right (regression coefficient -0.773; p = 0.002) and the smaller the difference in height of the lowest part of the breast between left and right (regression coefficient -0.465; p = 0.035). There was no significant correlation between age, weight, height, BMI, resected weight of the breast, postoperative breast size or type of scar with overall satisfaction. After breast reduction surgery, long-term outcome is rated significantly better by patients than by plastic surgeons. Good subjective ratings by patients for shape, symmetry and sensitivity correlated with high scores for overall assessment. Shape had the strongest influence on overall satisfaction, followed by symmetry and sensitivity of the breast. Postoperative size of the breast, resection weight, type of scar, age or BMI was not of significant influence. Symmetry was the only assessed subjective parameter of this study that could be objectified by postoperative measurements. Georg Thieme Verlag KG Stuttgart · New York.
NASA Astrophysics Data System (ADS)
Doke, Atul M.; Sadana, Ajit
2006-05-01
A fractal analysis is presented for the binding and dissociation of different heart-related compounds in solution to receptors immobilized on biosensor surfaces. The data analyzed include LCAT (lecithin cholesterol acyl transferase) concentrations in solution to egg-white apoA-I rHDL immobilized on a biosensor chip surface.1 Single- and dual- fractal models were employed to fit the data. Values of the binding and the dissociation rate coefficient(s), affinity values, and the fractal dimensions were obtained from the regression analysis provided by Corel Quattro Pro 8.0 (Corel Corporation Limited).2 The binding rate coefficients are quite sensitive to the degree of heterogeneity on the sensor chip surface. Predictive equations are developed for the binding rate coefficient as a function of the degree of heterogeneity present on the sensor chip surface and on the LCAT concentration in solution, and for the affinity as a function of the ratio of fractal dimensions present in the binding and the dissociation phases. The analysis presented provided physical insights into these analyte-receptor reactions occurring on different biosensor surfaces.
NASA Astrophysics Data System (ADS)
Pradhan, Biswajeet
2010-05-01
This paper presents the results of the cross-validation of a multivariate logistic regression model using remote sensing data and GIS for landslide hazard analysis on the Penang, Cameron, and Selangor areas in Malaysia. Landslide locations in the study areas were identified by interpreting aerial photographs and satellite images, supported by field surveys. SPOT 5 and Landsat TM satellite imagery were used to map landcover and vegetation index, respectively. Maps of topography, soil type, lineaments and land cover were constructed from the spatial datasets. Ten factors which influence landslide occurrence, i.e., slope, aspect, curvature, distance from drainage, lithology, distance from lineaments, soil type, landcover, rainfall precipitation, and normalized difference vegetation index (ndvi), were extracted from the spatial database and the logistic regression coefficient of each factor was computed. Then the landslide hazard was analysed using the multivariate logistic regression coefficients derived not only from the data for the respective area but also using the logistic regression coefficients calculated from each of the other two areas (nine hazard maps in all) as a cross-validation of the model. For verification of the model, the results of the analyses were then compared with the field-verified landslide locations. Among the three cases of the application of logistic regression coefficient in the same study area, the case of Selangor based on the Selangor logistic regression coefficients showed the highest accuracy (94%), where as Penang based on the Penang coefficients showed the lowest accuracy (86%). Similarly, among the six cases from the cross application of logistic regression coefficient in other two areas, the case of Selangor based on logistic coefficient of Cameron showed highest (90%) prediction accuracy where as the case of Penang based on the Selangor logistic regression coefficients showed the lowest accuracy (79%). Qualitatively, the cross application model yields reasonable results which can be used for preliminary landslide hazard mapping.
Assessing risk factors for periodontitis using regression
NASA Astrophysics Data System (ADS)
Lobo Pereira, J. A.; Ferreira, Maria Cristina; Oliveira, Teresa
2013-10-01
Multivariate statistical analysis is indispensable to assess the associations and interactions between different factors and the risk of periodontitis. Among others, regression analysis is a statistical technique widely used in healthcare to investigate and model the relationship between variables. In our work we study the impact of socio-demographic, medical and behavioral factors on periodontal health. Using regression, linear and logistic models, we can assess the relevance, as risk factors for periodontitis disease, of the following independent variables (IVs): Age, Gender, Diabetic Status, Education, Smoking status and Plaque Index. The multiple linear regression analysis model was built to evaluate the influence of IVs on mean Attachment Loss (AL). Thus, the regression coefficients along with respective p-values will be obtained as well as the respective p-values from the significance tests. The classification of a case (individual) adopted in the logistic model was the extent of the destruction of periodontal tissues defined by an Attachment Loss greater than or equal to 4 mm in 25% (AL≥4mm/≥25%) of sites surveyed. The association measures include the Odds Ratios together with the correspondent 95% confidence intervals.
NASA Astrophysics Data System (ADS)
Kiss, I.; Cioată, V. G.; Ratiu, S. A.; Rackov, M.; Penčić, M.
2018-01-01
Multivariate research is important in areas of cast-iron brake shoes manufacturing, because many variables interact with each other simultaneously. This article focuses on expressing the multiple linear regression model related to the hardness assurance by the chemical composition of the phosphorous cast irons destined to the brake shoes, having in view that the regression coefficients will illustrate the unrelated contributions of each independent variable towards predicting the dependent variable. In order to settle the multiple correlations between the hardness of the cast-iron brake shoes, and their chemical compositions several regression equations has been proposed. Is searched a mathematical solution which can determine the optimum chemical composition for the hardness desirable values. Starting from the above-mentioned affirmations two new statistical experiments are effectuated related to the values of Phosphorus [P], Manganese [Mn] and Silicon [Si]. Therefore, the regression equations, which describe the mathematical dependency between the above-mentioned elements and the hardness, are determined. As result, several correlation charts will be revealed.
Sun, Lili; Zhou, Liping; Yu, Yu; Lan, Yukun; Li, Zhiliang
2007-01-01
Polychlorinated diphenyl ethers (PCDEs) have received more and more concerns as a group of ubiquitous potential persistent organic pollutants (POPs). By using molecular electronegativity distance vector (MEDV-4), multiple linear regression (MLR) models are developed for sub-cooled liquid vapor pressures (P(L)), n-octanol/water partition coefficients (K(OW)) and sub-cooled liquid water solubilities (S(W,L)) of 209 PCDEs and diphenyl ether. The correlation coefficients (R) and the leave-one-out cross-validation (LOO) correlation coefficients (R(CV)) of all the 6-descriptor models for logP(L), logK(OW) and logS(W,L) are more than 0.98. By using stepwise multiple regression (SMR), the descriptors are selected and the resulting models are 5-descriptor model for logP(L), 4-descriptor model for logK(OW), and 6-descriptor model for logS(W,L), respectively. All these models exhibit excellent estimate capabilities for internal sample set and good predictive capabilities for external samples set. The consistency between observed and estimated/predicted values for logP(L) is the best (R=0.996, R(CV)=0.996), followed by logK(OW) (R=0.992, R(CV)=0.992) and logS(W,L) (R=0.983, R(CV)=0.980). By using MEDV-4 descriptors, the QSPR models can be used for prediction and the model predictions can hence extend the current database of experimental values.
Chelgani, S.C.; Hart, B.; Grady, W.C.; Hower, J.C.
2011-01-01
The relationship between maceral content plus mineral matter and gross calorific value (GCV) for a wide range of West Virginia coal samples (from 6518 to 15330 BTU/lb; 15.16 to 35.66MJ/kg) has been investigated by multivariable regression and adaptive neuro-fuzzy inference system (ANFIS). The stepwise least square mathematical method comparison between liptinite, vitrinite, plus mineral matter as input data sets with measured GCV reported a nonlinear correlation coefficient (R2) of 0.83. Using the same data set the correlation between the predicted GCV from the ANFIS model and the actual GCV reported a R2 value of 0.96. It was determined that the GCV-based prediction methods, as used in this article, can provide a reasonable estimation of GCV. Copyright ?? Taylor & Francis Group, LLC.
Li, Jin-ming; Zheng, Huai-jing; Wang, Lu-nan; Deng, Wei
2003-04-01
To establish a model for one choosing controls with a suitable concentration for internal quality control (IQC) with qualitative ELISA detection, and a consecutive plotting method on Levey-Jennings control chart when reagent kit lot is changed. First, a series of control serum with 0.2, 0.5, 1.0, 2.0 and 5.0ng/ml HBsAg respectively were assessed for within-run and between-run precision according to NCCLs EP5 document. Then, a linear regression equation (y=bx + a) with best correlation coefficient (r > 0.99) was established based on S/CO values of the series of control serum. Finally, one could choose controls with S/CO value calculated from the equation (y = bx + a) minus the product of the S/CO value multiplying three-fold between-run CV to be still more than 1.0 for IQC use. For consecutive plotting on Levey-Jennings control chart when ELISA kit lot was changed, the new lot kits were used to detect the same series of HBsAg control serum as above. Then, a new linear regression equation (y2 = b2x2 + a2) with best correlation coefficient was obtained. The old one (y1 =b1x1 + a1) could be obtained based on the mean values from above precision assessment. The S/CO value of a control serum detected by new lot kit could be changed to that detected by old kit lot based on the factor of y2/y1. Therefore, the plotting on primary Levey-Jennings control chart could be continued. The within-run coefficient of variation CV of the ELISA method for control serum with 0.2, 0.5, 1.0, 2.0 and 5.0ng/ml HBsAg were 11.08%, 9.49%, 9.83%, 9.18% and 7.25%, respectively, and between-run CV were 13.25%, 14.03%, 15.11%, 13.29% and 9.92%. The linear regression equation with best correlation coefficient from a test at random was y = 3.509x + 0.180. The suitable concentration of control serum for IQC could be 0.5ng/ml or 1.0ng/ml. The linear regression equation from the old lot and other two new lots of the ELISA kits were y1 = 3.550(x1) + 0.226, y2 = 3.238(x2) +0.388, and y3 =3.428(x3) + 0.148, respectively. Then, the transferring factors of 0.960 (y2/y1) and 0.908 (y3/y1) were obtained. The results shows that the model established for IQC control serum concentration selecting and for consecutive plotting on control chart when the reagent lot is changed is effective and practical.
Microbial Transformation of Esters of Chlorinated Carboxylic Acids
Paris, D. F.; Wolfe, N. L.; Steen, W. C.
1984-01-01
Two groups of compounds were selected for microbial transformation studies. In the first group were carboxylic acid esters having a fixed aromatic moiety and an increasing length of the alkyl component. Ethyl esters of chlorine-substituted carboxylic acids were in the second group. Microorganisms from environmental waters and a pure culture of Pseudomonas putida U were used. The bacterial populations were monitored by plate counts, and disappearance of the parent compound was followed by gas-liquid chromatography as a function of time. The products of microbial hydrolysis were the respective carboxylic acids. Octanol-water partition coefficients (Kow) for the compounds were measured. These values spanned three orders of magnitude, whereas microbial transformation rate constants (kb) varied only 50-fold. The microbial rate constants of the carboxylic acid esters with a fixed aromatic moiety increased with an increasing length of alkyl substituents. The regression coefficient for the linear relationships between log kb and log Kow was high for group 1 compounds, indicating that these parameters correlated well. The regression coefficient for the linear relationships for group 2 compounds, however, was low, indicating that these parameters correlated poorly. PMID:16346459
42 CFR 433.68 - Permissible health care-related taxes.
Code of Federal Regulations, 2011 CFR
2011-10-01
...-related tax is imposed by a unit of local government, the tax must extend to all items or services or... least squares, the slope (designated as (B) (that is. the value of the x coefficient) of two linear... linear regression, as described in paragraph (e)(2)(i) of this section, for the State's tax program, if...
42 CFR 433.68 - Permissible health care-related taxes.
Code of Federal Regulations, 2013 CFR
2013-10-01
...-related tax is imposed by a unit of local government, the tax must extend to all items or services or... least squares, the slope (designated as (B) (that is. the value of the x coefficient) of two linear... linear regression, as described in paragraph (e)(2)(i) of this section, for the State's tax program, if...
ERIC Educational Resources Information Center
Young, Phillip; Reimer, Don; Young, Karen Holsey
2010-01-01
Background: Studies addressing pay discrimination for females in education have relied on main effect regression models, mostly examining amount (intercept values) rather than rate of pay (slope coefficients). Purpose: The purpose is to determine if organizational characteristics and human capital endowments purported to influence pay are facially…
On the Occurrence of Standardized Regression Coefficients Greater than One.
ERIC Educational Resources Information Center
Deegan, John, Jr.
1978-01-01
It is demonstrated here that standardized regression coefficients greater than one can legitimately occur. Furthermore, the relationship between the occurrence of such coefficients and the extent of multicollinearity present among the set of predictor variables in an equation is examined. Comments on the interpretation of these coefficients are…
The importance of regional models in assessing canine cancer incidences in Switzerland
Leyk, Stefan; Brunsdon, Christopher; Graf, Ramona; Pospischil, Andreas; Fabrikant, Sara Irina
2018-01-01
Fitting canine cancer incidences through a conventional regression model assumes constant statistical relationships across the study area in estimating the model coefficients. However, it is often more realistic to consider that these relationships may vary over space. Such a condition, known as spatial non-stationarity, implies that the model coefficients need to be estimated locally. In these kinds of local models, the geographic scale, or spatial extent, employed for coefficient estimation may also have a pervasive influence. This is because important variations in the local model coefficients across geographic scales may impact the understanding of local relationships. In this study, we fitted canine cancer incidences across Swiss municipal units through multiple regional models. We computed diagnostic summaries across the different regional models, and contrasted them with the diagnostics of the conventional regression model, using value-by-alpha maps and scalograms. The results of this comparative assessment enabled us to identify variations in the goodness-of-fit and coefficient estimates. We detected spatially non-stationary relationships, in particular, for the variables related to biological risk factors. These variations in the model coefficients were more important at small geographic scales, making a case for the need to model canine cancer incidences locally in contrast to more conventional global approaches. However, we contend that prior to undertaking local modeling efforts, a deeper understanding of the effects of geographic scale is needed to better characterize and identify local model relationships. PMID:29652921
The importance of regional models in assessing canine cancer incidences in Switzerland.
Boo, Gianluca; Leyk, Stefan; Brunsdon, Christopher; Graf, Ramona; Pospischil, Andreas; Fabrikant, Sara Irina
2018-01-01
Fitting canine cancer incidences through a conventional regression model assumes constant statistical relationships across the study area in estimating the model coefficients. However, it is often more realistic to consider that these relationships may vary over space. Such a condition, known as spatial non-stationarity, implies that the model coefficients need to be estimated locally. In these kinds of local models, the geographic scale, or spatial extent, employed for coefficient estimation may also have a pervasive influence. This is because important variations in the local model coefficients across geographic scales may impact the understanding of local relationships. In this study, we fitted canine cancer incidences across Swiss municipal units through multiple regional models. We computed diagnostic summaries across the different regional models, and contrasted them with the diagnostics of the conventional regression model, using value-by-alpha maps and scalograms. The results of this comparative assessment enabled us to identify variations in the goodness-of-fit and coefficient estimates. We detected spatially non-stationary relationships, in particular, for the variables related to biological risk factors. These variations in the model coefficients were more important at small geographic scales, making a case for the need to model canine cancer incidences locally in contrast to more conventional global approaches. However, we contend that prior to undertaking local modeling efforts, a deeper understanding of the effects of geographic scale is needed to better characterize and identify local model relationships.
NASA Technical Reports Server (NTRS)
Kalton, G.
1983-01-01
A number of surveys were conducted to study the relationship between the level of aircraft or traffic noise exposure experienced by people living in a particular area and their annoyance with it. These surveys generally employ a clustered sample design which affects the precision of the survey estimates. Regression analysis of annoyance on noise measures and other variables is often an important component of the survey analysis. Formulae are presented for estimating the standard errors of regression coefficients and ratio of regression coefficients that are applicable with a two- or three-stage clustered sample design. Using a simple cost function, they also determine the optimum allocation of the sample across the stages of the sample design for the estimation of a regression coefficient.
The Outlier Detection for Ordinal Data Using Scalling Technique of Regression Coefficients
NASA Astrophysics Data System (ADS)
Adnan, Arisman; Sugiarto, Sigit
2017-06-01
The aims of this study is to detect the outliers by using coefficients of Ordinal Logistic Regression (OLR) for the case of k category responses where the score from 1 (the best) to 8 (the worst). We detect them by using the sum of moduli of the ordinal regression coefficients calculated by jackknife technique. This technique is improved by scalling the regression coefficients to their means. R language has been used on a set of ordinal data from reference distribution. Furthermore, we compare this approach by using studentised residual plots of jackknife technique for ANOVA (Analysis of Variance) and OLR. This study shows that the jackknifing technique along with the proper scaling may lead us to reveal outliers in ordinal regression reasonably well.
Pabari, Ritesh M; Ramtoola, Zebunnissa
2012-07-01
A two factor, three level (3(2)) face centred, central composite design (CCD) was applied to investigate the main and interaction effects of tablet diameter and compression force (CF) on hardness, disintegration time (DT) and porosity of mannitol based orodispersible tablets (ODTs). Tablet diameters of 10, 13 and 15 mm, and CF of 10, 15 and 20 kN were studied. Results of multiple linear regression analysis show that both the tablet diameter and CF influence tablet characteristics. A negative value of regression coefficient for tablet diameter showed an inverse relationship with hardness and DT. A positive value of regression coefficient for CF indicated an increase in hardness and DT with increasing CF as a result of the decrease in tablet porosity. Interestingly, at the larger tablet diameter of 15 mm, while hardness increased and porosity decreased with an increase in CF, the DT was resistant to change. The optimised combination was a tablet of 15 mm diameter compressed at 15 kN showing a rapid DT of 37.7s and high hardness of 71.4N. Using these parameters, ODTs containing ibuprofen showed no significant change in DT (ANOVA; p>0.05) irrespective of the hydrophobicity of the ibuprofen. Copyright © 2012 Elsevier B.V. All rights reserved.
Meteorological adjustment of yearly mean values for air pollutant concentration comparison
NASA Technical Reports Server (NTRS)
Sidik, S. M.; Neustadter, H. E.
1976-01-01
Using multiple linear regression analysis, models which estimate mean concentrations of Total Suspended Particulate (TSP), sulfur dioxide, and nitrogen dioxide as a function of several meteorologic variables, two rough economic indicators, and a simple trend in time are studied. Meteorologic data were obtained and do not include inversion heights. The goodness of fit of the estimated models is partially reflected by the squared coefficient of multiple correlation which indicates that, at the various sampling stations, the models accounted for about 23 to 47 percent of the total variance of the observed TSP concentrations. If the resulting model equations are used in place of simple overall means of the observed concentrations, there is about a 20 percent improvement in either: (1) predicting mean concentrations for specified meteorological conditions; or (2) adjusting successive yearly averages to allow for comparisons devoid of meteorological effects. An application to source identification is presented using regression coefficients of wind velocity predictor variables.
Regression analysis of sparse asynchronous longitudinal data.
Cao, Hongyuan; Zeng, Donglin; Fine, Jason P
2015-09-01
We consider estimation of regression models for sparse asynchronous longitudinal observations, where time-dependent responses and covariates are observed intermittently within subjects. Unlike with synchronous data, where the response and covariates are observed at the same time point, with asynchronous data, the observation times are mismatched. Simple kernel-weighted estimating equations are proposed for generalized linear models with either time invariant or time-dependent coefficients under smoothness assumptions for the covariate processes which are similar to those for synchronous data. For models with either time invariant or time-dependent coefficients, the estimators are consistent and asymptotically normal but converge at slower rates than those achieved with synchronous data. Simulation studies evidence that the methods perform well with realistic sample sizes and may be superior to a naive application of methods for synchronous data based on an ad hoc last value carried forward approach. The practical utility of the methods is illustrated on data from a study on human immunodeficiency virus.
Sensitivity Analysis of the Integrated Medical Model for ISS Programs
NASA Technical Reports Server (NTRS)
Goodenow, D. A.; Myers, J. G.; Arellano, J.; Boley, L.; Garcia, Y.; Saile, L.; Walton, M.; Kerstman, E.; Reyes, D.; Young, M.
2016-01-01
Sensitivity analysis estimates the relative contribution of the uncertainty in input values to the uncertainty of model outputs. Partial Rank Correlation Coefficient (PRCC) and Standardized Rank Regression Coefficient (SRRC) are methods of conducting sensitivity analysis on nonlinear simulation models like the Integrated Medical Model (IMM). The PRCC method estimates the sensitivity using partial correlation of the ranks of the generated input values to each generated output value. The partial part is so named because adjustments are made for the linear effects of all the other input values in the calculation of correlation between a particular input and each output. In SRRC, standardized regression-based coefficients measure the sensitivity of each input, adjusted for all the other inputs, on each output. Because the relative ranking of each of the inputs and outputs is used, as opposed to the values themselves, both methods accommodate the nonlinear relationship of the underlying model. As part of the IMM v4.0 validation study, simulations are available that predict 33 person-missions on ISS and 111 person-missions on STS. These simulated data predictions feed the sensitivity analysis procedures. The inputs to the sensitivity procedures include the number occurrences of each of the one hundred IMM medical conditions generated over the simulations and the associated IMM outputs: total quality time lost (QTL), number of evacuations (EVAC), and number of loss of crew lives (LOCL). The IMM team will report the results of using PRCC and SRRC on IMM v4.0 predictions of the ISS and STS missions created as part of the external validation study. Tornado plots will assist in the visualization of the condition-related input sensitivities to each of the main outcomes. The outcomes of this sensitivity analysis will drive review focus by identifying conditions where changes in uncertainty could drive changes in overall model output uncertainty. These efforts are an integral part of the overall verification, validation, and credibility review of IMM v4.0.
Klaeboe, Ronny
2005-09-01
When Gardermoen replaced Fornebu as the main airport for Oslo, aircraft noise levels increased in recreational areas near Gardermoen and decreased in areas near Fornebu. Krog and Engdahl [J. Acoust. Soc. Am. 116, 323-333 (2004)] estimate that recreationists' annoyance from aircraft noise in these areas changed more than would be anticipated from the actual noise changes. However, the sizes of their estimated "situation" effects are not credible. One possible reason for the anomalous results is that standard regression assumptions become violated when motivational factors are inserted into the regression model. Standardized regression coefficients (beta values) should also not be utilized for comparisons across equations.
Bayesian Estimation of Multivariate Latent Regression Models: Gauss versus Laplace
ERIC Educational Resources Information Center
Culpepper, Steven Andrew; Park, Trevor
2017-01-01
A latent multivariate regression model is developed that employs a generalized asymmetric Laplace (GAL) prior distribution for regression coefficients. The model is designed for high-dimensional applications where an approximate sparsity condition is satisfied, such that many regression coefficients are near zero after accounting for all the model…
Viability estimation of pepper seeds using time-resolved photothermal signal characterization
NASA Astrophysics Data System (ADS)
Kim, Ghiseok; Kim, Geon-Hee; Lohumi, Santosh; Kang, Jum-Soon; Cho, Byoung-Kwan
2014-11-01
We used infrared thermal signal measurement system and photothermal signal and image reconstruction techniques for viability estimation of pepper seeds. Photothermal signals from healthy and aged seeds were measured for seven periods (24, 48, 72, 96, 120, 144, and 168 h) using an infrared camera and analyzed by a regression method. The photothermal signals were regressed using a two-term exponential decay curve with two amplitudes and two time variables (lifetime) as regression coefficients. The regression coefficients of the fitted curve showed significant differences for each seed groups, depending on the aging times. In addition, the viability of a single seed was estimated by imaging of its regression coefficient, which was reconstructed from the measured photothermal signals. The time-resolved photothermal characteristics, along with the regression coefficient images, can be used to discriminate the aged or dead pepper seeds from the healthy seeds.
NASA Astrophysics Data System (ADS)
Ameen, Sheeraz; Taher, Taha; Ahmed, Thamir M.
2018-06-01
Hydrostatics and hydrodynamics forces are generated and applied on the vertical lift tunnel gates due to the influence of a wide range of dam operating conditions. One of the most important forces is the uplift force resulting from the jet flow issuing below the gate. This force is based mainly upon many hydraulic and geometrical parameters. In this work, the uplift force is studied in terms of bottom pressure coefficient. The investigation is made paying particular attention on the effects of various three discharges and three gate lip angles on values of bottom pressure coefficients in addition to four different tunnel longitudinal slopes whose impact has not been studied in many previous works. Hydraulic model is constructed in this work for the sake of measuring all parameters required for estimating the bottom pressure coefficients, which are all examined against gate openings. The results show that the bottom pressure coefficient is related to the said variables, however, its behaviour and values are not necessary regular with variance of studied variables. The values are seen more significantly related to the flow rates and for some extent to the slopes of tunnel. An attempt by using the nonlinear regression of Statistical package of social sciences (SPSS) is made to set equations relating bottom pressure coefficient with gate openings for several angles of gate lips. The obtained equations are shown in good agreement with the selected cases of experimental results. The results are applicable for design purposes for similar geometrical and flow parameters considered in this study.
Pal, Shyamali
2017-12-01
The presence of Macro prolactin is a significant cause of elevated prolactin resulting in misdiagnosis in all automated systems. Poly ethylene glycol (PEG) pretreatment is the preventive process but such process includes the probability of loss of a fraction of bioactive prolactin. Surprisingly, PEG treated EQAS & IQAS samples in Cobas e 411 are found out to be correlating with direct results of at least 3 immunoassay systems and treated and untreated Cobas e 411 results are comparable by a correlation coefficient. Comparison of EQAS, IQAS and patient samples were done to find out the trueness of such correlation factor. Study with patient's results have established the correlation coefficient is valid for very small concentration of prolactin also. EQAS, IQAS and 150 patient samples were treated with PEG and prolactin results of treated and untreated samples obtained from Roche Cobas e 411. 25 patient's results (treated) were compared with direct results in Advia Centaur, Architect I & Access2 systems. Correlation coefficient was obtained from trend line of the treated and untreated results. Two tailed p-value obtained from regression coefficient(r) and sample size. The correlation coefficient is in the range (0.761-0.771). Reverse correlation range is (1.289-1.301). r value of two sets of calculated results were 0.995. Two tailed p- value is zero approving dismissal of null hypothesis. The z-score of EQAS does not always assure authenticity of resultsPEG precipitation is correlated by the factor 0.761 even in very small concentrationsAbbreviationsGFCgel filtration chromatographyPEGpolyethylene glycolEQASexternal quality assurance systemM-PRLmacro prolactinPRLprolactinECLIAelectro-chemiluminescence immunoassayCLIAclinical laboratory improvement amendmentsIQASinternal quality assurance systemrregression coefficient.
NASA Astrophysics Data System (ADS)
Sharudin, R. W.; AbdulBari Ali, S.; Zulkarnain, M.; Shukri, M. A.
2018-05-01
This study reports on the integration of Artificial Neural Network (ANNs) with experimental data in predicting the solubility of carbon dioxide (CO2) blowing agent in SEBS by generating highest possible value for Regression coefficient (R2). Basically, foaming of thermoplastic elastomer with CO2 is highly affected by the CO2 solubility. The ability of ANN in predicting interpolated data of CO2 solubility was investigated by comparing training results via different method of network training. Regards to the final prediction result for CO2 solubility by ANN, the prediction trend (output generate) was corroborated with the experimental results. The obtained result of different method of training showed the trend of output generated by Gradient Descent with Momentum & Adaptive LR (traingdx) required longer training time and required more accurate input to produce better output with final Regression Value of 0.88. However, it goes vice versa with Levenberg-Marquardt (trainlm) technique as it produced better output in quick detention time with final Regression Value of 0.91.
Garabedian, Stephen P.
1986-01-01
A nonlinear, least-squares regression technique for the estimation of ground-water flow model parameters was applied to the regional aquifer underlying the eastern Snake River Plain, Idaho. The technique uses a computer program to simulate two-dimensional, steady-state ground-water flow. Hydrologic data for the 1980 water year were used to calculate recharge rates, boundary fluxes, and spring discharges. Ground-water use was estimated from irrigated land maps and crop consumptive-use figures. These estimates of ground-water withdrawal, recharge rates, and boundary flux, along with leakance, were used as known values in the model calibration of transmissivity. Leakance values were adjusted between regression solutions by comparing model-calculated to measured spring discharges. In other simulations, recharge and leakance also were calibrated as prior-information regression parameters, which limits the variation of these parameters using a normalized standard error of estimate. Results from a best-fit model indicate a wide areal range in transmissivity from about 0.05 to 44 feet squared per second and in leakance from about 2.2x10 -9 to 6.0 x 10 -8 feet per second per foot. Along with parameter values, model statistics also were calculated, including the coefficient of correlation between calculated and observed head (0.996), the standard error of the estimates for head (40 feet), and the parameter coefficients of variation (about 10-40 percent). Additional boundary flux was added in some areas during calibration to achieve proper fit to ground-water flow directions. Model fit improved significantly when areas that violated model assumptions were removed. It also improved slightly when y-direction (northwest-southeast) transmissivity values were larger than x-direction (northeast-southwest) transmissivity values. The model was most sensitive to changes in recharge, and in some areas, to changes in transmissivity, particularly near the spring discharge area from Milner Dam to King Hill.
Functional Linear Model with Zero-value Coefficient Function at Sub-regions.
Zhou, Jianhui; Wang, Nae-Yuh; Wang, Naisyin
2013-01-01
We propose a shrinkage method to estimate the coefficient function in a functional linear regression model when the value of the coefficient function is zero within certain sub-regions. Besides identifying the null region in which the coefficient function is zero, we also aim to perform estimation and inferences for the nonparametrically estimated coefficient function without over-shrinking the values. Our proposal consists of two stages. In stage one, the Dantzig selector is employed to provide initial location of the null region. In stage two, we propose a group SCAD approach to refine the estimated location of the null region and to provide the estimation and inference procedures for the coefficient function. Our considerations have certain advantages in this functional setup. One goal is to reduce the number of parameters employed in the model. With a one-stage procedure, it is needed to use a large number of knots in order to precisely identify the zero-coefficient region; however, the variation and estimation difficulties increase with the number of parameters. Owing to the additional refinement stage, we avoid this necessity and our estimator achieves superior numerical performance in practice. We show that our estimator enjoys the Oracle property; it identifies the null region with probability tending to 1, and it achieves the same asymptotic normality for the estimated coefficient function on the non-null region as the functional linear model estimator when the non-null region is known. Numerically, our refined estimator overcomes the shortcomings of the initial Dantzig estimator which tends to under-estimate the absolute scale of non-zero coefficients. The performance of the proposed method is illustrated in simulation studies. We apply the method in an analysis of data collected by the Johns Hopkins Precursors Study, where the primary interests are in estimating the strength of association between body mass index in midlife and the quality of life in physical functioning at old age, and in identifying the effective age ranges where such associations exist.
Penalized spline estimation for functional coefficient regression models.
Cao, Yanrong; Lin, Haiqun; Wu, Tracy Z; Yu, Yan
2010-04-01
The functional coefficient regression models assume that the regression coefficients vary with some "threshold" variable, providing appreciable flexibility in capturing the underlying dynamics in data and avoiding the so-called "curse of dimensionality" in multivariate nonparametric estimation. We first investigate the estimation, inference, and forecasting for the functional coefficient regression models with dependent observations via penalized splines. The P-spline approach, as a direct ridge regression shrinkage type global smoothing method, is computationally efficient and stable. With established fixed-knot asymptotics, inference is readily available. Exact inference can be obtained for fixed smoothing parameter λ, which is most appealing for finite samples. Our penalized spline approach gives an explicit model expression, which also enables multi-step-ahead forecasting via simulations. Furthermore, we examine different methods of choosing the important smoothing parameter λ: modified multi-fold cross-validation (MCV), generalized cross-validation (GCV), and an extension of empirical bias bandwidth selection (EBBS) to P-splines. In addition, we implement smoothing parameter selection using mixed model framework through restricted maximum likelihood (REML) for P-spline functional coefficient regression models with independent observations. The P-spline approach also easily allows different smoothness for different functional coefficients, which is enabled by assigning different penalty λ accordingly. We demonstrate the proposed approach by both simulation examples and a real data application.
NASA Astrophysics Data System (ADS)
Nishidate, Izumi; Abdul, Wares MD.; Ohtsu, Mizuki; Nakano, Kazuya; Haneishi, Hideaki
2018-02-01
We propose a method to estimate transcutaneous bilirubin, hemoglobin, and melanin based on the diffuse reflectance spectroscopy. In the proposed method, the Monte Carlo simulation-based multiple regression analysis for an absorbance spectrum in the visible wavelength region (460-590 nm) is used to specify the concentrations of bilirubin (Cbil), oxygenated hemoglobin (Coh), deoxygenated hemoglobin (Cdh), and melanin (Cm). Using the absorbance spectrum calculated from the measured diffuse reflectance spectrum as a response variable and the extinction coefficients of bilirubin, oxygenated hemoglobin, deoxygenated hemoglobin, and melanin, as predictor variables, multiple regression analysis provides regression coefficients. Concentrations of bilirubin, oxygenated hemoglobin, deoxygenated hemoglobin, and melanin, are then determined from the regression coefficients using conversion vectors that are numerically deduced in advance by the Monte Carlo simulations for light transport in skin. Total hemoglobin concentration (Cth) and tissue oxygen saturation (StO2) are simply calculated from the oxygenated hemoglobin and deoxygenated hemoglobin. In vivo animal experiments with bile duct ligation in rats demonstrated that the estimated Cbil is increased after ligation of bile duct and reaches to around 20 mg/dl at 72 h after the onset of the ligation, which corresponds to the reference value of Cbil measured by a commercially available transcutaneous bilirubin meter. We also performed in vivo experiments with rats while varying the fraction of inspired oxygen (FiO2). Coh and Cdh decreased and increased, respectively, as FiO2 decreased. Consequently, StO2 was dramatically decreased. The results in this study indicate potential of the method for simultaneous evaluation of multiple chromophores in skin tissue.
Miller, Justin B; Axelrod, Bradley N; Schutte, Christian
2012-01-01
The recent release of the Wechsler Memory Scale Fourth Edition contains many improvements from a theoretical and administration perspective, including demographic corrections using the Advanced Clinical Solutions. Although the administration time has been reduced from previous versions, a shortened version may be desirable in certain situations given practical time limitations in clinical practice. The current study evaluated two- and three-subtest estimations of demographically corrected Immediate and Delayed Memory index scores using both simple arithmetic prorating and regression models. All estimated values were significantly associated with observed index scores. Use of Lin's Concordance Correlation Coefficient as a measure of agreement showed a high degree of precision and virtually zero bias in the models, although the regression models showed a stronger association than prorated models. Regression-based models proved to be more accurate than prorated estimates with less dispersion around observed values, particularly when using three subtest regression models. Overall, the present research shows strong support for estimating demographically corrected index scores on the WMS-IV in clinical practice with an adequate performance using arithmetically prorated models and a stronger performance using regression models to predict index scores.
Hidden Connections between Regression Models of Strain-Gage Balance Calibration Data
NASA Technical Reports Server (NTRS)
Ulbrich, Norbert
2013-01-01
Hidden connections between regression models of wind tunnel strain-gage balance calibration data are investigated. These connections become visible whenever balance calibration data is supplied in its design format and both the Iterative and Non-Iterative Method are used to process the data. First, it is shown how the regression coefficients of the fitted balance loads of a force balance can be approximated by using the corresponding regression coefficients of the fitted strain-gage outputs. Then, data from the manual calibration of the Ames MK40 six-component force balance is chosen to illustrate how estimates of the regression coefficients of the fitted balance loads can be obtained from the regression coefficients of the fitted strain-gage outputs. The study illustrates that load predictions obtained by applying the Iterative or the Non-Iterative Method originate from two related regression solutions of the balance calibration data as long as balance loads are given in the design format of the balance, gage outputs behave highly linear, strict statistical quality metrics are used to assess regression models of the data, and regression model term combinations of the fitted loads and gage outputs can be obtained by a simple variable exchange.
Janik, Leslie J; Forrester, Sean T; Soriano-Disla, José M; Kirby, Jason K; McLaughlin, Michael J; Reimann, Clemens
2015-02-01
The authors' aim was to develop rapid and inexpensive regression models for the prediction of partitioning coefficients (Kd), defined as the ratio of the total or surface-bound metal/metalloid concentration of the solid phase to the total concentration in the solution phase. Values of Kd were measured for boric acid (B[OH]3(0)) and selected added soluble oxoanions: molybdate (MoO4(2-)), antimonate (Sb[OH](6-)), selenate (SeO4(2-)), tellurate (TeO4(2-)) and vanadate (VO4(3-)). Models were developed using approximately 500 spectrally representative soils of the Geochemical Mapping of Agricultural Soils of Europe (GEMAS) program. These calibration soils represented the major properties of the entire 4813 soils of the GEMAS project. Multiple linear regression (MLR) from soil properties, partial least-squares regression (PLSR) using mid-infrared diffuse reflectance Fourier-transformed (DRIFT) spectra, and models using DRIFT spectra plus analytical pH values (DRIFT + pH), were compared with predicted log K(d + 1) values. Apart from selenate (R(2) = 0.43), the DRIFT + pH calibrations resulted in marginally better models to predict log K(d + 1) values (R(2) = 0.62-0.79), compared with those from PSLR-DRIFT (R(2) = 0.61-0.72) and MLR (R(2) = 0.54-0.79). The DRIFT + pH calibrations were applied to the prediction of log K(d + 1) values in the remaining 4313 soils. An example map of predicted log K(d + 1) values for added soluble MoO4(2-) in soils across Europe is presented. The DRIFT + pH PLSR models provided a rapid and inexpensive tool to assess the risk of mobility and potential availability of boric acid and selected oxoanions in European soils. For these models to be used in the prediction of log K(d + 1) values in soils globally, additional research will be needed to determine if soil variability is accounted on the calibration. © 2014 SETAC.
Restoration of acidic mine spoils with sewage sludge: II measurement of solids applied
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stucky, D.J.; Zoeller, A.L.
1980-01-01
Sewage sludge was incorporated in acidic strip mine spoils at rates equivalent to 0, 224, 336, and 448 dry metric tons (dmt)/ha and placed in pots in a greenhouse. Spoil parameters were determined 48 hours after sludge incorporation, Time Planting (P), and five months after orchardgrass (Dactylis glomerata L.) was planted, Time Harvest (H), in the pots. Parameters measured were: pH, organic matter content (OM), cation exchange capacity (CEC), electrical conductivity (EC) and yield. Values for each parameter were significantly different at the two sampling times. Correlation coefficient values were calculated for all parameters versus rates of applied sewage sludgemore » and all parameters versus each other. Multiple regressions were performed, stepwise, for all parameters versus rates of applied sewage sludge. Equations to predict amounts of sewage sludge incorporated in spoils were derived for individual and multiple parameters. Generally, measurements made at Time P achieved the highest correlation coefficient and multiple correlation coefficient values; therefore, the authors concluded data from Time P had the greatest predictability value. The most important value measured to predict rate of applied sewage sludge was pH and some additional accuracy was obtained by including CEC in equation. This experiment indicated that soil properties can be used to estimate amounts of sewage sludge solids required to reclaim acidic mine spoils and to estimate quantities incorporated.« less
A primer for biomedical scientists on how to execute model II linear regression analysis.
Ludbrook, John
2012-04-01
1. There are two very different ways of executing linear regression analysis. One is Model I, when the x-values are fixed by the experimenter. The other is Model II, in which the x-values are free to vary and are subject to error. 2. I have received numerous complaints from biomedical scientists that they have great difficulty in executing Model II linear regression analysis. This may explain the results of a Google Scholar search, which showed that the authors of articles in journals of physiology, pharmacology and biochemistry rarely use Model II regression analysis. 3. I repeat my previous arguments in favour of using least products linear regression analysis for Model II regressions. I review three methods for executing ordinary least products (OLP) and weighted least products (WLP) regression analysis: (i) scientific calculator and/or computer spreadsheet; (ii) specific purpose computer programs; and (iii) general purpose computer programs. 4. Using a scientific calculator and/or computer spreadsheet, it is easy to obtain correct values for OLP slope and intercept, but the corresponding 95% confidence intervals (CI) are inaccurate. 5. Using specific purpose computer programs, the freeware computer program smatr gives the correct OLP regression coefficients and obtains 95% CI by bootstrapping. In addition, smatr can be used to compare the slopes of OLP lines. 6. When using general purpose computer programs, I recommend the commercial programs systat and Statistica for those who regularly undertake linear regression analysis and I give step-by-step instructions in the Supplementary Information as to how to use loss functions. © 2011 The Author. Clinical and Experimental Pharmacology and Physiology. © 2011 Blackwell Publishing Asia Pty Ltd.
Satellite remote sensing of fine particulate air pollutants over Indian mega cities
NASA Astrophysics Data System (ADS)
Sreekanth, V.; Mahesh, B.; Niranjan, K.
2017-11-01
In the backdrop of the need for high spatio-temporal resolution data on PM2.5 mass concentrations for health and epidemiological studies over India, empirical relations between Aerosol Optical Depth (AOD) and PM2.5 mass concentrations are established over five Indian mega cities. These relations are sought to predict the surface PM2.5 mass concentrations from high resolution columnar AOD datasets. Current study utilizes multi-city public domain PM2.5 data (from US Consulate and Embassy's air monitoring program) and MODIS AOD, spanning for almost four years. PM2.5 is found to be positively correlated with AOD. Station-wise linear regression analysis has shown spatially varying regression coefficients. Similar analysis has been repeated by eliminating data from the elevated aerosol prone seasons, which has improved the correlation coefficient. The impact of the day to day variability in the local meteorological conditions on the AOD-PM2.5 relationship has been explored by performing a multiple regression analysis. A cross-validation approach for the multiple regression analysis considering three years of data as training dataset and one-year data as validation dataset yielded an R value of ∼0.63. The study was concluded by discussing the factors which can improve the relationship.
Wrong Signs in Regression Coefficients
NASA Technical Reports Server (NTRS)
McGee, Holly
1999-01-01
When using parametric cost estimation, it is important to note the possibility of the regression coefficients having the wrong sign. A wrong sign is defined as a sign on the regression coefficient opposite to the researcher's intuition and experience. Some possible causes for the wrong sign discussed in this paper are a small range of x's, leverage points, missing variables, multicollinearity, and computational error. Additionally, techniques for determining the cause of the wrong sign are given.
Mutlu, Selime; Kahraman, Kevser; Öztürk, Serpil
2017-02-01
The effects of microwave irradiation on resistant starch (RS) formation and functional properties in high-amylose corn starch, Hylon VII, by applying microwave-storing cycles and drying processes were investigated. The Response Surface Methodology (RSM) was used to optimize the reaction conditions, microwave time (2-4min) and power (20-100%), for RS formation. The starch:water (1:10) mixtures were cooked and autoclaved and then different microwave-storing cycles and drying (oven or freeze drying) processes were applied. The RS contents of the samples increased with increasing microwave-storing cycle. The highest RS (43.4%) was obtained by oven drying after 3 cycles of microwave treatment at 20% power for 2min. The F, p (<0.05) and R 2 values indicated that the selected models were consistent. Linear equations were obtained for oven-dried samples applied by 1 and 3 cycles of microwave with regression coefficients of 0.65 and 0.62, respectively. Quadratic equation was obtained for freeze-dried samples applied by 3 cycles of microwave with a regression coefficient of 0.83. The solubility, water binding capacity (WBC) and RVA viscosity values of the microwave applied samples were higher than those of native Hylon VII. The WBC and viscosity values of the freeze-dried samples were higher than those of the oven-dried ones. Copyright © 2016 Elsevier B.V. All rights reserved.
Genetic parameters for stayability to consecutive calvings in Zebu cattle.
Silva, D O; Santana, M L; Ayres, D R; Menezes, G R O; Silva, L O C; Nobre, P R C; Pereira, R J
2017-12-22
Longer-lived cows tend to be more profitable and the stayability trait is a selection criterion correlated to longevity. An alternative to the traditional approach to evaluate stayability is its definition based on consecutive calvings, whose main advantage is the more accurate evaluation of young bulls. However, no study using this alternative approach has been conducted for Zebu breeds. Therefore, the objective of this study was to compare linear random regression models to fit stayability to consecutive calvings of Guzerá, Nelore and Tabapuã cows and to estimate genetic parameters for this trait in the respective breeds. Data up to the eighth calving were used. The models included the fixed effects of age at first calving and year-season of birth of the cow and the random effects of contemporary group, additive genetic, permanent environmental and residual. Random regressions were modeled by orthogonal Legendre polynomials of order 1 to 4 (2 to 5 coefficients) for contemporary group, additive genetic and permanent environmental effects. Using Deviance Information Criterion as the selection criterion, the model with 4 regression coefficients for each effect was the most adequate for the Nelore and Tabapuã breeds and the model with 5 coefficients is recommended for the Guzerá breed. For Guzerá, heritabilities ranged from 0.05 to 0.08, showing a quadratic trend with a peak between the fourth and sixth calving. For the Nelore and Tabapuã breeds, the estimates ranged from 0.03 to 0.07 and from 0.03 to 0.08, respectively, and increased with increasing calving number. The additive genetic correlations exhibited a similar trend among breeds and were higher for stayability between closer calvings. Even between more distant calvings (second v. eighth), stayability showed a moderate to high genetic correlation, which was 0.77, 0.57 and 0.79 for the Guzerá, Nelore and Tabapuã breeds, respectively. For Guzerá, when the models with 4 or 5 regression coefficients were compared, the rank correlations between predicted breeding values for the intercept were always higher than 0.99, indicating the possibility of practical application of the least parameterized model. In conclusion, the model with 4 random regression coefficients is recommended for the genetic evaluation of stayability to consecutive calvings in Zebu cattle.
NASA Astrophysics Data System (ADS)
Nishidate, Izumi; Wiswadarma, Aditya; Hase, Yota; Tanaka, Noriyuki; Maeda, Takaaki; Niizeki, Kyuichi; Aizu, Yoshihisa
2011-08-01
In order to visualize melanin and blood concentrations and oxygen saturation in human skin tissue, a simple imaging technique based on multispectral diffuse reflectance images acquired at six wavelengths (500, 520, 540, 560, 580 and 600nm) was developed. The technique utilizes multiple regression analysis aided by Monte Carlo simulation for diffuse reflectance spectra. Using the absorbance spectrum as a response variable and the extinction coefficients of melanin, oxygenated hemoglobin, and deoxygenated hemoglobin as predictor variables, multiple regression analysis provides regression coefficients. Concentrations of melanin and total blood are then determined from the regression coefficients using conversion vectors that are deduced numerically in advance, while oxygen saturation is obtained directly from the regression coefficients. Experiments with a tissue-like agar gel phantom validated the method. In vivo experiments with human skin of the human hand during upper limb occlusion and of the inner forearm exposed to UV irradiation demonstrated the ability of the method to evaluate physiological reactions of human skin tissue.
Sun, Yi; Arning, Martin; Bochmann, Frank; Börger, Jutta; Heitmann, Thomas
2018-06-01
The Occupational Safety and Health Monitoring and Assessment Tool (OSH-MAT) is a practical instrument that is currently used in the German woodworking and metalworking industries to monitor safety conditions at workplaces. The 12-item scoring system has three subscales rating technical, organizational, and personnel-related conditions in a company. Each item has a rating value ranging from 1 to 9, with higher values indicating higher standard of safety conditions. The reliability of this instrument was evaluated in a cross-sectional survey among 128 companies and its validity among 30,514 companies. The inter-rater reliability of the instrument was examined independently and simultaneously by two well-trained safety engineers. Agreement between the double ratings was quantified by the intraclass correlation coefficient and absolute agreement of the rating values. The content validity of the OSH-MAT was evaluated by quantifying the association between OSH-MAT values and 5-year average injury rates by Poisson regression analysis adjusted for the size of the companies and industrial sectors. The construct validity of OSH-MAT was examined by principle component factor analysis. Our analysis indicated good to very good inter-rater reliability (intraclass correlation coefficient = 0.64-0.74) of OSH-MAT values with an absolute agreement of between 72% and 81%. Factor analysis identified three component subscales that met exactly the structure theory of this instrument. The Poisson regression analysis demonstrated a statistically significant exposure-response relationship between OSH-MAT values and the 5-year average injury rates. These analyses indicate that OSH-MAT is a valid and reliable instrument that can be used effectively to monitor safety conditions at workplaces.
NASA Astrophysics Data System (ADS)
Kugeiko, M. M.; Lisenko, S. A.
2008-07-01
An easily automated method for determining the real part of the refractive index of human blood erythrocytes in the range 0.3 1.2 μm is proposed. The method is operationally and metrologically reliable and is based on the measurement of the coefficients of light scattering from forward and backward hemisphere by two pairs of angles and on the use of multiple regression equations. An engineering solution for constructing a measurement system according to this method is proposed, which makes it possible to maximally reduce the calibration errors and effects of destabilizing factors.
Misyura, Maksym; Sukhai, Mahadeo A; Kulasignam, Vathany; Zhang, Tong; Kamel-Reid, Suzanne; Stockley, Tracy L
2018-01-01
Aims A standard approach in test evaluation is to compare results of the assay in validation to results from previously validated methods. For quantitative molecular diagnostic assays, comparison of test values is often performed using simple linear regression and the coefficient of determination (R2), using R2 as the primary metric of assay agreement. However, the use of R2 alone does not adequately quantify constant or proportional errors required for optimal test evaluation. More extensive statistical approaches, such as Bland-Altman and expanded interpretation of linear regression methods, can be used to more thoroughly compare data from quantitative molecular assays. Methods We present the application of Bland-Altman and linear regression statistical methods to evaluate quantitative outputs from next-generation sequencing assays (NGS). NGS-derived data sets from assay validation experiments were used to demonstrate the utility of the statistical methods. Results Both Bland-Altman and linear regression were able to detect the presence and magnitude of constant and proportional error in quantitative values of NGS data. Deming linear regression was used in the context of assay comparison studies, while simple linear regression was used to analyse serial dilution data. Bland-Altman statistical approach was also adapted to quantify assay accuracy, including constant and proportional errors, and precision where theoretical and empirical values were known. Conclusions The complementary application of the statistical methods described in this manuscript enables more extensive evaluation of performance characteristics of quantitative molecular assays, prior to implementation in the clinical molecular laboratory. PMID:28747393
NASA Astrophysics Data System (ADS)
Das Bhowmik, R.; Arumugam, S.
2015-12-01
Multivariate downscaling techniques exhibited superiority over univariate regression schemes in terms of preserving cross-correlations between multiple variables- precipitation and temperature - from GCMs. This study focuses on two aspects: (a) develop an analytical solutions on estimating biases in cross-correlations from univariate downscaling approaches and (b) quantify the uncertainty in land-surface states and fluxes due to biases in cross-correlations in downscaled climate forcings. Both these aspects are evaluated using climate forcings available from both historical climate simulations and CMIP5 hindcasts over the entire US. The analytical solution basically relates the univariate regression parameters, co-efficient of determination of regression and the co-variance ratio between GCM and downscaled values. The analytical solutions are compared with the downscaled univariate forcings by choosing the desired p-value (Type-1 error) in preserving the observed cross-correlation. . For quantifying the impacts of biases on cross-correlation on estimating streamflow and groundwater, we corrupt the downscaled climate forcings with different cross-correlation structure.
The DPAC Compensation Model: An Introductory Handbook.
1987-04-01
introductory and advanced economics courses at the US Air Force Academy, he served for four years as an analyst and action officer in the ...introduces new users to the ACOL framework and provides some guidelines for choosing reasonable values for the four long-run parameters required to run the ...regression coefficients for ACOL and the civilian unemployment rate; for pilots, the number of " new " pilot
STX--Fortran-4 program for estimates of tree populations from 3P sample-tree-measurements
L. R. Grosenbaugh
1967-01-01
Describes how to use an improved and greatly expanded version of an earlier computer program (1964) that converts dendrometer measurements of 3P-sample trees to population values in terms of whatever units user desires. Many new options are available, including that of obtaining a product-yield and appraisal report based on regression coefficients supplied by user....
Goudarzi, Reza; Zeraati, Hojjat; Akbari Sari, Ali; Rashidian, Arash; Mohammad, Kazem
2016-02-01
Health-related quality of life (HRQoL) is used as a measure to valuate healthcare interventions and guide policy making. The EuroQol EQ-5D is a widely used generic preference-based instrument to measure Health-related quality of life. The objective of this study was to develop a value set of the EQ-5D health states for an Iranian population. This study is a cross-sectional study of Iranian populations. Our sample from Iranian populations consists out of 869 participants, who were selected for this study using a stratified probability sampling method. The sample was taken from individuals living in the city of Tehran and was stratified by age and gender from July to November 2013. Respondents valued 13 health states using the visual analogue scale (VAS) of the EQ-5D. Several fixed effects regression models were tested to predict the full set of health states. We selected the final model based on the logical consistency of the estimates, the sign and magnitude of the regression coefficients, goodness of fit, and parsimony. We also compared predicted values with a value set from similar studies in the UK and other countries. Our results show that the HRQoL does not vary among socioeconomic groups. Models at the individual level resulted in an additive model with all coefficients being statistically significant, R(2) = 0.55, a value of 0.75 for the best health state (11112), and a value of -0.074 for the worst health state (33333). The value set obtained for the study sample remarkably differs from those elicited in developed countries. This study is the first estimate for the EQ-5D value set based on the VAS in Iran. Given the importance of locally adapted value set the use of this value set can be recommended for future studies in Iran and In the EMRO regions.
Maupin, Molly A.
1999-01-01
Pumped withdrawals compose most of the irrigation-water diversions from the Snake River between Upper Salmon Falls and Swan Falls Dams in southwestern Idaho. Pumps at 32 sites along the reach lift water as high as 745 feet to irrigate croplands on plateaus north and south of the river. The number of pump sites at which withdrawals are being continuously measured has been steadily decreasing, from 32 in 1990 to 7 in 1998. A cost-effective and accurate means of estimating annual irrigation-water withdrawals at pump sites that are no longer continuously measured was needed. Therefore, the U.S. Geological Survey began a study in 1998, as part of its Water-Use Program, to determine power-consumption coeffi- cients (PCCs) for each pump site so that withdrawals could be estimated by using electrical powerconsumption and total head data. PCC values for each pump site were determined by using withdrawal data that were measured by the U.S. Geological Survey during 1990–92 and 1994–95, energy data reported by Idaho Power Company during the same period, and total head data collected at each site during a field inventory in 1998. Individual average annual withdrawals for the 32 pump sites ranged from 1,120 to 44,480 acre-feet; average PCC values ranged from 103 to 1,248 kilowatthours per acre-foot. During the 1998 field season, power demand, total head, and withdrawal at 18 sites were measured to determine 1998 PCC values. Most of the 1998 PCC values were within 10 percent of the 5-year average, which demonstrates that withdrawals for a site that is no longer continuously measured can be calculated with reasonable accuracy by using the PCC value determined from this study and annual power-consumption data. K-factors, coefficients that describe the amount of energy necessary to lift water, were determined for each pump site by using values of PCC and total head and ranged from 1.11 to 1.89 kilowatthours per acre-foot per foot. Statistical methods were used to define the relations among PCC values and selected pumpsite characteristics. Multiple correlation analysis between average PCC values and total head, total horsepower, and total number of pumps revealed the strongest correlation was between average PCC and total head. Linear regression of these two variables resulted in a strong coefficient of determination R2=0 .9 86) and a representative K-factor of 1.463. Pump sites were subdivided into two groups on the basis of total head—0 to 300 feet and greater than 300 feet. Regression of average PCC values for eight pump sites with total head less than 300 feet produced a good correlation of determination (R2=0.870) and a representative K-factor of 1.682. The second group consisted of 10 pump sites with total head greater than 300 feet; regression produced a correlation of R2=0.939 and a representative K-factor of 1.405. Data on pump-site characteristics were successfully used to determine individual PCC and K-factor values. Statistical relations between pumpsite characteristics and PCC values were defined and used to determine regression equations that resulted in good coefficients of determination and representative K-factors. The individual PCC values will be used in the future to calculate irrigation- water withdrawals at sites that are no longer continuously measured. The representative K-factors and regression equations will be used to calculate irrigation-water withdrawals at sites that have not been previously measured and where total head and power consumption are known.
Relationship between photoreceptor outer segment length and visual acuity in diabetic macular edema.
Forooghian, Farzin; Stetson, Paul F; Meyer, Scott A; Chew, Emily Y; Wong, Wai T; Cukras, Catherine; Meyerle, Catherine B; Ferris, Frederick L
2010-01-01
The purpose of this study was to quantify photoreceptor outer segment (PROS) length in 27 consecutive patients (30 eyes) with diabetic macular edema using spectral domain optical coherence tomography and to describe the correlation between PROS length and visual acuity. Three spectral domain-optical coherence tomography scans were performed on all eyes during each session using Cirrus HD-OCT. A prototype algorithm was developed for quantitative assessment of PROS length. Retinal thicknesses and PROS lengths were calculated for 3 parameters: macular grid (6 x 6 mm), central subfield (1 mm), and center foveal point (0.33 mm). Intrasession repeatability was assessed using coefficient of variation and intraclass correlation coefficient. The association between retinal thickness and PROS length with visual acuity was assessed using linear regression and Pearson correlation analyses. The main outcome measures include intrasession repeatability of macular parameters and correlation of these parameters with visual acuity. Mean retinal thickness and PROS length were 298 mum to 381 microm and 30 microm to 32 mum, respectively, for macular parameters assessed in this study. Coefficient of variation values were 0.75% to 4.13% for retinal thickness and 1.97% to 14.01% for PROS length. Intraclass correlation coefficient values were 0.96 to 0.99 and 0.73 to 0.98 for retinal thickness and PROS length, respectively. Slopes from linear regression analyses assessing the association of retinal thickness and visual acuity were not significantly different from 0 (P > 0.20), whereas the slopes of PROS length and visual acuity were significantly different from 0 (P < 0.0005). Correlation coefficients for macular thickness and visual acuity ranged from 0.13 to 0.22, whereas coefficients for PROS length and visual acuity ranged from -0.61 to -0.81. Photoreceptor outer segment length can be quantitatively assessed using Cirrus HD-OCT. Although the intrasession repeatability of PROS measurements was less than that of macular thickness measurements, the stronger correlation of PROS length with visual acuity suggests that the PROS measures may be more directly related to visual function. Photoreceptor outer segment length may be a useful physiologic outcome measure, both clinically and as a direct assessment of treatment effects.
Srivastava, Nishi; Srivastava, Amit; Srivastava, Sharad; Rawat, Ajay Kumar Singh; Khan, Abdul Rahman
2016-03-01
A rapid, sensitive, selective and robust quantitative densitometric high-performance thin-layer chromatographic method was developed and validated for separation and quantification of syringic acid (SYA) and kaempferol (KML) in the hydrolyzed extracts of Bergenia ciliata and Bergenia stracheyi. The separation was performed on silica gel 60F254 high-performance thin-layer chromatography plates using toluene : ethyl acetate : formic acid (5 : 4: 1, v/v/v) as the mobile phase. The quantification of SYA and KML was carried out using a densitometric reflection/absorption mode at 290 nm. A dense spot of SYA and KML appeared on the developed plate at a retention factor value of 0.61 ± 0.02 and 0.70 ± 0.01. A precise and accurate quantification was performed using linear regression analysis by plotting the peak area vs concentration 100-600 ng/band (correlation coefficient: r = 0.997, regression coefficient: R(2) = 0.996) for SYA and 100-600 ng/band (correlation coefficient: r = 0.995, regression coefficient: R(2) = 0.991) for KML. The developed method was validated in terms of accuracy, recovery and inter- and intraday study as per International Conference on Harmonisation guidelines. The limit of detection and limit of quantification of SYA and KML were determined, respectively, as 91.63, 142.26 and 277.67, 431.09 ng. The statistical data analysis showed that the method is reproducible and selective for the estimation of SYA and KML in extracts of B. ciliata and B. stracheyi. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Element enrichment factor calculation using grain-size distribution and functional data regression.
Sierra, C; Ordóñez, C; Saavedra, A; Gallego, J R
2015-01-01
In environmental geochemistry studies it is common practice to normalize element concentrations in order to remove the effect of grain size. Linear regression with respect to a particular grain size or conservative element is a widely used method of normalization. In this paper, the utility of functional linear regression, in which the grain-size curve is the independent variable and the concentration of pollutant the dependent variable, is analyzed and applied to detrital sediment. After implementing functional linear regression and classical linear regression models to normalize and calculate enrichment factors, we concluded that the former regression technique has some advantages over the latter. First, functional linear regression directly considers the grain-size distribution of the samples as the explanatory variable. Second, as the regression coefficients are not constant values but functions depending on the grain size, it is easier to comprehend the relationship between grain size and pollutant concentration. Third, regularization can be introduced into the model in order to establish equilibrium between reliability of the data and smoothness of the solutions. Copyright © 2014 Elsevier Ltd. All rights reserved.
Chaurasia, Ashok; Harel, Ofer
2015-02-10
Tests for regression coefficients such as global, local, and partial F-tests are common in applied research. In the framework of multiple imputation, there are several papers addressing tests for regression coefficients. However, for simultaneous hypothesis testing, the existing methods are computationally intensive because they involve calculation with vectors and (inversion of) matrices. In this paper, we propose a simple method based on the scalar entity, coefficient of determination, to perform (global, local, and partial) F-tests with multiply imputed data. The proposed method is evaluated using simulated data and applied to suicide prevention data. Copyright © 2014 John Wiley & Sons, Ltd.
NASA Astrophysics Data System (ADS)
Rogers, A. M.; Harmsen, S. C.; Herrmann, R. B.; Meremonte, M. E.
1987-04-01
As a first step in the assessment of the earthquake hazard in the southern Great Basin of Nevada-California, this study evaluates the attenuation of peak vertical ground motions using a number of different regression models applied to unfiltered and band-pass-filtered ground motion data. These data are concentrated in the distance range 10-250 km. The regression models include parameters to account for geometric spreading, anelastic attenuation with a power law frequency dependence, source size, and station site effects. We find that the data are most consistent with an essentially frequency-independent Q and a geometric spreading coefficient less than 1.0. Regressions are also performed on vertical component peak amplitudes reexpressed as pseudo-Wood-Anderson peak amplitude estimates (PWA), permitting comparison with earlier work that used Wood-Anderson (WA) data from California. Both of these results show that Q values in this region are high relative to California, having values in the range 700-900 over the frequency band 1-10 Hz. Comparison of ML magnitudes from stations BRK and PAS for earthquakes in the southern Great Basin shows that these two stations report magnitudes with differences that are distance dependent. This bias suggests that the Richter log A0 curve appropriate to California is too steep for earthquakes occurring in southern Nevada, a result implicitly supporting our finding that Q values are higher than those in California. The PWA attenuation functions derived from our data also indicate that local magnitudes reported by California observatories for earthquakes in this region may be overestimated by as much as 0.8 magnitude units in some cases. Both of these results will have an effect on the assessment of the earthquake hazard in this region. The robustness of our regression technique to extract the correct geometric spreading coefficient n and anelastic attenuation Q is tested by applying the technique to simulated data computed with given n and Q values. Using a stochastic modeling technique, we generate suites of seismograms for the distance range 10-200 km and for both WA and short-period vertical component seismometers. Regressions on the peak amplitudes from these records show that our regression model extracts values of n and Q approximately equal to the input values for either low-Q California attenuation or high-Q southern Nevada attenuation. Regressions on stochastically modeled WA and PWA amplitudes also provides a method of evaluating differences in magnitudes from WA and PWA amplitudes due to recording instrument response characteristics alone. These results indicate a difference between MLWA and MLPWA equal to 0.15 magnitude units, which we term the residual instrument correction. In contrast to the peak amplitude results, coda Q determinations using the single scatterer theory indicate that Qc values are dependent on source type and are proportional to ƒp, where p = 0.8 to 1.0. This result suggests that a difference exists between attenuation mechanisms for direct waves and backscattered waves in this region.
Psychomotor development index in children younger than 6 years from Argentine provinces.
Lejarraga, Horacio; Kelmansky, Diana M; Masautis, Alicia; Nunes, Fernando
2018-04-01
To obtain a psychomotor development index (PDI) for each Argentine province. Using a national, probabilistic, and stratified sample of 13 323 male and female children younger than 6 years selected for the National Survey on Nutrition and Health (Encuesta Nacional de Nutrición y Salud, ENNyS 2004), we estimated the PDI per province based on compliance with 10 developmental milestones. The median age at attainment (median age) of each milestone was estimated adjusting a logistic regression. The PDI was estimated as 100* (1 + b), where "b" is the regression coefficient of y= a + b x, where "y" is the median age as per the national reference (x) minus the median age at attainment of a milestone. The theoretical value expected for the PDI was 100. The PDI per province ranged between 72.1 and 106.4. Most provinces showed a negative regression coefficient, which indicated a progressive increase of the delay in the age at attainment of milestones. The correlation coefficient between the PDI per province and infant mortality in 2005was extremely high: -0.85, suggesting that both indicators share similar biological and social determinants. The PDI was negative because the higher the mortality, the lower the PDI. We have now a positive health indicator available in Argentina: the psychomotor development index, which is a low-cost, easy to collect, and reliable tool that may be used in national health statistics. Sociedad Argentina de Pediatría.
Pulmonary Catherization Data Correlate Poorly with Renal Function in Heart Failure.
Masha, Luke; Stone, James; Stone, Danielle; Zhang, Jun; Sheng, Luo
2018-04-10
The mechanisms of renal dysfunction in heart failure are poorly understood. We chose to explore the relationship of cardiac filling pressures and cardiac index (CI) in relation to renal dysfunction in advanced heart failure. To determine the relationship between renal function and cardiac filling pressures using the United Network of Organ Sharing (UNOS) pulmonary artery catherization registry. Patients over the age of 18 years who were listed for single-organ heart transplantation were included. Exclusion criteria included a history of mechanical circulatory support, previous transplantation, any use of renal replacement therapy, prior history of malignancy, and cardiac surgery, amongst others. Correlations between serum creatinine (SCr) and CI, pulmonary capillary wedge pressure (PCWP), pulmonary artery systolic pressure (PASP), and pulmonary artery diastolic pressure (PADP) were assessed by Pearson correlation coefficients and simple linear regression coefficients. Pearson correlation coefficients between SCr and PCWP, PASP, and PADP were near zero with values of 0.1, 0.07, and 0.08, respectively (p < 0.0001). A weak negative correlation coefficient between SCr and CI was found (correlation coefficient, -0.045, p = 0.027). In a subgroup of young patients unlikely to have noncardiac etiologies, no significant correlations between these values were identified. These findings suggest that, as assessed by pulmonary artery catherization, none of the factors - PCWP, PASP, PADP, or CI - play a prominent role in cardiorenal syndromes. © 2018 S. Karger AG, Basel.
The intermediate endpoint effect in logistic and probit regression
MacKinnon, DP; Lockwood, CM; Brown, CH; Wang, W; Hoffman, JM
2010-01-01
Background An intermediate endpoint is hypothesized to be in the middle of the causal sequence relating an independent variable to a dependent variable. The intermediate variable is also called a surrogate or mediating variable and the corresponding effect is called the mediated, surrogate endpoint, or intermediate endpoint effect. Clinical studies are often designed to change an intermediate or surrogate endpoint and through this intermediate change influence the ultimate endpoint. In many intermediate endpoint clinical studies the dependent variable is binary, and logistic or probit regression is used. Purpose The purpose of this study is to describe a limitation of a widely used approach to assessing intermediate endpoint effects and to propose an alternative method, based on products of coefficients, that yields more accurate results. Methods The intermediate endpoint model for a binary outcome is described for a true binary outcome and for a dichotomization of a latent continuous outcome. Plots of true values and a simulation study are used to evaluate the different methods. Results Distorted estimates of the intermediate endpoint effect and incorrect conclusions can result from the application of widely used methods to assess the intermediate endpoint effect. The same problem occurs for the proportion of an effect explained by an intermediate endpoint, which has been suggested as a useful measure for identifying intermediate endpoints. A solution to this problem is given based on the relationship between latent variable modeling and logistic or probit regression. Limitations More complicated intermediate variable models are not addressed in the study, although the methods described in the article can be extended to these more complicated models. Conclusions Researchers are encouraged to use an intermediate endpoint method based on the product of regression coefficients. A common method based on difference in coefficient methods can lead to distorted conclusions regarding the intermediate effect. PMID:17942466
NASA Astrophysics Data System (ADS)
Mekanik, F.; Imteaz, M. A.; Gato-Trinidad, S.; Elmahdi, A.
2013-10-01
In this study, the application of Artificial Neural Networks (ANN) and Multiple regression analysis (MR) to forecast long-term seasonal spring rainfall in Victoria, Australia was investigated using lagged El Nino Southern Oscillation (ENSO) and Indian Ocean Dipole (IOD) as potential predictors. The use of dual (combined lagged ENSO-IOD) input sets for calibrating and validating ANN and MR Models is proposed to investigate the simultaneous effect of past values of these two major climate modes on long-term spring rainfall prediction. The MR models that did not violate the limits of statistical significance and multicollinearity were selected for future spring rainfall forecast. The ANN was developed in the form of multilayer perceptron using Levenberg-Marquardt algorithm. Both MR and ANN modelling were assessed statistically using mean square error (MSE), mean absolute error (MAE), Pearson correlation (r) and Willmott index of agreement (d). The developed MR and ANN models were tested on out-of-sample test sets; the MR models showed very poor generalisation ability for east Victoria with correlation coefficients of -0.99 to -0.90 compared to ANN with correlation coefficients of 0.42-0.93; ANN models also showed better generalisation ability for central and west Victoria with correlation coefficients of 0.68-0.85 and 0.58-0.97 respectively. The ability of multiple regression models to forecast out-of-sample sets is compatible with ANN for Daylesford in central Victoria and Kaniva in west Victoria (r = 0.92 and 0.67 respectively). The errors of the testing sets for ANN models are generally lower compared to multiple regression models. The statistical analysis suggest the potential of ANN over MR models for rainfall forecasting using large scale climate modes.
Regression analysis of sparse asynchronous longitudinal data
Cao, Hongyuan; Zeng, Donglin; Fine, Jason P.
2015-01-01
Summary We consider estimation of regression models for sparse asynchronous longitudinal observations, where time-dependent responses and covariates are observed intermittently within subjects. Unlike with synchronous data, where the response and covariates are observed at the same time point, with asynchronous data, the observation times are mismatched. Simple kernel-weighted estimating equations are proposed for generalized linear models with either time invariant or time-dependent coefficients under smoothness assumptions for the covariate processes which are similar to those for synchronous data. For models with either time invariant or time-dependent coefficients, the estimators are consistent and asymptotically normal but converge at slower rates than those achieved with synchronous data. Simulation studies evidence that the methods perform well with realistic sample sizes and may be superior to a naive application of methods for synchronous data based on an ad hoc last value carried forward approach. The practical utility of the methods is illustrated on data from a study on human immunodeficiency virus. PMID:26568699
Overcoming multicollinearity in multiple regression using correlation coefficient
NASA Astrophysics Data System (ADS)
Zainodin, H. J.; Yap, S. J.
2013-09-01
Multicollinearity happens when there are high correlations among independent variables. In this case, it would be difficult to distinguish between the contributions of these independent variables to that of the dependent variable as they may compete to explain much of the similar variance. Besides, the problem of multicollinearity also violates the assumption of multiple regression: that there is no collinearity among the possible independent variables. Thus, an alternative approach is introduced in overcoming the multicollinearity problem in achieving a well represented model eventually. This approach is accomplished by removing the multicollinearity source variables on the basis of the correlation coefficient values based on full correlation matrix. Using the full correlation matrix can facilitate the implementation of Excel function in removing the multicollinearity source variables. It is found that this procedure is easier and time-saving especially when dealing with greater number of independent variables in a model and a large number of all possible models. Hence, in this paper detailed insight of the procedure is shown, compared and implemented.
Evaluation of AUC(0-4) predictive methods for cyclosporine in kidney transplant patients.
Aoyama, Takahiko; Matsumoto, Yoshiaki; Shimizu, Makiko; Fukuoka, Masamichi; Kimura, Toshimi; Kokubun, Hideya; Yoshida, Kazunari; Yago, Kazuo
2005-05-01
Cyclosporine (CyA) is the most commonly used immunosuppressive agent in patients who undergo kidney transplantation. Dosage adjustment of CyA is usually based on trough levels. Recently, trough levels have been replacing the area under the concentration-time curve during the first 4 h after CyA administration (AUC(0-4)). The aim of this study was to compare the predictive values obtained using three different methods of AUC(0-4) monitoring. AUC(0-4) was calculated from 0 to 4 h in early and stable renal transplant patients using the trapezoidal rule. The predicted AUC(0-4) was calculated using three different methods: the multiple regression equation reported by Uchida et al.; Bayesian estimation for modified population pharmacokinetic parameters reported by Yoshida et al.; and modified population pharmacokinetic parameters reported by Cremers et al. The predicted AUC(0-4) was assessed on the basis of predictive bias, precision, and correlation coefficient. The predicted AUC(0-4) values obtained using three methods through measurement of three blood samples showed small differences in predictive bias, precision, and correlation coefficient. In the prediction of AUC(0-4) measurement of one blood sample from stable renal transplant patients, the performance of the regression equation reported by Uchida depended on sampling time. On the other hand, the performance of Bayesian estimation with modified pharmacokinetic parameters reported by Yoshida through measurement of one blood sample, which is not dependent on sampling time, showed a small difference in the correlation coefficient. The prediction of AUC(0-4) using a regression equation required accurate sampling time. In this study, the prediction of AUC(0-4) using Bayesian estimation did not require accurate sampling time in the AUC(0-4) monitoring of CyA. Thus Bayesian estimation is assumed to be clinically useful in the dosage adjustment of CyA.
Dental age assessment of young Iranian adults using third molars: A multivariate regression study.
Bagherpour, Ali; Anbiaee, Najmeh; Partovi, Parnia; Golestani, Shayan; Afzalinasab, Shakiba
2012-10-01
In recent years, a noticeable increase in forensic age estimations of living individuals has been observed. Radiologic assessment of the mineralisation stage of third molars is of particular importance, with regard to the relevant age group. To attain a referral database and regression equations for dental age estimation of unaccompanied minors in an Iranian population was the goal of this study. Moreover, determination was made concerning the probability of an individual being over the age of 18 in case of full third molar(s) development. Using the scoring system of Gleiser and Hunt, modified by Köhler, an investigation of a cross-sectional sample of 1274 orthopantomograms of 885 females and 389 males aged between 15 and 22 years was carried out. Using kappa statistics, intra-observer reliability was tested. With Spearman correlation coefficient, correlation between the scores of all four wisdom teeth, was evaluated. We also carried out the Wilcoxon signed-rank test on asymmetry and calculated the regression formulae. A strong intra-observer agreement was displayed by the kappa value. No significant difference (p-value for upper and lower jaws were 0.07 and 0.59, respectively) was discovered by Wilcoxon signed-rank test for left and right asymmetry. The developmental stage of upper right and upper left third molars yielded the greatest correlation coefficient. The probability of an individual being over the age of 18 is 95.6% for males and 100.0% for females in case four fully developed third molars are present. Taking into consideration gender, location and number of wisdom teeth, regression formulae were arrived at. Use of population-specific standards is recommended as a means of improving the accuracy of forensic age estimates based on third molars mineralisation. To obtain more exact regression formulae, wider age range studies are recommended. Copyright © 2012 Elsevier Ltd and Faculty of Forensic and Legal Medicine. All rights reserved.
Weather adjustment using seemingly unrelated regression
DOE Office of Scientific and Technical Information (OSTI.GOV)
Noll, T.A.
1995-05-01
Seemingly unrelated regression (SUR) is a system estimation technique that accounts for time-contemporaneous correlation between individual equations within a system of equations. SUR is suited to weather adjustment estimations when the estimation is: (1) composed of a system of equations and (2) the system of equations represents either different weather stations, different sales sectors or a combination of different weather stations and different sales sectors. SUR utilizes the cross-equation error values to develop more accurate estimates of the system coefficients than are obtained using ordinary least-squares (OLS) estimation. SUR estimates can be generated using a variety of statistical software packagesmore » including MicroTSP and SAS.« less
Mager, P P; Rothe, H
1990-10-01
Multicollinearity of physicochemical descriptors leads to serious consequences in quantitative structure-activity relationship (QSAR) analysis, such as incorrect estimators and test statistics of regression coefficients of the ordinary least-squares (OLS) model applied usually to QSARs. Beside the diagnosis of the known simple collinearity, principal component regression analysis (PCRA) also allows the diagnosis of various types of multicollinearity. Only if the absolute values of PCRA estimators are order statistics that decrease monotonically, the effects of multicollinearity can be circumvented. Otherwise, obscure phenomena may be observed, such as good data recognition but low predictive model power of a QSAR model.
Sheppard, James P.; Holder, Roger; Nichols, Linda; Bray, Emma; Hobbs, F.D. Richard; Mant, Jonathan; Little, Paul; Williams, Bryan; Greenfield, Sheila; McManus, Richard J.
2014-01-01
Objectives: Identification of people with lower (white-coat effect) or higher (masked effect) blood pressure at home compared to the clinic usually requires ambulatory or home monitoring. This study assessed whether changes in SBP with repeated measurement at a single clinic predict subsequent differences between clinic and home measurements. Methods: This study used an observational cohort design and included 220 individuals aged 35–84 years, receiving treatment for hypertension, but whose SBP was not controlled. The characteristics of change in SBP over six clinic readings were defined as the SBP drop, the slope and the quadratic coefficient using polynomial regression modelling. The predictive abilities of these characteristics for lower or higher home SBP readings were investigated with logistic regression and repeated operating characteristic analysis. Results: The single clinic SBP drop was predictive of the white-coat effect with a sensitivity of 90%, specificity of 50%, positive predictive value of 56% and negative predictive value of 88%. Predictive values for the masked effect and those of the slope and quadratic coefficient were slightly lower, but when the slope and quadratic variables were combined, the sensitivity, specificity, positive and negative predictive values for the masked effect were improved to 91, 48, 24 and 97%, respectively. Conclusion: Characteristics obtainable from multiple SBP measurements in a single clinic in patients with treated hypertension appear to reasonably predict those unlikely to have a large white-coat or masked effect, potentially allowing better targeting of out-of-office monitoring in routine clinical practice. PMID:25144295
Hayashi, K; Yamada, T; Sawa, T
2015-03-01
The return or Poincaré plot is a non-linear analytical approach in a two-dimensional plane, where a timed signal is plotted against itself after a time delay. Its scatter pattern reflects the randomness and variability in the signals. Quantification of a Poincaré plot of the electroencephalogram has potential to determine anaesthesia depth. We quantified the degree of dispersion (i.e. standard deviation, SD) along the diagonal line of the electroencephalogram-Poincaré plot (named as SD1/SD2), and compared SD1/SD2 values with spectral edge frequency 95 (SEF95) and bispectral index values. The regression analysis showed a tight linear regression equation with a coefficient of determination (R(2) ) value of 0.904 (p < 0.0001) between the Poincaré index (SD1/SD2) and SEF95, and a moderate linear regression equation between SD1/SD2 and bispectral index (R(2) = 0.346, p < 0.0001). Quantification of the Poincaré plot tightly correlates with SEF95, reflecting anaesthesia-dependent changes in electroencephalogram oscillation. © 2014 The Association of Anaesthetists of Great Britain and Ireland.
Talpur, M Younis; Kara, Huseyin; Sherazi, S T H; Ayyildiz, H Filiz; Topkafa, Mustafa; Arslan, Fatma Nur; Naz, Saba; Durmaz, Fatih; Sirajuddin
2014-11-01
Single bounce attenuated total reflectance (SB-ATR) Fourier transform infrared (FTIR) spectroscopy in conjunction with chemometrics was used for accurate determination of free fatty acid (FFA), peroxide value (PV), iodine value (IV), conjugated diene (CD) and conjugated triene (CT) of cottonseed oil (CSO) during potato chips frying. Partial least square (PLS), stepwise multiple linear regression (SMLR), principal component regression (PCR) and simple Beer׳s law (SBL) were applied to develop the calibrations for simultaneous evaluation of five stated parameters of cottonseed oil (CSO) during frying of French frozen potato chips at 170°C. Good regression coefficients (R(2)) were achieved for FFA, PV, IV, CD and CT with value of >0.992 by PLS, SMLR, PCR, and SBL. Root mean square error of prediction (RMSEP) was found to be less than 1.95% for all determinations. Result of the study indicated that SB-ATR FTIR in combination with multivariate chemometrics could be used for accurate and simultaneous determination of different parameters during the frying process without using any toxic organic solvent. Copyright © 2014 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Kelly, B.; Chelsky, A.; Bulygina, E.; Roberts, B. J.
2017-12-01
Remote sensing techniques have become valuable tools to researchers, providing the capability to measure and visualize important parameters without the need for time or resource intensive sampling trips. Relationships between dissolved organic carbon (DOC), colored dissolved organic matter (CDOM) and spectral data have been used to remotely sense DOC concentrations in riverine systems, however, this approach has not been applied to the northern Gulf of Mexico (GoM) and needs to be tested to determine how accurate these relationships are in riverine-dominated shelf systems. In April, July, and October 2017 we sampled surface water from 80+ sites over an area of 100,000 km2 along the Louisiana-Texas shelf in the northern GoM. DOC concentrations were measured on filtered water samples using a Shimadzu TOC-VCSH analyzer using standard techniques. Additionally, DOC concentrations were estimated from CDOM absorption coefficients of filtered water samples on a UV-Vis spectrophotometer using a modification of the methods of Fichot and Benner (2011). These values were regressed against Landsat visible band spectral data for those same locations to establish a relationship between the spectral data, CDOM absorption coefficients. This allowed us to spatially map CDOM absorption coefficients in the Gulf of Mexico using the Landsat spectral data in GIS. We then used a multiple linear regressions model to derive DOC concentrations from the CDOM absorption coefficients and applied those to our map. This study provides an evaluation of the viability of scaling up CDOM absorption coefficient and remote-sensing derived estimates of DOC concentrations to the scale of the LA-TX shelf ecosystem.
Bouti, Khalid; Benamor, Jouda; Bourkadi, Jamal Eddine
2017-08-01
Peak Expiratory Flow (PEF) has never been characterised among healthy Moroccan school children. To study the relationship between PEF and anthropometric parameters (sex, age, height and weight) in healthy Moroccan school children, to establish predictive equations of PEF; and to compare flowmetric and spirometric PEF with Forced Expiratory Volume in 1 second (FEV1). This cross-sectional study was conducted between April, 2016 and May, 2016. It involved 222 (122 boys and 100 girls) healthy school children living in Ksar el-Kebir, Morocco. We used mobile equipments for realisation of spirometry and peak expiratory flow measurements. SPSS (Version 22.0) was used to calculate Student's t-test, Pearson's correlation coefficient and linear regression. Significant linear correlation was seen between PEF, age and height in boys and girls. The equation for prediction of flowmetric PEF in boys was calculated as 'F-PEF = -187+ 24.4 Age + 1.61 Height' (p-value<0.001, r=0.86), and for girls as 'F-PEF = -151 + 17Age + 1.59Height' (p-value<0.001, r=0.86). The equation for prediction of spirometric PEF in boys was calculated as 'S-PEF = -199+ 9.8Age + 2.67Height' (p-value<0.05, r=0.77), and for girls as 'S-PEF = -181 + 8.5Age + 2.5Height' (p-value<0.001, r=0.83). The boys had higher values than the girls. The performance of the Mini Wright Peak Flow Meter was lower than that of a spirometer. Our study established PEF predictive equations in Moroccan children. Our results appeared to be reliable, as evident by the high correlation coefficient in this sample. PEF can be an alternative of FEV1 in centers without spirometry.
Novaković, Romana; Geelen, Anouk; Ristić-Medić, Danijela; Nikolić, Marina; Souverein, Olga W; McNulty, Helene; Duffy, Maresa; Hoey, Leane; Dullemeijer, Carla; Renkema, Jacoba M S; Gurinović, Mirjana; Glibetić, Marija; de Groot, Lisette C P G M; Van't Veer, Pieter
2018-06-07
Dietary reference values for folate intake vary widely across Europe. MEDLINE and Embase through November 2016 were searched for data on the association between folate intake and biomarkers (serum/plasma folate, red blood cell [RBC] folate, plasma homocysteine) from observational studies in healthy adults and elderly. The regression coefficient of biomarkers on intake (β) was extracted from each study, and the overall and stratified pooled β and SE (β) were obtained by random effects meta-analysis on a double log scale. These dose-response estimates may be used to derive folate intake reference values. For every doubling in folate intake, the changes in serum/plasma folate, RBC folate and plasma homocysteine were +22, +21, and -16% respectively. The overall pooled regression coefficients were β = 0.29 (95% CI 0.21-0.37) for serum/plasma folate (26 estimates from 17 studies), β = 0.28 (95% CI 0.21-0.36) for RBC (13 estimates from 11 studies), and β = -0.21 (95% CI -0.31 to -0.11) for plasma homocysteine (10 estimates from 6 studies). These estimates along with those from randomized controlled trials can be used for underpinning dietary recommendations for folate in adults and elderly. © 2018 S. Karger AG, Basel.
Global estimation of long-term persistence in annual river runoff
NASA Astrophysics Data System (ADS)
Markonis, Y.; Moustakis, Y.; Nasika, C.; Sychova, P.; Dimitriadis, P.; Hanel, M.; Máca, P.; Papalexiou, S. M.
2018-03-01
Long-term persistence (LTP) of annual river runoff is a topic of ongoing hydrological research, due to its implications to water resources management. Here, we estimate its strength, measured by the Hurst coefficient H, in 696 annual, globally distributed, streamflow records with at least 80 years of data. We use three estimation methods (maximum likelihood estimator, Whittle estimator and least squares variance) resulting in similar mean values of H close to 0.65. Subsequently, we explore potential factors influencing H by two linear (Spearman's rank correlation, multiple linear regression) and two non-linear (self-organizing maps, random forests) techniques. Catchment area is found to be crucial for medium to larger watersheds, while climatic controls, such as aridity index, have higher impact to smaller ones. Our findings indicate that long-term persistence is weaker than found in other studies, suggesting that enhanced LTP is encountered in large-catchment rivers, were the effect of spatial aggregation is more intense. However, we also show that the estimated values of H can be reproduced by a short-term persistence stochastic model such as an auto-regressive AR(1) process. A direct consequence is that some of the most common methods for the estimation of H coefficient, might not be suitable for discriminating short- and long-term persistence even in long observational records.
ERIC Educational Resources Information Center
Yan, Jun; Aseltine, Robert H., Jr.; Harel, Ofer
2013-01-01
Comparing regression coefficients between models when one model is nested within another is of great practical interest when two explanations of a given phenomenon are specified as linear models. The statistical problem is whether the coefficients associated with a given set of covariates change significantly when other covariates are added into…
NASA Astrophysics Data System (ADS)
Dyar, M. D.; Carmosino, M. L.; Breves, E. A.; Ozanne, M. V.; Clegg, S. M.; Wiens, R. C.
2012-04-01
A remote laser-induced breakdown spectrometer (LIBS) designed to simulate the ChemCam instrument on the Mars Science Laboratory Rover Curiosity was used to probe 100 geologic samples at a 9-m standoff distance. ChemCam consists of an integrated remote LIBS instrument that will probe samples up to 7 m from the mast of the rover and a remote micro-imager (RMI) that will record context images. The elemental compositions of 100 igneous and highly-metamorphosed rocks are determined with LIBS using three variations of multivariate analysis, with a goal of improving the analytical accuracy. Two forms of partial least squares (PLS) regression are employed with finely-tuned parameters: PLS-1 regresses a single response variable (elemental concentration) against the observation variables (spectra, or intensity at each of 6144 spectrometer channels), while PLS-2 simultaneously regresses multiple response variables (concentrations of the ten major elements in rocks) against the observation predictor variables, taking advantage of natural correlations between elements. Those results are contrasted with those from the multivariate regression technique of the least absolute shrinkage and selection operator (lasso), which is a penalized shrunken regression method that selects the specific channels for each element that explain the most variance in the concentration of that element. To make this comparison, we use results of cross-validation and of held-out testing, and employ unscaled and uncentered spectral intensity data because all of the input variables are already in the same units. Results demonstrate that the lasso, PLS-1, and PLS-2 all yield comparable results in terms of accuracy for this dataset. However, the interpretability of these methods differs greatly in terms of fundamental understanding of LIBS emissions. PLS techniques generate principal components, linear combinations of intensities at any number of spectrometer channels, which explain as much variance in the response variables as possible while avoiding multicollinearity between principal components. When the selected number of principal components is projected back into the original feature space of the spectra, 6144 correlation coefficients are generated, a small fraction of which are mathematically significant to the regression. In contrast, the lasso models require only a small number (< 24) of non-zero correlation coefficients (β values) to determine the concentration of each of the ten major elements. Causality between the positively-correlated emission lines chosen by the lasso and the elemental concentration was examined. In general, the higher the lasso coefficient (β), the greater the likelihood that the selected line results from an emission of that element. Emission lines with negative β values should arise from elements that are anti-correlated with the element being predicted. For elements except Fe, Al, Ti, and P, the lasso-selected wavelength with the highest β value corresponds to the element being predicted, e.g. 559.8 nm for neutral Ca. However, the specific lines chosen by the lasso with positive β values are not always those from the element being predicted. Other wavelengths and the elements that most strongly correlate with them to predict concentration are obviously related to known geochemical correlations or close overlap of emission lines, while others must result from matrix effects. Use of the lasso technique thus directly informs our understanding of the underlying physical processes that give rise to LIBS emissions by determining which lines can best represent concentration, and which lines from other elements are causing matrix effects.
NASA Astrophysics Data System (ADS)
Wei, C.; Cheng, K. S.
Using meteorological radar and satellite imagery had become an efficient tool for rainfall forecasting However few studies were aimed to predict quantitative rainfall in small watersheds for flood forecasting by using remote sensing data Due to the terrain shelter and ground clutter effect of Central Mountain Ridges the application of meteorological radar data was limited in mountainous areas of central Taiwan This study devises a new scheme to predict rainfall of a small upstream watershed by combing GOES-9 geostationary weather satellite imagery and ground rainfall records which can be applied for local quantitative rainfall forecasting during periods of typhoon and heavy rainfall Imagery of two typhoon events in 2004 and five correspondent ground raingauges records of Chitou Forest Recreational Area which is located in upstream region of Bei-Shi river were analyzed in this study The watershed accounts for 12 7 square kilometers and altitudes ranging from 1000 m to 1800 m Basin-wide Average Rainfall BAR in study area were estimated by block kriging Cloud Top Temperature CTT from satellite imagery and ground hourly rainfall records were medium correlated The regression coefficient ranges from 0 5 to 0 7 and the value decreases as the altitude of the gauge site increases The regression coefficient of CCT and next 2 to 6 hour accumulated BAR decrease as the time scale increases The rainfall forecasting for BAR were analyzed by Kalman Filtering Technique The correlation coefficient and average hourly deviates between estimated and observed value of BAR for
Ghosh, Adarsh; Singh, Tulika; Singla, Veenu; Bagga, Rashmi; Khandelwal, Niranjan
2017-12-01
Apparent diffusion coefficient (ADC) maps are usually generated by builtin software provided by the MRI scanner vendors; however, various open-source postprocessing software packages are available for image manipulation and parametric map generation. The purpose of this study is to establish the reproducibility of absolute ADC values obtained using different postprocessing software programs. DW images with three b values were obtained with a 1.5-T MRI scanner, and the trace images were obtained. ADC maps were automatically generated by the in-line software provided by the vendor during image generation and were also separately generated on postprocessing software. These ADC maps were compared on the basis of ROIs using paired t test, Bland-Altman plot, mountain plot, and Passing-Bablok regression plot. There was a statistically significant difference in the mean ADC values obtained from the different postprocessing software programs when the same baseline trace DW images were used for the ADC map generation. For using ADC values as a quantitative cutoff for histologic characterization of tissues, standardization of the postprocessing algorithm is essential across processing software packages, especially in view of the implementation of vendor-neutral archiving.
Lowery, Michael G; Calfin, Brenda; Yeh, Shu-Jen; Doan, Tao; Shain, Eric; Hanna, Charles; Hohs, Ronald; Kantor, Stan; Lindberg, John; Khalil, Omar S
2006-01-01
We used the effect of temperature on the localized reflectance of human skin to assess the role of noise sources on the correlation between temperature-induced fractional change in optical density of human skin (DeltaOD(T)) and blood glucose concentration [BG]. Two temperature-controlled optical probes at 30 degrees C contacted the skin, one was then cooled by -10 degrees C; the other was heated by +10 degrees C. DeltaOD(T) upon cooling or heating was correlated with capillary [BG] of diabetic volunteers over a period of three days. Calibration models in the first two days were used to predict [BG] in the third day. We examined the conditions where the correlation coefficient (R2) for predicting [BG] in a third day ranked higher than R2 values resulting from fitting permutations of randomized [BG] to the same DeltaOD(T) values. It was possible to establish a four-term linear regression correlation between DeltaOD(T) upon cooling and [BG] with a correlation coefficient higher than that of an established noise threshold in diabetic patients that were mostly females with less than 20 years of diabetes duration. The ability to predict [BG] values with a correlation coefficient above biological and body-interface noise varied between the cases of cooling and heating.
NASA Astrophysics Data System (ADS)
Gowda, Shivalinge; Krishnaveni, S.; Yashoda, T.; Umesh, T. K.; Gowda, Ramakrishna
2004-09-01
Photon mass attenuation coefficients of some thermoluminescent dosimetric (TLD) compounds, such as LiF, CaCO_3, CaSO_4, CaSO_4\\cdot2H_2O, SrSO_4, CdSO_4, BaSO_4, C_4H_6BaO_4 and 3CdSO_4\\cdot8H_2O were determined at 279.2, 320.07, 514.0, 661.6, 1115.5, 1173.2 and 1332.5 keV in a well-collimated narrow beam good geometry set-up using a high resolution, hyper pure germanium detector. The attenuation coefficient data were then used to compute the effective atomic number and the electron density of TLD compounds. The interpolation of total attenuation cross-sections of photons of energy E in elements of atomic number Z was performed using the logarithmic regression analysis of the data measured by the authors and reported earlier. The best-fit coefficients so obtained in the photon energy range of 279.2 to 320.07 keV, 514.0 to 661.6 keV and 1115.5 to 1332.5 keV by a piece-wise interpolation method were then used to find the effective atomic number and electron density of the compounds. These values are found to be in agreement with other available published values.
Gouvinhas, Irene; Machado, Nelson; Carvalho, Teresa; de Almeida, José M M M; Barros, Ana I R N A
2015-01-01
Extra virgin olive oils produced from three cultivars on different maturation stages were characterized using Raman spectroscopy. Chemometric methods (principal component analysis, discriminant analysis, principal component regression and partial least squares regression) applied to Raman spectral data were utilized to evaluate and quantify the statistical differences between cultivars and their ripening process. The models for predicting the peroxide value and free acidity of olive oils showed good calibration and prediction values and presented high coefficients of determination (>0.933). Both the R(2), and the correlation equations between the measured chemical parameters, and the values predicted by each approach are presented; these comprehend both PCR and PLS, used to assess SNV normalized Raman data, as well as first and second derivative of the spectra. This study demonstrates that a combination of Raman spectroscopy with multivariate analysis methods can be useful to predict rapidly olive oil chemical characteristics during the maturation process. Copyright © 2014 Elsevier B.V. All rights reserved.
Determining the response of sea level to atmospheric pressure forcing using TOPEX/POSEIDON data
NASA Technical Reports Server (NTRS)
Fu, Lee-Lueng; Pihos, Greg
1994-01-01
The static response of sea level to the forcing of atmospheric pressure, the so-called inverted barometer (IB) effect, is investigated using TOPEX/POSEIDON data. This response, characterized by the rise and fall of sea level to compensate for the change of atmospheric pressure at a rate of -1 cm/mbar, is not associated with any ocean currents and hence is normally treated as an error to be removed from sea level observation. Linear regression and spectral transfer function analyses are applied to sea level and pressure to examine the validity of the IB effect. In regions outside the tropics, the regression coefficient is found to be consistently close to the theoretical value except for the regions of western boundary currents, where the mesoscale variability interferes with the IB effect. The spectral transfer function shows near IB response at periods of 30 degrees is -0.84 +/- 0.29 cm/mbar (1 standard deviation). The deviation from = 1 cm /mbar is shown to be caused primarily by the effect of wind forcing on sea level, based on multivariate linear regression model involving both pressure and wind forcing. The regression coefficient for pressure resulting from the multivariate analysis is -0.96 +/- 0.32 cm/mbar. In the tropics the multivariate analysis fails because sea level in the tropics is primarily responding to remote wind forcing. However, after removing from the data the wind-forced sea level estimated by a dynamic model of the tropical Pacific, the pressure regression coefficient improves from -1.22 +/- 0.69 cm/mbar to -0.99 +/- 0.46 cm/mbar, clearly revealing an IB response. The result of the study suggests that with a proper removal of the effect of wind forcing the IB effect is valid in most of the open ocean at periods longer than 20 days and spatial scales larger than 500 km.
Misyura, Maksym; Sukhai, Mahadeo A; Kulasignam, Vathany; Zhang, Tong; Kamel-Reid, Suzanne; Stockley, Tracy L
2018-02-01
A standard approach in test evaluation is to compare results of the assay in validation to results from previously validated methods. For quantitative molecular diagnostic assays, comparison of test values is often performed using simple linear regression and the coefficient of determination (R 2 ), using R 2 as the primary metric of assay agreement. However, the use of R 2 alone does not adequately quantify constant or proportional errors required for optimal test evaluation. More extensive statistical approaches, such as Bland-Altman and expanded interpretation of linear regression methods, can be used to more thoroughly compare data from quantitative molecular assays. We present the application of Bland-Altman and linear regression statistical methods to evaluate quantitative outputs from next-generation sequencing assays (NGS). NGS-derived data sets from assay validation experiments were used to demonstrate the utility of the statistical methods. Both Bland-Altman and linear regression were able to detect the presence and magnitude of constant and proportional error in quantitative values of NGS data. Deming linear regression was used in the context of assay comparison studies, while simple linear regression was used to analyse serial dilution data. Bland-Altman statistical approach was also adapted to quantify assay accuracy, including constant and proportional errors, and precision where theoretical and empirical values were known. The complementary application of the statistical methods described in this manuscript enables more extensive evaluation of performance characteristics of quantitative molecular assays, prior to implementation in the clinical molecular laboratory. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Lu, Lee-Jane W.; Nishino, Thomas K.; Khamapirad, Tuenchit; Grady, James J; Leonard, Morton H.; Brunder, Donald G.
2009-01-01
Breast density (the percentage of fibroglandular tissue in the breast) has been suggested to be a useful surrogate marker for breast cancer risk. It is conventionally measured using screen-film mammographic images by a labor intensive histogram segmentation method (HSM). We have adapted and modified the HSM for measuring breast density from raw digital mammograms acquired by full-field digital mammography. Multiple regression model analyses showed that many of the instrument parameters for acquiring the screening mammograms (e.g. breast compression thickness, radiological thickness, radiation dose, compression force, etc) and image pixel intensity statistics of the imaged breasts were strong predictors of the observed threshold values (model R2=0.93) and %density (R2=0.84). The intra-class correlation coefficient of the %-density for duplicate images was estimated to be 0.80, using the regression model-derived threshold values, and 0.94 if estimated directly from the parameter estimates of the %-density prediction regression model. Therefore, with additional research, these mathematical models could be used to compute breast density objectively, automatically bypassing the HSM step, and could greatly facilitate breast cancer research studies. PMID:17671343
Properties of a color-changeable chewing gum used to evaluate masticatory performance.
Hama, Yohei; Kanazawa, Manabu; Minakuchi, Shunsuke; Uchida, Tatsuro; Sasaki, Yoshiyuki
2014-04-01
To clarify the basic properties of a color-changeable chewing gum to determine its applicability to evaluations of masticatory performance under different types of dental status. Ten participants with natural dentition aged 26-30 years chewed gum that changes color during several chewing strokes over five repetitions. Changes in color were assessed using a colorimeter, and then L*, a*, and b* values in the CIELAB color system were quantified. Relationships between chewing progression and color changes were assessed using regression analysis and the reliability of color changes was assessed using intraclass correlation coefficients. We then measured 42 dentate participants (age, 22-31 years) and 47 complete denture wearers (age, 44-90 years) to determine the detectability of masticatory performance under two types of dental status. Regression between the number of chewing strokes and the difference between two colors was non-linear. The intraclass correlation coefficients were highest between 60 and 160 chewing strokes. Dentate and edentulous groups significantly differed (Wilcoxon rank sum test) and values were widely distributed within each group. The color of the chewing gum changed over a wide range, which was sufficient to evaluate the masticatory performance of individuals with natural dentition and those with complete dentures. Changes in the color values of the gum reliably reflected masticatory performance. These findings indicate that the color-changeable chewing gum will be useful for evaluating masticatory performance under any dental status. Copyright © 2014 Japan Prosthodontic Society. Published by Elsevier Ltd. All rights reserved.
Tools to Support Interpreting Multiple Regression in the Face of Multicollinearity
Kraha, Amanda; Turner, Heather; Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K.
2012-01-01
While multicollinearity may increase the difficulty of interpreting multiple regression (MR) results, it should not cause undue problems for the knowledgeable researcher. In the current paper, we argue that rather than using one technique to investigate regression results, researchers should consider multiple indices to understand the contributions that predictors make not only to a regression model, but to each other as well. Some of the techniques to interpret MR effects include, but are not limited to, correlation coefficients, beta weights, structure coefficients, all possible subsets regression, commonality coefficients, dominance weights, and relative importance weights. This article will review a set of techniques to interpret MR effects, identify the elements of the data on which the methods focus, and identify statistical software to support such analyses. PMID:22457655
Tools to support interpreting multiple regression in the face of multicollinearity.
Kraha, Amanda; Turner, Heather; Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K
2012-01-01
While multicollinearity may increase the difficulty of interpreting multiple regression (MR) results, it should not cause undue problems for the knowledgeable researcher. In the current paper, we argue that rather than using one technique to investigate regression results, researchers should consider multiple indices to understand the contributions that predictors make not only to a regression model, but to each other as well. Some of the techniques to interpret MR effects include, but are not limited to, correlation coefficients, beta weights, structure coefficients, all possible subsets regression, commonality coefficients, dominance weights, and relative importance weights. This article will review a set of techniques to interpret MR effects, identify the elements of the data on which the methods focus, and identify statistical software to support such analyses.
Adjusting for Confounding in Early Postlaunch Settings: Going Beyond Logistic Regression Models.
Schmidt, Amand F; Klungel, Olaf H; Groenwold, Rolf H H
2016-01-01
Postlaunch data on medical treatments can be analyzed to explore adverse events or relative effectiveness in real-life settings. These analyses are often complicated by the number of potential confounders and the possibility of model misspecification. We conducted a simulation study to compare the performance of logistic regression, propensity score, disease risk score, and stabilized inverse probability weighting methods to adjust for confounding. Model misspecification was induced in the independent derivation dataset. We evaluated performance using relative bias confidence interval coverage of the true effect, among other metrics. At low events per coefficient (1.0 and 0.5), the logistic regression estimates had a large relative bias (greater than -100%). Bias of the disease risk score estimates was at most 13.48% and 18.83%. For the propensity score model, this was 8.74% and >100%, respectively. At events per coefficient of 1.0 and 0.5, inverse probability weighting frequently failed or reduced to a crude regression, resulting in biases of -8.49% and 24.55%. Coverage of logistic regression estimates became less than the nominal level at events per coefficient ≤5. For the disease risk score, inverse probability weighting, and propensity score, coverage became less than nominal at events per coefficient ≤2.5, ≤1.0, and ≤1.0, respectively. Bias of misspecified disease risk score models was 16.55%. In settings with low events/exposed subjects per coefficient, disease risk score methods can be useful alternatives to logistic regression models, especially when propensity score models cannot be used. Despite better performance of disease risk score methods than logistic regression and propensity score models in small events per coefficient settings, bias, and coverage still deviated from nominal.
Bao, Jie; Hou, Zhangshuan; Huang, Maoyi; ...
2015-12-04
Here, effective sensitivity analysis approaches are needed to identify important parameters or factors and their uncertainties in complex Earth system models composed of multi-phase multi-component phenomena and multiple biogeophysical-biogeochemical processes. In this study, the impacts of 10 hydrologic parameters in the Community Land Model on simulations of runoff and latent heat flux are evaluated using data from a watershed. Different metrics, including residual statistics, the Nash-Sutcliffe coefficient, and log mean square error, are used as alternative measures of the deviations between the simulated and field observed values. Four sensitivity analysis (SA) approaches, including analysis of variance based on the generalizedmore » linear model, generalized cross validation based on the multivariate adaptive regression splines model, standardized regression coefficients based on a linear regression model, and analysis of variance based on support vector machine, are investigated. Results suggest that these approaches show consistent measurement of the impacts of major hydrologic parameters on response variables, but with differences in the relative contributions, particularly for the secondary parameters. The convergence behaviors of the SA with respect to the number of sampling points are also examined with different combinations of input parameter sets and output response variables and their alternative metrics. This study helps identify the optimal SA approach, provides guidance for the calibration of the Community Land Model parameters to improve the model simulations of land surface fluxes, and approximates the magnitudes to be adjusted in the parameter values during parametric model optimization.« less
Measurement of reaeration coefficients for selected Florida streams
Hampson, P.S.; Coffin, J.E.
1989-01-01
A total of 29 separate reaeration coefficient determinations were performed on 27 subreaches of 12 selected Florida streams between October 1981 and May 1985. Measurements performed prior to June 1984 were made using the peak and area methods with ethylene and propane as the tracer gases. Later measurements utilized the steady-state method with propane as the only tracer gas. The reaeration coefficients ranged from 1.07 to 45.9 days with a mean estimated probable error of +/16.7%. Ten predictive equations (compiled from the literature) were also evaluated using the measured coefficients. The most representative equation was one of the energy dissipation type with a standard error of 60.3%. Seven of the 10 predictive additional equations were modified using the measured coefficients and nonlinear regression techniques. The most accurate of the developed equations was also of the energy dissipation form and had a standard error of 54.9%. For 5 of the 13 subreaches in which both ethylene and propane were used, the ethylene data resulted in substantially larger reaeration coefficient values which were rejected. In these reaches, ethylene concentrations were probably significantly affected by one or more electrophilic addition reactions known to occur in aqueous media. (Author 's abstract)
Mastin, Mark C.; Konrad, Christopher P.; Veilleux, Andrea G.; Tecca, Alison E.
2016-09-20
An investigation into the magnitude and frequency of floods in Washington State computed the annual exceedance probability (AEP) statistics for 648 U.S. Geological Survey unregulated streamgages in and near the borders of Washington using the recorded annual peak flows through water year 2014. This is an updated report from a previous report published in 1998 that used annual peak flows through the water year 1996. New in this report, a regional skew coefficient was developed for the Pacific Northwest region that includes areas in Oregon, Washington, Idaho and western Montana within the Columbia River drainage basin south of the United States-Canada border, the coastal areas of Oregon and western Washington, and watersheds draining into Puget Sound, Washington. The skew coefficient is an important term in the Log Pearson Type III equation used to define the distribution of the log-transformed annual peaks. The Expected Moments Algorithm was used to fit historical and censored peak-flow data to the log Pearson Type III distribution. A Multiple Grubb-Beck test was employed to censor low outliers of annual peak flows to improve on the frequency distribution. This investigation also includes a section on observed trends in annual peak flows that showed significant trends (p-value < 0.05) in 21 of 83 long-term sites, but with small magnitude Kendall tau values suggesting a limited monotonic trend in the time series of annual peaks. Most of the sites with a significant trend in western Washington were positive and all the sites with significant trends (three sites) in eastern Washington were negative.Multivariate regression analysis with measured basin characteristics and the AEP statistics at long-term, unregulated, and un-urbanized (defined as drainage basins with less than 5 percent impervious land cover for this investigation) streamgages within Washington and some in Idaho and Oregon that are near the Washington border was used to develop equations to estimate AEP statistics at ungaged basins. Washington was divided into four regions to improve the accuracy of the regression equations; a set of equations for eight selected AEPs and for each region were constructed. Selected AEP statistics included the annual peak flows that equaled or exceeded 50, 20, 10, 4, 2, 1, 0.5 and 0.2 percent of the time equivalent to peak flows for peaks with a 2-, 5-, 10-, 25-, 50-, 100-, 200-, and 500-year recurrence intervals, respectively. Annual precipitation and drainage area were the significant basin characteristics in the regression equations for all four regression regions in Washington and forest cover was significant for the two regression regions in eastern Washington. Average standard error of prediction for the regional regression equations ranged from 70.19 to 125.72 percent for Regression Regions 1 and 2 on the eastern side of the Cascade Mountains and from 43.22 to 58.04 percent for Regression Regions 3 and 4 on the western side of the Cascade Mountains. The pseudo coefficient of determination (where a value of 100 signifies a perfect regression model) ranged from 68.39 to 90.68 for Regression Regions 1 and 2, and 92.35 to 95.44 for Regions 3 and 4.The calculated AEP statistics for the streamgages and the regional regression equations are expected to be incorporated into StreamStats after the publication of this report. StreamStats is the interactive Web-based map tool created by the U.S. Geological Survey to allow the user to choose a streamgage and obtain published statistics or choose ungaged locations where the program automatically applies the regional regression equations and computes the estimates of the AEP statistics.
Usman, B I; Amin, S M N; Arshad, A; Kamarudin, M S
2016-07-01
Samples of grey eel catfish Plotosus canius were collected from the coastal waters of Port Dickson, Malaysia from January to December, 2012. A total of 341 specimens (172 males and 169 females) were used to estimate the length-weight relationship parameters. Mean population size of females were 0.72 cm taller than the males, however difference was not significant (t-test, P > 0.05). The overall relationship equations between total length (TL) and body weight (BW) were established for males as Log TW = 2.71 Log TL - 1.85 (R2 = 0.95) and for females as Log TW = 2.88 Log TL-2.10 (R2 = 0.95). The estimated relative growth co-efficient (b) values were 2.71 for males and 2.88 for females. It is revealed that growth pattern of the species showed negative allometry. In both males and females, relationship between TL and SL gave highest regression coefficient (0.99). While relationship between TL and EL gave lowest regression coefficient in both males and females (0.87 and 0.81 respectively). The findings from this study contributed first information on morphometric relations of the fish from Malaysian coastal waters and could be useful for sustainable management options of P. canius in Malaysia.
Quantitative prediction of ionization effect on human skin permeability.
Baba, Hiromi; Ueno, Yusuke; Hashida, Mitsuru; Yamashita, Fumiyoshi
2017-04-30
Although skin permeability of an active ingredient can be severely affected by its ionization in a dose solution, most of the existing prediction models cannot predict such impacts. To provide reliable predictors, we curated a novel large dataset of in vitro human skin permeability coefficients for 322 entries comprising chemically diverse permeants whose ionization fractions can be calculated. Subsequently, we generated thousands of computational descriptors, including LogD (octanol-water distribution coefficient at a specific pH), and analyzed the dataset using nonlinear support vector regression (SVR) and Gaussian process regression (GPR) combined with greedy descriptor selection. The SVR model was slightly superior to the GPR model, with externally validated squared correlation coefficient, root mean square error, and mean absolute error values of 0.94, 0.29, and 0.21, respectively. These models indicate that Log D is effective for a comprehensive prediction of ionization effects on skin permeability. In addition, the proposed models satisfied the statistical criteria endorsed in recent model validation studies. These models can evaluate virtually generated compounds at any pH; therefore, they can be used for high-throughput evaluations of numerous active ingredients and optimization of their skin permeability with respect to permeant ionization. Copyright © 2017 Elsevier B.V. All rights reserved.
Prediction of heat capacities of solid inorganic salts from group contributions
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mostafa, A.T.M.G.; Eakman, J.M.; Yarbro, S.L.
1997-01-01
A group contribution technique is proposed to predict the coefficients in the heat capacity correlation, C{sub p} = a + bT + c/T{sup 2} + dT{sup 2}, for solid inorganic salts. The results from this work are compared with fits to experimental data from the literature. It is shown to give good predictions for both simple and complex solid inorganic salts. Literature heat capacities for a large number (664) of solid inorganic salts covering a broad range of cations (129), anions (17) and ligands (2) have been used in regressions to obtain group contributions for the parameters in the heatmore » capacity temperature function. A mean error of 3.18% is found when predicted values are compared with literature values for heat capacity at 298{degrees} K. Estimates of the error standard deviation from the regression for each additivity constant are also determined.« less
Mangrove canopy density analysis using Sentinel-2A imagery satellite data
NASA Astrophysics Data System (ADS)
Wachid, M. N.; Hapsara, R. P.; Cahyo, R. D.; Wahyu, G. N.; Syarif, A. M.; Umarhadi, D. A.; Fitriani, A. N.; Ramadhanningrum, D. P.; Widyatmanti, W.
2017-06-01
Teluk Jor has alluvium surface sediment that came from volcanic materials. Sea wave that relatively calm and the closed beach shape support the existence of mangrove forest at Teluk Jor. Sentinel-2A imagery has a good spatial and spectral resolution for mangrove density study. The regression between samples and the NDVI values of Sentinel-2A used to analyze the mangrove canopy density. Mangrove canopy density was identified using field survey with transect method. The regression analysis shows field data and NDVI value has correlation R=0.7739 and coefficient of determination R2=0.5989. The result of the analysis shows area of low density 397,900 m2, moderate density 336,200 m2, the high density has 110,300 m2 and very high density has 500 m2. This research also found that mangrove genus in Teluk Jor consists of Rhizopora, Ceriops, Aegiceras and Sonneratia.
ERIC Educational Resources Information Center
Quinino, Roberto C.; Reis, Edna A.; Bessegato, Lupercio F.
2013-01-01
This article proposes the use of the coefficient of determination as a statistic for hypothesis testing in multiple linear regression based on distributions acquired by beta sampling. (Contains 3 figures.)
Qu, Yanfei; Ma, Yongwen; Wan, Jinquan; Wang, Yan
2018-06-01
The silicon oil-air partition coefficients (K SiO/A ) of hydrophobic compounds are vital parameters for applying silicone oil as non-aqueous-phase liquid in partitioning bioreactors. Due to the limited number of K SiO/A values determined by experiment for hydrophobic compounds, there is an urgent need to model the K SiO/A values for unknown chemicals. In the present study, we developed a universal quantitative structure-activity relationship (QSAR) model using a sequential approach with macro-constitutional and micromolecular descriptors for silicone oil-air partition coefficients (K SiO/A ) of hydrophobic compounds with large structural variance. The geometry optimization and vibrational frequencies of each chemical were calculated using the hybrid density functional theory at the B3LYP/6-311G** level. Several quantum chemical parameters that reflect various intermolecular interactions as well as hydrophobicity were selected to develop QSAR model. The result indicates that a regression model derived from logK SiO/A , the number of non-hydrogen atoms (#nonHatoms) and energy gap of E LUMO and E HOMO (E LUMO -E HOMO ) could explain the partitioning mechanism of hydrophobic compounds between silicone oil and air. The correlation coefficient R 2 of the model is 0.922, and the internal and external validation coefficient, Q 2 LOO and Q 2 ext , are 0.91 and 0.89 respectively, implying that the model has satisfactory goodness-of-fit, robustness, and predictive ability and thus provides a robust predictive tool to estimate the logK SiO/A values for chemicals in application domain. The applicability domain of the model was visualized by the Williams plot.
NASA Astrophysics Data System (ADS)
Setiyorini, Anis; Suprijadi, Jadi; Handoko, Budhi
2017-03-01
Geographically Weighted Regression (GWR) is a regression model that takes into account the spatial heterogeneity effect. In the application of the GWR, inference on regression coefficients is often of interest, as is estimation and prediction of the response variable. Empirical research and studies have demonstrated that local correlation between explanatory variables can lead to estimated regression coefficients in GWR that are strongly correlated, a condition named multicollinearity. It later results on a large standard error on estimated regression coefficients, and, hence, problematic for inference on relationships between variables. Geographically Weighted Lasso (GWL) is a method which capable to deal with spatial heterogeneity and local multicollinearity in spatial data sets. GWL is a further development of GWR method, which adds a LASSO (Least Absolute Shrinkage and Selection Operator) constraint in parameter estimation. In this study, GWL will be applied by using fixed exponential kernel weights matrix to establish a poverty modeling of Java Island, Indonesia. The results of applying the GWL to poverty datasets show that this method stabilizes regression coefficients in the presence of multicollinearity and produces lower prediction and estimation error of the response variable than GWR does.
An improved multiple linear regression and data analysis computer program package
NASA Technical Reports Server (NTRS)
Sidik, S. M.
1972-01-01
NEWRAP, an improved version of a previous multiple linear regression program called RAPIER, CREDUC, and CRSPLT, allows for a complete regression analysis including cross plots of the independent and dependent variables, correlation coefficients, regression coefficients, analysis of variance tables, t-statistics and their probability levels, rejection of independent variables, plots of residuals against the independent and dependent variables, and a canonical reduction of quadratic response functions useful in optimum seeking experimentation. A major improvement over RAPIER is that all regression calculations are done in double precision arithmetic.
A Small and Slim Coaxial Probe for Single Rice Grain Moisture Sensing
You, Kok Yeow; Mun, Hou Kit; You, Li Ling; Salleh, Jamaliah; Abbas, Zulkifly
2013-01-01
A moisture detection of single rice grains using a slim and small open-ended coaxial probe is presented. The coaxial probe is suitable for the nondestructive measurement of moisture values in the rice grains ranging from from 9.5% to 26%. Empirical polynomial models are developed to predict the gravimetric moisture content of rice based on measured reflection coefficients using a vector network analyzer. The relationship between the reflection coefficient and relative permittivity were also created using a regression method and expressed in a polynomial model, whose model coefficients were obtained by fitting the data from Finite Element-based simulation. Besides, the designed single rice grain sample holder and experimental set-up were shown. The measurement of single rice grains in this study is more precise compared to the measurement in conventional bulk rice grains, as the random air gap present in the bulk rice grains is excluded. PMID:23493127
Octanol-air partition coefficients of polybrominated biphenyls.
Hongxia, Zhao; Jingwen, Chen; Xie, Quan; Baocheng, Qu; Xinmiao, Liang
2009-03-01
The octanol-air partition coefficients (K(OA)) for PBB15, PBB26, PBB31, PBB49, PBB103 and PBB153 were determined as a function of temperature using a gas chromatographic retention time technique with 1,1,1-trichloro-2,2-bis (4-chlorophenyl) ethane (p,p'-DDT) as a reference substance. The internal energies of phase change from octanol to air (Delta(OA)U) were calculated for the six compounds and were in the range from 74 to 116 kJ mol(-1). Simple regression equations of log K(OA) versus relative retention times (RRTs) on gas chromatography (GC), and log K(OA) versus molecular connectivity indexes (MCI) were obtained, for which the correlation coefficients (r(2)) were greater than 0.985 at 283.15K and 298.15K. Thus the K(OA) values of the remaining PBBs can be predicted by using their RRTs and MCI according to these relationships.
Rea, A.H.; Tortorelli, R.L.
1997-01-01
This digital report contains two digital-map grids of data that were used to develop peak-flow regression equations in Tortorelli, 1997, 'Techniques for estimating peak-streamflow frequency for unregulated streams and streams regulated by small floodwater retarding structures in Oklahoma,' U.S. Geological Survey Water-Resources Investigations Report 97-4202. One data set is a grid of mean annual precipitation, in inches, based on the period 1961-90, for Oklahoma. The data set was derived from the PRISM (Parameter-elevation Regressions on Independent Slopes Model) mean annual precipitation grid for the United States, developed by Daly, Neilson, and Phillips (1994, 'A statistical-topographic model for mapping climatological precipitation over mountainous terrain:' Journal of Applied Meteorology, v. 33, no. 2, p. 140-158). The second data set is a grid of generalized skew coefficients of logarithms of annual maximum streamflow for Oklahoma streams less than or equal to 2,510 square miles in drainage area. This grid of skew coefficients is taken from figure 11 of Tortorelli and Bergman, 1985, 'Techniques for estimating flood peak discharges for unregulated streams and streams regulated by small floodwater retarding structures in Oklahoma,' U.S. Geological Survey Water-Resources Investigations Report 84-4358. To save disk space, the skew coefficient values have been multiplied by 100 and rounded to integers with two significant digits. The data sets are provided in an ASCII grid format.
NASA Astrophysics Data System (ADS)
Abunama, Taher; Othman, Faridah
2017-06-01
Analysing the fluctuations of wastewater inflow rates in sewage treatment plants (STPs) is essential to guarantee a sufficient treatment of wastewater before discharging it to the environment. The main objectives of this study are to statistically analyze and forecast the wastewater inflow rates into the Bandar Tun Razak STP in Kuala Lumpur, Malaysia. A time series analysis of three years’ weekly influent data (156weeks) has been conducted using the Auto-Regressive Integrated Moving Average (ARIMA) model. Various combinations of ARIMA orders (p, d, q) have been tried to select the most fitted model, which was utilized to forecast the wastewater inflow rates. The linear regression analysis was applied to testify the correlation between the observed and predicted influents. ARIMA (3, 1, 3) model was selected with the highest significance R-square and lowest normalized Bayesian Information Criterion (BIC) value, and accordingly the wastewater inflow rates were forecasted to additional 52weeks. The linear regression analysis between the observed and predicted values of the wastewater inflow rates showed a positive linear correlation with a coefficient of 0.831.
Random effects coefficient of determination for mixed and meta-analysis models
Demidenko, Eugene; Sargent, James; Onega, Tracy
2011-01-01
The key feature of a mixed model is the presence of random effects. We have developed a coefficient, called the random effects coefficient of determination, Rr2, that estimates the proportion of the conditional variance of the dependent variable explained by random effects. This coefficient takes values from 0 to 1 and indicates how strong the random effects are. The difference from the earlier suggested fixed effects coefficient of determination is emphasized. If Rr2 is close to 0, there is weak support for random effects in the model because the reduction of the variance of the dependent variable due to random effects is small; consequently, random effects may be ignored and the model simplifies to standard linear regression. The value of Rr2 apart from 0 indicates the evidence of the variance reduction in support of the mixed model. If random effects coefficient of determination is close to 1 the variance of random effects is very large and random effects turn into free fixed effects—the model can be estimated using the dummy variable approach. We derive explicit formulas for Rr2 in three special cases: the random intercept model, the growth curve model, and meta-analysis model. Theoretical results are illustrated with three mixed model examples: (1) travel time to the nearest cancer center for women with breast cancer in the U.S., (2) cumulative time watching alcohol related scenes in movies among young U.S. teens, as a risk factor for early drinking onset, and (3) the classic example of the meta-analysis model for combination of 13 studies on tuberculosis vaccine. PMID:23750070
Shoeib, Mahiba; Harner, Tom
2002-05-01
Octanol-air partition coefficients (Koa) were measured directly for 19 organochlorine (OC) pesticides over the temperature range of 5 to 35 degrees C. Values of log Koa at 25 degrees C ranged over three orders of magnitude, from 7.4 for hexachlorobenzene to 10.1 for 1,1-dichloro-2,2-bis(p-chlorophenyl) ethane. Measured values were compared to values calculated as KowRT/H (where R is the ideal gas constant [8.314 J mol(-1) K(-1)], T is absolute temperature, and H is Henry's law constant) were, in general, larger. Discrepancies of up to three orders of magnitude were observed, highlighting the need for direct measurements of Koa. Plots of Koa versus inverse absolute temperature exhibited a log-linear correlation. Enthalpies of phase transition between octanol and air (deltaHoa) were determined from the temperature slopes and were in the range of 56 to 105 kJ mol(-1) K(-1). Activity coefficients in octanol (gamma(o)) were determined from Koa and reported supercooled liquid vapor pressures (pL(o)), and these were in the range of 0.3 to 12, indicating near-ideal solution behavior. Differences in Koa values for structural isomers of hexachlorocyclohexane were also explored. A Koa-based model was described for predicting the partitioning of OC pesticides to aerosols and used to calculate particulate fractions at 25 and -10 degrees C. The model also agreed well with experimental results for several OC pesticides that were equilibrated with urban aerosols in the laboratory. A log-log regression of the particle-gas partition coefficient versus Koa had a slope near unity, indicating that octanol is a good surrogate for the aerosol organic matter.
NASA Astrophysics Data System (ADS)
Zhan, Liwei; Li, Chengwei
2017-02-01
A hybrid PSO-SVM-based model is proposed to predict the friction coefficient between aircraft tire and coating. The presented hybrid model combines a support vector machine (SVM) with particle swarm optimization (PSO) technique. SVM has been adopted to solve regression problems successfully. Its regression accuracy is greatly related to optimizing parameters such as the regularization constant C , the parameter gamma γ corresponding to RBF kernel and the epsilon parameter \\varepsilon in the SVM training procedure. However, the friction coefficient which is predicted based on SVM has yet to be explored between aircraft tire and coating. The experiment reveals that drop height and tire rotational speed are the factors affecting friction coefficient. Bearing in mind, the friction coefficient can been predicted using the hybrid PSO-SVM-based model by the measured friction coefficient between aircraft tire and coating. To compare regression accuracy, a grid search (GS) method and a genetic algorithm (GA) are used to optimize the relevant parameters (C , γ and \\varepsilon ), respectively. The regression accuracy could be reflected by the coefficient of determination ({{R}2} ). The result shows that the hybrid PSO-RBF-SVM-based model has better accuracy compared with the GS-RBF-SVM- and GA-RBF-SVM-based models. The agreement of this model (PSO-RBF-SVM) with experiment data confirms its good performance.
Williams-Sether, Tara; Gross, Tara A.
2016-02-09
Seasonal mean daily flow data from 119 U.S. Geological Survey streamflow-gaging stations in North Dakota; the surrounding states of Montana, Minnesota, and South Dakota; and the Canadian provinces of Manitoba and Saskatchewan with 10 or more years of unregulated flow record were used to develop regression equations for flow duration, n-day high flow and n-day low flow using ordinary least-squares and Tobit regression techniques. Regression equations were developed for seasonal flow durations at the 10th, 25th, 50th, 75th, and 90th percent exceedances; the 1-, 7-, and 30-day seasonal mean high flows for the 10-, 25-, and 50-year recurrence intervals; and the 1-, 7-, and 30-day seasonal mean low flows for the 2-, 5-, and 10-year recurrence intervals. Basin and climatic characteristics determined to be significant explanatory variables in one or more regression equations included drainage area, percentage of basin drainage area that drains to isolated lakes and ponds, ruggedness number, stream length, basin compactness ratio, minimum basin elevation, precipitation, slope ratio, stream slope, and soil permeability. The adjusted coefficient of determination for the n-day high-flow regression equations ranged from 55.87 to 94.53 percent. The Chi2 values for the duration regression equations ranged from 13.49 to 117.94, whereas the Chi2 values for the n-day low-flow regression equations ranged from 4.20 to 49.68.
Muradian, Kh K; Utko, N O; Mozzhukhina, T H; Pishel', I M; Litoshenko, O Ia; Bezrukov, V V; Fraĭfel'd, V E
2002-01-01
Correlative and regressive relations between the gaseous exchange, thermoregulation and mitochondrial protein content were analyzed by two- and three-dimensional statistics in mice. It has been shown that the pair wise linear methods of analysis did not reveal any significant correlation between the parameters under exploration. However, it became evident at three-dimensional and non-linear plotting for which the coefficients of multivariable correlation reached and even exceeded 0.7-0.8. The calculations based on partial differentiation of the multivariable regression equations allow to conclude that at certain values of VO2, VCO2 and body temperature negative relations between the systems of gaseous exchange and thermoregulation become dominating.
Zhang, Yun-lin; Qin, Bo-qiang; Ma, Rong-hua; Zhu, Guang-wei; Zhang, Lu; Chen, Wei-min
2005-03-01
Chromophoric dissolved organic matter (CDOM) represents one of the primary light-absorbing species in natural waters and plays a critical in determining the aquatic light field. CDOM shows a featureless absorption spectrum that increases exponentially with decreasing wavelength, which limits the penetration of biologically damaging UV-B radiation (wavelength from 280 to 320 nm) in the water column, thus shielding aquatic organisms. CDOM absorption measurements and their relationship with dissolved organic carbon (DOC), and fluorescence are presented in typical macrophyte and algae lake zone of Lake Taihu based on a field investigation in April in 2004 and lab analysis. Absorption spectral of CDOM was measured from 240 to 800 nm using a Shimadzu UV-2401PC UV-Vis recording spectrophotometer. Fluorescence with an excitation wavelength of 355 nm, an emission wavelength of 450 nm is measured using a Shimadzu 5301 spectrofluorometer. Concentrations of DOC ranged from 6.3 to 17.2 mg/L with an average of 9.08 +/- 2.66 mg/L. CDOM absorption coefficients at 280 nm and 355 nm were in the range of 11.2 - 32.6 m(-1) (average 17.46m(-1) +/- 5.75 m(-1) and 2.4 - 8.3 m(-1) (average 4.17m(-1) +/- 1.47 m(-l)), respectively. The values of the DOC-specific absorption coefficient at 355 nm ranged from 0.31 to 0.64 L x (mg x m)-1. Fluorescence emission at 450 nm, excited at 355 nm, had a mean value of 1.32nm(-1) +/- 0.84 nm(-1). A significant lake zone difference is found in DOC concentration, CDOM absorption coefficient and fluorescence, but not in DOC-specific absorption coefficient and spectral slope coefficient. This regional distribution pattern is in agreement with the location of sources of yellow substance: highest concentrations close to river mouth under the influence of river inflow, lower values in East Lake Taihu. The values of algae lake zone are obvious larger than those of macrophyte lake zone. In Meiliang Bay, CDOM absorption, DOC concentration and fluorescence tend to decreasing from inside to mouth of the Bay. The results show a good correlation between CDOM absorption and DOC coefficients during 280 - 500 nm short wavelength intervals. The R-square coefficient between CDOM absorption and DOC concentration decreases with the increase of wavelength from 280 to 500 nm. The significant linear regression correlations between fluorescence, DOC concentration and absorption coefficients were found at 355 nm. The exponential slope coefficients ranged from 13.0 to 16.4 microm(-1) with a mean value 14.37microm(-1) +/- 0.73microm(-1), 17.3microm(-1) - 20.3microm(-1) with a mean value 19.17microm(-1) +/- 0.84microm(-1) and 12.0microm(-1) - 15.8microm(-1) with a mean value 13.38microm(-1) +/- 0.82microm(-1) over the 280 - 500 nm, 280 - 360 nm and 360 - 440 nm intervals.
Rogers, Paul; Stoner, Julie
2016-01-01
Regression models for correlated binary outcomes are commonly fit using a Generalized Estimating Equations (GEE) methodology. GEE uses the Liang and Zeger sandwich estimator to produce unbiased standard error estimators for regression coefficients in large sample settings even when the covariance structure is misspecified. The sandwich estimator performs optimally in balanced designs when the number of participants is large, and there are few repeated measurements. The sandwich estimator is not without drawbacks; its asymptotic properties do not hold in small sample settings. In these situations, the sandwich estimator is biased downwards, underestimating the variances. In this project, a modified form for the sandwich estimator is proposed to correct this deficiency. The performance of this new sandwich estimator is compared to the traditional Liang and Zeger estimator as well as alternative forms proposed by Morel, Pan and Mancl and DeRouen. The performance of each estimator was assessed with 95% coverage probabilities for the regression coefficient estimators using simulated data under various combinations of sample sizes and outcome prevalence values with an Independence (IND), Autoregressive (AR) and Compound Symmetry (CS) correlation structure. This research is motivated by investigations involving rare-event outcomes in aviation data. PMID:26998504
2014-01-01
This paper examined the efficiency of multivariate linear regression (MLR) and artificial neural network (ANN) models in prediction of two major water quality parameters in a wastewater treatment plant. Biochemical oxygen demand (BOD) and chemical oxygen demand (COD) as well as indirect indicators of organic matters are representative parameters for sewer water quality. Performance of the ANN models was evaluated using coefficient of correlation (r), root mean square error (RMSE) and bias values. The computed values of BOD and COD by model, ANN method and regression analysis were in close agreement with their respective measured values. Results showed that the ANN performance model was better than the MLR model. Comparative indices of the optimized ANN with input values of temperature (T), pH, total suspended solid (TSS) and total suspended (TS) for prediction of BOD was RMSE = 25.1 mg/L, r = 0.83 and for prediction of COD was RMSE = 49.4 mg/L, r = 0.81. It was found that the ANN model could be employed successfully in estimating the BOD and COD in the inlet of wastewater biochemical treatment plants. Moreover, sensitive examination results showed that pH parameter have more effect on BOD and COD predicting to another parameters. Also, both implemented models have predicted BOD better than COD. PMID:24456676
Zhao, Yu Xi; Xie, Ping; Sang, Yan Fang; Wu, Zi Yi
2018-04-01
Hydrological process evaluation is temporal dependent. Hydrological time series including dependence components do not meet the data consistency assumption for hydrological computation. Both of those factors cause great difficulty for water researches. Given the existence of hydrological dependence variability, we proposed a correlationcoefficient-based method for significance evaluation of hydrological dependence based on auto-regression model. By calculating the correlation coefficient between the original series and its dependence component and selecting reasonable thresholds of correlation coefficient, this method divided significance degree of dependence into no variability, weak variability, mid variability, strong variability, and drastic variability. By deducing the relationship between correlation coefficient and auto-correlation coefficient in each order of series, we found that the correlation coefficient was mainly determined by the magnitude of auto-correlation coefficient from the 1 order to p order, which clarified the theoretical basis of this method. With the first-order and second-order auto-regression models as examples, the reasonability of the deduced formula was verified through Monte-Carlo experiments to classify the relationship between correlation coefficient and auto-correlation coefficient. This method was used to analyze three observed hydrological time series. The results indicated the coexistence of stochastic and dependence characteristics in hydrological process.
Comparison of the color of natural teeth measured by a colorimeter and Shade Vision System.
Cho, Byeong-Hoon; Lim, Yong-Kyu; Lee, Yong-Keun
2007-10-01
The objectives were to measure the difference in the color and color parameters of natural teeth measured by a tristimulus colorimeter (CM, used as a reference) and Shade Vision System (SV), and to determine the influence of color parameters on the color difference between the values measured by two instruments. Color of 12 maxillary and mandibular anterior teeth was measured by CM and SV for 47 volunteers (number of teeth=564). Color parameters such as CIE L*, a* and b* values, chroma and hue angle measured by two instruments were compared. Chroma was calculated as C*ab=(a*2 = b*2)1/2, and hue angle was calculated as h degrees =arctan(b*/a*). The influence of color parameters measured by CM on the color difference (DeltaE*(ab)) between the values measured by two instruments was analyzed with multiple regression analysis (alpha=0.01). Mean DeltaE*(ab) value between the values measured by two instruments was 21.7 (+/-3.7), and the mean difference in lightness (CIE L*) and chroma was 16.2 (+/-3.9) and 13.2 (+/-3.0), respectively. Difference in hue angle was high as 132.7 (+/-53.3) degrees . Except for the hue angle, all the color parameters showed significant correlations and the coefficient of determination (r(2)) was in the range of 0.089-0.478. Based on multiple regression analysis, the standardized partial correlation coefficient (beta) of the included predictors for the color difference was -0.710 for CIE L* and -0.300 for C*(ab) (p<0.01). All the color parameters showed significant but weak correlations except for hue angle. When lightness and chroma of teeth were high, color difference between the values measured by two instruments was small. Clinical accuracy of two instruments should be investigated further.
Poulin, Patrick; Hop, Cornelis Eca; Salphati, Laurent; Liederer, Bianca M
2013-04-01
Understanding drug distribution and accumulation in tumors would be informative in the assessment of efficacy in targeted therapy; however, existing methods for predicting tissue drug distribution focus on normal tissues and do not incorporate tumors. The main objective of this study was to describe the relationships between tissue-plasma concentration ratios (Kp ) of normal tissues and those of subcutaneous xenograft tumors under nonsteady-state conditions, and establish regression equations that could potentially be used for the prediction of drug levels in several human tumor xenografts in mouse, based solely on a Kp value determined in a normal tissue (e.g., muscle). A dataset of 17 compounds was collected from the literature and from Genentech. Tissue and plasma concentration data in mouse were obtained following oral gavage or intraperitoneal administration. Linear regression analyses were performed between Kp values in several normal tissues (muscle, lung, liver, or brain) and those in human tumor xenografts (CL6, EBC-1, HT-29, PC3, U-87, MCF-7-neo-Her2, or BT474M1.1). The tissue-plasma ratios in normal tissues reasonably correlated with the tumor-plasma ratios in CL6, EBC-1, HT-29, U-87, BT474M1.1, and MCF-7-neo-Her2 xenografts (r(2) in the range 0.62-1) but not with the PC3 xenograft. In general, muscle and lung exhibited the strongest correlation with tumor xenografts, followed by liver. Regression coefficients from brain were low, except between brain and the glioblastoma U-87 xenograft (r(2) in the range 0.62-0.94). Furthermore, reasonably strong correlations were observed between muscle and lung and between muscle and liver (r(2) in the range 0.67-0.96). The slopes of the regressions differed depending on the class of drug (strong vs. weak base) and type of tissue (brain vs. other tissues and tumors). Overall, this study will contribute to our understanding of tissue-plasma partition coefficients for tumors and facilitate the use of physiologically based pharmacokinetics (PBPK) modeling for chemotherapy in oncology studies. © 2013 Wiley Periodicals, Inc. and the American Pharmacists Association J Pharm Sci 102:1355-1369, 2013. Copyright © 2013 Wiley Periodicals, Inc.
Sood, Neeraj; Ghosh, Arkadipta; Escarce, José J
2009-10-01
To estimate the effect of growth in health care costs that outpaces gross domestic product (GDP) growth ("excess" growth in health care costs) on employment, gross output, and value added to GDP of U.S. industries. We analyzed data from 38 U.S. industries for the period 1987-2005. All data are publicly available from various government agencies. We estimated bivariate and multivariate regressions. To develop the regression models, we assumed that rapid growth in health care costs has a larger effect on economic performance for industries where large percentages of workers receive employer-sponsored health insurance (ESI). We used the estimated regression coefficients to simulate economic outcomes under alternative scenarios of health care cost inflation. Faster growth in health care costs had greater adverse effects on economic outcomes for industries with larger percentages of workers who had ESI. We found that a 10 percent increase in excess growth in health care costs would have resulted in 120,803 fewer jobs, US$28,022 million in lost gross output, and US$14,082 million in lost value added in 2005. These declines represent 0.17 to 0.18 percent of employment, gross output, and value added in 2005. Excess growth in health care costs is adversely affecting the economic performance of U.S. industries.
Motlagh, Mohadeseh Ghanbari; Kafaky, Sasan Babaie; Mataji, Asadollah; Akhavan, Reza
2018-05-21
Hyrcanian forests of North of Iran are of great importance in terms of various economic and environmental aspects. In this study, Spot-6 satellite images and regression models were applied to estimate above-ground biomass in these forests. This research was carried out in six compartments in three climatic (semi-arid to humid) types and two altitude classes. In the first step, ground sampling methods at the compartment level were used to estimate aboveground biomass (Mg/ha). Then, by reviewing the results of other studies, the most appropriate vegetation indices were selected. In this study, three indices of NDVI, RVI, and TVI were calculated. We investigated the relationship between the vegetation indices and aboveground biomass measured at sample-plot level. Based on the results, the relationship between aboveground biomass values and vegetation indices was a linear regression with the highest level of significance for NDVI in all compartments. Since at the compartment level the correlation coefficient between NDVI and aboveground biomass was the highest, NDVI was used for mapping aboveground biomass. According to the results of this study, biomass values were highly different in various climatic and altitudinal classes with the highest biomass value observed in humid climate and high-altitude class.
Gíslason, Magnús; Sigurðsson, Sigurður; Guðnason, Vilmundur; Harris, Tamara; Carraro, Ugo; Gargiulo, Paolo
2018-01-01
Sarcopenic muscular degeneration has been consistently identified as an independent risk factor for mortality in aging populations. Recent investigations have realized the quantitative potential of computed tomography (CT) image analysis to describe skeletal muscle volume and composition; however, the optimum approach to assessing these data remains debated. Current literature reports average Hounsfield unit (HU) values and/or segmented soft tissue cross-sectional areas to investigate muscle quality. However, standardized methods for CT analyses and their utility as a comorbidity index remain undefined, and no existing studies compare these methods to the assessment of entire radiodensitometric distributions. The primary aim of this study was to present a comparison of nonlinear trimodal regression analysis (NTRA) parameters of entire radiodensitometric muscle distributions against extant CT metrics and their correlation with lower extremity function (LEF) biometrics (normal/fast gait speed, timed up-and-go, and isometric leg strength) and biochemical and nutritional parameters, such as total solubilized cholesterol (SCHOL) and body mass index (BMI). Data were obtained from 3,162 subjects, aged 66–96 years, from the population-based AGES-Reykjavik Study. 1-D k-means clustering was employed to discretize each biometric and comorbidity dataset into twelve subpopulations, in accordance with Sturges’ Formula for Class Selection. Dataset linear regressions were performed against eleven NTRA distribution parameters and standard CT analyses (fat/muscle cross-sectional area and average HU value). Parameters from NTRA and CT standards were analogously assembled by age and sex. Analysis of specific NTRA parameters with standard CT results showed linear correlation coefficients greater than 0.85, but multiple regression analysis of correlative NTRA parameters yielded a correlation coefficient of 0.99 (P<0.005). These results highlight the specificities of each muscle quality metric to LEF biometrics, SCHOL, and BMI, and particularly highlight the value of the connective tissue regime in this regard. PMID:29513690
Edmunds, Kyle; Gíslason, Magnús; Sigurðsson, Sigurður; Guðnason, Vilmundur; Harris, Tamara; Carraro, Ugo; Gargiulo, Paolo
2018-01-01
Sarcopenic muscular degeneration has been consistently identified as an independent risk factor for mortality in aging populations. Recent investigations have realized the quantitative potential of computed tomography (CT) image analysis to describe skeletal muscle volume and composition; however, the optimum approach to assessing these data remains debated. Current literature reports average Hounsfield unit (HU) values and/or segmented soft tissue cross-sectional areas to investigate muscle quality. However, standardized methods for CT analyses and their utility as a comorbidity index remain undefined, and no existing studies compare these methods to the assessment of entire radiodensitometric distributions. The primary aim of this study was to present a comparison of nonlinear trimodal regression analysis (NTRA) parameters of entire radiodensitometric muscle distributions against extant CT metrics and their correlation with lower extremity function (LEF) biometrics (normal/fast gait speed, timed up-and-go, and isometric leg strength) and biochemical and nutritional parameters, such as total solubilized cholesterol (SCHOL) and body mass index (BMI). Data were obtained from 3,162 subjects, aged 66-96 years, from the population-based AGES-Reykjavik Study. 1-D k-means clustering was employed to discretize each biometric and comorbidity dataset into twelve subpopulations, in accordance with Sturges' Formula for Class Selection. Dataset linear regressions were performed against eleven NTRA distribution parameters and standard CT analyses (fat/muscle cross-sectional area and average HU value). Parameters from NTRA and CT standards were analogously assembled by age and sex. Analysis of specific NTRA parameters with standard CT results showed linear correlation coefficients greater than 0.85, but multiple regression analysis of correlative NTRA parameters yielded a correlation coefficient of 0.99 (P<0.005). These results highlight the specificities of each muscle quality metric to LEF biometrics, SCHOL, and BMI, and particularly highlight the value of the connective tissue regime in this regard.
Nattee, Cholwich; Khamsemanan, Nirattaya; Lawtrakul, Luckhana; Toochinda, Pisanu; Hannongbua, Supa
2017-01-01
Malaria is still one of the most serious diseases in tropical regions. This is due in part to the high resistance against available drugs for the inhibition of parasites, Plasmodium, the cause of the disease. New potent compounds with high clinical utility are urgently needed. In this work, we created a novel model using a regression tree to study structure-activity relationships and predict the inhibition constant, K i of three different antimalarial analogues (Trimethoprim, Pyrimethamine, and Cycloguanil) based on their molecular descriptors. To the best of our knowledge, this work is the first attempt to study the structure-activity relationships of all three analogues combined. The most relevant descriptors and appropriate parameters of the regression tree are harvested using extremely randomized trees. These descriptors are water accessible surface area, Log of the aqueous solubility, total hydrophobic van der Waals surface area, and molecular refractivity. Out of all possible combinations of these selected parameters and descriptors, the tree with the strongest coefficient of determination is selected to be our prediction model. Predicted K i values from the proposed model show a strong coefficient of determination, R 2 =0.996, to experimental K i values. From the structure of the regression tree, compounds with high accessible surface area of all hydrophobic atoms (ASA_H) and low aqueous solubility of inhibitors (Log S) generally possess low K i values. Our prediction model can also be utilized as a screening test for new antimalarial drug compounds which may reduce the time and expenses for new drug development. New compounds with high predicted K i should be excluded from further drug development. It is also our inference that a threshold of ASA_H greater than 575.80 and Log S less than or equal to -4.36 is a sufficient condition for a new compound to possess a low K i . Copyright © 2016 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Afolagboye, Lekan Olatayo; Talabi, Abel Ojo; Oyelami, Charles Adebayo
2017-05-01
This study assessed the possibility of using index tests to determine the mechanical properties of crushed aggregates. The aggregates used in this study were derived from major Precambrian basement rocks in Ado-Ekiti, Nigeria. Regression analyses were performed to determine the empirical relations that mechanical properties of the aggregates may have with the point load strength (IS(50)), Schmidt rebound hammer value (SHR) and unconfined compressive strength (UCS) of the rocks. For all the data, strong correlation coefficients were found between IS(50), SHR, UCS, and mechanical properties of the aggregates. The regression analysis conducted on the different rocks separately showed that correlations coefficients obtained between the IS(50), SHR, UCS and mechanical properties of the aggregates were stronger than those of the grouped rocks. The T-test and F-test showed that the derived models were valid. This study has shown that the mechanical properties of the aggregates can be estimated from IS(50), SHR and USC but the influence of rock type on the relationships should be taken into consideration.
SCI model structure determination program (OSR) user's guide. [optimal subset regression
NASA Technical Reports Server (NTRS)
1979-01-01
The computer program, OSR (Optimal Subset Regression) which estimates models for rotorcraft body and rotor force and moment coefficients is described. The technique used is based on the subset regression algorithm. Given time histories of aerodynamic coefficients, aerodynamic variables, and control inputs, the program computes correlation between various time histories. The model structure determination is based on these correlations. Inputs and outputs of the program are given.
On the Asymptotic Relative Efficiency of Planned Missingness Designs.
Rhemtulla, Mijke; Savalei, Victoria; Little, Todd D
2016-03-01
In planned missingness (PM) designs, certain data are set a priori to be missing. PM designs can increase validity and reduce cost; however, little is known about the loss of efficiency that accompanies these designs. The present paper compares PM designs to reduced sample (RN) designs that have the same total number of data points concentrated in fewer participants. In 4 studies, we consider models for both observed and latent variables, designs that do or do not include an "X set" of variables with complete data, and a full range of between- and within-set correlation values. All results are obtained using asymptotic relative efficiency formulas, and thus no data are generated; this novel approach allows us to examine whether PM designs have theoretical advantages over RN designs removing the impact of sampling error. Our primary findings are that (a) in manifest variable regression models, estimates of regression coefficients have much lower relative efficiency in PM designs as compared to RN designs, (b) relative efficiency of factor correlation or latent regression coefficient estimates is maximized when the indicators of each latent variable come from different sets, and (c) the addition of an X set improves efficiency in manifest variable regression models only for the parameters that directly involve the X-set variables, but it substantially improves efficiency of most parameters in latent variable models. We conclude that PM designs can be beneficial when the model of interest is a latent variable model; recommendations are made for how to optimize such a design.
Predicting Quantitative Traits With Regression Models for Dense Molecular Markers and Pedigree
de los Campos, Gustavo; Naya, Hugo; Gianola, Daniel; Crossa, José; Legarra, Andrés; Manfredi, Eduardo; Weigel, Kent; Cotes, José Miguel
2009-01-01
The availability of genomewide dense markers brings opportunities and challenges to breeding programs. An important question concerns the ways in which dense markers and pedigrees, together with phenotypic records, should be used to arrive at predictions of genetic values for complex traits. If a large number of markers are included in a regression model, marker-specific shrinkage of regression coefficients may be needed. For this reason, the Bayesian least absolute shrinkage and selection operator (LASSO) (BL) appears to be an interesting approach for fitting marker effects in a regression model. This article adapts the BL to arrive at a regression model where markers, pedigrees, and covariates other than markers are considered jointly. Connections between BL and other marker-based regression models are discussed, and the sensitivity of BL with respect to the choice of prior distributions assigned to key parameters is evaluated using simulation. The proposed model was fitted to two data sets from wheat and mouse populations, and evaluated using cross-validation methods. Results indicate that inclusion of markers in the regression further improved the predictive ability of models. An R program that implements the proposed model is freely available. PMID:19293140
Ecotoxicology of phenylphosphonothioates.
Francis, B M; Hansen, L G; Fukuto, T R; Lu, P Y; Metcalf, R L
1980-01-01
The phenylphosphonothioate insecticides EPN and leptophos, and several analogs, were evaluated with respect to their delayed neurotoxic effects in hens and their environmental behavior in a terrestrial-aquatic model ecosystem. Acute toxicity to insects was highly correlated with sigma sigma of the substituted phenyl group (regression coefficient r = -0.91) while acute toxicity to mammals was slightly less well correlated (regression coefficient r = -0.71), and neurotoxicity was poorly correlated with sigma sigma (regression coefficient r = -0.35). Both EPN and leptophos were markedly more persistent and bioaccumulative in the model ecosystem than parathion. Desbromoleptophos, a contaminant and metabolite of leptophos, was seen to be a highly stable and persistent terminal residue of leptophos. PMID:6159210
Yiming, Gulinuer; Zhou, Xianhui; Lv, Wenkui; Peng, Yi; Zhang, Wenhui; Cheng, Xinchun; Li, Yaodong; Xing, Qiang; Zhang, Jianghua; Zhou, Qina; Zhang, Ling; Lu, Yanmei; Wang, Hongli; Tang, Baopeng
2017-01-01
Brachial-ankle pulse wave velocity (baPWV), a direct measure of aortic stiffness, has increasingly become an important assessment for cardiovascular risk. The present study established the reference and normal values of baPWV in a Central Asia population in Xinjiang, China. We recruited participants from a central Asia population in Xinjiang, China. We performed multiple regression analysis to investigate the determinants of baPWV. The median and 10th-90th percentiles were calculated to establish the reference and normal values based on these categories. In total, 5,757 Han participants aged 15-88 years were included in the present study. Spearman correlation analysis showed that age (r = 0.587, p < 0.001) and mean blood pressure (MBP, r = 0.599, p <0.001) were the major factors influencing the values of baPWV in the reference population. Furthermore, in the multiple linear regression analysis, the standardized regression coefficients of age (0.445) and MBP (0.460) were much higher than those of body mass index, triglyceride, and glycemia (-0.054, 0.035, and 0.033, respectively). In the covariance analysis, after adjustment for age and MBP, only diabetes was the significant independent determinant of baPWV (p = 0.009). Thus, participants with diabetes were excluded from the reference value population. The reference values ranged from 14.3 to 25.2 m/s, and the normal values ranged from 13.9 to 21.2 m/s. This is the first study that has established the reference and normal values for baPWV according to age and blood pressure in a Central Asia population.
Griffiths, Robert I; Gleeson, Michelle L; Danese, Mark D; O'Hagan, Anthony
2012-01-01
To assess the accuracy and precision of inverse probability weighted (IPW) least squares regression analysis for censored cost data. By using Surveillance, Epidemiology, and End Results-Medicare, we identified 1500 breast cancer patients who died and had complete cost information within the database. Patients were followed for up to 48 months (partitions) after diagnosis, and their actual total cost was calculated in each partition. We then simulated patterns of administrative and dropout censoring and also added censoring to patients receiving chemotherapy to simulate comparing a newer to older intervention. For each censoring simulation, we performed 1000 IPW regression analyses (bootstrap, sampling with replacement), calculated the average value of each coefficient in each partition, and summed the coefficients for each regression parameter to obtain the cumulative values from 1 to 48 months. The cumulative, 48-month, average cost was $67,796 (95% confidence interval [CI] $58,454-$78,291) with no censoring, $66,313 (95% CI $54,975-$80,074) with administrative censoring, and $66,765 (95% CI $54,510-$81,843) with administrative plus dropout censoring. In multivariate analysis, chemotherapy was associated with increased cost of $25,325 (95% CI $17,549-$32,827) compared with $28,937 (95% CI $20,510-$37,088) with administrative censoring and $29,593 ($20,564-$39,399) with administrative plus dropout censoring. Adding censoring to the chemotherapy group resulted in less accurate IPW estimates. This was ameliorated, however, by applying IPW within treatment groups. IPW is a consistent estimator of population mean costs if the weight is correctly specified. If the censoring distribution depends on some covariates, a model that accommodates this dependency must be correctly specified in IPW to obtain accurate estimates. Copyright © 2012 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.
Predictive value of age of walking for later motor performance in children with mental retardation.
Kokubun, M; Haishi, K; Okuzumi, H; Hosobuchi, T; Koike, T
1996-12-01
The purpose of the present study was to clarify the predictive value of age of walking for later motor performance in children with mental retardation. While paying due attention to other factors, our investigation focused on the relationship between a subject's age of walking, and his or her subsequent beam-walking performance. The subjects were 85 children with mental retardation with an average age of 13 years and 3 months. Beam-walking performance was measured by a procedure developed by the authors. Five low beams (5 cm) which varied in width (12.5, 10, 7.5, 5 and 2.5 cm) were employed. The performance of subjects was scored from zero to five points according to the width of the beam that they were able to walk without falling off. From the results of multiple regression analysis, three independent variables were found to be significantly related to beam-walking performance. The age of walking was the most basic variable: partial correlation coefficient (PCC) = -45; standardized partial regression coefficient (SPRC) = -0.41. The next variable in importance was walking duration (PCC = 0.38; SPRC = 0.31). The autism variable also contributed significantly (PCC = 0.28; SPRC = 0.22). Therefore, within the age range used in the present study, the age of walking in children with mental retardation was thought to have sufficient predictive value, even when the variables which might have possibly affected their subsequent performance were taken into consideration; the earlier the age of walking, the better the beam-walking performance.
Wheat flour dough Alveograph characteristics predicted by Mixolab regression models.
Codină, Georgiana Gabriela; Mironeasa, Silvia; Mironeasa, Costel; Popa, Ciprian N; Tamba-Berehoiu, Radiana
2012-02-01
In Romania, the Alveograph is the most used device to evaluate the rheological properties of wheat flour dough, but lately the Mixolab device has begun to play an important role in the breadmaking industry. These two instruments are based on different principles but there are some correlations that can be found between the parameters determined by the Mixolab and the rheological properties of wheat dough measured with the Alveograph. Statistical analysis on 80 wheat flour samples using the backward stepwise multiple regression method showed that Mixolab values using the ‘Chopin S’ protocol (40 samples) and ‘Chopin + ’ protocol (40 samples) can be used to elaborate predictive models for estimating the value of the rheological properties of wheat dough: baking strength (W), dough tenacity (P) and extensibility (L). The correlation analysis confirmed significant findings (P < 0.05 and P < 0.01) between the parameters of wheat dough studied by the Mixolab and its rheological properties measured with the Alveograph. A number of six predictive linear equations were obtained. Linear regression models gave multiple regression coefficients with R²(adjusted) > 0.70 for P, R²(adjusted) > 0.70 for W and R²(adjusted) > 0.38 for L, at a 95% confidence interval. Copyright © 2011 Society of Chemical Industry.
Hodgson, Robert; Reason, Timothy; Trueman, David; Wickstead, Rose; Kusel, Jeanette; Jasilek, Adam; Claxton, Lindsay; Taylor, Matthew; Pulikottil-Jacob, Ruth
2017-10-01
The estimation of utility values for the economic evaluation of therapies for wet age-related macular degeneration (AMD) is a particular challenge. Previous economic models in wet AMD have been criticized for failing to capture the bilateral nature of wet AMD by modelling visual acuity (VA) and utility values associated with the better-seeing eye only. Here we present a de novo regression analysis using generalized estimating equations (GEE) applied to a previous dataset of time trade-off (TTO)-derived utility values from a sample of the UK population that wore contact lenses to simulate visual deterioration in wet AMD. This analysis allows utility values to be estimated as a function of VA in both the better-seeing eye (BSE) and worse-seeing eye (WSE). VAs in both the BSE and WSE were found to be statistically significant (p < 0.05) when regressed separately. When included without an interaction term, only the coefficient for VA in the BSE was significant (p = 0.04), but when an interaction term between VA in the BSE and WSE was included, only the constant term (mean TTO utility value) was significant, potentially a result of the collinearity between the VA of the two eyes. The lack of both formal model fit statistics from the GEE approach and theoretical knowledge to support the superiority of one model over another make it difficult to select the best model. Limitations of this analysis arise from the potential influence of collinearity between the VA of both eyes, and the use of contact lenses to reflect VA states to obtain the original dataset. Whilst further research is required to elicit more accurate utility values for wet AMD, this novel regression analysis provides a possible source of utility values to allow future economic models to capture the quality of life impact of changes in VA in both eyes. Novartis Pharmaceuticals UK Limited.
Regression Analysis of Stage Variability for West-Central Florida Lakes
Sacks, Laura A.; Ellison, Donald L.; Swancar, Amy
2008-01-01
The variability in a lake's stage depends upon many factors, including surface-water flows, meteorological conditions, and hydrogeologic characteristics near the lake. An understanding of the factors controlling lake-stage variability for a population of lakes may be helpful to water managers who set regulatory levels for lakes. The goal of this study is to determine whether lake-stage variability can be predicted using multiple linear regression and readily available lake and basin characteristics defined for each lake. Regressions were evaluated for a recent 10-year period (1996-2005) and for a historical 10-year period (1954-63). Ground-water pumping is considered to have affected stage at many of the 98 lakes included in the recent period analysis, and not to have affected stage at the 20 lakes included in the historical period analysis. For the recent period, regression models had coefficients of determination (R2) values ranging from 0.60 to 0.74, and up to five explanatory variables. Standard errors ranged from 21 to 37 percent of the average stage variability. Net leakage was the most important explanatory variable in regressions describing the full range and low range in stage variability for the recent period. The most important explanatory variable in the model predicting the high range in stage variability was the height over median lake stage at which surface-water outflow would occur. Other explanatory variables in final regression models for the recent period included the range in annual rainfall for the period and several variables related to local and regional hydrogeology: (1) ground-water pumping within 1 mile of each lake, (2) the amount of ground-water inflow (by category), (3) the head gradient between the lake and the Upper Floridan aquifer, and (4) the thickness of the intermediate confining unit. Many of the variables in final regression models are related to hydrogeologic characteristics, underscoring the importance of ground-water exchange in controlling the stage of karst lakes in Florida. Regression equations were used to predict lake-stage variability for the recent period for 12 additional lakes, and the median difference between predicted and observed values ranged from 11 to 23 percent. Coefficients of determination for the historical period were considerably lower (maximum R2 of 0.28) than for the recent period. Reasons for these low R2 values are probably related to the small number of lakes (20) with stage data for an equivalent time period that were unaffected by ground-water pumping, the similarity of many of the lake types (large surface-water drainage lakes), and the greater uncertainty in defining historical basin characteristics. The lack of lake-stage data unaffected by ground-water pumping and the poor regression results obtained for that group of lakes limit the ability to predict natural lake-stage variability using this method in west-central Florida.
Analysis of Wind Tunnel Lateral Oscillatory Data of the F-16XL Aircraft
NASA Technical Reports Server (NTRS)
Klein, Vladislav; Murphy, Patrick C.; Szyba, Nathan M.
2004-01-01
Static and dynamic wind tunnel tests were performed on an 18% scale model of the F-16XL aircraft. These tests were performed over a wide range of angles of attack and sideslip with oscillation amplitudes from 5 deg. to 30 deg. and reduced frequencies from 0.073 to 0.269. Harmonic analysis was used to estimate Fourier coefficients and in-phase and out-of-phase components. For frequency dependent data from rolling oscillations, a two-step regression method was used to obtain unsteady models (indicial functions), and derivatives due to sideslip angle, roll rate and yaw rate from in-phase and out-of-phase components. Frequency dependence was found for angles of attack between 20 deg. and 50 deg. Reduced values of coefficient of determination and increased values of fit error were found for angles of attack between 35 deg. and 45 deg. An attempt to estimate model parameters from yaw oscillations failed, probably due to the low number of test cases at different frequencies.
Yu, S; Gao, S; Gan, Y; Zhang, Y; Ruan, X; Wang, Y; Yang, L; Shi, J
2016-04-01
Quantitative structure-property relationship modelling can be a valuable alternative method to replace or reduce experimental testing. In particular, some endpoints such as octanol-water (KOW) and organic carbon-water (KOC) partition coefficients of polychlorinated biphenyls (PCBs) are easier to predict and various models have been already developed. In this paper, two different methods, which are multiple linear regression based on the descriptors generated using Dragon software and hologram quantitative structure-activity relationships, were employed to predict suspended particulate matter (SPM) derived log KOC and generator column, shake flask and slow stirring method derived log KOW values of 209 PCBs. The predictive ability of the derived models was validated using a test set. The performances of all these models were compared with EPI Suite™ software. The results indicated that the proposed models were robust and satisfactory, and could provide feasible and promising tools for the rapid assessment of the SPM derived log KOC and generator column, shake flask and slow stirring method derived log KOW values of PCBs.
Height and Weight Estimation From Anthropometric Measurements Using Machine Learning Regressions
Fernandes, Bruno J. T.; Roque, Alexandre
2018-01-01
Height and weight are measurements explored to tracking nutritional diseases, energy expenditure, clinical conditions, drug dosages, and infusion rates. Many patients are not ambulant or may be unable to communicate, and a sequence of these factors may not allow accurate estimation or measurements; in those cases, it can be estimated approximately by anthropometric means. Different groups have proposed different linear or non-linear equations which coefficients are obtained by using single or multiple linear regressions. In this paper, we present a complete study of the application of different learning models to estimate height and weight from anthropometric measurements: support vector regression, Gaussian process, and artificial neural networks. The predicted values are significantly more accurate than that obtained with conventional linear regressions. In all the cases, the predictions are non-sensitive to ethnicity, and to gender, if more than two anthropometric parameters are analyzed. The learning model analysis creates new opportunities for anthropometric applications in industry, textile technology, security, and health care. PMID:29651366
Hewitt, Angela L.; Popa, Laurentiu S.; Pasalar, Siavash; Hendrix, Claudia M.
2011-01-01
Encoding of movement kinematics in Purkinje cell simple spike discharge has important implications for hypotheses of cerebellar cortical function. Several outstanding questions remain regarding representation of these kinematic signals. It is uncertain whether kinematic encoding occurs in unpredictable, feedback-dependent tasks or kinematic signals are conserved across tasks. Additionally, there is a need to understand the signals encoded in the instantaneous discharge of single cells without averaging across trials or time. To address these questions, this study recorded Purkinje cell firing in monkeys trained to perform a manual random tracking task in addition to circular tracking and center-out reach. Random tracking provides for extensive coverage of kinematic workspaces. Direction and speed errors are significantly greater during random than circular tracking. Cross-correlation analyses comparing hand and target velocity profiles show that hand velocity lags target velocity during random tracking. Correlations between simple spike firing from 120 Purkinje cells and hand position, velocity, and speed were evaluated with linear regression models including a time constant, τ, as a measure of the firing lead/lag relative to the kinematic parameters. Across the population, velocity accounts for the majority of simple spike firing variability (63 ± 30% of Radj2), followed by position (28 ± 24% of Radj2) and speed (11 ± 19% of Radj2). Simple spike firing often leads hand kinematics. Comparison of regression models based on averaged vs. nonaveraged firing and kinematics reveals lower Radj2 values for nonaveraged data; however, regression coefficients and τ values are highly similar. Finally, for most cells, model coefficients generated from random tracking accurately estimate simple spike firing in either circular tracking or center-out reach. These findings imply that the cerebellum controls movement kinematics, consistent with a forward internal model that predicts upcoming limb kinematics. PMID:21795616
The validation of a human force model to predict dynamic forces resulting from multi-joint motions
NASA Technical Reports Server (NTRS)
Pandya, Abhilash K.; Maida, James C.; Aldridge, Ann M.; Hasson, Scott M.; Woolford, Barbara J.
1992-01-01
The development and validation is examined of a dynamic strength model for humans. This model is based on empirical data. The shoulder, elbow, and wrist joints were characterized in terms of maximum isolated torque, or position and velocity, in all rotational planes. This data was reduced by a least squares regression technique into a table of single variable second degree polynomial equations determining torque as a function of position and velocity. The isolated joint torque equations were then used to compute forces resulting from a composite motion, in this case, a ratchet wrench push and pull operation. A comparison of the predicted results of the model with the actual measured values for the composite motion indicates that forces derived from a composite motion of joints (ratcheting) can be predicted from isolated joint measures. Calculated T values comparing model versus measured values for 14 subjects were well within the statistically acceptable limits and regression analysis revealed coefficient of variation between actual and measured to be within 0.72 and 0.80.
NASA Technical Reports Server (NTRS)
Hoepffner, Nicolas; Sathyendranath, Shubha
1993-01-01
The contributions of detrital particles and phytoplankton to total light absorption are retrieved by nonlinear regression on the absorption spectra of total particles from various oceanic regions. The model used explains more than 96% of the variance in the observed particle absorption spectra. The resulting absorption spectra of phytoplankton are then decomposed into several Gaussian bands reflecting absorption by phytoplankton pigments. Such a decomposition, combined with high-performance liquid chromatography data on phytoplankton pigment concentrations, allows the computation of specific absorption coefficients for chlorophylls a, b, and c and carotenoids. The spectral values of these in vivo absorption coefficients are then discussed, considering the effects of secondary pigments which were not measured quantitatively. We show that these coefficients can be used to reconstruct the absorption spectra of phytoplankton at various locations and depths. Discrepancies that do occur at some stations are explained in terms of particle size effect. These coefficients can be used to determine the concentrations of phytoplankton pigments in the water, given the absorption spectrum of total particles.
Random effects coefficient of determination for mixed and meta-analysis models.
Demidenko, Eugene; Sargent, James; Onega, Tracy
2012-01-01
The key feature of a mixed model is the presence of random effects. We have developed a coefficient, called the random effects coefficient of determination, [Formula: see text], that estimates the proportion of the conditional variance of the dependent variable explained by random effects. This coefficient takes values from 0 to 1 and indicates how strong the random effects are. The difference from the earlier suggested fixed effects coefficient of determination is emphasized. If [Formula: see text] is close to 0, there is weak support for random effects in the model because the reduction of the variance of the dependent variable due to random effects is small; consequently, random effects may be ignored and the model simplifies to standard linear regression. The value of [Formula: see text] apart from 0 indicates the evidence of the variance reduction in support of the mixed model. If random effects coefficient of determination is close to 1 the variance of random effects is very large and random effects turn into free fixed effects-the model can be estimated using the dummy variable approach. We derive explicit formulas for [Formula: see text] in three special cases: the random intercept model, the growth curve model, and meta-analysis model. Theoretical results are illustrated with three mixed model examples: (1) travel time to the nearest cancer center for women with breast cancer in the U.S., (2) cumulative time watching alcohol related scenes in movies among young U.S. teens, as a risk factor for early drinking onset, and (3) the classic example of the meta-analysis model for combination of 13 studies on tuberculosis vaccine.
New insights into old methods for identifying causal rare variants.
Wang, Haitian; Huang, Chien-Hsun; Lo, Shaw-Hwa; Zheng, Tian; Hu, Inchi
2011-11-29
The advance of high-throughput next-generation sequencing technology makes possible the analysis of rare variants. However, the investigation of rare variants in unrelated-individuals data sets faces the challenge of low power, and most methods circumvent the difficulty by using various collapsing procedures based on genes, pathways, or gene clusters. We suggest a new way to identify causal rare variants using the F-statistic and sliced inverse regression. The procedure is tested on the data set provided by the Genetic Analysis Workshop 17 (GAW17). After preliminary data reduction, we ranked markers according to their F-statistic values. Top-ranked markers were then subjected to sliced inverse regression, and those with higher absolute coefficients in the most significant sliced inverse regression direction were selected. The procedure yields good false discovery rates for the GAW17 data and thus is a promising method for future study on rare variants.
Forecast model applications of retrieved three dimensional liquid water fields
NASA Technical Reports Server (NTRS)
Raymond, William H.; Olson, William S.
1990-01-01
Forecasts are made for tropical storm Emily using heating rates derived from the SSM/I physical retrievals described in chapters 2 and 3. Average values of the latent heating rates from the convective and stratiform cloud simulations, used in the physical retrieval, are obtained for individual 1.1 km thick vertical layers. Then, the layer-mean latent heating rates are regressed against the slant path-integrated liquid and ice precipitation water contents to determine the best fit two parameter regression coefficients for each layer. The regression formulae and retrieved precipitation water contents are utilized to infer the vertical distribution of heating rates for forecast model applications. In the forecast model, diabatic temperature contributions are calculated and used in a diabatic initialization, or in a diabatic initialization combined with a diabatic forcing procedure. Our forecasts show that the time needed to spin-up precipitation processes in tropical storm Emily is greatly accelerated through the application of the data.
Kamali, M A; Ghorbani, S H; Sharbabak, M Moradi; Zamiri, M J
2007-08-01
1. Genetic parameters were estimated in a base population of a closed experimental strain of fowl. Data were obtained on 21 245 Iranian native hens (breeding centre for Fars province) subject to 8 successive generations of selection. This population had been selected for body weight at 12 weeks of age (BW12) and egg number during the first 12 weeks of the laying period (EN), mean egg weight (EW) at weeks 28, 30 and 32, and age at sexual maturity (ASM). 2. The method of multi-traits restricted maximum likelihood with an animal model was used to estimate genetic parameters. Resulting heritabilities for BW12, EN, EW and ASM were 0.68 +/- 0.02, 0.40 +/- 0.02, 0.64 +/- 0.02 and 0.49 +/- 0.02, respectively. 3. Genetic correlations between BW12 and EN, EW and ASM were 0.11 +/- 0.33, 0.54 +/- 0.21 and -0.12 +/- 0.03, respectively. Genetic correlations between EN and EW and ASM were -0.09 +/- 0.03 and -0.85 +/- 0.01, respectively, while between EW and ASM, it was 0.05 +/- 0.03. 4. The overall predicted genetic gains, after 7 generations of selection, estimated by the regression coefficients of the breeding value on generation number were equal to 22.7, 0.17, 0.04 and -1.38, for BW12, EN, EW and ASM, respectively. 5. A pedigree file of 21 245 female and male birds was used to calculate inbreeding coefficients and their influence on production and reproduction traits. Average inbreeding coefficients for all birds, inbred birds, female birds and male birds were 0.048, 0.673, 0.055 and 0.047%, respectively. Regression coefficients of BW12, ASM, EN and EW on inbreeding coefficient for all birds were equal to 0.51 +/- 0.001, 0.31 +/- 0.003, -0.51 +/- 0.003 and 0.03 +/- 0.001, respectively.
Zhou, Yang; Fu, Xiaping; Ying, Yibin; Fang, Zhenhuan
2015-06-23
A fiber-optic probe system was developed to estimate the optical properties of turbid media based on spatially resolved diffuse reflectance. Because of the limitations in numerical calculation of radiative transfer equation (RTE), diffusion approximation (DA) and Monte Carlo simulations (MC), support vector regression (SVR) was introduced to model the relationship between diffuse reflectance values and optical properties. The SVR models of four collection fibers were trained by phantoms in calibration set with a wide range of optical properties which represented products of different applications, then the optical properties of phantoms in prediction set were predicted after an optimal searching on SVR models. The results indicated that the SVR model was capable of describing the relationship with little deviation in forward validation. The correlation coefficient (R) of reduced scattering coefficient μ'(s) and absorption coefficient μ(a) in the prediction set were 0.9907 and 0.9980, respectively. The root mean square errors of prediction (RMSEP) of μ'(s) and μ(a) in inverse validation were 0.411 cm(-1) and 0.338 cm(-1), respectively. The results indicated that the integrated fiber-optic probe system combined with SVR model were suitable for fast and accurate estimation of optical properties of turbid media based on spatially resolved diffuse reflectance. Copyright © 2015 Elsevier B.V. All rights reserved.
The dynamic correlation between policy uncertainty and stock market returns in China
NASA Astrophysics Data System (ADS)
Yang, Miao; Jiang, Zhi-Qiang
2016-11-01
The dynamic correlation is examined between government's policy uncertainty and Chinese stock market returns in the period from January 1995 to December 2014. We find that the stock market is significantly correlated to policy uncertainty based on the results of the Vector Auto Regression (VAR) and Structural Vector Auto Regression (SVAR) models. In contrast, the results of the Dynamic Conditional Correlation Generalized Multivariate Autoregressive Conditional Heteroscedasticity (DCC-MGARCH) model surprisingly show a low dynamic correlation coefficient between policy uncertainty and market returns, suggesting that the fluctuations of each variable are greatly influenced by their values in the preceding period. Our analysis highlights the understanding of the dynamical relationship between stock market and fiscal and monetary policy.
Wagner, Brian J.; Gorelick, Steven M.
1986-01-01
A simulation nonlinear multiple-regression methodology for estimating parameters that characterize the transport of contaminants is developed and demonstrated. Finite difference contaminant transport simulation is combined with a nonlinear weighted least squares multiple-regression procedure. The technique provides optimal parameter estimates and gives statistics for assessing the reliability of these estimates under certain general assumptions about the distributions of the random measurement errors. Monte Carlo analysis is used to estimate parameter reliability for a hypothetical homogeneous soil column for which concentration data contain large random measurement errors. The value of data collected spatially versus data collected temporally was investigated for estimation of velocity, dispersion coefficient, effective porosity, first-order decay rate, and zero-order production. The use of spatial data gave estimates that were 2–3 times more reliable than estimates based on temporal data for all parameters except velocity. Comparison of estimated linear and nonlinear confidence intervals based upon Monte Carlo analysis showed that the linear approximation is poor for dispersion coefficient and zero-order production coefficient when data are collected over time. In addition, examples demonstrate transport parameter estimation for two real one-dimensional systems. First, the longitudinal dispersivity and effective porosity of an unsaturated soil are estimated using laboratory column data. We compare the reliability of estimates based upon data from individual laboratory experiments versus estimates based upon pooled data from several experiments. Second, the simulation nonlinear regression procedure is extended to include an additional governing equation that describes delayed storage during contaminant transport. The model is applied to analyze the trends, variability, and interrelationship of parameters in a mourtain stream in northern California.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ozaki, Toshiro, E-mail: ganronbun@amail.plala.or.jp; Seki, Hiroshi; Shiina, Makoto
2009-09-15
The purpose of the present study was to elucidate a method for predicting the intrahepatic arteriovenous shunt rate from computed tomography (CT) images and biochemical data, instead of from arterial perfusion scintigraphy, because adverse exacerbated systemic effects may be induced in cases where a high shunt rate exists. CT and arterial perfusion scintigraphy were performed in patients with liver metastases from gastric or colorectal cancer. Biochemical data and tumor marker levels of 33 enrolled patients were measured. The results were statistically verified by multiple regression analysis. The total metastatic hepatic tumor volume (V{sub metastasized}), residual hepatic parenchyma volume (V{sub residual};more » calculated from CT images), and biochemical data were treated as independent variables; the intrahepatic arteriovenous (IHAV) shunt rate (calculated from scintigraphy) was treated as a dependent variable. The IHAV shunt rate was 15.1 {+-} 11.9%. Based on the correlation matrixes, the best correlation coefficient of 0.84 was established between the IHAV shunt rate and V{sub metastasized} (p < 0.01). In the multiple regression analysis with the IHAV shunt rate as the dependent variable, the coefficient of determination (R{sup 2}) was 0.75, which was significant at the 0.1% level with two significant independent variables (V{sub metastasized} and V{sub residual}). The standardized regression coefficients ({beta}) of V{sub metastasized} and V{sub residual} were significant at the 0.1 and 5% levels, respectively. Based on this result, we can obtain a predicted value of IHAV shunt rate (p < 0.001) using CT images. When a high shunt rate was predicted, beneficial and consistent clinical monitoring can be initiated in, for example, hepatic arterial infusion chemotherapy.« less
NASA Astrophysics Data System (ADS)
Cheymol, Anne; de Backer, Hugo; Josefsson, Weine; Stübi, René
2006-08-01
The Aerosol Optical Depths (AODs) retrieved from Brewer Ozone Spectrophotometer measurements with a method previously developed (Cheymol and De Backer, 2003) are now validated by comparisons between AODs from six Brewer spectrophotometers and two CSEM SPM2000 sunphotometers: two Brewer spectrophotometers 016 and 178 at Uccle in Belgium; one Brewer spectrophotometer 128 and one sunphotometer CSEM SPM2000 at Norrköping in Sweden; and three Brewer instruments 040, 072, 156 at Arosa and one CSEM SPM2000 sunphotometer at Davos in Switzerland. The comparison between AODs from Brewer spectrophotometer 128 at 320.1 nm and sunphotometer SPM2000 at 368 nm at Norrköping shows that the AODs obtained from the Brewer measurements with the Langley Plot Method (LPM) are very accurate if the neutral density filter spectral transmittances are well known: with the measured values of these filters, the correlation coefficient, the slope, and the intercept of the regression line are 0.98, 0.85 ± 0.004, and 0.02 ± 0.0014, respectively. The bias observed is mainly owing to the wavelength difference between the two instruments. The comparison between AODs from different Brewer spectrophotometers confirm that AODs will be in very good agreement if they are measured with several Brewer instruments at the same place: At Uccle, the correlation coefficient, slope, and intercept of the regression line are 0.98, 1.02 ± 0.003, and 0.06 ± 0.001, respectively; at Arosa, the comparisons between the AODs from three Brewer spectrophotometers 040, 072, and 156 give a correlation coefficient, a slope, and an intercept of the regression line above 0.94, 0.98 and below 0.04, respectively.
Mohaghegh, Pegah; Yavari, Parvin; Akbari, Mohammad Esmaeil; Abadi, Alireza; Ahmadi, Farzaneh; Shormeij, Zeinab
2015-01-01
Background Not only the expand development of knowledge for reducing risk factors, but also the improvement in early diagnosis and treatment of cancer, and socioeconomic inequalities could affect cancer incidence, diagnosis stage, and mortality. The aim of this study was investigation the relationships between family levels of socioeconomic status and distribution of breast cancer risk factors. Methods This descriptive cross-sectional study has conducted on 526 patients who were suffering from breast cancer, and have registered in Cancer Research Center of Shahid Beheshti University of Medical Sciences from March 2008 to December 2013. A reliable and valid questionnaire about family levels of socioeconomic status has filled by interviewing the patients via phone. For analyzing the data, Multinomial logistic regression, Kendal tau-b correlation coefficient and Contingency Coefficient tests have executed by SPSS19. Results The mean age of the patients was 48.30 (SD=11.41). According to the results of this study, there was a significant relationship between family socioeconomic status and patient's age at diagnosis of breast cancer (p value<0.001). Also, the relationships between socioeconomic status and number of pregnancies, and duration of breast feeding were significant (p value> 0.001). In the multiple logistic regressions, the relationship between excellent socioeconomic status and number of abortions was significant (p value> 0.007). Furthermore, the relationships between moderate and good socioeconomic statuses and smoking were significant (p value=0.05 and p value=0.02, respectively). Conclusion The results have indicated that among those patients having better socioeconomic status, age at cancer diagnosis, number of pregnancies and duration of breast feeding was lower, and then number of abortions was more than the others. According to the results of this study, it was really important to focus on family socioeconomic status as a critical and effective variable on breast cancer risk factors among the Iranian women. PMID:25821572
Mohaghegh, Pegah; Yavari, Parvin; Akbari, Mohammad Esmaeil; Abadi, Alireza; Ahmadi, Farzaneh; Shormeij, Zeinab
2015-01-01
Not only the expand development of knowledge for reducing risk factors, but also the improvement in early diagnosis and treatment of cancer, and socioeconomic inequalities could affect cancer incidence, diagnosis stage, and mortality. The aim of this study was investigation the relationships between family levels of socioeconomic status and distribution of breast cancer risk factors. This descriptive cross-sectional study has conducted on 526 patients who were suffering from breast cancer, and have registered in Cancer Research Center of Shahid Beheshti University of Medical Sciences from March 2008 to December 2013. A reliable and valid questionnaire about family levels of socioeconomic status has filled by interviewing the patients via phone. For analyzing the data, Multinomial logistic regression, Kendal tau-b correlation coefficient and Contingency Coefficient tests have executed by SPSS19. The mean age of the patients was 48.30 (SD=11.41). According to the results of this study, there was a significant relationship between family socioeconomic status and patient's age at diagnosis of breast cancer (p value<0.001). Also, the relationships between socioeconomic status and number of pregnancies, and duration of breast feeding were significant (p value> 0.001). In the multiple logistic regressions, the relationship between excellent socioeconomic status and number of abortions was significant (p value> 0.007). Furthermore, the relationships between moderate and good socioeconomic statuses and smoking were significant (p value=0.05 and p value=0.02, respectively). The results have indicated that among those patients having better socioeconomic status, age at cancer diagnosis, number of pregnancies and duration of breast feeding was lower, and then number of abortions was more than the others. According to the results of this study, it was really important to focus on family socioeconomic status as a critical and effective variable on breast cancer risk factors among the Iranian women.
Confidence Intervals for Squared Semipartial Correlation Coefficients: The Effect of Nonnormality
ERIC Educational Resources Information Center
Algina, James; Keselman, H. J.; Penfield, Randall D.
2010-01-01
The increase in the squared multiple correlation coefficient ([delta]R[superscript 2]) associated with a variable in a regression equation is a commonly used measure of importance in regression analysis. Algina, Keselman, and Penfield found that intervals based on asymptotic principles were typically very inaccurate, even though the sample size…
Estimation of octanol/water partition coefficients using LSER parameters
Luehrs, Dean C.; Hickey, James P.; Godbole, Kalpana A.; Rogers, Tony N.
1998-01-01
The logarithms of octanol/water partition coefficients, logKow, were regressed against the linear solvation energy relationship (LSER) parameters for a training set of 981 diverse organic chemicals. The standard deviation for logKow was 0.49. The regression equation was then used to estimate logKow for a test of 146 chemicals which included pesticides and other diverse polyfunctional compounds. Thus the octanol/water partition coefficient may be estimated by LSER parameters without elaborate software but only moderate accuracy should be expected.
Forecast of future aviation fuels: The model
NASA Technical Reports Server (NTRS)
Ayati, M. B.; Liu, C. Y.; English, J. M.
1981-01-01
A conceptual models of the commercial air transportation industry is developed which can be used to predict trends in economics, demand, and consumption. The methodology is based on digraph theory, which considers the interaction of variables and propagation of changes. Air transportation economics are treated by examination of major variables, their relationships, historic trends, and calculation of regression coefficients. A description of the modeling technique and a compilation of historic airline industry statistics used to determine interaction coefficients are included. Results of model validations show negligible difference between actual and projected values over the twenty-eight year period of 1959 to 1976. A limited application of the method presents forecasts of air tranportation industry demand, growth, revenue, costs, and fuel consumption to 2020 for two scenarios of future economic growth and energy consumption.
Thermal requirements of Dermanyssus gallinae (De Geer, 1778) (Acari: Dermanyssidae).
Tucci, Edna Clara; do Prado, Angelo P; de Araújo, Raquel Pires
2008-01-01
The thermal requirements for development of Dermanyssus gallinae were studied under laboratory conditions at 15, 20, 25, 30 and 35 degrees C, a 12h photoperiod and 60-85% RH. The thermal requirements for D. gallinae were as follows. Preoviposition: base temperature 3.4 degrees C, thermal constant (k) 562.85 degree-hours, determination coefficient (R(2)) 0.59, regression equation: Y= -0.006035 + 0.001777x. Egg: base temperature 10.60 degrees C, thermal constant (k) 689.65 degree-hours, determination coefficient (R(2)) 0.94, regression equation: Y= -0.015367 + 0.001450x. Larva: base temperature 9.82 degrees C, thermal constant (k) 464.91 degree-hours, determination coefficient (R(2)) 0.87, regression equation: Y= -0.021123 + 0.002151x. Protonymph: base temperature 10.17 degrees C, thermal constant (k) 504.49 degree-hours, determination coefficient (R(2)) 0.90, regression equation: Y= -0.020152 + 0.001982x. Deutonymph: base temperature 11.80 degrees C, thermal constant (k) 501.11 degree-hours, determination coefficient (R(2)) 0.99, regression equation: Y= -0.023555 + 0.001996x. The results obtained showed that 15 to 42 generations of Dermanyssus gallinae may occur during the year in the State of São Paulo, as estimated based on isotherm charts. Dermanyssus gallinae may develop continually in the State of São Paulo, with a population decrease in the winter. There were differences between the developmental stages of D. gallinae in relation to thermal requirements.
Srinivas, Nuggehally R; Syed, Muzeeb
2016-01-01
Limited pharmacokinetic sampling strategy may be useful for predicting the area under the curve (AUC) for triptans and may have clinical utility as a prospective tool for prediction. Using appropriate intranasal pharmacokinetic data, a Cmax vs. AUC relationship was established by linear regression models for sumatriptan and zolmitriptan. The predictions of the AUC values were performed using published mean/median Cmax data and appropriate regression lines. The quotient of observed and predicted values rendered fold-difference calculation. The mean absolute error (MAE), mean positive error (MPE), mean negative error (MNE), root mean square error (RMSE), correlation coefficient (r), and the goodness of the AUC fold prediction were used to evaluate the two triptans. Also, data from the mean concentration profiles at time points of 1 hour (sumatriptan) and 3 hours (zolmitriptan) were used for the AUC prediction. The Cmax vs. AUC models displayed excellent correlation for both sumatriptan (r = .9997; P < .001) and zolmitriptan (r = .9999; P < .001). Irrespective of the two triptans, the majority of the predicted AUCs (83%-85%) were within 0.76-1.25-fold difference using the regression model. The prediction of AUC values for sumatriptan or zolmitriptan using the concentration data that reflected the Tmax occurrence were in the proximity of the reported values. In summary, the Cmax vs. AUC models exhibited strong correlations for sumatriptan and zolmitriptan. The usefulness of the prediction of the AUC values was established by a rigorous statistical approach.
Vesicular stomatitis forecasting based on Google Trends
Lu, Yi; Zhou, GuangYa; Chen, Qin
2018-01-01
Background Vesicular stomatitis (VS) is an important viral disease of livestock. The main feature of VS is irregular blisters that occur on the lips, tongue, oral mucosa, hoof crown and nipple. Humans can also be infected with vesicular stomatitis and develop meningitis. This study analyses 2014 American VS outbreaks in order to accurately predict vesicular stomatitis outbreak trends. Methods American VS outbreaks data were collected from OIE. The data for VS keywords were obtained by inputting 24 disease-related keywords into Google Trends. After calculating the Pearson and Spearman correlation coefficients, it was found that there was a relationship between outbreaks and keywords derived from Google Trends. Finally, the predicted model was constructed based on qualitative classification and quantitative regression. Results For the regression model, the Pearson correlation coefficients between the predicted outbreaks and actual outbreaks are 0.953 and 0.948, respectively. For the qualitative classification model, we constructed five classification predictive models and chose the best classification predictive model as the result. The results showed, SN (sensitivity), SP (specificity) and ACC (prediction accuracy) values of the best classification predictive model are 78.52%,72.5% and 77.14%, respectively. Conclusion This study applied Google search data to construct a qualitative classification model and a quantitative regression model. The results show that the method is effective and that these two models obtain more accurate forecast. PMID:29385198
How to predict the sugariness and hardness of melons: A near-infrared hyperspectral imaging method.
Sun, Meijun; Zhang, Dong; Liu, Li; Wang, Zheng
2017-03-01
Hyperspectral imaging (HSI) in the near-infrared (NIR) region (900-1700nm) was used for non-intrusive quality measurements (of sweetness and texture) in melons. First, HSI data from melon samples were acquired to extract the spectral signatures. The corresponding sample sweetness and hardness values were recorded using traditional intrusive methods. Partial least squares regression (PLSR), principal component analysis (PCA), support vector machine (SVM), and artificial neural network (ANN) models were created to predict melon sweetness and hardness values from the hyperspectral data. Experimental results for the three types of melons show that PLSR produces the most accurate results. To reduce the high dimensionality of the hyperspectral data, the weighted regression coefficients of the resulting PLSR models were used to identify the most important wavelengths. On the basis of these wavelengths, each image pixel was used to visualize the sweetness and hardness in all the portions of each sample. Copyright © 2016 Elsevier Ltd. All rights reserved.
Song, Seung Yeob; Lee, Young Koung; Kim, In-Jung
2016-01-01
A high-throughput screening system for Citrus lines were established with higher sugar and acid contents using Fourier transform infrared (FT-IR) spectroscopy in combination with multivariate analysis. FT-IR spectra confirmed typical spectral differences between the frequency regions of 950-1100 cm(-1), 1300-1500 cm(-1), and 1500-1700 cm(-1). Principal component analysis (PCA) and subsequent partial least square-discriminant analysis (PLS-DA) were able to discriminate five Citrus lines into three separate clusters corresponding to their taxonomic relationships. The quantitative predictive modeling of sugar and acid contents from Citrus fruits was established using partial least square regression algorithms from FT-IR spectra. The regression coefficients (R(2)) between predicted values and estimated sugar and acid content values were 0.99. These results demonstrate that by using FT-IR spectra and applying quantitative prediction modeling to Citrus sugar and acid contents, excellent Citrus lines can be early detected with greater accuracy. Copyright © 2015 Elsevier Ltd. All rights reserved.
Structural features that predict real-value fluctuations of globular proteins.
Jamroz, Michal; Kolinski, Andrzej; Kihara, Daisuke
2012-05-01
It is crucial to consider dynamics for understanding the biological function of proteins. We used a large number of molecular dynamics (MD) trajectories of nonhomologous proteins as references and examined static structural features of proteins that are most relevant to fluctuations. We examined correlation of individual structural features with fluctuations and further investigated effective combinations of features for predicting the real value of residue fluctuations using the support vector regression (SVR). It was found that some structural features have higher correlation than crystallographic B-factors with fluctuations observed in MD trajectories. Moreover, SVR that uses combinations of static structural features showed accurate prediction of fluctuations with an average Pearson's correlation coefficient of 0.669 and a root mean square error of 1.04 Å. This correlation coefficient is higher than the one observed in predictions by the Gaussian network model (GNM). An advantage of the developed method over the GNMs is that the former predicts the real value of fluctuation. The results help improve our understanding of relationships between protein structure and fluctuation. Furthermore, the developed method provides a convienient practial way to predict fluctuations of proteins using easily computed static structural features of proteins. Copyright © 2012 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Ghosh, S. M.; Behera, M. D.
2017-12-01
Forest aboveground biomass (AGB) is an important factor for preparation of global policy making decisions to tackle the impact of climate change. Several previous studies has concluded that remote sensing methods are more suitable for estimating forest biomass on regional scale. Among all available remote sensing data and methods, Synthetic Aperture Radar (SAR) data in combination with decision tree based machine learning algorithms has shown better promise in estimating higher biomass values. There aren't many studies done for biomass estimation of dense Indian tropical forests with high biomass density. In this study aboveground biomass was estimated for two major tree species, Sal (Shorea robusta) and Teak (Tectona grandis), of Katerniaghat Wildlife Sanctuary, a tropical forest situated in northern India. Biomass was estimated by combining C-band SAR data from Sentinel-1A satellite, vegetation indices produced using Sentinel-2A data and ground inventory plots. Along with SAR backscatter value, SAR texture images were also used as input as earlier studies had found that image texture has a correlation with vegetation biomass. Decision tree based nonlinear machine learning algorithms were used in place of parametric regression models for establishing relationship between fields measured values and remotely sensed parameters. Using random forest model with a combination of vegetation indices with SAR backscatter as predictor variables shows best result for Sal forest, with a coefficient of determination value of 0.71 and a RMSE value of 105.027 t/ha. In teak forest also best result can be found in the same combination but for stochastic gradient boosted model with a coefficient of determination value of 0.6 and a RMSE value of 79.45 t/ha. These results are mostly better than the results of other studies done for similar kind of forests. This study shows that Sentinel series satellite data has exceptional capabilities in estimating dense forest AGB and machine learning algorithms are better means to do so than parametric regression models.
Factor Scores, Structure Coefficients, and Communality Coefficients
ERIC Educational Resources Information Center
Goodwyn, Fara
2012-01-01
This paper presents heuristic explanations of factor scores, structure coefficients, and communality coefficients. Common misconceptions regarding these topics are clarified. In addition, (a) the regression (b) Bartlett, (c) Anderson-Rubin, and (d) Thompson methods for calculating factor scores are reviewed. Syntax necessary to execute all four…
Sood, Neeraj; Ghosh, Arkadipta; Escarce, José J
2009-01-01
Objective To estimate the effect of growth in health care costs that outpaces gross domestic product (GDP) growth (“excess” growth in health care costs) on employment, gross output, and value added to GDP of U.S. industries. Study Setting We analyzed data from 38 U.S. industries for the period 1987–2005. All data are publicly available from various government agencies. Study Design We estimated bivariate and multivariate regressions. To develop the regression models, we assumed that rapid growth in health care costs has a larger effect on economic performance for industries where large percentages of workers receive employer-sponsored health insurance (ESI). We used the estimated regression coefficients to simulate economic outcomes under alternative scenarios of health care cost inflation. Results Faster growth in health care costs had greater adverse effects on economic outcomes for industries with larger percentages of workers who had ESI. We found that a 10 percent increase in excess growth in health care costs would have resulted in 120,803 fewer jobs, US$28,022 million in lost gross output, and US$14,082 million in lost value added in 2005. These declines represent 0.17 to 0.18 percent of employment, gross output, and value added in 2005. Conclusion Excess growth in health care costs is adversely affecting the economic performance of U.S. industries. PMID:19500165
Multiple linear regression analysis
NASA Technical Reports Server (NTRS)
Edwards, T. R.
1980-01-01
Program rapidly selects best-suited set of coefficients. User supplies only vectors of independent and dependent data and specifies confidence level required. Program uses stepwise statistical procedure for relating minimal set of variables to set of observations; final regression contains only most statistically significant coefficients. Program is written in FORTRAN IV for batch execution and has been implemented on NOVA 1200.
Poitout, V; Moatti-Sirat, D; Reach, G
1992-01-01
The feasibility of calibrating a glucose sensor by using a wearable glucose meter for blood glucose determination and moderate variations of blood glucose concentration was assessed. Six miniaturized glucose sensors were implanted in the subcutaneous tissue of conscious dogs, and the parameters used for the in vivo calibration of the sensor (sensitivity coefficient and extrapolated current in the absence of glucose) were determined from values of blood glucose and sensor response obtained during glucose infusion. (1) Venous plasma glucose level and venous total blood glucose level were measured simultaneously on the same sample, using a Beckman analyser and a Glucometer II, respectively. The regression between plasma glucose (x) and whole blood glucose (y) was y = 1.12x-0.08 mM (n = 114 values, r = 0.96, p = 0.0001). The error grid analysis indicated that the use of a Glucometer II for blood glucose determination was appropriate in dogs. (2) The in vivo sensitivity coefficients were 0.57 +/- 0.11 nA mM-1 when determined from plasma glucose, and 0.51 +/- 0.07 nA mM-1 when determined from whole blood glucose (t = 1.53, p = 0.18, n.s.). The background currents were 0.88 +/- 0.57 nA when determined from plasma glucose, and 0.63 +/- 0.77 nA when determined from whole blood glucose (t = 0.82, p = 0.45, n.s.). (3) The regression equation of the estimation of the subcutaneous glucose level obtained from the two methods was y = 1.04x + 0.56 mM (n = 171 values, r = 0.98, p = 0.0001).(ABSTRACT TRUNCATED AT 250 WORDS)
The relationship between body mass index and uric acid: a study on Japanese adult twins.
Tanaka, Kentaro; Ogata, Soshiro; Tanaka, Haruka; Omura, Kayoko; Honda, Chika; Hayakawa, Kazuo
2015-09-01
The present study aimed to investigate the association between body mass index (BMI) and uric acid (UA) using the twin study methodology to adjust for genetic factors. The association between BMI and UA was investigated in a cross-sectional study using data from both monozygotic and dizygotic twins registered at the Osaka University Center for Twin Research and the Osaka University Graduate School of Medicine. From January 2011 to March 2014, 422 individuals participated in the health examination. We measured height, weight, age, BMI, lifestyle habits (Breslow's Health Practice Index), serum UA, and serum creatinine. To investigate the association between UA and BMI with adjustment for the clustering of a twin within a pair, individual-level analyses were performed using generalized linear mixed models (GLMMs). To investigate an association with adjustment for genetic and family environmental factors, twin-pair difference values analyses were performed. In all analysis, BMI was associated with UA in men and women. Using the GLMMs, standardized regression coefficients were 0.194 (95 % confidence interval: 0.016-0.373) in men and 0.186 (95 % confidence interval: 0.071-0.302) in women. Considering twin-pair difference value analyses, standardized regression coefficients were 0.333 (95 % confidence interval: 0.072-0.594) in men and 0.314 (95 % confidence interval: 0.151-0.477) in women. The present study shows that BMI was significantly associated with UA, after adjusting for both genetic and familial environment factors in both men and women.
Li, Xuehua; Zhao, Wenxing; Li, Jing; Jiang, Jingqiu; Chen, Jianji; Chen, Jingwen
2013-08-01
To assess the persistence and fate of volatile organic compounds in the troposphere, the rate constants for the reaction with ozone (kO3) are needed. As kO3 values are only available for hundreds of compounds, and experimental determination of kO3 is costly and time-consuming, it is of importance to develop predictive models on kO3. In this study, a total of 379 logkO3 values at different temperatures were used to develop and validate a model for the prediction of kO3, based on quantum chemical descriptors, Dragon descriptors and structural fragments. Molecular descriptors were screened by stepwise multiple linear regression, and the model was constructed by partial least-squares regression. The cross validation coefficient QCUM(2) of the model is 0.836, and the external validation coefficient Qext(2) is 0.811, indicating that the model has high robustness and good predictive performance. The most significant descriptor explaining logkO3 is the BELm2 descriptor with connectivity information weighted atomic masses. kO3 increases with increasing BELm2, and decreases with increasing ionization potential. The applicability domain of the proposed model was visualized by the Williams plot. The developed model can be used to predict kO3 at different temperatures for a wide range of organic chemicals, including alkenes, cycloalkenes, haloalkenes, alkynes, oxygen-containing compounds, nitrogen-containing compounds (except primary amines) and aromatic compounds. Copyright © 2013 Elsevier Ltd. All rights reserved.
Low-level lead exposure and the IQ of children. A meta-analysis of modern studies
DOE Office of Scientific and Technical Information (OSTI.GOV)
Needleman, H.L.; Gatsonis, C.A.
1990-02-02
We identified 24 modern studies of childhood exposures to lead in relation to IQ. From this population, 12 that employed multiple regression analysis with IQ as the dependent variable and lead as the main effect and that controlled for nonlead covariates were selected for a quantitative, integrated review or meta-analysis. The studies were grouped according to type of tissue analyzed for lead. There were 7 blood and 5 tooth lead studies. Within each group, we obtained joint P values by two different methods and average effect sizes as measured by the partial correlation coefficients. We also investigated the sensitivity ofmore » the results to any single study. The sample sizes ranged from 75 to 724. The sign of the regression coefficient for lead was negative in 11 of 12 studies. The negative partial r's for lead ranged from -.27 to -.003. The power to find an effect was limited, below 0.6 in 7 of 12 studies. The joint P values for the blood lead studies were less than .0001 for both methods of analysis (95% confidence interval for group partial r, -.15 {plus minus} .05), while for the tooth lead studies they were .0005 and .004, respectively (95% confidence interval for group partial r, -.08 {plus minus} .05). The hypothesis that lead impairs children's IQ at low dose is strongly supported by this quantitative review. The effect is robust to the impact of any single study.« less
Specifics of soil temperature under winter oilseed rape canopy
NASA Astrophysics Data System (ADS)
Krčmářová, Jana; Středa, Tomáš; Pokorný, Radovan
2014-09-01
The aim of this study was to evaluate the course of soil temperature under the winter oilseed rape canopy and to determine relationships between soil temperature, air temperature and partly soil moisture. In addition, the aim was to describe the dependence by means of regression equations usable for pests and pathogens prediction, crop development, and yields models. The measurement of soil and near the ground air temperatures was performed at the experimental field Žabiče (South Moravia, the Czech Republic). The course of temperature was determined under or in the winter oilseed rape canopy during spring growth season in the course of four years (2010 - 2012 and 2014). In all years, the standard varieties (Petrol, Sherpa) were grown, in 2014 the semi-dwarf variety PX104 was added. Automatic soil sensors were positioned at three depths (0.05, 0.10 and 0.20 m) under soil surface, air temperature sensors in 0.05 m above soil surfaces. The course of soil temperature differs significantly between standard (Sherpa and Petrol) and semi-dwarf (PX104) varieties. Results of the cross correlation analysis showed, that the best interrelationships between air and soil temperature were achieved in 2 hours delay for the soil temperature in 0.05 m, 4 hour delay for 0.10 m and 7 hour delay for 0.20 m for standard varieties. For semi-dwarf variety, this delay reached 6 hour for the soil temperature in 0.05 m, 7 hour delay for 0.10 m and 11 hour for 0.20 m. After the time correction, the determination coefficient (R2) reached values from 0.67 to 0.95 for 0.05 m, 0.50 to 0.84 for 0.10 m in variety Sherpa during all experimental years. For variety PX104 this coefficient reached values from 0.51 to 0.72 in 0.05 m depth and from 0.39 to 0.67 in 0.10 m depth in the year 2014. The determination coefficient in the 0.20 m depth was lower for both varieties; its values were from 0.15 to 0.65 in variety Sherpa. In variety PX104 the values of R2 from 0.23 to 0.57 were determined. When using multiple regressions with quadratic spacing (modelling of hourly soil temperature based on the hourly near surface air temperature and hourly soil moisture in the 0.10-0.40 m profile), the difference between the measured and modelled soil temperatures in the depth of 0.05 m was -3.92 to 3.99°C. The regression equation paired with alternative agrometeorological instruments enables relatively accurate modelling of soil temperatures (R2 = 0.95).
Kaitaniemi, Pekka
2008-04-09
Allometric equations are widely used in many branches of biological science. The potential information content of the normalization constant b in allometric equations of the form Y = bX(a) has, however, remained largely neglected. To demonstrate the potential for utilizing this information, I generated a large number of artificial datasets that resembled those that are frequently encountered in biological studies, i.e., relatively small samples including measurement error or uncontrolled variation. The value of X was allowed to vary randomly within the limits describing different data ranges, and a was set to a fixed theoretical value. The constant b was set to a range of values describing the effect of a continuous environmental variable. In addition, a normally distributed random error was added to the values of both X and Y. Two different approaches were then used to model the data. The traditional approach estimated both a and b using a regression model, whereas an alternative approach set the exponent a at its theoretical value and only estimated the value of b. Both approaches produced virtually the same model fit with less than 0.3% difference in the coefficient of determination. Only the alternative approach was able to precisely reproduce the effect of the environmental variable, which was largely lost among noise variation when using the traditional approach. The results show how the value of b can be used as a source of valuable biological information if an appropriate regression model is selected.
NASA Astrophysics Data System (ADS)
Eliseev, A. V.; Mokhov, I. I.; Guseva, M. S.
2006-05-01
The ERA40 and NCEP/NCAR data over 1958 1998 were used to estimate the sensitivity of amplitude-phase characteristics (APCs) of the annual cycle (AC) of the surface air temperature (SAT) T s. The results were compared with outputs of the ECHAM4/OPYC3, HadCM3, and INM RAS general circulation models and the IAP RAS climate model of intermediate complexity, which were run with variations in greenhouse gases and sulfate aerosol specified over 1860 2100. The analysis was performed in terms of the linear regression coefficients b of SAT AC APCs on the local annual mean temperature and in terms of the sensitivity characteristic D = br 2, which takes into account not only the linear regression coefficient but also its statistical significance (via the correlation coefficient r). The reanalysis data were used to reveal the features of the tendencies of change in the SAT AC APCs in various regions, including areas near the snow-ice boundary, storm-track ocean regions, large desert areas, and the tropical Pacific. These results agree with earlier observations. The model computations are in fairly good agreement with the reanalysis data in regions of statistically significant variations in SAT AC APCs. The differences between individual models and the reanalysis data can be explained, in particular, in terms of the features of the sea-ice schemes used in the models. Over the land in the middle and high latitudes of the Northern Hemisphere, the absolute values of D for the fall phase time and the interval of exceeding exhibit a positive intermodel correlation with the absolute value of D for the annual-harmonic amplitude. Over the ocean, the models reproducing larger (in modulus) sensitivity parameters of the SAT annual-harmonic amplitude are generally characterized by larger (in modulus) negative sensitivity values of the semiannual-harmonic amplitude T s, 2, especially at latitudes characteristic of the sea-ice boundary. In contrast to the averaged fields of AC APCs and their interannual standard deviations, the sensitivity parameters of the SAT AC APCs on a regional scale vary noticeably for various types of anthropogenic forcing.
Structured penalties for functional linear models-partially empirical eigenvectors for regression.
Randolph, Timothy W; Harezlak, Jaroslaw; Feng, Ziding
2012-01-01
One of the challenges with functional data is incorporating geometric structure, or local correlation, into the analysis. This structure is inherent in the output from an increasing number of biomedical technologies, and a functional linear model is often used to estimate the relationship between the predictor functions and scalar responses. Common approaches to the problem of estimating a coefficient function typically involve two stages: regularization and estimation. Regularization is usually done via dimension reduction, projecting onto a predefined span of basis functions or a reduced set of eigenvectors (principal components). In contrast, we present a unified approach that directly incorporates geometric structure into the estimation process by exploiting the joint eigenproperties of the predictors and a linear penalty operator. In this sense, the components in the regression are 'partially empirical' and the framework is provided by the generalized singular value decomposition (GSVD). The form of the penalized estimation is not new, but the GSVD clarifies the process and informs the choice of penalty by making explicit the joint influence of the penalty and predictors on the bias, variance and performance of the estimated coefficient function. Laboratory spectroscopy data and simulations are used to illustrate the concepts.
Zhong-xiang, Feng; Shi-sheng, Lu; Wei-hua, Zhang; Nan-nan, Zhang
2014-01-01
In order to build a combined model which can meet the variation rule of death toll data for road traffic accidents and can reflect the influence of multiple factors on traffic accidents and improve prediction accuracy for accidents, the Verhulst model was built based on the number of death tolls for road traffic accidents in China from 2002 to 2011; and car ownership, population, GDP, highway freight volume, highway passenger transportation volume, and highway mileage were chosen as the factors to build the death toll multivariate linear regression model. Then the two models were combined to be a combined prediction model which has weight coefficient. Shapley value method was applied to calculate the weight coefficient by assessing contributions. Finally, the combined model was used to recalculate the number of death tolls from 2002 to 2011, and the combined model was compared with the Verhulst and multivariate linear regression models. The results showed that the new model could not only characterize the death toll data characteristics but also quantify the degree of influence to the death toll by each influencing factor and had high accuracy as well as strong practicability. PMID:25610454
Feng, Zhong-xiang; Lu, Shi-sheng; Zhang, Wei-hua; Zhang, Nan-nan
2014-01-01
In order to build a combined model which can meet the variation rule of death toll data for road traffic accidents and can reflect the influence of multiple factors on traffic accidents and improve prediction accuracy for accidents, the Verhulst model was built based on the number of death tolls for road traffic accidents in China from 2002 to 2011; and car ownership, population, GDP, highway freight volume, highway passenger transportation volume, and highway mileage were chosen as the factors to build the death toll multivariate linear regression model. Then the two models were combined to be a combined prediction model which has weight coefficient. Shapley value method was applied to calculate the weight coefficient by assessing contributions. Finally, the combined model was used to recalculate the number of death tolls from 2002 to 2011, and the combined model was compared with the Verhulst and multivariate linear regression models. The results showed that the new model could not only characterize the death toll data characteristics but also quantify the degree of influence to the death toll by each influencing factor and had high accuracy as well as strong practicability.
Acevedo-Mendoza, Wilmer F; Buitrago Gómez, Diana Paola; Atehortua-Otero, Miguel Ángel; Páez, Miguel Ángel; Jiménez-Rincón, Manuela; Lagos-Grisales, Guillermo J; Rodríguez-Morales, Alfonso J
2017-03-01
Bacterial meningitis is an important cause of infectious neurological morbidity and mortality. Its incidence has decreased with the introduction of vaccination programmes against preventable agents. However, low-income and middle-income countries with poor access to health care still have a significant burden of the disease. Thus, the relationship between the Gini coefficient and H. influenzae and M. tuberculosis meningitis incidence in Colombia, during 2008-2011, was assessed. In this ecological study, the Gini coefficient was obtained from the Colombian Department of Statistics, incidence rates were calculated (cases/1,000,000 pop) and linear regressions were performed using the Gini coefficient, to assess the relationship between the latter and the incidence of meningitis. It was observed that when inequality increases in the Colombian departments, the incidence of meningitis also increases, with a significant association in the models (p<0.01) for both M. tuberculosis (r²=0.2382; p<0.001) and H. influenzae (r²=0.2509; p<0.001). This research suggests that high Gini coefficient values influence the incidence of Mycobacterium tuberculosis and Haemophilus influenzae meningitis, showing that social inequality is critical to disease occurrence. Early detection, supervised treatment, vaccination coverage, access to health care are efficient control strategies.
Testing for gene-environment interaction under exposure misspecification.
Sun, Ryan; Carroll, Raymond J; Christiani, David C; Lin, Xihong
2017-11-09
Complex interplay between genetic and environmental factors characterizes the etiology of many diseases. Modeling gene-environment (GxE) interactions is often challenged by the unknown functional form of the environment term in the true data-generating mechanism. We study the impact of misspecification of the environmental exposure effect on inference for the GxE interaction term in linear and logistic regression models. We first examine the asymptotic bias of the GxE interaction regression coefficient, allowing for confounders as well as arbitrary misspecification of the exposure and confounder effects. For linear regression, we show that under gene-environment independence and some confounder-dependent conditions, when the environment effect is misspecified, the regression coefficient of the GxE interaction can be unbiased. However, inference on the GxE interaction is still often incorrect. In logistic regression, we show that the regression coefficient is generally biased if the genetic factor is associated with the outcome directly or indirectly. Further, we show that the standard robust sandwich variance estimator for the GxE interaction does not perform well in practical GxE studies, and we provide an alternative testing procedure that has better finite sample properties. © 2017, The International Biometric Society.
Nakamura, Kengo; Yasutaka, Tetsuo; Kuwatani, Tatsu; Komai, Takeshi
2017-11-01
In this study, we applied sparse multiple linear regression (SMLR) analysis to clarify the relationships between soil properties and adsorption characteristics for a range of soils across Japan and identify easily-obtained physical and chemical soil properties that could be used to predict K and n values of cadmium, lead and fluorine. A model was first constructed that can easily predict the K and n values from nine soil parameters (pH, cation exchange capacity, specific surface area, total carbon, soil organic matter from loss on ignition and water holding capacity, the ratio of sand, silt and clay). The K and n values of cadmium, lead and fluorine of 17 soil samples were used to verify the SMLR models by the root mean square error values obtained from 512 combinations of soil parameters. The SMLR analysis indicated that fluorine adsorption to soil may be associated with organic matter, whereas cadmium or lead adsorption to soil is more likely to be influenced by soil pH, IL. We found that an accurate K value can be predicted from more than three soil parameters for most soils. Approximately 65% of the predicted values were between 33 and 300% of their measured values for the K value; 76% of the predicted values were within ±30% of their measured values for the n value. Our findings suggest that adsorption properties of lead, cadmium and fluorine to soil can be predicted from the soil physical and chemical properties using the presented models. Copyright © 2017 Elsevier Ltd. All rights reserved.
Rabasco, Paola; Caivano, Rocchina; Simeon, Vittorio; Dinardo, Giuseppina; Lotumolo, Antonella; Gioioso, Matilde; Villonio, Antonio; Iannelli, Giancarlo; D'Antuono, Felice; Zandolino, Alexis; Macarini, Luca; Guglielmi, Giuseppe; Cammarota, Aldo
2017-02-07
To analyze diffusion-weighted imaging (DWI) and the related apparent diffusion coefficient (ADC) in women with breast cancer, correlating these values with the presence at 3 years of distant metastases, and to demonstrate that DWI-Magnetic Resonance Imaging (MRI) and related ADC values may represent a prognostic value in the study of women with breast cancer. Sixty women (aged 45-73 years) affected with breast cancer with a follow-up in 3 years were enrolled. On DWI, we obtained the ADC values, and these were correlated with the clinical condition of patients at 3 years. Moreover, tumour size, lymph node status, and molecular markers, including estrogens receptor, progesterone receptor, Ki-67 index, and human growth factor receptor 2 protein, were correlated with ADC values. This study was approved by the Scientific Committee of our institution. We considered patients with metastasis at 3 years (12 patients - 20%) and without metastasis (48 patients - 80%). The mean ADC value in patients with no metastases at 3 years was 1.06 ± 0.38, while for patients with metastases it was 0.74 ± 0.34 (p = .011). The receiver-operator curve analysis identified a value of 0.75 (<0.75 with risk to develop metastasis) as the best predictive cutoff for ADC values, with the highest sensitivity (81.25%) and higher specificity (66.67%). After regression analysis, ADC value, positivity to estrogen-progestin receptors, and presence of lymph nodes were the only prognostic factors found to be statistically significant. DWI-MRI and related ADC values may represent a prognostic value in women with breast cancer.
ERIC Educational Resources Information Center
Mugrage, Beverly; And Others
Three ridge regression solutions are compared with ordinary least squares regression and with principal components regression using all components. Ridge regression, particularly the Lawless-Wang solution, out-performed ordinary least squares regression and the principal components solution on the criteria of stability of coefficient and closeness…
[Hyperspectral Estimation of Apple Tree Canopy LAI Based on SVM and RF Regression].
Han, Zhao-ying; Zhu, Xi-cun; Fang, Xian-yi; Wang, Zhuo-yuan; Wang, Ling; Zhao, Geng-Xing; Jiang, Yuan-mao
2016-03-01
Leaf area index (LAI) is the dynamic index of crop population size. Hyperspectral technology can be used to estimate apple canopy LAI rapidly and nondestructively. It can be provide a reference for monitoring the tree growing and yield estimation. The Red Fuji apple trees of full bearing fruit are the researching objects. Ninety apple trees canopies spectral reflectance and LAI values were measured by the ASD Fieldspec3 spectrometer and LAI-2200 in thirty orchards in constant two years in Qixia research area of Shandong Province. The optimal vegetation indices were selected by the method of correlation analysis of the original spectral reflectance and vegetation indices. The models of predicting the LAI were built with the multivariate regression analysis method of support vector machine (SVM) and random forest (RF). The new vegetation indices, GNDVI527, ND-VI676, RVI682, FD-NVI656 and GRVI517 and the previous two main vegetation indices, NDVI670 and NDVI705, are in accordance with LAI. In the RF regression model, the calibration set decision coefficient C-R2 of 0.920 and validation set decision coefficient V-R2 of 0.889 are higher than the SVM regression model by 0.045 and 0.033 respectively. The root mean square error of calibration set C-RMSE of 0.249, the root mean square error validation set V-RMSE of 0.236 are lower than that of the SVM regression model by 0.054 and 0.058 respectively. Relative analysis of calibrating error C-RPD and relative analysis of validation set V-RPD reached 3.363 and 2.520, 0.598 and 0.262, respectively, which were higher than the SVM regression model. The measured and predicted the scatterplot trend line slope of the calibration set and validation set C-S and V-S are close to 1. The estimation result of RF regression model is better than that of the SVM. RF regression model can be used to estimate the LAI of red Fuji apple trees in full fruit period.
Validation of the questionnaire on hand function assessment in leprosy.
Ferreira, Telma Leonel; Alvarez, Rosicler Rocha Aiza; Virmond, Marcos da Cunha Lopes
2012-06-01
To validate the psychometric properties of the questionnaire on hand function assessment in leprosy. Study conducted with a convenience sample of 101 consecutive patients in Brasília (Central-Western Brazil), from June 2008 to July 2009. The individuals were adults affected by leprosy, with impairment of the ulnar, median and radial nerves. Interobservers and intraobserver reproducibility was analyzed through successive interviews, and construct validity was analyzed through association between age, clinical form of leprosy, duration of nerve injury, grip and pinch strength measured with a dynamometer, sensibility test performed with Semmes-Weinstein monofilaments and manual ability assessment using the Jebsen test of hand function. Pondered kappa coefficient was calculated and a Bland-Altman plot was constructed to assess the reproducibility of the instrument. For internal consistency, Cronbach's alpha coefficient was utilized. Pearson's correlation coefficient was calculated and a multiple regression model was used. The pondered kappa values for interobservers and intraobserver assessments ranged from 0.86 to 0.97 and from 0.85 to 0.97, respectively. The value of Cronbach's alpha coefficient was 0.967. Pearson's correlation coefficient showed an association (p < 0.001) among duration of nerve injury, grip and pinch strength, cutaneous sensibility and mean score in the Jebsen Test. The mean score of the questionnaire on hand functional assessment in leprosy was associated with operational classification of leprosy, duration of nerve injury, grip strength, cutaneous sensibility and manual ability (p < 0.0001 for the model as a whole). The questionnaire on hand functional assessment in leprosy presents almost perfect interobservers and intraobserver reproducibility, high internal consistency and correlation with operational classification of leprosy, duration of nerve injury, grip strength, cutaneous sensibility in the hands and manual ability.
Park, Su San; Lee, Ju Yul; Cho, Sung-Il
2007-07-01
We investigated the validity of the dipstick method (Mossman Associates Inc. USA) and the expired CO method to distinguish between smokers and nonsmokers. We also elucidated the related factors of the two methods. This study included 244 smokers and 50 ex-smokers, recruited from smoking cessation clinics at 4 local public health centers, who had quit for over 4 weeks. We calculated the sensitivity, specificity and Kappa coefficient of each method for validity. We obtained ROC curve, predictive value and agreement to determine the cutoff of expired air CO method. Finally, we elucidated the related factors and compared their effect powers using the standardized regression coefficient. The dipstick method showed a sensitivity of 92.6%, specificity of 96.0% and Kappa coefficient of 0.79. The best cutoff value to distinguish smokers was 5-6 ppm. At 5 ppm, the expired CO method showed a sensitivity of 94.3%, specificity of 82.0% and Kappa coefficient of 0.73. And at 6 ppm, sensitivity, specificity and Kappa coefficient were 88.5%, 86.0% and 0.64, respectively. Therefore, the dipstick method had higher sensitivity and specificity than the expired CO method. The dipstick and expired CO methods were significantly increased with increasing smoking amount. With longer time since the last smoking, expired CO showed a rapid decrease after 4 hours, whereas the dipstick method showed relatively stable levels for more than 4 hours. The dipstick and expired CO methods were both good indicators for assessing smoking status. However, the former showed higher sensitivity and specificity and stable levels over longer hours after smoking, compared to the expired CO method.
Spatial Autocorrelation Approaches to Testing Residuals from Least Squares Regression.
Chen, Yanguang
2016-01-01
In geo-statistics, the Durbin-Watson test is frequently employed to detect the presence of residual serial correlation from least squares regression analyses. However, the Durbin-Watson statistic is only suitable for ordered time or spatial series. If the variables comprise cross-sectional data coming from spatial random sampling, the test will be ineffectual because the value of Durbin-Watson's statistic depends on the sequence of data points. This paper develops two new statistics for testing serial correlation of residuals from least squares regression based on spatial samples. By analogy with the new form of Moran's index, an autocorrelation coefficient is defined with a standardized residual vector and a normalized spatial weight matrix. Then by analogy with the Durbin-Watson statistic, two types of new serial correlation indices are constructed. As a case study, the two newly presented statistics are applied to a spatial sample of 29 China's regions. These results show that the new spatial autocorrelation models can be used to test the serial correlation of residuals from regression analysis. In practice, the new statistics can make up for the deficiencies of the Durbin-Watson test.
Very-short-term wind power prediction by a hybrid model with single- and multi-step approaches
NASA Astrophysics Data System (ADS)
Mohammed, E.; Wang, S.; Yu, J.
2017-05-01
Very-short-term wind power prediction (VSTWPP) has played an essential role for the operation of electric power systems. This paper aims at improving and applying a hybrid method of VSTWPP based on historical data. The hybrid method is combined by multiple linear regressions and least square (MLR&LS), which is intended for reducing prediction errors. The predicted values are obtained through two sub-processes:1) transform the time-series data of actual wind power into the power ratio, and then predict the power ratio;2) use the predicted power ratio to predict the wind power. Besides, the proposed method can include two prediction approaches: single-step prediction (SSP) and multi-step prediction (MSP). WPP is tested comparatively by auto-regressive moving average (ARMA) model from the predicted values and errors. The validity of the proposed hybrid method is confirmed in terms of error analysis by using probability density function (PDF), mean absolute percent error (MAPE) and means square error (MSE). Meanwhile, comparison of the correlation coefficients between the actual values and the predicted values for different prediction times and window has confirmed that MSP approach by using the hybrid model is the most accurate while comparing to SSP approach and ARMA. The MLR&LS is accurate and promising for solving problems in WPP.
NASA Technical Reports Server (NTRS)
Edwards, T. R. (Inventor)
1985-01-01
Apparatus for doubling the data density rate of an analog to digital converter or doubling the data density storage capacity of a memory deviced is discussed. An interstitial data point midway between adjacent data points in a data stream having an even number of equal interval data points is generated by applying a set of predetermined one-dimensional convolute integer coefficients which can include a set of multiplier coefficients and a normalizer coefficient. Interpolator means apply the coefficients to the data points by weighting equally on each side of the center of the even number of equal interval data points to obtain an interstital point value at the center of the data points. A one-dimensional output data set, which is twice as dense as a one-dimensional equal interval input data set, can be generated where the output data set includes interstitial points interdigitated between adjacent data points in the input data set. The method for generating the set of interstital points is a weighted, nearest-neighbor, non-recursive, moving, smoothing averaging technique, equivalent to applying a polynomial regression calculation to the data set.
Composite marginal quantile regression analysis for longitudinal adolescent body mass index data.
Yang, Chi-Chuan; Chen, Yi-Hau; Chang, Hsing-Yi
2017-09-20
Childhood and adolescenthood overweight or obesity, which may be quantified through the body mass index (BMI), is strongly associated with adult obesity and other health problems. Motivated by the child and adolescent behaviors in long-term evolution (CABLE) study, we are interested in individual, family, and school factors associated with marginal quantiles of longitudinal adolescent BMI values. We propose a new method for composite marginal quantile regression analysis for longitudinal outcome data, which performs marginal quantile regressions at multiple quantile levels simultaneously. The proposed method extends the quantile regression coefficient modeling method introduced by Frumento and Bottai (Biometrics 2016; 72:74-84) to longitudinal data accounting suitably for the correlation structure in longitudinal observations. A goodness-of-fit test for the proposed modeling is also developed. Simulation results show that the proposed method can be much more efficient than the analysis without taking correlation into account and the analysis performing separate quantile regressions at different quantile levels. The application to the longitudinal adolescent BMI data from the CABLE study demonstrates the practical utility of our proposal. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
Paseiro-Cerrato, Rafael; Tongchat, Chinawat; Franz, Roland
2016-05-01
This study evaluated the influence of parameters such as temperature and type of low-density polyethylene (LDPE) film on the log Kp/f values of seven model migrants in food simulants. Two different types of LDPE films contaminated by extrusion and immersion were placed in contact with three food simulants including 20% ethanol, 50% ethanol and olive oil under several time-temperature conditions. Results suggest that most log Kp/f values are little affected by these parameters in this study. In addition, the relation between log Kp/f and log Po/w was established for each food simulant and regression lines, as well as correlation coefficients, were calculated. Correlations were compared with data from real foodstuffs. Data presented in this study could be valuable in assigning certain foods to particular food simulants as well as predicting the mass transfer of potential migrants into different types of food or food simulants, avoiding tedious and expensive laboratory analysis. The results could be especially useful for regulatory agencies as well as for the food industry.
Yuan, Jintao; Yu, Shuling; Zhang, Ting; Yuan, Xuejie; Cao, Yunyuan; Yu, Xingchen; Yang, Xuan; Yao, Wu
2016-06-01
Octanol/water (K(OW)) and octanol/air (K(OA)) partition coefficients are two important physicochemical properties of organic substances. In current practice, K(OW) and K(OA) values of some polychlorinated biphenyls (PCBs) are measured using generator column method. Quantitative structure-property relationship (QSPR) models can serve as a valuable alternative method of replacing or reducing experimental steps in the determination of K(OW) and K(OA). In this paper, two different methods, i.e., multiple linear regression based on dragon descriptors and hologram quantitative structure-activity relationship, were used to predict generator-column-derived log K(OW) and log K(OA) values of PCBs. The predictive ability of the developed models was validated using a test set, and the performances of all generated models were compared with those of three previously reported models. All results indicated that the proposed models were robust and satisfactory and can thus be used as alternative models for the rapid assessment of the K(OW) and K(OA) of PCBs. Copyright © 2016 Elsevier Inc. All rights reserved.
Li, Lin; Xu, Shuo; An, Xin; Zhang, Lu-Da
2011-10-01
In near infrared spectral quantitative analysis, the precision of measured samples' chemical values is the theoretical limit of those of quantitative analysis with mathematical models. However, the number of samples that can obtain accurately their chemical values is few. Many models exclude the amount of samples without chemical values, and consider only these samples with chemical values when modeling sample compositions' contents. To address this problem, a semi-supervised LS-SVR (S2 LS-SVR) model is proposed on the basis of LS-SVR, which can utilize samples without chemical values as well as those with chemical values. Similar to the LS-SVR, to train this model is equivalent to solving a linear system. Finally, the samples of flue-cured tobacco were taken as experimental material, and corresponding quantitative analysis models were constructed for four sample compositions' content(total sugar, reducing sugar, total nitrogen and nicotine) with PLS regression, LS-SVR and S2 LS-SVR. For the S2 LS-SVR model, the average relative errors between actual values and predicted ones for the four sample compositions' contents are 6.62%, 7.56%, 6.11% and 8.20%, respectively, and the correlation coefficients are 0.974 1, 0.973 3, 0.923 0 and 0.948 6, respectively. Experimental results show the S2 LS-SVR model outperforms the other two, which verifies the feasibility and efficiency of the S2 LS-SVR model.
DFT study on oxidation of HS(CH2) m SH ( m = 1-8) in oxidative desulfurization
NASA Astrophysics Data System (ADS)
Song, Y. Z.; Song, J. J.; Zhao, T. T.; Chen, C. Y.; He, M.; Du, J.
2016-06-01
Density functional theory was employed for calculation of HS(CH2) m SH ( m = 1-8) and its derivatives at B3LYP method at 6-31++g ( d, p) level. Using eigenvalues of LUMO and HOMO for HS(CH2) m SH, the standard electrode potentials were estimated by a stepwise multiple regression techniques (MLR), and obtained as E° = 1.500 + 7.167 × 10-3 HOMO-0.229 LUMO with high correlation coefficients of 0.973 and F values of 43.973.
NASA Technical Reports Server (NTRS)
Righter, K.; Leeman, W. P.; Hervig, R. L.
2006-01-01
Partitioning of Ni, Co and V between Cr-rich spinels and basaltic melt has been studied experimentally between 1150 and 1325 C, and at controlled oxygen fugacity from the Co-CoO buffer to slightly above the hematite magnetite buffer. These new results, together with new Ni, Co and V analyses of experimental run products from Leeman [Leeman, W.P., 1974. Experimental determination of the partitioning of divalent cations between olivine and basaltic liquid, Pt. II. PhD thesis, Univ. Oregon, 231 - 337.], show that experimentally determined spinel melt partition coefficients (D) are dependent upon temperature (T), oxygen fugacity (fO2) and spinel composition. In particular, partition coefficients determined on doped systems are higher than those in natural (undoped) systems, perhaps due to changing activity coefficients over the composition range defined by the experimental data. Using our new results and published runs (n =85), we obtain a multilinear regression equation that predicts experimental D(V) values as a function of T, fO2, concentration of V in melt and spinel composition. This equation allows prediction of D(V) spinel/melt values for natural mafic liquids at relevant crystallization conditions. Similarly, D(Ni) and D(Co) values can be inferred from our experiments at redox conditions approaching the QFM buffer, temperatures of 1150 to 1250 C and spinel composition (early Cr-bearing and later Ti-magnetite) appropriate for basic magma differentiation. When coupled with major element modelling of liquid lines of descent, these values (D(Ni) sp/melt=10 and D(Co) sp/melt=5) closely reproduce the compositional variation observed in komatiite, mid-ocean ridge basalt (MORB), ocean island basalt (OIB) and basalt to rhyolite suites.
Evaluation of keratoconus progression.
Shajari, Mehdi; Steinwender, Gernot; Herrmann, Kim; Kubiak, Kate Barbara; Pavlovic, Ivana; Plawetzki, Elena; Schmack, Ingo; Kohnen, Thomas
2018-06-01
To define variables for the evaluation of keratoconus progression and to determine cut-off values. In this retrospective cohort study (2010-2016), 265 eyes of 165 patients diagnosed with keratoconus underwent two Scheimpflug measurements (Pentacam) that took place 1 year apart ±3 months. Variables used for keratoconus detection were evaluated for progression and a correlation analysis was performed. By logistic regression analysis, a keratoconus progression index (KPI) was defined. Receiver-operating characteristic curve (ROC) analysis was performed and Youden Index calculated to determine cut-off values. Variables used for keratoconus detection showed a weak correlation with each other (eg, correlation r=0.245 between RPImin and Kmax, p<0.001). Therefore, we used parameters that took several variables into consideration (eg, D-index, index of surface variance, index for height asymmetry, KPI). KPI was defined by logistic regression and consisted of a Pachymin coefficient of -0.78 (p=0.001), a maximum elevation of back surface coefficient of 0.27 and coefficient of corneal curvature at the zone 3 mm away from the thinnest point on the posterior corneal surface of -12.44 (both p<0.001). The two variables with the highest Youden Index in the ROC analysis were D-index and KPI: D-index had a cut-off of 0.4175 (70.6% sensitivity) and Youden Index of 0.606. Cut-off for KPI was -0.78196 (84.7% sensitivity) and a Youden Index of 0.747; both 90% specificity. Keratoconus progression should be defined by evaluating parameters that consider several corneal changes; we suggest D-index and KPI to detect progression. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Preliminary Survey on TRY Forest Traits and Growth Index Relations - New Challenges
NASA Astrophysics Data System (ADS)
Lyubenova, Mariyana; Kattge, Jens; van Bodegom, Peter; Chikalanov, Alexandre; Popova, Silvia; Zlateva, Plamena; Peteva, Simona
2016-04-01
Forest ecosystems provide critical ecosystem goods and services, including food, fodder, water, shelter, nutrient cycling, and cultural and recreational value. Forests also store carbon, provide habitat for a wide range of species and help alleviate land degradation and desertification. Thus they have a potentially significant role to play in climate change adaptation planning through maintaining ecosystem services and providing livelihood options. Therefore the study of forest traits is such an important issue not just for individual countries but for the planet as a whole. We need to know what functional relations between forest traits exactly can express TRY data base and haw it will be significant for the global modeling and IPBES. The study of the biodiversity characteristics at all levels and functional links between them is extremely important for the selection of key indicators for assessing biodiversity and ecosystem services for sustainable natural capital control. By comparing the available information in tree data bases: TRY, ITR (International Tree Ring) and SP-PAM the 42 tree species are selected for the traits analyses. The dependence between location characteristics (latitude, longitude, altitude, annual precipitation, annual temperature and soil type) and forest traits (specific leaf area, leaf weight ratio, wood density and growth index) is studied by by multiply regression analyses (RDA) using the statistical software package Canoco 4.5. The Pearson correlation coefficient (measure of linear correlation), Kendal rank correlation coefficient (non parametric measure of statistical dependence) and Spearman correlation coefficient (monotonic function relationship between two variables) are calculated for each pair of variables (indexes) and species. After analysis of above mentioned correlation coefficients the dimensional linear regression models, multidimensional linear and nonlinear regression models and multidimensional neural networks models are built. The strongest dependence between It and WD was obtained. The research will support the work on: Strategic Plan for Biodiversity 2011-2020, modelling and implementation of ecosystem-based approaches to climate change adaptation and disaster risk reduction. Key words: Specific leaf area (SLA), Leaf weight ratio (LWR), Wood density (WD), Growth index (It)
NASA Astrophysics Data System (ADS)
Cambra-López, María; Winkel, Albert; Mosquera, Julio; Ogink, Nico W. M.; Aarnink, André J. A.
2015-06-01
The objective of this study was to compare co-located real-time light scattering devices and equivalent gravimetric samplers in poultry and pig houses for PM10 mass concentration, and to develop animal-specific calibration factors for light scattering samplers. These results will contribute to evaluate the comparability of different sampling instruments for PM10 concentrations. Paired DustTrak light scattering device (DustTrak aerosol monitor, TSI, U.S.) and PM10 gravimetric cyclone sampler were used for measuring PM10 mass concentrations during 24 h periods (from noon to noon) inside animal houses. Sampling was conducted in 32 animal houses in the Netherlands, including broilers, broiler breeders, layers in floor and in aviary system, turkeys, piglets, growing-finishing pigs in traditional and low emission housing with dry and liquid feed, and sows in individual and group housing. A total of 119 pairs of 24 h measurements (55 for poultry and 64 for pigs) were recorded and analyzed using linear regression analysis. Deviations between samplers were calculated and discussed. In poultry, cyclone sampler and DustTrak data fitted well to a linear regression, with a regression coefficient equal to 0.41, an intercept of 0.16 mg m-3 and a correlation coefficient of 0.91 (excluding turkeys). Results in turkeys showed a regression coefficient equal to 1.1 (P = 0.49), an intercept of 0.06 mg m-3 (P < 0.0001) and a correlation coefficient of 0.98. In pigs, we found a regression coefficient equal to 0.61, an intercept of 0.05 mg m-3 and a correlation coefficient of 0.84. Measured PM10 concentrations using DustTraks were clearly underestimated (approx. by a factor 2) in both poultry and pig housing systems compared with cyclone pre-separators. Absolute, relative, and random deviations increased with concentration. DustTrak light scattering devices should be self-calibrated to investigate PM10 mass concentrations accurately in animal houses. We recommend linear regression equations as animal-specific calibration factors for DustTraks instead of manufacturer calibration factors, especially in heavily dusty environments such as animal houses.
Fellahi, Jean-Luc; Fischer, Marc-Olivier; Rebet, Olivier; Dalbera, Audrey; Massetti, Massimo; Gérard, Jean-Louis; Hanouz, Jean-Luc
2013-04-01
Little is known about changes in near-infrared spectroscopy (NIRS)-derived cerebral (rSO(2)b) and somatic (rSO(2)s) oxygen saturation during a fluid challenge. The authors tested the hypothesis that they could differ from central venous oxygen saturation (ScvO(2)) and from one site to another. A prospective observational study. A teaching university hospital. Fifty consecutive adult patients. Admission to the intensive care unit after cardiac surgery and investigation before and after a fluid challenge. Simultaneous comparative ScvO(2), rSO(2)b, and rSO(2)s data points were collected from a blood-gas analyzer and the EQUANOX monitor (Nonin Medical, Inc, Plymouth, MN). Correlations were determined by linear regression. Multiple stepwise linear regression models were used to assess independent variables associated with changes in ScvO(2), rSO(2)b, and rSO(2)s. A statistically significant relationship was found between absolute values of ScvO(2) and rSO(2)b (r = 0.42, p < 0.001) but not between absolute values of ScvO(2) and rSO(2)s (r = 0.18, p = 0.066). No relationship was found between percent changes in ScvO(2) and rSO(2)b (r = 0.05, p = 0.715) and between percent changes in ScvO(2) and rSO(2)s (r = 0.02, p = 0.886) after the fluid challenge. Cardiac index contributed to the prediction of changes in ScvO(2) (regression coefficient = -4.09, p = 0.006), whereas the mean arterial pressure contributed to the prediction of changes in rSO(2)b (regression coefficient = -0.05, p = 0.027). rSO(2)b and rSO(2)s cannot be used to provide noninvasive estimation of ScvO(2), and trends in rSO(2)b and rSO(2)s cannot be considered as noninvasive surrogates for the trend in ScvO(2) after cardiac surgery. Different independent variables contribute to the prediction of ScvO(2), rSO(2)b, and rSO(2)s. Copyright © 2013 Elsevier Inc. All rights reserved.
Sefiddashti, Sara Emamgholipour; Arab, Mohammad; Ghazanfari, Sadegh; Kazemi, Zhila; Rezaei, Satar; Karyani, Ali Kazemi
2016-07-01
Considering the scarcity of skilled workers in the health sector, the appropriate distribution of human resources in this sector is very important for improving people's health. Having information about the degree of equality in the distribution of health human resources and their time trends is necessary for better planning and efficient use of these resources. The aim of this study was to determine the trend of inequality in the allocation of human resources in the health sector in Tehran between 2007 and 2013. This cross-sectional study was conducted in Tehran Province in Iran. The inequality in the distribution of human resources (specialists, general practitioners, pharmacists, paramedics, dentists, nurses and community health workers (Behvarz)) in 10 cities in Tehran Province was investigated using the Gini coefficient and the dissimilarity index. The time trend of inequality was examined by regression analysis. The required data were collected from the statistical yearbook of the Iran Statistics Center (ISC). The highest value of the Gini coefficient (GC) was related to nurses (GC = 0.291) in 2007. The highest value of the Gini coefficient was related to nurses and Behvarzs in 2008 and 2009, respectively. The distribution of specialists had the highest inequality in 2010 (GC = 0.298), 2011 (GC = 0.300) and 2013 (GC = 0.316). General practitioners had the lowest Gini coefficient for 2007, 2008 and 2012. Nurses for 2009 and Behvarzs for 2010, 2011 and 2013 had the lowest value of Gini coefficient. The dissimilarity indexes for specialists and general practitioners were 26.64 and 8.72 in 2013, respectively. The means of this index for included resources were 31.35, 18.27, 16.91, 22.32, 15.82, 26.74, and 24.33, respectively. The time trend analysis showed that the coefficient of time was positive for all of the human resources, except Behvarzes, and only the coefficient of general practitioners was statistically significant ( p<0.01). Over time, inequalities in the distribution of resources in the health sector have been increasing. By developing the private sector and considering the trend of this sector to operate in the more developed regions, health policy makers should continually evaluate the distribution of human resources, and they should arrange a specific plan for the allocation of human resources in the health sector.
Manaster, Amanda D.; Domanski, Marian M.; Straub, Timothy D.; Boldt, Justin A.
2016-08-18
Acoustic technologies have the potential to be used as a surrogate for measuring suspended-sediment concentration (SSC). This potential was examined in a fine-grained (97-100 percent fines) riverine system in central Illinois by way of installation of an acoustic instrument. Acoustic data were collected continuously over the span of 5.5 years. Acoustic parameters were regressed against SSC data to determine the accuracy of using acoustic technology as a surrogate for measuring SSC in a fine-grained riverine system. The resulting regressions for SSC and sediment acoustic parameters had coefficients of determination ranging from 0.75 to 0.97 for various events and configurations. The overall Nash-Sutcliffe model-fit efficiency was 0.95 for the 132 observed and predicted SSC values determined using the sediment acoustic parameter regressions. The study of using acoustic technologies as a surrogate for measuring SSC in fine-grained riverine systems is ongoing. The results at this site are promising in the realm of surrogate technology.
Martínez Gila, Diego Manuel; Cano Marchal, Pablo; Gómez Ortega, Juan; Gámez García, Javier
2018-03-25
Normally the olive oil quality is assessed by chemical analysis according to international standards. These norms define chemical and organoleptic markers, and depending on the markers, the olive oil can be labelled as lampante, virgin, or extra virgin olive oil (EVOO), the last being an indicator of top quality. The polyphenol content is related to EVOO organoleptic features, and different scientific works have studied the positive influence that these compounds have on human health. The works carried out in this paper are focused on studying relations between the polyphenol content in olive oil samples and its spectral response in the near infrared spectra. In this context, several acquisition parameters have been assessed to optimize the measurement process within the virgin olive oil production process. The best regression model reached a mean error value of 156.14 mg/kg in leave one out cross validation, and the higher regression coefficient was 0.81 through holdout validation.
Automatic reconstruction of surge deposit thicknesses. Applications to some Italian volcanoes
NASA Astrophysics Data System (ADS)
Armienti, P.; Pareschi, M. T.
1987-04-01
The energy cone concept has been adopted to describe some kinds of surge deposits. The energy cone parameters (height and slope) are evaluated through a regression technique which utilizes deposit thicknesses and the correspondent quotes and heights of the energy cone. The regression also allows to evaluate a coefficient of proportionality linking the deposit thickness to the distance between topographic surface and energy line for a given eruption. Moreover, if an accurate topography is available (in this case a reconstruction of a digitalized topography of the Phlegrean Fields and of the Vesuvius), the energy cone parameters, obtained by the backfitted technique, can be used to evaluate the order of magnitude of the deposit volumes. The hazard map for a surge localized at the Solfatara (Phlegraean Fields, Naples) has been computed. The values of the energy cone parameters and the volume have been assumed to be equal to those estimated with the regression technique applied to a past surge eruption in the same area.
Age Estimation of Infants Through Metric Analysis of Developing Anterior Deciduous Teeth.
Viciano, Joan; De Luca, Stefano; Irurita, Javier; Alemán, Inmaculada
2018-01-01
This study provides regression equations for estimation of age of infants from the dimensions of their developing deciduous teeth. The sample comprises 97 individuals of known sex and age (62 boys, 35 girls), aged between 2 days and 1,081 days. The age-estimation equations were obtained for the sexes combined, as well as for each sex separately, thus including "sex" as an independent variable. The values of the correlations and determination coefficients obtained for each regression equation indicate good fits for most of the equations obtained. The "sex" factor was statistically significant when included as an independent variable in seven of the regression equations. However, the "sex" factor provided an advantage for age estimation in only three of the equations, compared to those that did not include "sex" as a factor. These data suggest that the ages of infants can be accurately estimated from measurements of their developing deciduous teeth. © 2017 American Academy of Forensic Sciences.
Cano Marchal, Pablo; Gómez Ortega, Juan; Gámez García, Javier
2018-01-01
Normally the olive oil quality is assessed by chemical analysis according to international standards. These norms define chemical and organoleptic markers, and depending on the markers, the olive oil can be labelled as lampante, virgin, or extra virgin olive oil (EVOO), the last being an indicator of top quality. The polyphenol content is related to EVOO organoleptic features, and different scientific works have studied the positive influence that these compounds have on human health. The works carried out in this paper are focused on studying relations between the polyphenol content in olive oil samples and its spectral response in the near infrared spectra. In this context, several acquisition parameters have been assessed to optimize the measurement process within the virgin olive oil production process. The best regression model reached a mean error value of 156.14 mg/kg in leave one out cross validation, and the higher regression coefficient was 0.81 through holdout validation. PMID:29587403
Poor methodological quality and reporting standards of systematic reviews in burn care management.
Wasiak, Jason; Tyack, Zephanie; Ware, Robert; Goodwin, Nicholas; Faggion, Clovis M
2017-10-01
The methodological and reporting quality of burn-specific systematic reviews has not been established. The aim of this study was to evaluate the methodological quality of systematic reviews in burn care management. Computerised searches were performed in Ovid MEDLINE, Ovid EMBASE and The Cochrane Library through to February 2016 for systematic reviews relevant to burn care using medical subject and free-text terms such as 'burn', 'systematic review' or 'meta-analysis'. Additional studies were identified by hand-searching five discipline-specific journals. Two authors independently screened papers, extracted and evaluated methodological quality using the 11-item A Measurement Tool to Assess Systematic Reviews (AMSTAR) tool and reporting quality using the 27-item Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) checklist. Characteristics of systematic reviews associated with methodological and reporting quality were identified. Descriptive statistics and linear regression identified features associated with improved methodological quality. A total of 60 systematic reviews met the inclusion criteria. Six of the 11 AMSTAR items reporting on 'a priori' design, duplicate study selection, grey literature, included/excluded studies, publication bias and conflict of interest were reported in less than 50% of the systematic reviews. Of the 27 items listed for PRISMA, 13 items reporting on introduction, methods, results and the discussion were addressed in less than 50% of systematic reviews. Multivariable analyses showed that systematic reviews associated with higher methodological or reporting quality incorporated a meta-analysis (AMSTAR regression coefficient 2.1; 95% CI: 1.1, 3.1; PRISMA regression coefficient 6·3; 95% CI: 3·8, 8·7) were published in the Cochrane library (AMSTAR regression coefficient 2·9; 95% CI: 1·6, 4·2; PRISMA regression coefficient 6·1; 95% CI: 3·1, 9·2) and included a randomised control trial (AMSTAR regression coefficient 1·4; 95%CI: 0·4, 2·4; PRISMA regression coefficient 3·4; 95% CI: 0·9, 5·8). The methodological and reporting quality of systematic reviews in burn care requires further improvement with stricter adherence by authors to the PRISMA checklist and AMSTAR tool. © 2016 Medicalhelplines.com Inc and John Wiley & Sons Ltd.
Moyer, Douglas; Bennett, Mark
2007-01-01
The U.S. Geological Survey (USGS), U.S. Environmental Protection Agency (USEPA), Chesapeake Bay Program (CBP), Interstate Commission for the Potomac River Basin (ICPRB), Maryland Department of the Environment (MDE), Virginia Department of Conservation and Recreation (VADCR), and University of Maryland (UMD) are collaborating to improve the resolution of the Chesapeake Bay Regional Watershed Model (CBRWM). This watershed model uses the Hydrologic Simulation Program-Fortran (HSPF) to simulate the fate and transport of nutrients and sediment throughout the Chesapeake Bay watershed and extended areas of Virginia, Maryland, and Delaware. Information from the CBRWM is used by the CBP and other watershed managers to assess the effectiveness of water-quality improvement efforts as well as guide future management activities. A critical step in the improvement of the CBRWM framework was the development of an HSPF function table (FTABLE) for each represented stream channel. The FTABLE is used to relate stage (water depth) in a particular stream channel to associated channel surface area, channel volume, and discharge (streamflow). The primary tool used to generate an FTABLE for each stream channel is the XSECT program, a computer program that requires nine input variables used to represent channel morphology. These input variables are reach length, upstream and downstream elevation, channel bottom width, channel bankfull width, channel bankfull stage, slope of the floodplain, and Manning's roughness coefficient for the channel and floodplain. For the purpose of this study, the nine input variables were grouped into three categories: channel geometry, Manning's roughness coefficient, and channel and floodplain slope. Values of channel geometry for every stream segment represented in CBRWM were obtained by first developing regional regression models that relate basin drainage area to observed values of bankfull width, bankfull depth, and bottom width at each of the 290 USGS streamflow-gaging stations included in the areal extent of the model. These regression models were developed on the basis of data from stations in four physiographic provinces (Appalachian Plateaus, Valley and Ridge, Piedmont, and Coastal Plain) and were used to predict channel geometry for all 738 stream segments in the modeled area from associated basin drainage area. Manning's roughness coefficient for the channel and floodplain was represented in the XSECT program in two forms. First, all available field-estimated values of roughness were compiled for gaging stations in each physiographic province. The median of field-estimated values of channel and floodplain roughness for each physiographic province was applied to all respective stream segments. The second representation of Manning's roughness coefficient was to allow roughness to vary with channel depth. Roughness was estimated at each gaging station for each 1-foot depth interval. Median values of roughness were calculated for each 1-foot depth interval for all stations in each physiographic province. Channel and floodplain slope were determined for every stream segment in CBRWM using the USGS National Elevation Dataset. Function tables were generated by the XSECT program using values of channel geometry, channel and floodplain roughness, and channel and floodplain slope. The FTABLEs for each of the 290 USGS streamflow-gaging stations were evaluated by comparing observed discharge to the XSECT-derived discharge. Function table stream discharge derived using depth-varying roughness was found to be more representative of and statistically indistinguishable from values of observed stream discharge. Additionally, results of regression analysis showed that XSECT-derived discharge accounted for approximately 90 percent of the variability associated with observed discharge in each of the four physiographic provinces. The results of this study indicate that the methodology developed to generate FTABLEs for every s
Equations of prediction for abdominal fat in brown egg-laying hens fed different diets.
Souza, C; Jaimes, J J B; Gewehr, C E
2017-06-01
The objective was to use noninvasive measurements to formulate equations for predicting the abdominal fat weight of laying hens in a noninvasive manner. Hens were fed with different diets; the external body measurements of birds were used as regressors. We used 288 Hy-Line Brown laying hens, distributed in a completely randomized design in a factorial arrangement, submitted for 16 wk to 2 metabolizable energy levels (2,550 and 2,800 kcal/kg) and 3 levels of crude protein in the diet (150, 160, and 170 g/kg), totaling 6 treatments, with 48 hens each. Sixteen hens per treatment of 92 wk age were utilized to evaluate body weight, bird length, tarsus and sternum, greater and lesser diameter of the tarsus, and abdominal fat weight, after slaughter. The equations were obtained by using measures evaluated with regressors through simple and multiple linear regression with the stepwise method of indirect elimination (backward), with P < 0.10 for all variables remaining in the model. The weight of abdominal fat as predicted by the equations and observed values for each bird were subjected to Pearson's correlation analysis. The equations generated by energy levels showed coefficients of determination of 0.50 and 0.74 for 2,800 and 2,550 kcal/kg of metabolizable energy, respectively, with correlation coefficients of 0.71 and 0.84, with a highly significant correlation between the calculated and observed values of abdominal fat. For protein levels of 150, 160, and 170 g/kg in the diet, it was possible to obtain coefficients of determination of 0.75, 0.57, and 0.61, with correlation coefficients of 0.86, 0.75, and 0.78, respectively. Regarding the general equation for predicting abdominal fat weight, the coefficient of determination was 0.62; the correlation coefficient was 0.79. The equations for predicting abdominal fat weight in laying hens, based on external measurements of the birds, showed positive coefficients of determination and correlation coefficients, thus allowing researchers to determine abdominal fat weight in vivo. © 2016 Poultry Science Association Inc.
Solid harmonic wavelet scattering for predictions of molecule properties
NASA Astrophysics Data System (ADS)
Eickenberg, Michael; Exarchakis, Georgios; Hirn, Matthew; Mallat, Stéphane; Thiry, Louis
2018-06-01
We present a machine learning algorithm for the prediction of molecule properties inspired by ideas from density functional theory (DFT). Using Gaussian-type orbital functions, we create surrogate electronic densities of the molecule from which we compute invariant "solid harmonic scattering coefficients" that account for different types of interactions at different scales. Multilinear regressions of various physical properties of molecules are computed from these invariant coefficients. Numerical experiments show that these regressions have near state-of-the-art performance, even with relatively few training examples. Predictions over small sets of scattering coefficients can reach a DFT precision while being interpretable.
NASA Astrophysics Data System (ADS)
Qu, Yonghua; Jiao, Siong; Lin, Xudong
2008-10-01
Hetao Irrigation District located in Inner Mongolia, is one of the three largest irrigated area in China. In the irrigational agriculture region, for the reasons that many efforts have been put on irrigation rather than on drainage, as a result much sedimentary salt that usually is solved in water has been deposited in surface soil. So there has arisen a problem in such irrigation district that soil salinity has become a chief fact which causes land degrading. Remote sensing technology is an efficiency way to map the salinity in regional scale. In the principle of remote sensing, soil spectrum is one of the most important indications which can be used to reflect the status of soil salinity. In the past decades, many efforts have been made to reveal the spectrum characteristics of the salinized soil, such as the traditional statistic regression method. But it also has been found that when the hyper-spectral reflectance data are considered, the traditional regression method can't be treat the large dimension data, because the hyper-spectral data usually have too higher spectral band number. In this paper, a partial least squares regression (PLSR) model was established based on the statistical analysis on the soil salinity and the reflectance of hyper-spectral. Dataset were collect through the field soil samples were collected in the region of Hetao irrigation from the end of July to the beginning of August. The independent validation using data which are not included in the calibration model reveals that the proposed model can predicate the main soil components such as the content of total ions(S%), PH with higher determination coefficients(R2) of 0.728 and 0.715 respectively. And the rate of prediction to deviation(RPD) of the above predicted value are larger than 1.6, which indicates that the calibrated PLSR model can be used as a tool to retrieve soil salinity with accurate results. When the PLSR model's regression coefficients were aggregated according to the wavelength of visual (blue, green, red) and near infrared bands of LandSat Thematic Mapper(TM) sensor, some significant response values were observed, which indicates that the proposed method in this paper can be used to analysis the remotely sensed data from the space-boarded platform.
Jürgens, Julian H W; Schulz, Nadine; Wybranski, Christian; Seidensticker, Max; Streit, Sebastian; Brauner, Jan; Wohlgemuth, Walter A; Deuerling-Zheng, Yu; Ricke, Jens; Dudeck, Oliver
2015-02-01
The objective of this study was to compare the parameter maps of a new flat-panel detector application for time-resolved perfusion imaging in the angiography room (FD-CTP) with computed tomography perfusion (CTP) in an experimental tumor model. Twenty-four VX2 tumors were implanted into the hind legs of 12 rabbits. Three weeks later, FD-CTP (Artis zeego; Siemens) and CTP (SOMATOM Definition AS +; Siemens) were performed. The parameter maps for the FD-CTP were calculated using a prototype software, and those for the CTP were calculated with VPCT-body software on a dedicated syngo MultiModality Workplace. The parameters were compared using Pearson product-moment correlation coefficient and linear regression analysis. The Pearson product-moment correlation coefficient showed good correlation values for both the intratumoral blood volume of 0.848 (P < 0.01) and the blood flow of 0.698 (P < 0.01). The linear regression analysis of the perfusion between FD-CTP and CTP showed for the blood volume a regression equation y = 4.44x + 36.72 (P < 0.01) and for the blood flow y = 0.75x + 14.61 (P < 0.01). This preclinical study provides evidence that FD-CTP allows a time-resolved (dynamic) perfusion imaging of tumors similar to CTP, which provides the basis for clinical applications such as the assessment of tumor response to locoregional therapies directly in the angiography suite.
Qidwai, Tabish; Yadav, Dharmendra K; Khan, Feroz; Dhawan, Sangeeta; Bhakuni, R S
2012-01-01
This work presents the development of quantitative structure activity relationship (QSAR) model to predict the antimalarial activity of artemisinin derivatives. The structures of the molecules are represented by chemical descriptors that encode topological, geometric, and electronic structure features. Screening through QSAR model suggested that compounds A24, A24a, A53, A54, A62 and A64 possess significant antimalarial activity. Linear model is developed by the multiple linear regression method to link structures to their reported antimalarial activity. The correlation in terms of regression coefficient (r(2)) was 0.90 and prediction accuracy of model in terms of cross validation regression coefficient (rCV(2)) was 0.82. This study indicates that chemical properties viz., atom count (all atoms), connectivity index (order 1, standard), ring count (all rings), shape index (basic kappa, order 2), and solvent accessibility surface area are well correlated with antimalarial activity. The docking study showed high binding affinity of predicted active compounds against antimalarial target Plasmepsins (Plm-II). Further studies for oral bioavailability, ADMET and toxicity risk assessment suggest that compound A24, A24a, A53, A54, A62 and A64 exhibits marked antimalarial activity comparable to standard antimalarial drugs. Later one of the predicted active compound A64 was chemically synthesized, structure elucidated by NMR and in vivo tested in multidrug resistant strain of Plasmodium yoelii nigeriensis infected mice. The experimental results obtained agreed well with the predicted values.
Sowande, O S; Oyewale, B F; Iyasere, O S
2010-06-01
The relationships between live weight and eight body measurements of West African Dwarf (WAD) goats were studied using 211 animals under farm condition. The animals were categorized based on age and sex. Data obtained on height at withers (HW), heart girth (HG), body length (BL), head length (HL), and length of hindquarter (LHQ) were fitted into simple linear, allometric, and multiple-regression models to predict live weight from the body measurements according to age group and sex. Results showed that live weight, HG, BL, LHQ, HL, and HW increased with the age of the animals. In multiple-regression model, HG and HL best fit the model for goat kids; HG, HW, and HL for goat aged 13-24 months; while HG, LHQ, HW, and HL best fit the model for goats aged 25-36 months. Coefficients of determination (R(2)) values for linear and allometric models for predicting the live weight of WAD goat increased with age in all the body measurements, with HG being the most satisfactory single measurement in predicting the live weight of WAD goat. Sex had significant influence on the model with R(2) values consistently higher in females except the models for LHQ and HW.
Cuffless and Continuous Blood Pressure Estimation from the Heart Sound Signals
Peng, Rong-Chao; Yan, Wen-Rong; Zhang, Ning-Ling; Lin, Wan-Hua; Zhou, Xiao-Lin; Zhang, Yuan-Ting
2015-01-01
Cardiovascular disease, like hypertension, is one of the top killers of human life and early detection of cardiovascular disease is of great importance. However, traditional medical devices are often bulky and expensive, and unsuitable for home healthcare. In this paper, we proposed an easy and inexpensive technique to estimate continuous blood pressure from the heart sound signals acquired by the microphone of a smartphone. A cold-pressor experiment was performed in 32 healthy subjects, with a smartphone to acquire heart sound signals and with a commercial device to measure continuous blood pressure. The Fourier spectrum of the second heart sound and the blood pressure were regressed using a support vector machine, and the accuracy of the regression was evaluated using 10-fold cross-validation. Statistical analysis showed that the mean correlation coefficients between the predicted values from the regression model and the measured values from the commercial device were 0.707, 0.712, and 0.748 for systolic, diastolic, and mean blood pressure, respectively, and that the mean errors were less than 5 mmHg, with standard deviations less than 8 mmHg. These results suggest that this technique is of potential use for cuffless and continuous blood pressure monitoring and it has promising application in home healthcare services. PMID:26393591
Cuffless and Continuous Blood Pressure Estimation from the Heart Sound Signals.
Peng, Rong-Chao; Yan, Wen-Rong; Zhang, Ning-Ling; Lin, Wan-Hua; Zhou, Xiao-Lin; Zhang, Yuan-Ting
2015-09-17
Cardiovascular disease, like hypertension, is one of the top killers of human life and early detection of cardiovascular disease is of great importance. However, traditional medical devices are often bulky and expensive, and unsuitable for home healthcare. In this paper, we proposed an easy and inexpensive technique to estimate continuous blood pressure from the heart sound signals acquired by the microphone of a smartphone. A cold-pressor experiment was performed in 32 healthy subjects, with a smartphone to acquire heart sound signals and with a commercial device to measure continuous blood pressure. The Fourier spectrum of the second heart sound and the blood pressure were regressed using a support vector machine, and the accuracy of the regression was evaluated using 10-fold cross-validation. Statistical analysis showed that the mean correlation coefficients between the predicted values from the regression model and the measured values from the commercial device were 0.707, 0.712, and 0.748 for systolic, diastolic, and mean blood pressure, respectively, and that the mean errors were less than 5 mmHg, with standard deviations less than 8 mmHg. These results suggest that this technique is of potential use for cuffless and continuous blood pressure monitoring and it has promising application in home healthcare services.
Xiao, Z; Tang, Z; Qiang, J; Wang, S; Qian, W; Zhong, Y; Wang, R; Wang, J; Wu, L; Tang, W; Zhang, Z
2018-01-25
Intravoxel incoherent motion is a promising method for the differentiation of sinonasal lesions. This study aimed to evaluate the value of intravoxel incoherent motion in the differentiation of benign and malignant sinonasal lesions and to compare the diagnostic performance of intravoxel incoherent motion with that of conventional DWI. One hundred thirty-one patients with histologically proved solid sinonasal lesions (56 benign and 75 malignant) who underwent conventional DWI and intravoxel incoherent motion were recruited in this study. The diffusion coefficient ( D ), pseudodiffusion coefficient ( D *), and perfusion fraction ( f ) values derived from intravoxel incoherent motion and ADC values derived from conventional DWI were measured and compared between the 2 groups using the Student t test. Receiver operating characteristic curve analysis, logistic regression analysis, and 10-fold cross-validation were performed to evaluate the diagnostic performance of single-parametric and multiparametric models. The mean ADC and D values were significantly lower in malignant sinonasal lesions than in benign sinonasal lesions (both P < .001). The mean f value was higher in malignant lesions than in benign lesions ( P = .003). Multiparametric models can significantly improve the cross-validated areas under the curve for the differentiation of sinonasal lesions compared with single-parametric models (all corrected P < .05 except the D value). The model of D + f provided a better diagnostic performance than the ADC value (corrected P < .001). Intravoxel incoherent motion appears to be a more effective MR imaging technique than conventional DWI in the differentiation of benign and malignant sinonasal lesions. © 2018 by American Journal of Neuroradiology.
Grijalva-Eternod, Carlos S; Wells, Jonathan C K; Girma, Tsinuel; Kæstel, Pernille; Admassu, Bitiya; Friis, Henrik; Andersen, Gregers S
2015-09-01
A midupper arm circumference (MUAC) <115 mm and weight-for-height z score (WHZ) or weight-for-length z score (WLZ) less than -3, all of which are recommended to identify severe wasting in children, often identify different children. The reasons behind this poor agreement are not well understood. We investigated the association between these 2 anthropometric indexes and body composition to help understand why they identify different children as wasted. We analyzed weight, length, MUAC, fat-mass (FM), and fat-free mass (FFM) data from 2470 measurements from 595 healthy Ethiopian infants obtained at birth and at 1.5, 2.5, 3.5, 4.5, and 6 mo of age. We derived WLZs by using 2006 WHO growth standards. We derived length-adjusted FM and FFM values as unexplained residuals after regressing each FM and FFM against length. We used a correlation analysis to assess associations between length, FFM, and FM (adjusted and nonadjusted for length) and the MUAC and WLZ and a multivariable regression analysis to assess the independent variability of length and length-adjusted FM and FFM with either the MUAC or the WLZ as the outcome. At all ages, length showed consistently strong positive correlations with the MUAC but not with the WLZ. Adjustment for length reduced observed correlation coefficients of FM and FFM with the MUAC but increased those for the WLZ. At all ages, both length-adjusted FM and FFM showed an independent association with the WLZ and MUAC with higher regression coefficients for the WLZ. Conversely, length showed greater regression coefficients for the MUAC. At all ages, the MUAC was shown to be more influenced than was the WLZ by the FM variability relative to the FFM variability. The MUAC and WLZ have different associations with body composition, and length influences these associations differently. Our results suggest that the WLZ is a good marker of tissue masses independent of length. The MUAC acts more as a composite index of poor growth indexing jointly tissue masses and length. This trial was registered at www.controlled-trials.com as ISRCTN46718296. © 2015 American Society for Nutrition.
Fabre, Stéphanie; Clerson, Pierre; Launay, Jean-Marie; Gautier, Jean-François; Vidal-Trecan, Tiphaine; Riveline, Jean-Pierre; Platt, Adam; Abrahamsson, Anna; Miner, Jeffrey N; Hughes, Glen; Richette, Pascal; Bardin, Thomas
2018-05-02
The uric acid (UA) level in patients with gout is a key factor in disease management and is typically measured in the laboratory using plasma samples obtained after venous puncture. This study aimed to assess the reliability of immediate UA measurement with capillary blood samples obtained by fingertip puncture with the HumaSens plus point-of-care meter. UA levels were measured using both the HumaSens plus meter in the clinic and the routine plasma UA method in the biochemistry laboratory of 238 consenting diabetic patients. HumaSens plus capillary and routine plasma UA measurements were compared by linear regression, Bland-Altman plots, intraclass correlation coefficient (ICC), and Lin's concordance coefficient. Values outside the dynamic range of the meter, low (LO) or high (HI), were analyzed separately. The best capillary UA thresholds for detecting hyperuricemia were determined by receiver operating characteristic (ROC) curves. The impact of potential confounding factors (demographic and biological parameters/treatments) was assessed. Capillary and routine plasma UA levels were compared to reference plasma UA measurements by liquid chromatography-mass spectrometry (LC-MS) for a subgroup of 67 patients. In total, 205 patients had capillary and routine plasma UA measurements available. ICC was 0.90 (95% confidence interval (CI) 0.87-0.92), Lin's coefficient was 0.91 (0.88-0.93), and the Bland-Altman plot showed good agreement over all tested values. Overall, 17 patients showed values outside the dynamic range. LO values were concordant with plasma values, but HI values were considered uninterpretable. Capillary UA thresholds of 299 and 340 μmol/l gave the best results for detecting hyperuricemia (corresponding to routine plasma UA thresholds of 300 and 360 μmol/l, respectively). No significant confounding factor was found among those tested, except for hematocrit; however, this had a negligible influence on the assay reliability. When capillary and routine plasma results were discordant, comparison with LC-MS measurements showed that plasma measurements had better concordance: capillary UA, ICC 0.84 (95% CI 0.75-0.90), Lin's coefficient 0.84 (0.77-0.91); plasma UA, ICC 0.96 (0.94-0.98), Lin's coefficient 0.96 (0.94-0.98). UA measurements with the HumaSens plus meter were reasonably comparable with those of the laboratory assay. The meter is easy to use and may be useful in the clinic and in epidemiologic studies.
NASA Astrophysics Data System (ADS)
Bloomfield, J. P.; Allen, D. J.; Griffiths, K. J.
2009-06-01
SummaryLinear regression methods can be used to quantify geological controls on baseflow index (BFI). This is illustrated using an example from the Thames Basin, UK. Two approaches have been adopted. The areal extents of geological classes based on lithostratigraphic and hydrogeological classification schemes have been correlated with BFI for 44 'natural' catchments from the Thames Basin. When regression models are built using lithostratigraphic classes that include a constant term then the model is shown to have some physical meaning and the relative influence of the different geological classes on BFI can be quantified. For example, the regression constants for two such models, 0.64 and 0.69, are consistent with the mean observed BFI (0.65) for the Thames Basin, and the signs and relative magnitudes of the regression coefficients for each of the lithostratigraphic classes are consistent with the hydrogeology of the Basin. In addition, regression coefficients for the lithostratigraphic classes scale linearly with estimates of log 10 hydraulic conductivity for each lithological class. When a regression is built using a hydrogeological classification scheme with no constant term, the model does not have any physical meaning, but it has a relatively high adjusted R2 value and because of the continuous coverage of the hydrogeological classification scheme, the model can be used for predictive purposes. A model calibrated on the 44 'natural' catchments and using four hydrogeological classes (low-permeability surficial deposits, consolidated aquitards, fractured aquifers and intergranular aquifers) is shown to perform as well as a model based on a hydrology of soil types (BFIHOST) scheme in predicting BFI in the Thames Basin. Validation of this model using 110 other 'variably impacted' catchments in the Basin shows that there is a correlation between modelled and observed BFI. Where the observed BFI is significantly higher than modelled BFI the deviations can be explained by an exogenous factor, catchment urban area. It is inferred that this is may be due influences from sewage discharge, mains leakage, and leakage from septic tanks.
White, Sarah A; van den Broek, Nynke R
2004-05-30
Before introducing a new measurement tool it is necessary to evaluate its performance. Several statistical methods have been developed, or used, to evaluate the reliability and validity of a new assessment method in such circumstances. In this paper we review some commonly used methods. Data from a study that was conducted to evaluate the usefulness of a specific measurement tool (the WHO Colour Scale) is then used to illustrate the application of these methods. The WHO Colour Scale was developed under the auspices of the WHO to provide a simple portable and reliable method of detecting anaemia. This Colour Scale is a discrete interval scale, whereas the actual haemoglobin values it is used to estimate are on a continuous interval scale and can be measured accurately using electrical laboratory equipment. The methods we consider are: linear regression, correlation coefficients, paired t-tests plotting differences against mean values and deriving limits of agreement; kappa and weighted kappa statistics, sensitivity and specificity, an intraclass correlation coefficient and the repeatability coefficient. We note that although the definition and properties of each of these methods is well established inappropriate methods continue to be used in medical literature for assessing reliability and validity, as evidenced in the context of the evaluation of the WHO Colour Scale. Copyright 2004 John Wiley & Sons, Ltd.
Galloway, Joel M.
2014-01-01
The Red River of the North (hereafter referred to as “Red River”) Basin is an important hydrologic region where water is a valuable resource for the region’s economy. Continuous water-quality monitors have been operated by the U.S. Geological Survey, in cooperation with the North Dakota Department of Health, Minnesota Pollution Control Agency, City of Fargo, City of Moorhead, City of Grand Forks, and City of East Grand Forks at the Red River at Fargo, North Dakota, from 2003 through 2012 and at Grand Forks, N.Dak., from 2007 through 2012. The purpose of the monitoring was to provide a better understanding of the water-quality dynamics of the Red River and provide a way to track changes in water quality. Regression equations were developed that can be used to estimate concentrations and loads for dissolved solids, sulfate, chloride, nitrate plus nitrite, total phosphorus, and suspended sediment using explanatory variables such as streamflow, specific conductance, and turbidity. Specific conductance was determined to be a significant explanatory variable for estimating dissolved solids concentrations at the Red River at Fargo and Grand Forks. The regression equations provided good relations between dissolved solid concentrations and specific conductance for the Red River at Fargo and at Grand Forks, with adjusted coefficients of determination of 0.99 and 0.98, respectively. Specific conductance, log-transformed streamflow, and a seasonal component were statistically significant explanatory variables for estimating sulfate in the Red River at Fargo and Grand Forks. Regression equations provided good relations between sulfate concentrations and the explanatory variables, with adjusted coefficients of determination of 0.94 and 0.89, respectively. For the Red River at Fargo and Grand Forks, specific conductance, streamflow, and a seasonal component were statistically significant explanatory variables for estimating chloride. For the Red River at Grand Forks, a time component also was a statistically significant explanatory variable for estimating chloride. The regression equations for chloride at the Red River at Fargo provided a fair relation between chloride concentrations and the explanatory variables, with an adjusted coefficient of determination of 0.66 and the equation for the Red River at Grand Forks provided a relatively good relation between chloride concentrations and the explanatory variables, with an adjusted coefficient of determination of 0.77. Turbidity and streamflow were statistically significant explanatory variables for estimating nitrate plus nitrite concentrations at the Red River at Fargo and turbidity was the only statistically significant explanatory variable for estimating nitrate plus nitrite concentrations at Grand Forks. The regression equation for the Red River at Fargo provided a relatively poor relation between nitrate plus nitrite concentrations, turbidity, and streamflow, with an adjusted coefficient of determination of 0.46. The regression equation for the Red River at Grand Forks provided a fair relation between nitrate plus nitrite concentrations and turbidity, with an adjusted coefficient of determination of 0.73. Some of the variability that was not explained by the equations might be attributed to different sources contributing nitrates to the stream at different times. Turbidity, streamflow, and a seasonal component were statistically significant explanatory variables for estimating total phosphorus at the Red River at Fargo and Grand Forks. The regression equation for the Red River at Fargo provided a relatively fair relation between total phosphorus concentrations, turbidity, streamflow, and season, with an adjusted coefficient of determination of 0.74. The regression equation for the Red River at Grand Forks provided a good relation between total phosphorus concentrations, turbidity, streamflow, and season, with an adjusted coefficient of determination of 0.87. For the Red River at Fargo, turbidity and streamflow were statistically significant explanatory variables for estimating suspended-sediment concentrations. For the Red River at Grand Forks, turbidity was the only statistically significant explanatory variable for estimating suspended-sediment concentration. The regression equation at the Red River at Fargo provided a good relation between suspended-sediment concentration, turbidity, and streamflow, with an adjusted coefficient of determination of 0.95. The regression equation for the Red River at Grand Forks provided a good relation between suspended-sediment concentration and turbidity, with an adjusted coefficient of determination of 0.96.
Age-related apparent diffusion coefficient changes in the normal brain.
Watanabe, Memi; Sakai, Osamu; Ozonoff, Al; Kussman, Steven; Jara, Hernán
2013-02-01
To measure the mean diffusional age-related changes of the brain over the full human life span by using diffusion-weighted spin-echo single-shot echo-planar magnetic resonance (MR) imaging and sequential whole-brain apparent diffusion coefficient (ADC) histogram analysis and, secondarily, to build mathematical models of these normal age-related changes throughout human life. After obtaining institutional review board approval, a HIPAA-compliant retrospective search was conducted for brain MR imaging studies performed in 2007 for various clinical indications. Informed consent was waived. The brain data of 414 healthy subjects (189 males and 225 females; mean age, 33.7 years; age range, 2 days to 89.3 years) were obtained with diffusion-weighted spin-echo single-shot echo-planar MR imaging. ADC histograms of the whole brain were generated. ADC peak values, histogram widths, and intracranial volumes were plotted against age, and model parameters were estimated by using nonlinear regression. Four different stages were identified for aging changes in ADC peak values, as characterized by specific mathematical terms: There were age-associated exponential decays for the maturation period and the development period, a constant term for adulthood, and a linear increase for the senescence period. The age dependency of ADC peak value was simulated by using four-term six-coefficient function, including biexponential and linear terms. This model fit the data very closely (R(2) = 0.91). Brain diffusivity as a whole demonstrated age-related changes through four distinct periods of life. These results could contribute to establishing an ADC baseline of the normal brain, covering the full human life span.
Light scattering by dust and anthropogenic aerosol at a remote site in the Negev desert, Israel
NASA Astrophysics Data System (ADS)
Andreae, Tracey W.; Andreae, Meinrat O.; Ichoku, Charles; Maenhaut, Willy; Cafmeyer, Jan; Karnieli, Arnon; Orlovsky, Leah
2002-01-01
We investigated aerosol optical properties, mass concentration, and chemical composition over a 2 year period at a remote site in the Negev desert, Israel (Sde Boker, 30° 51'N, 34° 47'E, 470 m above sea level). Light-scattering measurements were made at three wavelengths (450, 550, and 700 nm), using an integrating nephelometer, and included the separate determination of the backscatter fraction. Aerosol coarse and fine fractions were collected with stacked filter units; mass concentrations were determined by weighing, and the chemical composition by proton-induced X-ray emission and instrumental neutron activation analysis. The total scattering coefficient at 550 nm showed a median of 66.7 Mm-1(mean value 75.2 Mm-1, standard deviation 41.7 Mm-1) typical of moderately polluted continental air masses. Values of 1000 Mm-1and higher were encountered during severe dust storm events. During the study period, 31 such dust events were detected. In addition to high scattering levels, they were characterized by a sharp drop in the Ångström coefficient (i.e., the spectral dispersion of the light scattering) to values near zero. Mass-scattering efficiencies were obtained by a multivariate regression of the scattering coefficients on dust, sulfate, and residual components. An analysis of the contributions of these components to the total scattering observed showed that anthropogenic aerosol accounted for about 70% of scattering. The rest was dominated by the effect of the large dust events mentioned above and of small dust episodes typically occurring during midafternoon.
NASA Astrophysics Data System (ADS)
Gao, Ming; Li, Shiwei
2017-05-01
Based on experimental data of the soybean yield and quality from 30 sampling points, a quantitative structure-activity relationship model (2D-QSAR) was established using the soil quality (elements, pH, organic matter content and cation exchange capacity) as independent variables and soybean yield or quality as the dependent variable, with SPSS software. During the modeling, the full data set (30 and 14 compounds) was divided into a training set (24 and 11 compounds) for model generation and a test set (6 and 3 compounds) for model validation. The R2 values of the resulting models and data were 0.826 and 0.808 for soybean yield and quality, respectively, and all regression coefficients were significant (P < 0.05). The correlation coefficient R2pred of observed values and predicted values of the soybean yield and soybean quality in the test set were 0.961 and 0.956, respectively, indicating that the models had a good predictive ability. Moreover, the Mo, Se, K, N and organic matter contents and the cation exchange capacity of soil had a positive effect on soybean production, and the B, Mo, Se, K and N contents and cation exchange coefficient had a positive effect on soybean quality. The results are instructive for enhancing soils to improve the yield and quality of soybean, and this method can also be used to study other crops or regions, providing a theoretical basis to improving the yield and quality of crops.
Influence of leucite content on slow crack growth of dental porcelains.
Cesar, Paulo F; Soki, Fabiana N; Yoshimura, Humberto N; Gonzaga, Carla C; Styopkin, Victor
2008-08-01
To determine the stress corrosion susceptibility coefficient, n, of seven dental porcelains (A: Ceramco I; B: Ceramco-II; C: Ceramco-III; D: d.Sign; E: Cerabien; F: Vitadur-Alpha; and G: Ultropaline) after aging in air or artificial saliva, and correlate results with leucite content (LC). Bars were fired according to manufacturers' instructions and polished before induction of cracks by a Vickers indenter (19.6N, 20s). Four specimens were stored in air/room temperature, and three in saliva/37 degrees C. Five indentations were made per specimen and crack lengths measured at the following times: approximately 0; 1; 3; 10; 30; 100; 300; 1000 and 3000 h. The stress corrosion coefficient n was calculated by linear regression analysis after plotting crack length as a function of time, considering that the slope of the curve was [2/(3n+2)]. Microstructural analysis was performed to determine LC. LC of the porcelains were 22% (A and B); 6% (C); 15% (D); 0% (E and F); and 13% (G). Except for porcelains A and D, all materials showed a decrease in their n values when stored in artificial saliva. However, the decrease was more pronounced for porcelains B, F, and G. Ranking of materials varied according to storage media (in air, porcelain G showed higher n compared to A, while in saliva both showed similar coefficients). No correlation was found between n values and LC in air or saliva. Storage media influenced the n value obtained for most of the materials. LC did not affect resistance to slow crack growth regardless of the test environment.
Hewitt, Angela L; Popa, Laurentiu S; Pasalar, Siavash; Hendrix, Claudia M; Ebner, Timothy J
2011-11-01
Encoding of movement kinematics in Purkinje cell simple spike discharge has important implications for hypotheses of cerebellar cortical function. Several outstanding questions remain regarding representation of these kinematic signals. It is uncertain whether kinematic encoding occurs in unpredictable, feedback-dependent tasks or kinematic signals are conserved across tasks. Additionally, there is a need to understand the signals encoded in the instantaneous discharge of single cells without averaging across trials or time. To address these questions, this study recorded Purkinje cell firing in monkeys trained to perform a manual random tracking task in addition to circular tracking and center-out reach. Random tracking provides for extensive coverage of kinematic workspaces. Direction and speed errors are significantly greater during random than circular tracking. Cross-correlation analyses comparing hand and target velocity profiles show that hand velocity lags target velocity during random tracking. Correlations between simple spike firing from 120 Purkinje cells and hand position, velocity, and speed were evaluated with linear regression models including a time constant, τ, as a measure of the firing lead/lag relative to the kinematic parameters. Across the population, velocity accounts for the majority of simple spike firing variability (63 ± 30% of R(adj)(2)), followed by position (28 ± 24% of R(adj)(2)) and speed (11 ± 19% of R(adj)(2)). Simple spike firing often leads hand kinematics. Comparison of regression models based on averaged vs. nonaveraged firing and kinematics reveals lower R(adj)(2) values for nonaveraged data; however, regression coefficients and τ values are highly similar. Finally, for most cells, model coefficients generated from random tracking accurately estimate simple spike firing in either circular tracking or center-out reach. These findings imply that the cerebellum controls movement kinematics, consistent with a forward internal model that predicts upcoming limb kinematics.
Gene set analysis using variance component tests.
Huang, Yen-Tsung; Lin, Xihong
2013-06-28
Gene set analyses have become increasingly important in genomic research, as many complex diseases are contributed jointly by alterations of numerous genes. Genes often coordinate together as a functional repertoire, e.g., a biological pathway/network and are highly correlated. However, most of the existing gene set analysis methods do not fully account for the correlation among the genes. Here we propose to tackle this important feature of a gene set to improve statistical power in gene set analyses. We propose to model the effects of an independent variable, e.g., exposure/biological status (yes/no), on multiple gene expression values in a gene set using a multivariate linear regression model, where the correlation among the genes is explicitly modeled using a working covariance matrix. We develop TEGS (Test for the Effect of a Gene Set), a variance component test for the gene set effects by assuming a common distribution for regression coefficients in multivariate linear regression models, and calculate the p-values using permutation and a scaled chi-square approximation. We show using simulations that type I error is protected under different choices of working covariance matrices and power is improved as the working covariance approaches the true covariance. The global test is a special case of TEGS when correlation among genes in a gene set is ignored. Using both simulation data and a published diabetes dataset, we show that our test outperforms the commonly used approaches, the global test and gene set enrichment analysis (GSEA). We develop a gene set analyses method (TEGS) under the multivariate regression framework, which directly models the interdependence of the expression values in a gene set using a working covariance. TEGS outperforms two widely used methods, GSEA and global test in both simulation and a diabetes microarray data.
Suzuki, Hideaki; Tabata, Takahisa; Koizumi, Hiroki; Hohchi, Nobusuke; Takeuchi, Shoko; Kitamura, Takuro; Fujino, Yoshihisa; Ohbuchi, Toyoaki
2014-12-01
This study aimed to create a multiple regression model for predicting hearing outcomes of idiopathic sudden sensorineural hearing loss (ISSNHL). The participants were 205 consecutive patients (205 ears) with ISSNHL (hearing level ≥ 40 dB, interval between onset and treatment ≤ 30 days). They received systemic steroid administration combined with intratympanic steroid injection. Data were examined by simple and multiple regression analyses. Three hearing indices (percentage hearing improvement, hearing gain, and posttreatment hearing level [HLpost]) and 7 prognostic factors (age, days from onset to treatment, initial hearing level, initial hearing level at low frequencies, initial hearing level at high frequencies, presence of vertigo, and contralateral hearing level) were included in the multiple regression analysis as dependent and explanatory variables, respectively. In the simple regression analysis, the percentage hearing improvement, hearing gain, and HLpost showed significant correlation with 2, 5, and 6 of the 7 prognostic factors, respectively. The multiple correlation coefficients were 0.396, 0.503, and 0.714 for the percentage hearing improvement, hearing gain, and HLpost, respectively. Predicted values of HLpost calculated by the multiple regression equation were reliable with 70% probability with a 40-dB-width prediction interval. Prediction of HLpost by the multiple regression model may be useful to estimate the hearing prognosis of ISSNHL. © The Author(s) 2014.
NASA Astrophysics Data System (ADS)
Yoshida, Kenichiro; Nishidate, Izumi; Ojima, Nobutoshi; Iwata, Kayoko
2014-01-01
To quantitatively evaluate skin chromophores over a wide region of curved skin surface, we propose an approach that suppresses the effect of the shading-derived error in the reflectance on the estimation of chromophore concentrations, without sacrificing the accuracy of that estimation. In our method, we use multiple regression analysis, assuming the absorbance spectrum as the response variable and the extinction coefficients of melanin, oxygenated hemoglobin, and deoxygenated hemoglobin as the predictor variables. The concentrations of melanin and total hemoglobin are determined from the multiple regression coefficients using compensation formulae (CF) based on the diffuse reflectance spectra derived from a Monte Carlo simulation. To suppress the shading-derived error, we investigated three different combinations of multiple regression coefficients for the CF. In vivo measurements with the forearm skin demonstrated that the proposed approach can reduce the estimation errors that are due to shading-derived errors in the reflectance. With the best combination of multiple regression coefficients, we estimated that the ratio of the error to the chromophore concentrations is about 10%. The proposed method does not require any measurements or assumptions about the shape of the subjects; this is an advantage over other studies related to the reduction of shading-derived errors.
Bedload and Total Load Sediment Transport Equations for Rough Open-Channel Flow
NASA Astrophysics Data System (ADS)
Abrahams, A. D.; Gao, P.
2001-12-01
The total sediment load transported by an open-channel flow may be divided into bedload and suspended load. Bedload transport occurs by saltation at low shear stress and by sheetflow at high shear stress. Dimensional analysis is used to identify the dimensionless variables that control the transport rate of noncohesive sediments over a plane bed, and regression analysis is employed to isolate the significant variables and determine the values of the coefficients. In the general bedload transport equation (i.e. for saltation and sheetflow) the dimensionless bedload transport rate is a function of the dimensionless shear stress, the friction factor, and an efficiency coefficient. For sheetflow the last term approaches 1, so that the bedload transport rate becomes a function of just the dimensionless shear stress and the friction factor. The dimensional analysis indicates that the dimensionless total load transport rate is a function of the dimensionless bedload transport rate and the dimensionless settling velocity of the sediment. Predicted values of the transport rates are graphed against the computed values of these variables for 505 flume experiments reported in the literature. These graphs indicate that the equations developed in this study give good unbiased predictions of both the bedload transport rate and total load transport rate over a wide range of conditions.
NASA Astrophysics Data System (ADS)
Escalante-Aburto, Anayansi; de Dios Figueroa-Cárdenas, Juan; Véles-Medina, José Juan; Ponce-García, Néstor; Hernández-Estrada, Zorba Josué; Rayas-Duarte, Patricia; Simsek, Senay
2017-07-01
Little attention has been given to the influence of non-gluten components on the viscoelastic properties of wheat flour dough, bread making process and their products. The aim of this study was to evaluate by creep tests the viscoelastic properties of tablets manufactured from Osborne solubility fractions (globulins, gliadins, glutenins, albumins and residue), pentosans, flour and bread. Hard and soft wheat cultivars were used to prepare the reconstituted tablets. Sintered tablets (except flour and bread) showed similar values to those obtained from the sum of the regression coefficients of the fractions. Gliadins and albumins accounted for about 54% of the total elasticity. Gliadins contributed with almost half of the total viscosity (45.7%), and showed the highest value for the viscosity coefficient of the viscous element. When the effect of dilution was evaluated, the residue showed the highest instantaneous elastic modulus (788.2 MPa). Retardation times of the first element (λ1 3.5 s) were about 10 times lower than the second element (λ2 39.3 s). The analysis of compliance of data corrected by protein content in flour showed that the residue fraction presented the highest values. An important contribution of non-gluten components (starch, albumins and globulins) on the viscoelastic performance of sintered tablets from Osborne fractions, flour and bread was found.
Correlation and simple linear regression.
Zou, Kelly H; Tuncali, Kemal; Silverman, Stuart G
2003-06-01
In this tutorial article, the concepts of correlation and regression are reviewed and demonstrated. The authors review and compare two correlation coefficients, the Pearson correlation coefficient and the Spearman rho, for measuring linear and nonlinear relationships between two continuous variables. In the case of measuring the linear relationship between a predictor and an outcome variable, simple linear regression analysis is conducted. These statistical concepts are illustrated by using a data set from published literature to assess a computed tomography-guided interventional technique. These statistical methods are important for exploring the relationships between variables and can be applied to many radiologic studies.
Exact Analysis of Squared Cross-Validity Coefficient in Predictive Regression Models
ERIC Educational Resources Information Center
Shieh, Gwowen
2009-01-01
In regression analysis, the notion of population validity is of theoretical interest for describing the usefulness of the underlying regression model, whereas the presumably more important concept of population cross-validity represents the predictive effectiveness for the regression equation in future research. It appears that the inference…
Smith, S. Jerrod; Lewis, Jason M.; Graves, Grant M.
2015-09-28
Generalized-least-squares multiple-linear regression analysis was used to formulate regression relations between peak-streamflow frequency statistics and basin characteristics. Contributing drainage area was the only basin characteristic determined to be statistically significant for all percentage of annual exceedance probabilities and was the only basin characteristic used in regional regression equations for estimating peak-streamflow frequency statistics on unregulated streams in and near the Oklahoma Panhandle. The regression model pseudo-coefficient of determination, converted to percent, for the Oklahoma Panhandle regional regression equations ranged from about 38 to 63 percent. The standard errors of prediction and the standard model errors for the Oklahoma Panhandle regional regression equations ranged from about 84 to 148 percent and from about 76 to 138 percent, respectively. These errors were comparable to those reported for regional peak-streamflow frequency regression equations for the High Plains areas of Texas and Colorado. The root mean square errors for the Oklahoma Panhandle regional regression equations (ranging from 3,170 to 92,000 cubic feet per second) were less than the root mean square errors for the Oklahoma statewide regression equations (ranging from 18,900 to 412,000 cubic feet per second); therefore, the Oklahoma Panhandle regional regression equations produce more accurate peak-streamflow statistic estimates for the irrigated period of record in the Oklahoma Panhandle than do the Oklahoma statewide regression equations. The regression equations developed in this report are applicable to streams that are not substantially affected by regulation, impoundment, or surface-water withdrawals. These regression equations are intended for use for stream sites with contributing drainage areas less than or equal to about 2,060 square miles, the maximum value for the independent variable used in the regression analysis.
Annadurai, Gurusamy; Ling, Lai Yi; Lee, Jiunn-Fwu
2008-02-28
In this work, a four-level Box-Behnken factorial design was employed combining with response surface methodology (RSM) to optimize the medium composition for the degradation of phenol by pseudomonas putida (ATCC 31800). A mathematical model was then developed to show the effect of each medium composition and their interactions on the biodegradation of phenol. Response surface method was using four levels like glucose, yeast extract, ammonium sulfate and sodium chloride, which also enabled the identification of significant effects of interactions for the batch studies. The biodegradation of phenol on Pseudomonas putida (ATCC 31800) was determined to be pH-dependent and the maximum degradation capacity of microorganism at 30 degrees C when the phenol concentration was 0.2 g/L and the pH of the solution was 7.0. Second order polynomial regression model was used for analysis of the experiment. Cubic and quadratic terms were incorporated into the regression model through variable selection procedures. The experimental values are in good agreement with predicted values and the correlation coefficient was found to be 0.9980.
Raman spectroscopy based screening of IgG positive and negative sera for dengue virus infection
NASA Astrophysics Data System (ADS)
Bilal, M.; Saleem, M.; Bial, Maria; Khan, Saranjam; Ullah, Rahat; Ali, Hina; Ahmed, M.; Ikram, Masroor
2017-11-01
A quantitative analysis for the screening of immunoglobulin-G (IgG) positive human sera samples is presented for the dengue virus infection. The regression model was developed using 79 samples while 20 samples were used to test the performance of the model. The R-square (r 2) value of 0.91 was found through a leave-one-sample-out cross validation method, which shows the validity of this model. This model incorporates the molecular changes associated with IgG. Molecular analysis based on regression coefficients revealed that myristic acid, coenzyme-A, alanine, arabinose, arginine, vitamin C, carotene, fumarate, galactosamine, glutamate, lactic acid, stearic acid, tryptophan and vaccenic acid are positively correlated with IgG; while amide III, collagen, proteins, fatty acids, phospholipids and fucose are negatively correlated. For blindly tested samples, an excellent agreement has been found between the model predicted, and the clinical values of IgG. The parameters, which include sensitivity, specificity, accuracy and the area under the receiver operator characteristic curve, are found to be 100%, 83.3%, 95% and 0.99, respectively, which confirms the high quality of the model.
Fadzillah, Nurrulhidayah Ahmad; Man, Yaakob bin Che; Rohman, Abdul; Rosman, Arieff Salleh; Ismail, Amin; Mustafa, Shuhaimi; Khatib, Alfi
2015-01-01
The authentication of food products from the presence of non-allowed components for certain religion like lard is very important. In this study, we used proton Nuclear Magnetic Resonance ((1)H-NMR) spectroscopy for the analysis of butter adulterated with lard by simultaneously quantification of all proton bearing compounds, and consequently all relevant sample classes. Since the spectra obtained were too complex to be analyzed visually by the naked eyes, the classification of spectra was carried out.The multivariate calibration of partial least square (PLS) regression was used for modelling the relationship between actual value of lard and predicted value. The model yielded a highest regression coefficient (R(2)) of 0.998 and the lowest root mean square error calibration (RMSEC) of 0.0091% and root mean square error prediction (RMSEP) of 0.0090, respectively. Cross validation testing evaluates the predictive power of the model. PLS model was shown as good models as the intercept of R(2)Y and Q(2)Y were 0.0853 and -0.309, respectively.
Societal Value of Surgery for Facial Reanimation.
Su, Peiyi; Ishii, Lisa E; Joseph, Andrew; Nellis, Jason; Dey, Jacob; Bater, Kristin; Byrne, Patrick J; Boahene, Kofi D O; Ishii, Masaru
2017-03-01
Patients with facial paralysis are perceived negatively by society in a number of domains. Society's perception of the health utility of varying degrees of facial paralysis and the value society places on reconstructive surgery for facial reanimation need to be quantified. To measure health state utility of varying degrees of facial paralysis, willingness to pay (WTP) for a repair, and the subsequent value of facial reanimation surgery as perceived by society. This prospective observational study conducted in an academic tertiary referral center evaluated a group of 348 casual observers who viewed images of faces with unilateral facial paralysis of 3 severity levels (low, medium, and high) categorized by House-Brackmann grade. Structural equation modeling was performed to understand associations among health utility metrics, WTP, and facial perception domains. Data were collected from July 16 to September 26, 2015. Observer-rated (1) quality of life (QOL) using established health utility metrics (standard gamble, time trade-off, and a visual analog scale) and (2) their WTP for surgical repair. Among the 348 observers (248 women [71.3%]; 100 men [28.7%]; mean [SD] age, 29.3 [11.6] years), mixed-effects linear regression showed that WTP increased nonlinearly with increasing severity of paralysis. Participants were willing to pay $3487 (95% CI, $2362-$4961) to repair low-grade paralysis, $8571 (95% CI, $6401-$11 234) for medium-grade paralysis, and $20 431 (95% CI, $16 273-$25 317) for high-grade paralysis. The dominant factor affecting the participants' WTP was perceived QOL. Modeling showed that perceived QOL decreased with paralysis severity (regression coefficient, -0.004; 95% CI, -0.005 to -0.004; P < .001) and increased with attractiveness (regression coefficient, 0.002; 95% CI, 0.002 to 0.003; P < .001). Mean (SD) health utility scores calculated by the standard gamble metric for low- and high-grade paralysis were 0.98 (0.09) and 0.77 (0.25), respectively. Time trade-off and visual analog scale measures were highly correlated. We calculated mean (SD) WTP per quality-adjusted life-year, which ranged from $10 167 ($14 565) to $17 008 ($38 288) for low- to high-grade paralysis, respectively. Society perceives the repair of facial paralysis to be a high-value intervention. Societal WTP increases and perceived health state utility decreases with increasing House-Brackmann grade. This study demonstrates the usefulness of WTP as an objective measure to inform dimensions of disease severity and signal the value society places on proper facial function. NA.
NASA Astrophysics Data System (ADS)
Wheeler, David C.; Waller, Lance A.
2009-03-01
In this paper, we compare and contrast a Bayesian spatially varying coefficient process (SVCP) model with a geographically weighted regression (GWR) model for the estimation of the potentially spatially varying regression effects of alcohol outlets and illegal drug activity on violent crime in Houston, Texas. In addition, we focus on the inherent coefficient shrinkage properties of the Bayesian SVCP model as a way to address increased coefficient variance that follows from collinearity in GWR models. We outline the advantages of the Bayesian model in terms of reducing inflated coefficient variance, enhanced model flexibility, and more formal measuring of model uncertainty for prediction. We find spatially varying effects for alcohol outlets and drug violations, but the amount of variation depends on the type of model used. For the Bayesian model, this variation is controllable through the amount of prior influence placed on the variance of the coefficients. For example, the spatial pattern of coefficients is similar for the GWR and Bayesian models when a relatively large prior variance is used in the Bayesian model.
Sheehan, Kenneth R.; Strager, Michael P.; Welsh, Stuart A.
2013-01-01
Stream habitat assessments are commonplace in fish management, and often involve nonspatial analysis methods for quantifying or predicting habitat, such as ordinary least squares regression (OLS). Spatial relationships, however, often exist among stream habitat variables. For example, water depth, water velocity, and benthic substrate sizes within streams are often spatially correlated and may exhibit spatial nonstationarity or inconsistency in geographic space. Thus, analysis methods should address spatial relationships within habitat datasets. In this study, OLS and a recently developed method, geographically weighted regression (GWR), were used to model benthic substrate from water depth and water velocity data at two stream sites within the Greater Yellowstone Ecosystem. For data collection, each site was represented by a grid of 0.1 m2 cells, where actual values of water depth, water velocity, and benthic substrate class were measured for each cell. Accuracies of regressed substrate class data by OLS and GWR methods were calculated by comparing maps, parameter estimates, and determination coefficient r 2. For analysis of data from both sites, Akaike’s Information Criterion corrected for sample size indicated the best approximating model for the data resulted from GWR and not from OLS. Adjusted r 2 values also supported GWR as a better approach than OLS for prediction of substrate. This study supports GWR (a spatial analysis approach) over nonspatial OLS methods for prediction of habitat for stream habitat assessments.
Shafizadeh-Moghadam, Hossein; Valavi, Roozbeh; Shahabi, Himan; Chapi, Kamran; Shirzadi, Ataollah
2018-07-01
In this research, eight individual machine learning and statistical models are implemented and compared, and based on their results, seven ensemble models for flood susceptibility assessment are introduced. The individual models included artificial neural networks, classification and regression trees, flexible discriminant analysis, generalized linear model, generalized additive model, boosted regression trees, multivariate adaptive regression splines, and maximum entropy, and the ensemble models were Ensemble Model committee averaging (EMca), Ensemble Model confidence interval Inferior (EMciInf), Ensemble Model confidence interval Superior (EMciSup), Ensemble Model to estimate the coefficient of variation (EMcv), Ensemble Model to estimate the mean (EMmean), Ensemble Model to estimate the median (EMmedian), and Ensemble Model based on weighted mean (EMwmean). The data set covered 201 flood events in the Haraz watershed (Mazandaran province in Iran) and 10,000 randomly selected non-occurrence points. Among the individual models, the Area Under the Receiver Operating Characteristic (AUROC), which showed the highest value, belonged to boosted regression trees (0.975) and the lowest value was recorded for generalized linear model (0.642). On the other hand, the proposed EMmedian resulted in the highest accuracy (0.976) among all models. In spite of the outstanding performance of some models, nevertheless, variability among the prediction of individual models was considerable. Therefore, to reduce uncertainty, creating more generalizable, more stable, and less sensitive models, ensemble forecasting approaches and in particular the EMmedian is recommended for flood susceptibility assessment. Copyright © 2018 Elsevier Ltd. All rights reserved.
Ronco, Nicolás R; Menestrina, Fiorella; Romero, Lílian M; Castells, Cecilia B
2017-06-09
In this paper, we report gas-liquid partition constants for thirty-five volatile organic solutes in the room temperature ionic liquid trihexyl(tetradecyl)phosphonium bromide measured by gas-liquid chromatography using capillary columns. The relative contribution of gas-liquid partition and interfacial adsorption to retention was evaluated through the use of columns with different the phase ratio. Four capillary columns with exactly known phase ratios were constructed and employed to measure the solute retention factors at four temperatures between 313.15 and 343.15K. The partition coefficients were calculated from the slopes of the linear regression between solute retention factors and the reciprocal of phase ratio at a given temperature according to the gas-liquid chromatographic theory. Gas-liquid interfacial adsorption was detected for a few solutes and it has been considered for the calculations of partition coefficient. Reliable solute's infinite dilution activity coefficients can be obtained when retention data are determined by a unique partitioning mechanism. The partial molar excess enthalpies at infinite dilution have been estimated from the dependence of experimental values of solute activity coefficients with the column temperature. A thorough discussion of the uncertainties of the experimental measurements and the main advantages of the use of capillary columns to acquire the aforementioned relevant thermodynamic information was performed. Copyright © 2017 Elsevier B.V. All rights reserved.
[Effects of carbon components of fine particulate matter (PM2.5) on atherogenic index of plasma].
Fan, Jiao; Qin, Xiaolei; Xue, Xiaodan; Han, Bin; Bai, Zhipeng; Tang, Naijun; Zhang, Liwen
2014-01-01
To evaluate associations between carbon constituents of fine particulate matter (PM2.5) and atherogenic index of plasma (AIP). We collected subjects from two communities by a system sampling, and 112 people aged over 60 years old without cardiovascular disease were recruited. The levels of cholesterol (TC), triglycerides (TG), high-density lipoprotein cholesterol (HDL-C), low density lipoprotein cholesterol (LDL-C) of objects, and personal exposure to PM2.5 were measured on December, 2011. Total carbon (TC), organic carbon (OC) and elemental carbon (EC) of PM2.5 were detected and AIP was calculated according to its definition. The value of AIP among the 112 subjects was 0.05 ± 0.26. Personal exposure concentration of PM2.5 and its carbon components (TC,OC and EC) were (164.75 ± 110.67), (53.86 ± 29.65), (44.93 ± 26.37) and (9.49 ± 5.75) µg/m(3), respectively. The Pearson analysis showed the linear relationship between TC,OC,EC and AIP, all significant positive correlations. The correlation coefficients were TC (r = 0.307, P < 0.05),OC (r = 0.287, P < 0.05) and EC (r = 0.252, P < 0.05), respectively. The multiple logistic regression analysis showed that when the AIP risk categories were selected as dependent variable and low risk group as reference group, the regression coefficient of TC,OC and EC was separately 1.03 (95%CI:1.01-1.05), 1.03 (95%CI:1.01-1.05), 1.12 (95%CI:1.02-1.22) in the high risk group; while there was no statistical significance of the regression coefficient and OR in the middle risk group. There was stable associations between the carbon constituents (TC,OC and EC) of fine Particulate Matter (PM2.5) and AIP. The findings suggested that carbon components of PM2.5 should be considered as risk factors of atherogenic.
NASA Astrophysics Data System (ADS)
Sanchez Rivera, Yamil
The purpose of this study is to add to what we know about the affective domain and to create a valid instrument for future studies. The Motivation to Learn Science (MLS) Inventory is based on Krathwohl's Taxonomy of Affective Behaviors (Krathwohl et al., 1964). The results of the Exploratory Factor Analysis (EFA) and Confirmatory Factor Analysis (CFA) demonstrated that the MLS Inventory is a valid and reliable instrument. Therefore, the MLS Inventory is a uni-dimensional instrument composed of 9 items with convergent validity (no divergence). The instrument had a high Chronbach Alpha value of .898 during the EFA analysis and .919 with the CFA analysis. Factor loadings on the 9 items ranged from .617 to .800. Standardized regression weights ranged from .639 to .835 in the CFA analysis. Various indices (RMSEA = .033; NFI = .987; GFI = .985; CFI = 1.000) demonstrated a good fitness of the proposed model. Hierarchical linear modeling was used to statistical analyze data where students' motivation to learn science scores (level-1) were nested within teachers (level-2). The analysis was geared toward identifying if teachers' use of affective behavior (a level-2 classroom variable) was significantly related with students' MLS scores (level-1 criterion variable). Model testing proceeded in three phases: intercept-only model, means-as-outcome model, and a random-regression coefficient model. The intercept-only model revealed an intra-class correlation coefficient of .224 with an estimated reliability of .726. Therefore, data suggested that only 22.4% of the variance in MLS scores is between-classes and the remaining 77.6% is at the student-level. Due to the significant variance in MLS scores, X2(62.756, p<.0001), teachers' TAB scores were added as a level-2 predictor. The regression coefficient was non-significant (p>.05). Therefore, the teachers' self-reported use of affective behaviors was not a significant predictor of students' motivation to learn science.
Sherrouse, Benson C.; Riegle, Jodi L.; Semmens, Darius J.
2010-01-01
In response to the need for incorporating quantified and spatially explicit measures of social values into ecosystem services assessments, the Rocky Mountain Geographic Science Center, in collaboration with Colorado State University, has developed a geographic information system application, Social Values for Ecosystem Services (SolVES). SolVES can be used to assess, map, and quantify the perceived social values of ecosystem services. SolVES derives a quantitative social values metric, the Value Index, from a combination of spatial and nonspatial responses to public attitude and preference surveys. SolVES also generates landscape metrics, such as average elevation and distance to water, calculated from spatial data layers describing the underlying physical environment. Using kernel density calculations and zonal statistics, SolVES derives and maps the 10-point Value Index and reports landscape metrics associated with each index value for social value types such as aesthetics, biodiversity, and recreation. This can be repeated for various survey subgroups as distinguished by their attitudes and preferences regarding public uses of the forests such as motorized recreation and logging for fuels reduction. The Value Index provides a basis of comparison within and among survey subgroups to consider the effect of social contexts on the valuation of ecosystem services. SolVES includes regression coefficients linking the predicted value (the Value Index) to landscape metrics. These coefficients are used to generate predicted social value maps using value transfer techniques for areas where primary survey data are not available. SolVES was developed, and will continue to be enhanced through future versions, as a public domain tool to enable decision makers and researchers to map the social values of ecosystem services and to facilitate discussions among diverse stakeholders regarding tradeoffs between different ecosystem services in a variety of physical and social contexts.
Comparison of radar backscatter from Antarctic and Arctic sea ice
NASA Technical Reports Server (NTRS)
Hosseinmostafa, R.; Lytle, V.
1992-01-01
Two ship-based step-frequency radars, one at C-band (5.3 GHz) and one at Ku-band (13.9 GHz), measured backscatter from ice in the Weddell Sea. Most of the backscatter data were from first-year (FY) and second-year (SY) ice at the ice stations where the ship was stationary and detailed snow and ice characterizations were performed. The presence of a slush layer at the snow-ice interface masks the distinction between FY and SY ice in the Weddell Sea, whereas in the Arctic the separation is quite distinct. The effect of snow-covered ice on backscattering coefficients (sigma0) from the Weddell Sea region indicates that surface scattering is the dominant factor. Measured sigma0 values were compared with Kirchhoff and regression-analysis models. The Weibull power-density function was used to fit the measured backscattering coefficients at 45 deg.
Wang, Anxin; Li, Zhifang; Yang, Yuling; Chen, Guojuan; Wang, Chunxue; Wu, Yuntao; Ruan, Chunyu; Liu, Yan; Wang, Yilong; Wu, Shouling
2016-01-01
To investigate the relationship between baseline systolic blood pressure (SBP) and visit-to-visit blood pressure variability in a general population. This is a prospective longitudinal cohort study on cardiovascular risk factors and cardiovascular or cerebrovascular events. Study participants attended a face-to-face interview every 2 years. Blood pressure variability was defined using the standard deviation and coefficient of variation of all SBP values at baseline and follow-up visits. The coefficient of variation is the ratio of the standard deviation to the mean SBP. We used multivariate linear regression models to test the relationships between SBP and standard deviation, and between SBP and coefficient of variation. Approximately 43,360 participants (mean age: 48.2±11.5 years) were selected. In multivariate analysis, after adjustment for potential confounders, baseline SBPs <120 mmHg were inversely related to standard deviation (P<0.001) and coefficient of variation (P<0.001). In contrast, baseline SBPs ≥140 mmHg were significantly positively associated with standard deviation (P<0.001) and coefficient of variation (P<0.001). Baseline SBPs of 120-140 mmHg were associated with the lowest standard deviation and coefficient of variation. The associations between baseline SBP and standard deviation, and between SBP and coefficient of variation during follow-ups showed a U curve. Both lower and higher baseline SBPs were associated with increased blood pressure variability. To control blood pressure variability, a good target SBP range for a general population might be 120-139 mmHg.
NASA Astrophysics Data System (ADS)
Kiss, I.; Cioată, V. G.; Alexa, V.; Raţiu, S. A.
2017-05-01
The braking system is one of the most important and complex subsystems of railway vehicles, especially when it comes for safety. Therefore, installing efficient safe brakes on the modern railway vehicles is essential. Nowadays is devoted attention to solving problems connected with using high performance brake materials and its impact on thermal and mechanical loading of railway wheels. The main factor that influences the selection of a friction material for railway applications is the performance criterion, due to the interaction between the brake block and the wheel produce complex thermos-mechanical phenomena. In this work, the investigated subjects are the cast-iron brake shoes, which are still widely used on freight wagons. Therefore, the cast-iron brake shoes - with lamellar graphite and with a high content of phosphorus (0.8-1.1%) - need a special investigation. In order to establish the optimal condition for the cast-iron brake shoes we proposed a mathematical modelling study by using the statistical analysis and multiple regression equations. Multivariate research is important in areas of cast-iron brake shoes manufacturing, because many variables interact with each other simultaneously. Multivariate visualization comes to the fore when researchers have difficulties in comprehending many dimensions at one time. Technological data (hardness and chemical composition) obtained from cast-iron brake shoes were used for this purpose. In order to settle the multiple correlation between the hardness of the cast-iron brake shoes, and the chemical compositions elements several model of regression equation types has been proposed. Because a three-dimensional surface with variables on three axes is a common way to illustrate multivariate data, in which the maximum and minimum values are easily highlighted, we plotted graphical representation of the regression equations in order to explain interaction of the variables and locate the optimal level of each variable for maximal response. For the calculation of the regression coefficients, dispersion and correlation coefficients, the software Matlab was used.
NASA Astrophysics Data System (ADS)
Wilson, Barry T.; Knight, Joseph F.; McRoberts, Ronald E.
2018-03-01
Imagery from the Landsat Program has been used frequently as a source of auxiliary data for modeling land cover, as well as a variety of attributes associated with tree cover. With ready access to all scenes in the archive since 2008 due to the USGS Landsat Data Policy, new approaches to deriving such auxiliary data from dense Landsat time series are required. Several methods have previously been developed for use with finer temporal resolution imagery (e.g. AVHRR and MODIS), including image compositing and harmonic regression using Fourier series. The manuscript presents a study, using Minnesota, USA during the years 2009-2013 as the study area and timeframe. The study examined the relative predictive power of land cover models, in particular those related to tree cover, using predictor variables based solely on composite imagery versus those using estimated harmonic regression coefficients. The study used two common non-parametric modeling approaches (i.e. k-nearest neighbors and random forests) for fitting classification and regression models of multiple attributes measured on USFS Forest Inventory and Analysis plots using all available Landsat imagery for the study area and timeframe. The estimated Fourier coefficients developed by harmonic regression of tasseled cap transformation time series data were shown to be correlated with land cover, including tree cover. Regression models using estimated Fourier coefficients as predictor variables showed a two- to threefold increase in explained variance for a small set of continuous response variables, relative to comparable models using monthly image composites. Similarly, the overall accuracies of classification models using the estimated Fourier coefficients were approximately 10-20 percentage points higher than the models using the image composites, with corresponding individual class accuracies between six and 45 percentage points higher.
Kim, Mimi; Kang, Tae Wook; Kim, Young Kon; Kim, Seong Hyun; Kwon, Wooil; Ha, Sang Yun; Ji, Sang A
2016-03-01
To evaluate the correlation between grade of pancreatic neuroendocrine tumours (pNETs) based on the 2010 World Health Organization (WHO) classification and the apparent diffusion coefficient (ADC), and to assess whether the ADC value and WHO classification can predict recurrence-free survival (RFS) after surgery for pNETs. This retrospective study was approved by the Institutional Review Board. The requirement for informed consent was waived. Between March 2009 and November 2014, forty-nine patients who underwent magnetic resonance (MR) imaging with diffusion-weighted image and subsequent surgery for single pNETs were included. Correlations among qualitative MR imaging findings, quantitative ADC values, and WHO classifications were assessed. An ordered logistic regression test was used to control for tumour size as a confounding factor. The association between ADC value (or WHO classification) and RFS was analysed. All tumors (n=49) were classified as low- (n=29, grade 1), intermediate- (n=17, grade 2), and high-grade (n=3, grade 3), respectively. The mean ADC of pNETs was moderately negatively correlated with WHO classification before and after adjustment for tumour size (ρ=-0.64, p<0.001 and ρ=-0.55, p=0.001 respectively). RFS was significantly associated with WHO classification (p=0.007), but not with the ADC value (p=0.569). The ADC value of pNETs is moderately correlated with WHO tumour grade, regardless of tumour size. However, the WHO tumour classification of pNET may be more suitable for predicting RFS than the ADC value. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
SPSS and SAS programs for comparing Pearson correlations and OLS regression coefficients.
Weaver, Bruce; Wuensch, Karl L
2013-09-01
Several procedures that use summary data to test hypotheses about Pearson correlations and ordinary least squares regression coefficients have been described in various books and articles. To our knowledge, however, no single resource describes all of the most common tests. Furthermore, many of these tests have not yet been implemented in popular statistical software packages such as SPSS and SAS. In this article, we describe all of the most common tests and provide SPSS and SAS programs to perform them. When they are applicable, our code also computes 100 × (1 - α)% confidence intervals corresponding to the tests. For testing hypotheses about independent regression coefficients, we demonstrate one method that uses summary data and another that uses raw data (i.e., Potthoff analysis). When the raw data are available, the latter method is preferred, because use of summary data entails some loss of precision due to rounding.
NASA Astrophysics Data System (ADS)
Zhai, Mengting; Chen, Yan; Li, Jing; Zhou, Jun
2017-12-01
The molecular electrongativity distance vector (MEDV-13) was used to describe the molecular structure of benzyl ether diamidine derivatives in this paper, Based on MEDV-13, The three-parameter (M 3, M 15, M 47) QSAR model of insecticidal activity (pIC 50) for 60 benzyl ether diamidine derivatives was constructed by leaps-and-bounds regression (LBR) . The traditional correlation coefficient (R) and the cross-validation correlation coefficient (R CV ) were 0.975 and 0.971, respectively. The robustness of the regression model was validated by Jackknife method, the correlation coefficient R were between 0.971 and 0.983. Meanwhile, the independent variables in the model were tested to be no autocorrelation. The regression results indicate that the model has good robust and predictive capabilities. The research would provide theoretical guidance for the development of new generation of anti African trypanosomiasis drugs with efficiency and low toxicity.
Zheng, Qi; Peng, Limin
2016-01-01
Quantile regression provides a flexible platform for evaluating covariate effects on different segments of the conditional distribution of response. As the effects of covariates may change with quantile level, contemporaneously examining a spectrum of quantiles is expected to have a better capacity to identify variables with either partial or full effects on the response distribution, as compared to focusing on a single quantile. Under this motivation, we study a general adaptively weighted LASSO penalization strategy in the quantile regression setting, where a continuum of quantile index is considered and coefficients are allowed to vary with quantile index. We establish the oracle properties of the resulting estimator of coefficient function. Furthermore, we formally investigate a BIC-type uniform tuning parameter selector and show that it can ensure consistent model selection. Our numerical studies confirm the theoretical findings and illustrate an application of the new variable selection procedure. PMID:28008212
Liu, Ruixin; Zhang, Xiaodong; Zhang, Lu; Gao, Xiaojie; Li, Huiling; Shi, Junhan; Li, Xuelin
2014-06-01
The aim of this study was to predict the bitterness intensity of a drug using an electronic tongue (e-tongue). The model drug of berberine hydrochloride was used to establish a bitterness prediction model (BPM), based on the taste evaluation of bitterness intensity by a taste panel, the data provided by the e-tongue and a genetic algorithm-back-propagation neural network (GA-BP) modeling method. The modeling characteristics of the GA-BP were compared with those of multiple linear regression, partial least square regression and BP methods. The determination coefficient of the BPM was 0.99965±0.00004, the root mean square error of cross-validation was 0.1398±0.0488 and the correlation coefficient of the cross-validation between the true and predicted values was 0.9959±0.0027. The model is superior to the other three models based on these indicators. In conclusion, the model established in this study has a high fitting degree and may be used for the bitterness prediction modeling of berberine hydrochloride of different concentrations. The model also provides a reference for the generation of BPMs of other drugs. Additionally, the algorithm of the study is able to conduct a rapid and accurate quantitative analysis of the data provided by the e-tongue.
LIU, RUIXIN; ZHANG, XIAODONG; ZHANG, LU; GAO, XIAOJIE; LI, HUILING; SHI, JUNHAN; LI, XUELIN
2014-01-01
The aim of this study was to predict the bitterness intensity of a drug using an electronic tongue (e-tongue). The model drug of berberine hydrochloride was used to establish a bitterness prediction model (BPM), based on the taste evaluation of bitterness intensity by a taste panel, the data provided by the e-tongue and a genetic algorithm-back-propagation neural network (GA-BP) modeling method. The modeling characteristics of the GA-BP were compared with those of multiple linear regression, partial least square regression and BP methods. The determination coefficient of the BPM was 0.99965±0.00004, the root mean square error of cross-validation was 0.1398±0.0488 and the correlation coefficient of the cross-validation between the true and predicted values was 0.9959±0.0027. The model is superior to the other three models based on these indicators. In conclusion, the model established in this study has a high fitting degree and may be used for the bitterness prediction modeling of berberine hydrochloride of different concentrations. The model also provides a reference for the generation of BPMs of other drugs. Additionally, the algorithm of the study is able to conduct a rapid and accurate quantitative analysis of the data provided by the e-tongue. PMID:24926369
Yoneoka, Daisuke; Henmi, Masayuki
2017-06-01
Recently, the number of regression models has dramatically increased in several academic fields. However, within the context of meta-analysis, synthesis methods for such models have not been developed in a commensurate trend. One of the difficulties hindering the development is the disparity in sets of covariates among literature models. If the sets of covariates differ across models, interpretation of coefficients will differ, thereby making it difficult to synthesize them. Moreover, previous synthesis methods for regression models, such as multivariate meta-analysis, often have problems because covariance matrix of coefficients (i.e. within-study correlations) or individual patient data are not necessarily available. This study, therefore, proposes a brief explanation regarding a method to synthesize linear regression models under different covariate sets by using a generalized least squares method involving bias correction terms. Especially, we also propose an approach to recover (at most) threecorrelations of covariates, which is required for the calculation of the bias term without individual patient data. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Population heterogeneity in the salience of multiple risk factors for adolescent delinquency.
Lanza, Stephanie T; Cooper, Brittany R; Bray, Bethany C
2014-03-01
To present mixture regression analysis as an alternative to more standard regression analysis for predicting adolescent delinquency. We demonstrate how mixture regression analysis allows for the identification of population subgroups defined by the salience of multiple risk factors. We identified population subgroups (i.e., latent classes) of individuals based on their coefficients in a regression model predicting adolescent delinquency from eight previously established risk indices drawn from the community, school, family, peer, and individual levels. The study included N = 37,763 10th-grade adolescents who participated in the Communities That Care Youth Survey. Standard, zero-inflated, and mixture Poisson and negative binomial regression models were considered. Standard and mixture negative binomial regression models were selected as optimal. The five-class regression model was interpreted based on the class-specific regression coefficients, indicating that risk factors had varying salience across classes of adolescents. Standard regression showed that all risk factors were significantly associated with delinquency. Mixture regression provided more nuanced information, suggesting a unique set of risk factors that were salient for different subgroups of adolescents. Implications for the design of subgroup-specific interventions are discussed. Copyright © 2014 Society for Adolescent Health and Medicine. Published by Elsevier Inc. All rights reserved.
Retro-regression--another important multivariate regression improvement.
Randić, M
2001-01-01
We review the serious problem associated with instabilities of the coefficients of regression equations, referred to as the MRA (multivariate regression analysis) "nightmare of the first kind". This is manifested when in a stepwise regression a descriptor is included or excluded from a regression. The consequence is an unpredictable change of the coefficients of the descriptors that remain in the regression equation. We follow with consideration of an even more serious problem, referred to as the MRA "nightmare of the second kind", arising when optimal descriptors are selected from a large pool of descriptors. This process typically causes at different steps of the stepwise regression a replacement of several previously used descriptors by new ones. We describe a procedure that resolves these difficulties. The approach is illustrated on boiling points of nonanes which are considered (1) by using an ordered connectivity basis; (2) by using an ordering resulting from application of greedy algorithm; and (3) by using an ordering derived from an exhaustive search for optimal descriptors. A novel variant of multiple regression analysis, called retro-regression (RR), is outlined showing how it resolves the ambiguities associated with both "nightmares" of the first and the second kind of MRA.
Hidaka, Nobuhiro; Murata, Masaharu; Sasahara, Jun; Ishii, Keisuke; Mitsuda, Nobuaki
2015-05-01
Observed/expected lung area to head circumference ratio (o/e LHR) and lung to thorax transverse area ratio (LTR) are the sonographic indicators of postnatal outcome in fetuses with congenital diaphragmatic hernia (CDH), and they are not influenced by gestational age. We aimed to evaluate the relationship between these two parameters in the same subjects with fetal left-sided CDH. Fetuses with left-sided CDH managed between 2005 and 2012 were included. Data of LTR and o/e LHR values measured on the same day prior to 33 weeks' gestation in target fetuses were retrospectively collected. The correlation between the two parameters was estimated using the Spearman's rank-correlation coefficient, and linear regression analysis was used to assess the relationship between them. Data on 61 measurements from 36 CDH fetuses were analyzed to obtain a Spearman's rank-correlation coefficient of 0.74 with the following linear equation: LTR = 0.002 × (o/e LHR) + 0.005. The determination coefficient of this linear equation was sufficiently high at 0.712, and the prediction accuracy obtained with this regression formula was considered satisfactory. A good linear correlation between the LTR and the o/e LHR was obtained, suggesting that we can translate the predictive parameters for each other. This information is expected to be useful to improve our understanding of different investigations focusing on LTR or o/e LHR as a predictor of postnatal outcome in CDH. © 2014 Japanese Teratology Society.
NASA Astrophysics Data System (ADS)
Makama, Ezekiel Kaura; Lim, Hwee San; Abdullah, Khiruddin
2018-01-01
Precipitable water vapor (PWV) is a highly variable, but important greenhouse gas that regulates the radiation budget of the earth. Its variability in time and space makes it difficult to quantify. Knowledge of its vertical distribution, in particular, is crucial for many reasons. In this study, empirical relationships between isobaric layers of PWV over Peninsular Malaysia are examined. Analysis of variance (ANOVA) technique on Advanced Television and Infrared Observation Satellite Operational Vertical Sounder (ATOVS) observations, from 2005 to 2011, has been used to propose a relationship of the form, W=α(WL)β for the middle (MW) and upper (UW) layers PWV. W is either MW or UW with α and β as regression coefficients, which are functions of latitude. Coefficients of determination (R2) and root mean square error (RMSE) of respective values between 0.75-0.86 and 1.65-2.38 mm, across the zones, were obtained for both the MW and UW predictions, with a mean bias (MB) below ±1 mm.The predicted and observed PWV presented a better agreement northerly. Initial predictability test for each model was done on two independent data sets: ATOVS (2012-2015), and radiosonde (2010-2011) at Penang, Kuantan and Sepang stations, with very good outcomes. The results of the tests revealed remarkable performances, when compared with two previously reported models. The inclusion of variable regression coefficients, and the utilization of satellite-derived data, which provide soundings of data-void regions between radiosonde networks, proved to have optimized the results.
Interpretation of commonly used statistical regression models.
Kasza, Jessica; Wolfe, Rory
2014-01-01
A review of some regression models commonly used in respiratory health applications is provided in this article. Simple linear regression, multiple linear regression, logistic regression and ordinal logistic regression are considered. The focus of this article is on the interpretation of the regression coefficients of each model, which are illustrated through the application of these models to a respiratory health research study. © 2013 The Authors. Respirology © 2013 Asian Pacific Society of Respirology.
Hospitals Known for Nursing Excellence Perform Better on Value Based Purchasing Measures
Lasater, Karen B.; Germack, Hayley D.; Small, Dylan S.; McHugh, Matthew D.
2018-01-01
It is well-established that hospitals recognized for good nursing care – Magnet hospitals – are associated with better patient outcomes. Less is known about how Magnet hospitals compare to non-Magnets on quality measures linked to Medicare reimbursement. The purpose of this study was to determine how Magnet hospitals perform compared to matched non-Magnet hospitals on Hospital Value Based Purchasing (VBP) measures. A cross-sectional analysis of three linked data sources was performed. The sample included 3,021 non-federal acute care hospitals participating in the VBP program (323 Magnets; 2,698 non-Magnets). Propensity score matching was used to match Magnet and non-Magnet hospitals with similar hospital characteristics. After matching, linear and logistic regression models were used to examine the relationship between Magnet status and VBP performance. After matching and adjusting for hospital characteristics, Magnet recognition predicted higher scores on Total Performance (Regression Coefficient [RC] = 1.66, p < 0.05), Clinical Processes (RC = 3.85; p < 0.01), and Patient Experience (RC = 6.33; p < 0.001). The relationships between Magnet recognition and the Outcome and Efficiency domains were not statistically significant. Magnet hospitals known for nursing excellence perform better on Hospital VBP measures. As healthcare systems adapt to evolving incentives that reward value, attention to nurses at the front lines may be central to ensuring high-value care for patients. PMID:28558604
A SEMIPARAMETRIC BAYESIAN MODEL FOR CIRCULAR-LINEAR REGRESSION
We present a Bayesian approach to regress a circular variable on a linear predictor. The regression coefficients are assumed to have a nonparametric distribution with a Dirichlet process prior. The semiparametric Bayesian approach gives added flexibility to the model and is usefu...
Flint, A.L.; Childs, S.W.
1991-01-01
The Priestley-Taylor equation, a simplification of the Penman equation, was used to allow calculations of evapotranspiration under conditions where soil water supply limits evapotranspiration. The Priestley-Taylor coefficient, ??, was calculated to incorporate an exponential decrease in evapotranspiration as soil water content decreases. The method is appropriate for use when detailed meteorological measurements are not available. The data required to determine the parameter for the ?? coefficient are net radiation, soil heat flux, average air temperature, and soil water content. These values can be obtained from measurements or models. The dataset used in this report pertains to a partially vegetated clearcut forest site in southwest Oregon with soil depths ranging from 0.48 to 0.70 m and weathered bedrock below that. Evapotranspiration was estimated using the Bowen ratio method, and the calculated Priestley-Taylor coefficient was fitted to these estimates by nonlinear regression. The calculated Priestley-Taylor coefficient (?????) was found to be approximately 0.9 when the soil was near field capacity (0.225 cm3 cm-3). It was not until soil water content was less than 0.14 cm3 cm-3 that soil water supply limited evapotranspiration. The soil reached a final residual water content near 0.05 cm3 cm-3 at the end of the growing season. ?? 1991.
1980-07-01
FUNCTION ( t) CENTERED AT C WITH PERIOD n -nr 0 soTIME t FIGURE 3.4S RECTAPOOLAR PORN )=C FUNCTION g t) CENTERED AT 0 WITH PERIOD n n n 52n tI y I (h...of a typical family in Kabiria (a city in Northern Algeria) over the time period Jan.-Feb. 1975 through Nov.-Dec. 1977. We would like to obtain a...values of y .. .. ... -75- Table 4.2 The Average Bi-Monthly Expenses of a Family in Kabiria and Their Fourier Representation Fourier Coefficients x k
Yao, Xin; Niu, Yandong; Li, Youzhi; Zou, Dongsheng; Ding, Xiaohui; Bian, Hualin
2018-05-09
Bioaccumulation of five heavy metals (Cd, Cu, Mn, Pb, and Zn) in six plant organs (panicle, leaf, stem, root, rhizome, and bud) of the emergent and perennial plant species, Miscanthus sacchariflorus, were investigated to estimate the plant's potential for accumulating heavy metals in the wetlands of Dongting Lake. We found the highest Cd concentrations in the panicles and leaves; while the highest Cu and Mn were observed in the roots, the highest Pb in the panicles, and the highest Zn in the panicles and buds. In contrast, the lowest Cd concentrations were detected in the stem, roots, and buds; the lowest Cu concentrations in the leaves and stems; the lowest Mn concentrations in the panicles, rhizomes, and buds; the lowest Pb concentrations in the stems; and the lowest Zn concentrations in the leaves, stems, and rhizomes. Mean Cu concentration in the plant showed a positive regression coefficient with plot elevation, soil organic matter content, and soil Cu concentration, whereas it showed a negative regression coefficient with soil moisture and electrolyte leakage. Mean Mn concentration showed positive and negative regression coefficients with soil organic matter and soil moisture, respectively. Mean Pb concentration exhibited positive regression coefficient with plot elevation and soil total P concentration, and Zn concentration showed a positive regression coefficient with soil available P and total P concentrations. However, there was no significant regression coefficient between mean Cd concentration in the plant and the investigated environmental parameters. Stems and roots were the main organs involved in heavy metal accumulation from the environment. The mean quantities of heavy metals accumulated in the plant tissues were 2.2 mg Cd, 86.7 mg Cu, 290.3 mg Mn, 15.9 mg Pb, and 307 mg Zn per square meter. In the Dongting Lake wetlands, 0.7 × 10 3 kg Cd, 22.9 × 10 3 kg Cu, 77.5 × 10 3 kg Mn, 3.1 × 10 3 kg Pb, and 95.9 × 10 3 kg Zn per year were accumulated by aboveground organs and removed from the lake through harvesting for paper manufacture.
Prediction of Classroom Reverberation Time using Neural Network
NASA Astrophysics Data System (ADS)
Liyana Zainudin, Fathin; Kadir Mahamad, Abd; Saon, Sharifah; Nizam Yahya, Musli
2018-04-01
In this paper, an alternative method for predicting the reverberation time (RT) using neural network (NN) for classroom was designed and explored. Classroom models were created using Google SketchUp software. The NN applied training dataset from the classroom models with RT values that were computed from ODEON 12.10 software. The NN was conducted separately for 500Hz, 1000Hz, and 2000Hz as absorption coefficient that is one of the prominent input variable is frequency dependent. Mean squared error (MSE) and regression (R) values were obtained to examine the NN efficiency. Overall, the NN shows a good result with MSE < 0.005 and R > 0.9. The NN also managed to achieve a percentage of accuracy of 92.53% for 500Hz, 93.66% for 1000Hz, and 93.18% for 2000Hz and thus displays a good and efficient performance. Nevertheless, the optimum RT value is range between 0.75 – 0.9 seconds.
Modeling of spectral signatures of littoral waters
NASA Astrophysics Data System (ADS)
Haltrin, Vladimir I.
1997-12-01
The spectral values of remotely obtained radiance reflectance coefficient (RRC) are compared with the values of RRC computed from inherent optical properties measured during the shipborne experiment near the West Florida coast. The model calculations are based on the algorithm developed at the Naval Research Laboratory at Stennis Space Center and presented here. The algorithm is based on the radiation transfer theory and uses regression relationships derived from experimental data. Overall comparison of derived and measured RRCs shows that this algorithm is suitable for processing ground truth data for the purposes of remote data calibration. The second part of this work consists of the evaluation of the predictive visibility model (PVM). The simulated three-dimensional values of optical properties are compared with the measured ones. Preliminary results of comparison are encouraging and show that the PVM can qualitatively predict the evolution of inherent optical properties in littoral waters.
Spatial Autocorrelation Approaches to Testing Residuals from Least Squares Regression
Chen, Yanguang
2016-01-01
In geo-statistics, the Durbin-Watson test is frequently employed to detect the presence of residual serial correlation from least squares regression analyses. However, the Durbin-Watson statistic is only suitable for ordered time or spatial series. If the variables comprise cross-sectional data coming from spatial random sampling, the test will be ineffectual because the value of Durbin-Watson’s statistic depends on the sequence of data points. This paper develops two new statistics for testing serial correlation of residuals from least squares regression based on spatial samples. By analogy with the new form of Moran’s index, an autocorrelation coefficient is defined with a standardized residual vector and a normalized spatial weight matrix. Then by analogy with the Durbin-Watson statistic, two types of new serial correlation indices are constructed. As a case study, the two newly presented statistics are applied to a spatial sample of 29 China’s regions. These results show that the new spatial autocorrelation models can be used to test the serial correlation of residuals from regression analysis. In practice, the new statistics can make up for the deficiencies of the Durbin-Watson test. PMID:26800271
NASA Astrophysics Data System (ADS)
Hammud, Hassan H.; Ghannoum, Amer; Masoud, Mamdouh S.
2006-02-01
Sixteen Schiff bases obtained from the condensation of benzaldehyde or salicylaldehyde with various amines (aniline, 4-carboxyaniline, phenylhydrazine, 2,4-dinitrophenylhydrazine, ethylenediamine, hydrazine, o-phenylenediamine and 2,6-pyridinediamine) are studied with UV-vis spectroscopy to observe the effect of solvents, substituents and other structural factors on the spectra. The bands involving different electronic transitions are interpreted. Computerized analysis and multiple regression techniques were applied to calculate the regression and correlation coefficients based on the equation that relates peak position λmax to the solvent parameters that depend on the H-bonding ability, refractive index and dielectric constant of solvents.
An indirect approach to assess the pests on sorghum by remote sensing
NASA Astrophysics Data System (ADS)
Singh, D.; Sao, R.
In today's world of advanced technology various techniques are being used to study ecological parameter and gathering data for agricultural benefits. The major aspects of remote sensing are timely estimates of agriculture crop yield, prediction of pest etc. The damage caused by the pest to crop is well known. Therefore, in this paper, an attempt has to be made to estimate the number of pests on sorghum by remote sensing technique. The studies were made on crop Sorghum (Meethi Sudan) that is a forage variety and the pest observed is a species of grasshopper. The beds of crop sorghum were specially prepared for pests as well as microwave scattering measurements. In first phase of study, dependence of number of pests on sorghum plant parameters (i.e., crop covered moist soil (SM), plant height (PH), leaf area index (LAI), percentage Biomass (BIO), Total chlorophyll (TC)) have been observed by the regression analyses and it was found that pests were more dependent on sorghum chlorophyll than other plant parameters, while climatic conditions were taken as constant. A linear relationship has been obtained between number of pests and TC with quite significant values of coefficient of determination (r^2=0.86). These crop parameters are easily assessable through microwave remote sensing so they can form the basis for prediction of pest remotely. In second phase of study, several observations were carried out for various growth stages of sorghum using bistatic scatterometer for both like polarizations (i.e., HH- and VV-) and different incidence angles at X-band (9.5 GHz). Linear, and multiple regression analysis were carried out to check dependence of scattering coefficient on these crop parameters and it was noticed that scattering coefficient was more dependent on sorghum TC than other plant parameters at X-band. A negative correlation has been obtained between TC and scattering coefficient with quite good values of r^2 (0.82). VV-pol gives better results than HH-pol and incidence angle should be more than 40 degree for both like pols for assessing the sorghum TC at X-band. The TC assessed by the microwave measurements was helpful to estimate the number of pests on sorghum. Combining both phase of study, number of pests was estimated and a quite good agreement (r^2=0.76) was found between observed and estimated pests.
NASA Astrophysics Data System (ADS)
Murakami, Hiroki; Watanabe, Tsuneo; Fukuoka, Daisuke; Terabayashi, Nobuo; Hara, Takeshi; Muramatsu, Chisako; Fujita, Hiroshi
2016-04-01
The word "Locomotive syndrome" has been proposed to describe the state of requiring care by musculoskeletal disorders and its high-risk condition. Reduction of the knee extension strength is cited as one of the risk factors, and the accurate measurement of the strength is needed for the evaluation. The measurement of knee extension strength using a dynamometer is one of the most direct and quantitative methods. This study aims to develop a system for measuring the knee extension strength using the ultrasound images of the rectus femoris muscles obtained with non-invasive ultrasonic diagnostic equipment. First, we extract the muscle area from the ultrasound images and determine the image features, such as the thickness of the muscle. We combine these features and physical features, such as the patient's height, and build a regression model of the knee extension strength from training data. We have developed a system for estimating the knee extension strength by applying the regression model to the features obtained from test data. Using the test data of 168 cases, correlation coefficient value between the measured values and estimated values was 0.82. This result suggests that this system can estimate knee extension strength with high accuracy.
Improved model of the retardance in citric acid coated ferrofluids using stepwise regression
NASA Astrophysics Data System (ADS)
Lin, J. F.; Qiu, X. R.
2017-06-01
Citric acid (CA) coated Fe3O4 ferrofluids (FFs) have been conducted for biomedical application. The magneto-optical retardance of CA coated FFs was measured by a Stokes polarimeter. Optimization and multiple regression of retardance in FFs were executed by Taguchi method and Microsoft Excel previously, and the F value of regression model was large enough. However, the model executed by Excel was not systematic. Instead we adopted the stepwise regression to model the retardance of CA coated FFs. From the results of stepwise regression by MATLAB, the developed model had highly predictable ability owing to F of 2.55897e+7 and correlation coefficient of one. The average absolute error of predicted retardances to measured retardances was just 0.0044%. Using the genetic algorithm (GA) in MATLAB, the optimized parametric combination was determined as [4.709 0.12 39.998 70.006] corresponding to the pH of suspension, molar ratio of CA to Fe3O4, CA volume, and coating temperature. The maximum retardance was found as 31.712°, close to that obtained by evolutionary solver in Excel and a relative error of -0.013%. Above all, the stepwise regression method was successfully used to model the retardance of CA coated FFs, and the maximum global retardance was determined by the use of GA.
Threshold regression to accommodate a censored covariate.
Qian, Jing; Chiou, Sy Han; Maye, Jacqueline E; Atem, Folefac; Johnson, Keith A; Betensky, Rebecca A
2018-06-22
In several common study designs, regression modeling is complicated by the presence of censored covariates. Examples of such covariates include maternal age of onset of dementia that may be right censored in an Alzheimer's amyloid imaging study of healthy subjects, metabolite measurements that are subject to limit of detection censoring in a case-control study of cardiovascular disease, and progressive biomarkers whose baseline values are of interest, but are measured post-baseline in longitudinal neuropsychological studies of Alzheimer's disease. We propose threshold regression approaches for linear regression models with a covariate that is subject to random censoring. Threshold regression methods allow for immediate testing of the significance of the effect of a censored covariate. In addition, they provide for unbiased estimation of the regression coefficient of the censored covariate. We derive the asymptotic properties of the resulting estimators under mild regularity conditions. Simulations demonstrate that the proposed estimators have good finite-sample performance, and often offer improved efficiency over existing methods. We also derive a principled method for selection of the threshold. We illustrate the approach in application to an Alzheimer's disease study that investigated brain amyloid levels in older individuals, as measured through positron emission tomography scans, as a function of maternal age of dementia onset, with adjustment for other covariates. We have developed an R package, censCov, for implementation of our method, available at CRAN. © 2018, The International Biometric Society.
Tokunaga, Makoto; Watanabe, Susumu; Sonoda, Shigeru
2017-09-01
Multiple linear regression analysis is often used to predict the outcome of stroke rehabilitation. However, the predictive accuracy may not be satisfactory. The objective of this study was to elucidate the predictive accuracy of a method of calculating motor Functional Independence Measure (mFIM) at discharge from mFIM effectiveness predicted by multiple regression analysis. The subjects were 505 patients with stroke who were hospitalized in a convalescent rehabilitation hospital. The formula "mFIM at discharge = mFIM effectiveness × (91 points - mFIM at admission) + mFIM at admission" was used. By including the predicted mFIM effectiveness obtained through multiple regression analysis in this formula, we obtained the predicted mFIM at discharge (A). We also used multiple regression analysis to directly predict mFIM at discharge (B). The correlation between the predicted and the measured values of mFIM at discharge was compared between A and B. The correlation coefficients were .916 for A and .878 for B. Calculating mFIM at discharge from mFIM effectiveness predicted by multiple regression analysis had a higher degree of predictive accuracy of mFIM at discharge than that directly predicted. Copyright © 2017 National Stroke Association. Published by Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Coskuntuncel, Orkun
2013-01-01
The purpose of this study is two-fold; the first aim being to show the effect of outliers on the widely used least squares regression estimator in social sciences. The second aim is to compare the classical method of least squares with the robust M-estimator using the "determination of coefficient" (R[superscript 2]). For this purpose,…
Procedures for adjusting regional regression models of urban-runoff quality using local data
Hoos, A.B.; Sisolak, J.K.
1993-01-01
Statistical operations termed model-adjustment procedures (MAP?s) can be used to incorporate local data into existing regression models to improve the prediction of urban-runoff quality. Each MAP is a form of regression analysis in which the local data base is used as a calibration data set. Regression coefficients are determined from the local data base, and the resulting `adjusted? regression models can then be used to predict storm-runoff quality at unmonitored sites. The response variable in the regression analyses is the observed load or mean concentration of a constituent in storm runoff for a single storm. The set of explanatory variables used in the regression analyses is different for each MAP, but always includes the predicted value of load or mean concentration from a regional regression model. The four MAP?s examined in this study were: single-factor regression against the regional model prediction, P, (termed MAP-lF-P), regression against P,, (termed MAP-R-P), regression against P, and additional local variables (termed MAP-R-P+nV), and a weighted combination of P, and a local-regression prediction (termed MAP-W). The procedures were tested by means of split-sample analysis, using data from three cities included in the Nationwide Urban Runoff Program: Denver, Colorado; Bellevue, Washington; and Knoxville, Tennessee. The MAP that provided the greatest predictive accuracy for the verification data set differed among the three test data bases and among model types (MAP-W for Denver and Knoxville, MAP-lF-P and MAP-R-P for Bellevue load models, and MAP-R-P+nV for Bellevue concentration models) and, in many cases, was not clearly indicated by the values of standard error of estimate for the calibration data set. A scheme to guide MAP selection, based on exploratory data analysis of the calibration data set, is presented and tested. The MAP?s were tested for sensitivity to the size of a calibration data set. As expected, predictive accuracy of all MAP?s for the verification data set decreased as the calibration data-set size decreased, but predictive accuracy was not as sensitive for the MAP?s as it was for the local regression models.
Multicollinearity and Regression Analysis
NASA Astrophysics Data System (ADS)
Daoud, Jamal I.
2017-12-01
In regression analysis it is obvious to have a correlation between the response and predictor(s), but having correlation among predictors is something undesired. The number of predictors included in the regression model depends on many factors among which, historical data, experience, etc. At the end selection of most important predictors is something objective due to the researcher. Multicollinearity is a phenomena when two or more predictors are correlated, if this happens, the standard error of the coefficients will increase [8]. Increased standard errors means that the coefficients for some or all independent variables may be found to be significantly different from In other words, by overinflating the standard errors, multicollinearity makes some variables statistically insignificant when they should be significant. In this paper we focus on the multicollinearity, reasons and consequences on the reliability of the regression model.
QSAR modeling of flotation collectors using principal components extracted from topological indices.
Natarajan, R; Nirdosh, Inderjit; Basak, Subhash C; Mills, Denise R
2002-01-01
Several topological indices were calculated for substituted-cupferrons that were tested as collectors for the froth flotation of uranium. The principal component analysis (PCA) was used for data reduction. Seven principal components (PC) were found to account for 98.6% of the variance among the computed indices. The principal components thus extracted were used in stepwise regression analyses to construct regression models for the prediction of separation efficiencies (Es) of the collectors. A two-parameter model with a correlation coefficient of 0.889 and a three-parameter model with a correlation coefficient of 0.913 were formed. PCs were found to be better than partition coefficient to form regression equations, and inclusion of an electronic parameter such as Hammett sigma or quantum mechanically derived electronic charges on the chelating atoms did not improve the correlation coefficient significantly. The method was extended to model the separation efficiencies of mercaptobenzothiazoles (MBT) and aminothiophenols (ATP) used in the flotation of lead and zinc ores, respectively. Five principal components were found to explain 99% of the data variability in each series. A three-parameter equation with correlation coefficient of 0.985 and a two-parameter equation with correlation coefficient of 0.926 were obtained for MBT and ATP, respectively. The amenability of separation efficiencies of chelating collectors to QSAR modeling using PCs based on topological indices might lead to the selection of collectors for synthesis and testing from a virtual database.
Ma, Guangcai; Yuan, Quan; Yu, Haiying; Lin, Hongjun; Chen, Jianrong; Hong, Huachang
2017-04-01
The binding of organic chemicals to serum albumin can significantly reduce their unbound concentration in blood and affect their biological reactions. In this study, we developed a new QSAR model for bovine serum albumin (BSA) - water partition coefficients (K BSA/W ) of neutral organic chemicals with large structural variance, logK BSA/W values covering 3.5 orders of magnitude (1.19-4.76). All chemical geometries were optimized by semi-empirical PM6 algorithm. Several quantum chemical parameters that reflect various intermolecular interactions as well as hydrophobicity were selected to develop QSAR model. The result indicates the regression model derived from logK ow , the most positive net atomic charges on an atom, Connolly solvent excluded volume, polarizability, and Abraham acidity could explain the partitioning mechanism of organic chemicals between BSA and water. The simulated external validation and cross validation verifies the developed model has good statistical robustness and predictive ability, thus can be used to estimate the logK BSA/W values for chemicals in application domain, accordingly to provide basic data for the toxicity assessment of the chemicals. Copyright © 2016 Elsevier Inc. All rights reserved.
Paterson, Gord; Liu, Jian; Haffner, G Douglas; Drouillard, Ken G
2010-08-01
This research investigated dose-dependent whole body and fecal elimination of 39 polychlorinated biphenyl (PCB) congeners spanning a range of chemical hydrophobicities (log Kow) by the Japanese koi (Cyprinus carpio). Both whole body (ktot) and fecal (keg) PCB congener elimination rate coefficients were negatively correlated with log Kow and observed to be dose independent. PCB congener ktot values determined for koi were representative of those generated for fish species of similar size and reared at near optimal temperatures. For persistent and metabolized-type PCB congeners, no significant difference was observed between the regressions describing the relationships between ktot and log Kow for these congeners. Individual PCB congener keg coefficient estimates ranged between 1% and 20% of their respective ktot values but averaged only 5% of the magnitude of ktot over a log Kow range of 5.7-7.8. These results verify first-order kinetics of PCB elimination by a fish species and demonstrate that the relative contribution of keg to ktot is negligible, even for highly hydrophobic (log Kow>6.5) compounds. It was concluded that gill elimination is the primary mechanism of elimination for persistent organic pollutants such as PCBs by Japanese koi.
Psychometric properties of the Valued Living Questionnaire Adapted to Dementia Caregiving.
Romero-Moreno, R; Gallego-Alberto, L; Márquez-González, M; Losada, A
2017-09-01
Caring for a relative with dementia is associated with physical and emotional health problems in caregivers. There are no studies analysing the role of personal values in the caregiver stress process. This study aims to analyse the psychometric properties of the Valued Living Questionnaire Adapted to Caregiving (VLQAC), and to explore the relationship between personal values and stressors, coping strategies and caregiver distress. A total of 253 individual interviews with caregivers of relatives with dementia were conducted, and the following variables were assessed: personal values, stressors, cognitive fusion, emotional acceptance, depression, anxiety, and satisfaction with life. An exploratory factor analysis and hierarchical regression analyses were carried out. Two factors were obtained, Commitment to Own Values and Commitment to Family Values which explain 43.42% of variance, with reliability coefficients (Cronbach's alpha) of .76 and .61, respectively. Personal values had a significant effect on emotional distress (depression and anxiety) and satisfaction with life, even when controlling for socio-demographic variables, stressors and coping strategies. Results suggest that the personal values construct of dementia caregivers is two-dimensional. The personal values of the caregivers play an important role in accounting for distress and satisfaction with life in this population.
Parametric regression model for survival data: Weibull regression model as an example
2016-01-01
Weibull regression model is one of the most popular forms of parametric regression model that it provides estimate of baseline hazard function, as well as coefficients for covariates. Because of technical difficulties, Weibull regression model is seldom used in medical literature as compared to the semi-parametric proportional hazard model. To make clinical investigators familiar with Weibull regression model, this article introduces some basic knowledge on Weibull regression model and then illustrates how to fit the model with R software. The SurvRegCensCov package is useful in converting estimated coefficients to clinical relevant statistics such as hazard ratio (HR) and event time ratio (ETR). Model adequacy can be assessed by inspecting Kaplan-Meier curves stratified by categorical variable. The eha package provides an alternative method to model Weibull regression model. The check.dist() function helps to assess goodness-of-fit of the model. Variable selection is based on the importance of a covariate, which can be tested using anova() function. Alternatively, backward elimination starting from a full model is an efficient way for model development. Visualization of Weibull regression model after model development is interesting that it provides another way to report your findings. PMID:28149846
Standardized Regression Coefficients as Indices of Effect Sizes in Meta-Analysis
ERIC Educational Resources Information Center
Kim, Rae Seon
2011-01-01
When conducting a meta-analysis, it is common to find many collected studies that report regression analyses, because multiple regression analysis is widely used in many fields. Meta-analysis uses effect sizes drawn from individual studies as a means of synthesizing a collection of results. However, indices of effect size from regression analyses…
NASA Astrophysics Data System (ADS)
Mahrooghy, Majid; Ashraf, Ahmed B.; Daye, Dania; Mies, Carolyn; Rosen, Mark; Feldman, Michael; Kontos, Despina
2014-03-01
We evaluate the prognostic value of sparse representation-based features by applying the K-SVD algorithm on multiparametric kinetic, textural, and morphologic features in breast dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI). K-SVD is an iterative dimensionality reduction method that optimally reduces the initial feature space by updating the dictionary columns jointly with the sparse representation coefficients. Therefore, by using K-SVD, we not only provide sparse representation of the features and condense the information in a few coefficients but also we reduce the dimensionality. The extracted K-SVD features are evaluated by a machine learning algorithm including a logistic regression classifier for the task of classifying high versus low breast cancer recurrence risk as determined by a validated gene expression assay. The features are evaluated using ROC curve analysis and leave one-out cross validation for different sparse representation and dimensionality reduction numbers. Optimal sparse representation is obtained when the number of dictionary elements is 4 (K=4) and maximum non-zero coefficients is 2 (L=2). We compare K-SVD with ANOVA based feature selection for the same prognostic features. The ROC results show that the AUC of the K-SVD based (K=4, L=2), the ANOVA based, and the original features (i.e., no dimensionality reduction) are 0.78, 0.71. and 0.68, respectively. From the results, it can be inferred that by using sparse representation of the originally extracted multi-parametric, high-dimensional data, we can condense the information on a few coefficients with the highest predictive value. In addition, the dimensionality reduction introduced by K-SVD can prevent models from over-fitting.
Aircraft Anomaly Detection Using Performance Models Trained on Fleet Data
NASA Technical Reports Server (NTRS)
Gorinevsky, Dimitry; Matthews, Bryan L.; Martin, Rodney
2012-01-01
This paper describes an application of data mining technology called Distributed Fleet Monitoring (DFM) to Flight Operational Quality Assurance (FOQA) data collected from a fleet of commercial aircraft. DFM transforms the data into aircraft performance models, flight-to-flight trends, and individual flight anomalies by fitting a multi-level regression model to the data. The model represents aircraft flight performance and takes into account fixed effects: flight-to-flight and vehicle-to-vehicle variability. The regression parameters include aerodynamic coefficients and other aircraft performance parameters that are usually identified by aircraft manufacturers in flight tests. Using DFM, the multi-terabyte FOQA data set with half-million flights was processed in a few hours. The anomalies found include wrong values of competed variables, (e.g., aircraft weight), sensor failures and baises, failures, biases, and trends in flight actuators. These anomalies were missed by the existing airline monitoring of FOQA data exceedances.
NASA Technical Reports Server (NTRS)
Roth, D. J.; Swickard, S. M.; Stang, D. B.; Deguire, M. R.
1991-01-01
A review and statistical analysis of the ultrasonic velocity method for estimating the porosity fraction in polycrystalline materials is presented. Initially, a semiempirical model is developed showing the origin of the linear relationship between ultrasonic velocity and porosity fraction. Then, from a compilation of data produced by many researchers, scatter plots of velocity versus percent porosity data are shown for Al2O3, MgO, porcelain-based ceramics, PZT, SiC, Si3N4, steel, tungsten, UO2,(U0.30Pu0.70)C, and YBa2Cu3O(7-x). Linear regression analysis produces predicted slope, intercept, correlation coefficient, level of significance, and confidence interval statistics for the data. Velocity values predicted from regression analysis of fully-dense materials are in good agreement with those calculated from elastic properties.
Utility of correlation techniques in gravity and magnetic interpretation
NASA Technical Reports Server (NTRS)
Chandler, V. W.; Koski, J. S.; Braice, L. W.; Hinze, W. J.
1977-01-01
Internal correspondence uses Poisson's Theorem in a moving-window linear regression analysis between the anomalous first vertical derivative of gravity and total magnetic field reduced to the pole. The regression parameters provide critical information on source characteristics. The correlation coefficient indicates the strength of the relation between magnetics and gravity. Slope value gives delta j/delta sigma estimates of the anomalous source. The intercept furnishes information on anomaly interference. Cluster analysis consists of the classification of subsets of data into groups of similarity based on correlation of selected characteristics of the anomalies. Model studies are used to illustrate implementation and interpretation procedures of these methods, particularly internal correspondence. Analysis of the results of applying these methods to data from the midcontinent and a transcontinental profile shows they can be useful in identifying crustal provinces, providing information on horizontal and vertical variations of physical properties over province size zones, validating long wavelength anomalies, and isolating geomagnetic field removal problems.
NASA Technical Reports Server (NTRS)
Roth, D. J.; Swickard, S. M.; Stang, D. B.; Deguire, M. R.
1990-01-01
A review and statistical analysis of the ultrasonic velocity method for estimating the porosity fraction in polycrystalline materials is presented. Initially, a semi-empirical model is developed showing the origin of the linear relationship between ultrasonic velocity and porosity fraction. Then, from a compilation of data produced by many researchers, scatter plots of velocity versus percent porosity data are shown for Al2O3, MgO, porcelain-based ceramics, PZT, SiC, Si3N4, steel, tungsten, UO2,(U0.30Pu0.70)C, and YBa2Cu3O(7-x). Linear regression analysis produced predicted slope, intercept, correlation coefficient, level of significance, and confidence interval statistics for the data. Velocity values predicted from regression analysis for fully-dense materials are in good agreement with those calculated from elastic properties.
Bai, Jin-Shun; Cao, Wei-Dong; Xiong, Jing; Zeng, Nao-Hua; Shimizu, Katshyoshi; Rui, Yu-Kui
2013-12-01
In order to explore the feasibility of using the image processing technology to diagnose the nitrogen status and to predict the maize yield, a field experiment with different nitrogen rates with green manure incorporation was conducted. Maize canopy digital images over a range of growth stages were captured by digital camera. Maize nitrogen status and the relationships between image color indices derived by digital camera for maize at different growth stages and maize nitrogen status indicators were analyzed. These digital camera sourced image color indices at different growth stages for maize were also regressed with maize grain yield at maturity. The results showed that the plant nitrogen status for maize was improved by green manure application. The leaf chlorophyll content (SPAD value), aboveground biomass and nitrogen uptake for green manure treatments at different maize growth stages were all higher than that for chemical fertilization treatments. The correlations between spectral indices with plant nitrogen indicators for maize affected by green manure application were weaker than that affected by chemical fertilization. And the correlation coefficients for green manure application were ranged with the maize growth stages changes. The best spectral indices for diagnosis of plant nitrogen status after green manure incorporation were normalized blue value (B/(R+G+B)) at 12-leaf (V12) stage and normalized red value (R/(R+G+B)) at grain-filling (R4) stage individually. The coefficients of determination based on linear regression were 0. 45 and 0. 46 for B/(R+G+B) at V12 stage and R/(R+G+B) at R4 stage respectively, acting as a predictor of maize yield response to nitrogen affected by green manure incorporation. Our findings suggested that digital image technique could be a potential tool for in-season prediction of the nitrogen status and grain yield for maize after green manure incorporation when the suitable growth stages and spectral indices for diagnosis were selected.
Xia, Qing; Liu, Changhong; Liu, Jinxia; Pan, Wenjuan; Lu, Xuzhong; Yang, Jianbo; Chen, Wei; Zheng, Lei
2016-03-30
Rancidity is an important attribute for quality assessment of butter cookies, while traditional methods for rancidity measurement are usually laborious, destructive and prone to operational error. In the present paper, the potential of applying multi-spectral imaging (MSI) technology with 19 wavelengths in the range of 405-970 nm to evaluate the rancidity in butter cookies was investigated. Moisture content, acid value and peroxide value were determined by traditional methods and then related with the spectral information by partial least squares regression (PLSR) and back-propagation artificial neural network (BP-ANN). The optimal models for predicting moisture content, acid value and peroxide value were obtained by PLSR. The correlation coefficient (r) obtained by PLSR models revealed that MSI had a perfect ability to predict moisture content (r = 0.909), acid value (r = 0.944) and peroxide value (r = 0.971). The study demonstrated that the rancidity level of butter cookies can be continuously monitored and evaluated in real-time by the multi-spectral imaging, which is of great significance for developing online food safety monitoring solutions. © 2015 Society of Chemical Industry.
COSMO-SkyMed potentiality to identify crop-specific behavior and monitor phenological parameters
NASA Astrophysics Data System (ADS)
Guarini, Rocchina; Segalini, Federica; Mastronardi, Giovanni; Notarnicola, Claudia; Vuolo, Francesco; Dini, Luigi
2014-10-01
This work aims at investigating the capability of COSMO-SkyMed® (CSK®) constellation of Synthetic Aperture Radar (SAR) system to monitor the Leaf Area Index (LAI) of different crops. The experiment was conducted in the Marchfeld Region, an agricultural Austrian area, and focused on five crop species: sugar beet, soybean, potato, pea and corn. A linear regression analysis was carried out to assess the sensitivity of CSK® backscattering coefficients to crops changes base on LAI values. CSK® backscattering coefficients were averaged at a field scale (<σ°dB>) and were compared to the DEIMOS-1 derived values of estimated LAI. LAI were as well averaged over the corresponding fields (
Rao, Harsha L; Yadav, Ravi K; Begum, Viquar U; Addepalli, Uday K; Senthil, Sirisha; Choudhari, Nikhil S; Garudadri, Chandra S
2015-03-01
To evaluate the effect of typical scan score (TSS), when within the acceptable limits, on the diagnostic performance of retinal nerve fibre layer (RNFL) parameters with the enhanced corneal compensation (ECC) protocol of scanning laser polarimetry (SLP) in glaucoma. In a cross-sectional study, 203 eyes of 160 glaucoma patients and 140 eyes of 104 control subjects underwent RNFL imaging with the ECC protocol of SLP. TSS was used to quantify atypical birefringence pattern (ABP) images. Influence of TSS on the diagnostic ability of SLP parameters was evaluated by receiver operating characteristic (ROC) regression models after adjusting for the effect of disease severity [based on mean deviation (MD)] on standard automated perimetry). Diagnostic abilities of all RNFL parameters of SLP increased when the TSS values were higher. This effect was statistically significant for TSNIT (coefficient: 0.08, p<0.001) and inferior average parameters (coefficient: 0.06, p=0.002) but not for nerve fibre indicator (NFI, coefficient: 0.03, p=0.21). In early glaucoma (MD of -5 dB), predicted area under ROC curve (AUC) for TSNIT average parameter improved from 0.642 at a TSS of 90 to 0.845 at a TSS of 100. In advanced glaucoma (MD of -15 dB), AUC for TSNIT average improved from 0.832 at a TSS of 90 to 0.947 at 100. Diagnostic performances of TSNIT and inferior average RNFL parameters with ECC protocol of SLP were significantly influenced by TSS even when the TSS values were within the acceptable limits. Diagnostic ability of NFI was unaffected by TSS values. © 2014 Acta Ophthalmologica Scandinavica Foundation. Published by John Wiley & Sons Ltd.
Time-dependent oral absorption models
NASA Technical Reports Server (NTRS)
Higaki, K.; Yamashita, S.; Amidon, G. L.
2001-01-01
The plasma concentration-time profiles following oral administration of drugs are often irregular and cannot be interpreted easily with conventional models based on first- or zero-order absorption kinetics and lag time. Six new models were developed using a time-dependent absorption rate coefficient, ka(t), wherein the time dependency was varied to account for the dynamic processes such as changes in fluid absorption or secretion, in absorption surface area, and in motility with time, in the gastrointestinal tract. In the present study, the plasma concentration profiles of propranolol obtained in human subjects following oral dosing were analyzed using the newly derived models based on mass balance and compared with the conventional models. Nonlinear regression analysis indicated that the conventional compartment model including lag time (CLAG model) could not predict the rapid initial increase in plasma concentration after dosing and the predicted Cmax values were much lower than that observed. On the other hand, all models with the time-dependent absorption rate coefficient, ka(t), were superior to the CLAG model in predicting plasma concentration profiles. Based on Akaike's Information Criterion (AIC), the fluid absorption model without lag time (FA model) exhibited the best overall fit to the data. The two-phase model including lag time, TPLAG model was also found to be a good model judging from the values of sum of squares. This model also described the irregular profiles of plasma concentration with time and frequently predicted Cmax values satisfactorily. A comparison of the absorption rate profiles also suggested that the TPLAG model is better at prediction of irregular absorption kinetics than the FA model. In conclusion, the incorporation of a time-dependent absorption rate coefficient ka(t) allows the prediction of nonlinear absorption characteristics in a more reliable manner.
Tan, Kok Chooi; Lim, Hwee San; Matjafri, Mohd Zubir; Abdullah, Khiruddin
2012-06-01
Atmospheric corrections for multi-temporal optical satellite images are necessary, especially in change detection analyses, such as normalized difference vegetation index (NDVI) rationing. Abrupt change detection analysis using remote-sensing techniques requires radiometric congruity and atmospheric correction to monitor terrestrial surfaces over time. Two atmospheric correction methods were used for this study: relative radiometric normalization and the simplified method for atmospheric correction (SMAC) in the solar spectrum. A multi-temporal data set consisting of two sets of Landsat images from the period between 1991 and 2002 of Penang Island, Malaysia, was used to compare NDVI maps, which were generated using the proposed atmospheric correction methods. Land surface temperature (LST) was retrieved using ATCOR3_T in PCI Geomatica 10.1 image processing software. Linear regression analysis was utilized to analyze the relationship between NDVI and LST. This study reveals that both of the proposed atmospheric correction methods yielded high accuracy through examination of the linear correlation coefficients. To check for the accuracy of the equation obtained through linear regression analysis for every single satellite image, 20 points were randomly chosen. The results showed that the SMAC method yielded a constant value (in terms of error) to predict the NDVI value from linear regression analysis-derived equation. The errors (average) from both proposed atmospheric correction methods were less than 10%.
NASA Astrophysics Data System (ADS)
Chardon, Jérémy; Hingray, Benoit; Favre, Anne-Catherine
2018-01-01
Statistical downscaling models (SDMs) are often used to produce local weather scenarios from large-scale atmospheric information. SDMs include transfer functions which are based on a statistical link identified from observations between local weather and a set of large-scale predictors. As physical processes driving surface weather vary in time, the most relevant predictors and the regression link are likely to vary in time too. This is well known for precipitation for instance and the link is thus often estimated after some seasonal stratification of the data. In this study, we present a two-stage analog/regression model where the regression link is estimated from atmospheric analogs of the current prediction day. Atmospheric analogs are identified from fields of geopotential heights at 1000 and 500 hPa. For the regression stage, two generalized linear models are further used to model the probability of precipitation occurrence and the distribution of non-zero precipitation amounts, respectively. The two-stage model is evaluated for the probabilistic prediction of small-scale precipitation over France. It noticeably improves the skill of the prediction for both precipitation occurrence and amount. As the analog days vary from one prediction day to another, the atmospheric predictors selected in the regression stage and the value of the corresponding regression coefficients can vary from one prediction day to another. The model allows thus for a day-to-day adaptive and tailored downscaling. It can also reveal specific predictors for peculiar and non-frequent weather configurations.
Interpretation of the Coefficients in the Fit y = at + bx + c
ERIC Educational Resources Information Center
Farnsworth, David L.
2006-01-01
The goals of this note are to derive formulas for the coefficients a and b in the least-squares regression plane y = at + bx + c for observations (t[subscript]i,x[subscript]i,y[subscript]i), i = 1, 2, ..., n, and to present meanings for the coefficients a and b. In this note, formulas for the coefficients a and b in the least-squares fit are…
Stochastic inversion of ocean color data using the cross-entropy method.
Salama, Mhd Suhyb; Shen, Fang
2010-01-18
Improving the inversion of ocean color data is an ever continuing effort to increase the accuracy of derived inherent optical properties. In this paper we present a stochastic inversion algorithm to derive inherent optical properties from ocean color, ship and space borne data. The inversion algorithm is based on the cross-entropy method where sets of inherent optical properties are generated and converged to the optimal set using iterative process. The algorithm is validated against four data sets: simulated, noisy simulated in-situ measured and satellite match-up data sets. Statistical analysis of validation results is based on model-II regression using five goodness-of-fit indicators; only R2 and root mean square of error (RMSE) are mentioned hereafter. Accurate values of total absorption coefficient are derived with R2 > 0.91 and RMSE, of log transformed data, less than 0.55. Reliable values of the total backscattering coefficient are also obtained with R2 > 0.7 (after removing outliers) and RMSE < 0.37. The developed algorithm has the ability to derive reliable results from noisy data with R2 above 0.96 for the total absorption and above 0.84 for the backscattering coefficients. The algorithm is self contained and easy to implement and modify to derive the variability of chlorophyll-a absorption that may correspond to different phytoplankton species. It gives consistently accurate results and is therefore worth considering for ocean color global products.
[New method of mixed gas infrared spectrum analysis based on SVM].
Bai, Peng; Xie, Wen-Jun; Liu, Jun-Hua
2007-07-01
A new method of infrared spectrum analysis based on support vector machine (SVM) for mixture gas was proposed. The kernel function in SVM was used to map the seriously overlapping absorption spectrum into high-dimensional space, and after transformation, the high-dimensional data could be processed in the original space, so the regression calibration model was established, then the regression calibration model with was applied to analyze the concentration of component gas. Meanwhile it was proved that the regression calibration model with SVM also could be used for component recognition of mixture gas. The method was applied to the analysis of different data samples. Some factors such as scan interval, range of the wavelength, kernel function and penalty coefficient C that affect the model were discussed. Experimental results show that the component concentration maximal Mean AE is 0.132%, and the component recognition accuracy is higher than 94%. The problems of overlapping absorption spectrum, using the same method for qualitative and quantitative analysis, and limit number of training sample, were solved. The method could be used in other mixture gas infrared spectrum analyses, promising theoretic and application values.
Wang, Shuang; Zhang, Yuchen; Dai, Wenrui; Lauter, Kristin; Kim, Miran; Tang, Yuzhe; Xiong, Hongkai; Jiang, Xiaoqian
2016-01-01
Motivation: Genome-wide association studies (GWAS) have been widely used in discovering the association between genotypes and phenotypes. Human genome data contain valuable but highly sensitive information. Unprotected disclosure of such information might put individual’s privacy at risk. It is important to protect human genome data. Exact logistic regression is a bias-reduction method based on a penalized likelihood to discover rare variants that are associated with disease susceptibility. We propose the HEALER framework to facilitate secure rare variants analysis with a small sample size. Results: We target at the algorithm design aiming at reducing the computational and storage costs to learn a homomorphic exact logistic regression model (i.e. evaluate P-values of coefficients), where the circuit depth is proportional to the logarithmic scale of data size. We evaluate the algorithm performance using rare Kawasaki Disease datasets. Availability and implementation: Download HEALER at http://research.ucsd-dbmi.org/HEALER/ Contact: shw070@ucsd.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26446135
[Spectral reflectance characteristics and modeling of typical Takyr Solonetzs water content].
Zhang, Jun-hua; Jia, Ke-li
2015-03-01
Based on the analysis of the spectral reflectance of the typical Takyr Solonetzs soil in Ningxia, the relationship of soil water content and spectral reflectance was determined, and a quantitative model for the prediction of soil water content was constructed. The results showed that soil spectral reflectance decreased with the increasing soil water content when it was below the water holding capacity but increased with the increasing soil water content when it was higher than the water holding capacity. Soil water content presented significantly negative correlation with original reflectance (r), smooth reflectance (R), logarithm of reflectance (IgR), and positive correlation with the reciprocal of R and logarithm of reciprocal [lg (1/R)]. The correlation coefficient of soil water content and R in the whole wavelength was 0.0013, 0.0397 higher than r and lgR, respectively. Average correlation coefficient of soil water content with 1/R and [lg (1/R)] at the wavelength of 950-1000 nm was 0.2350 higher than that of 400-950 nm. The relationships of soil water content with the first derivate differential (R') , the first derivate differential of logarithm (lgR)' and the first derivate differential of logarithm of reciprocal [lg(1/R)]' were unstable. Base on the coefficients of r, lg(1/R), R' and (lgR)', different regression models were established to predict soil water content, and the coefficients of determination were 0.7610, 0.8184, 0.8524 and 0.8255, respectively. The determination coefficient for power function model of R'. reached 0.9447, while the fitting degree between the predicted value based on this model and on-site measured value was 0.8279. The model of R' had the highest fitted accuracy, while that of r had the lowest one. The results could provide a scientific basis for soil water content prediction and field irrigation in the Takyr Solonetzs region.
Malloy, Elizabeth J; Morris, Jeffrey S; Adar, Sara D; Suh, Helen; Gold, Diane R; Coull, Brent A
2010-07-01
Frequently, exposure data are measured over time on a grid of discrete values that collectively define a functional observation. In many applications, researchers are interested in using these measurements as covariates to predict a scalar response in a regression setting, with interest focusing on the most biologically relevant time window of exposure. One example is in panel studies of the health effects of particulate matter (PM), where particle levels are measured over time. In such studies, there are many more values of the functional data than observations in the data set so that regularization of the corresponding functional regression coefficient is necessary for estimation. Additional issues in this setting are the possibility of exposure measurement error and the need to incorporate additional potential confounders, such as meteorological or co-pollutant measures, that themselves may have effects that vary over time. To accommodate all these features, we develop wavelet-based linear mixed distributed lag models that incorporate repeated measures of functional data as covariates into a linear mixed model. A Bayesian approach to model fitting uses wavelet shrinkage to regularize functional coefficients. We show that, as long as the exposure error induces fine-scale variability in the functional exposure profile and the distributed lag function representing the exposure effect varies smoothly in time, the model corrects for the exposure measurement error without further adjustment. Both these conditions are likely to hold in the environmental applications we consider. We examine properties of the method using simulations and apply the method to data from a study examining the association between PM, measured as hourly averages for 1-7 days, and markers of acute systemic inflammation. We use the method to fully control for the effects of confounding by other time-varying predictors, such as temperature and co-pollutants.
Aggarwal, P; Kashyap, B
2017-06-01
Rampant use of fluconazole in Candida infections has led to predominance of less susceptible non-albicans Candida over Candida albicans. The aim of the study was to determine if zone diameters around fluconazole disk can be used to estimate the minimum inhibitory concentration (MIC) for clinical isolates of Candida species and vice versa. Categorical agreement between the Clinical & Laboratory Standards Institute (CLSI) recommended disk diffusion and CLSI broth microdilution method was sought for. Antifungal susceptibility testing by disk diffusion and Broth microdilution was done as per CLSI document M44-S3 and CLSI document M27-S4 for Candida isolates respectively. Regression analysis correlating zone diameters to MIC value was done. Pearson's correlation coefficient was calculated to determine correlation between disk zone diameters and MICs. Candida albicans (33.3%) was clearly outnumbered by other non-albicans species predominantly Candida tropicalis (42.5%) and Candida glabrata (18.4%). Ten percent of the strains were resistant to fluconazole by disk diffusion and 13% by broth microdilution. MIC range for Candida albicans and Candida tropicalis ranged from≤0.25-64μg/ml while that of Candida glabrata ranged from≤0.25-128μg/ml. Categorical agreement between disk diffusion and broth microdilution was 86.8%. Pearson's coefficient of correlation was -0.5975 indicating moderate negative correlation between the two variables. Zone sizes can be used to estimate the MIC values, although with limited accuracy. There should be a constant effort to upgrade the guidelines in view of new clinical data, and laboratories should make an active effort to incorporate them. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Peng, Ying; Yu, Bin; Wang, Peng; Kong, De-Guang; Chen, Bang-Hua; Yang, Xiao-Bing
2017-12-01
Outbreaks of hand-foot-mouth disease (HFMD) have occurred many times and caused serious health burden in China since 2008. Application of modern information technology to prediction and early response can be helpful for efficient HFMD prevention and control. A seasonal auto-regressive integrated moving average (ARIMA) model for time series analysis was designed in this study. Eighty-four-month (from January 2009 to December 2015) retrospective data obtained from the Chinese Information System for Disease Prevention and Control were subjected to ARIMA modeling. The coefficient of determination (R 2 ), normalized Bayesian Information Criterion (BIC) and Q-test P value were used to evaluate the goodness-of-fit of constructed models. Subsequently, the best-fitted ARIMA model was applied to predict the expected incidence of HFMD from January 2016 to December 2016. The best-fitted seasonal ARIMA model was identified as (1,0,1)(0,1,1) 12 , with the largest coefficient of determination (R 2 =0.743) and lowest normalized BIC (BIC=3.645) value. The residuals of the model also showed non-significant autocorrelations (P Box-Ljung (Q) =0.299). The predictions by the optimum ARIMA model adequately captured the pattern in the data and exhibited two peaks of activity over the forecast interval, including a major peak during April to June, and again a light peak for September to November. The ARIMA model proposed in this study can forecast HFMD incidence trend effectively, which could provide useful support for future HFMD prevention and control in the study area. Besides, further observations should be added continually into the modeling data set, and parameters of the models should be adjusted accordingly.
Hoos, A.B.; McMahon, G.
2009-01-01
Understanding how nitrogen transport across the landscape varies with landscape characteristics is important for developing sound nitrogen management policies. We used a spatially referenced regression analysis (SPARROW) to examine landscape characteristics influencing delivery of nitrogen from sources in a watershed to stream channels. Modelled landscape delivery ratio varies widely (by a factor of 4) among watersheds in the southeastern United States - higher in the western part (Tennessee, Alabama, and Mississippi) than in the eastern part, and the average value for the region is lower compared to other parts of the nation. When we model landscape delivery ratio as a continuous function of local-scale landscape characteristics, we estimate a spatial pattern that varies as a function of soil and climate characteristics but exhibits spatial structure in residuals (observed load minus predicted load). The spatial pattern of modelled landscape delivery ratio and the spatial pattern of residuals coincide spatially with Level III ecoregions and also with hydrologic landscape regions. Subsequent incorporation into the model of these frameworks as regional scale variables improves estimation of landscape delivery ratio, evidenced by reduced spatial bias in residuals, and suggests that cross-scale processes affect nitrogen attenuation on the landscape. The model-fitted coefficient values are logically consistent with the hypothesis that broad-scale classifications of hydrologic response help to explain differential rates of nitrogen attenuation, controlling for local-scale landscape characteristics. Negative model coefficients for hydrologic landscape regions where the primary flow path is shallow ground water suggest that a lower fraction of nitrogen mass will be delivered to streams; this relation is reversed for regions where the primary flow path is overland flow.
Hoos, Anne B.; McMahon, Gerard
2009-01-01
Understanding how nitrogen transport across the landscape varies with landscape characteristics is important for developing sound nitrogen management policies. We used a spatially referenced regression analysis (SPARROW) to examine landscape characteristics influencing delivery of nitrogen from sources in a watershed to stream channels. Modelled landscape delivery ratio varies widely (by a factor of 4) among watersheds in the southeastern United States—higher in the western part (Tennessee, Alabama, and Mississippi) than in the eastern part, and the average value for the region is lower compared to other parts of the nation. When we model landscape delivery ratio as a continuous function of local-scale landscape characteristics, we estimate a spatial pattern that varies as a function of soil and climate characteristics but exhibits spatial structure in residuals (observed load minus predicted load). The spatial pattern of modelled landscape delivery ratio and the spatial pattern of residuals coincide spatially with Level III ecoregions and also with hydrologic landscape regions. Subsequent incorporation into the model of these frameworks as regional scale variables improves estimation of landscape delivery ratio, evidenced by reduced spatial bias in residuals, and suggests that cross-scale processes affect nitrogen attenuation on the landscape. The model-fitted coefficient values are logically consistent with the hypothesis that broad-scale classifications of hydrologic response help to explain differential rates of nitrogen attenuation, controlling for local-scale landscape characteristics. Negative model coefficients for hydrologic landscape regions where the primary flow path is shallow ground water suggest that a lower fraction of nitrogen mass will be delivered to streams; this relation is reversed for regions where the primary flow path is overland flow.
Sun, Jing; Chen, Hanjian; Zheng, Jun; Mao, Bin; Zhu, Shengmei; Feng, Jingyi
2017-12-01
Radial artery applanation tonometry (RAAT) has been developed and utilized for continuous arterial pressure monitoring. However, evidence is lacking to clinically verify the RAAT technology and identify appropriate patient groups before routine clinical use. This study aims to evaluate the RAAT technology by comparing systolic blood pressure (SBP), mean blood pressure (MBP) and diastolic blood pressure (DBP) values in patients undergoing colon carcinoma surgery. Blood Pressure (BP) values obtained via RAAT (TL-300, Tensys Medical Inc., San Diego, CA, USA) and conventional arterial catheterization from 30 colon carcinoma surgical patients were collected and compared via Bland-Atman method, linear regression and 4-quadrant plot concordance analysis. For SBPs, MBPs and DBPs, means of the differences (±standard deviation; 95% limits of agreement) were -0.9 (±7.6; -15.7 to 13.9) mmHg, 3.1 (±6.5; -9.6 to 15.8) mmHg and 4.3 (±7.4; -10.3 to 18.8) mmHg, respectively. Linear regression coefficients of determination were 0.8706 for SBPs, 0.8353 for MBPs and 0.6858 for DBPs. Four-quadrant concordance correlation coefficients were 0.8740, 0.8522 and 0.7108 for SBPs, MBPs and DBPs, respectively. A highly selected patient collective undergoing colon carcinoma surgery was studied. BP measurements obtained via the TL-300 had clinically acceptable agreement with that acquired invasively using an arterial catheter. For use in clinical routine, it is necessary to take measures for improvement regarding movement artifacts and dilution of noise. A large sample size of patients under various conditions is also needed to further evaluate the RAAT technology before clinically routine use.
Can Salivary Acetylcholinesterase be a Diagnostic Biomarker for Alzheimer?
Bakhtiari, Sedigheh; Moghadam, Nahid Beladi; Ehsani, Marjan; Mortazavi, Hamed; Sabour, Siamak; Bakhshi, Mahin
2017-01-01
The loss of brain cholinergic activity is a key phenomenon in the biochemistry of Alzheimer's Disease (AD). Due to the specific biosynthesis of Acetylcholinesterase (AChE) of cholinergic neurons, the enzyme has been proposed as a potential biochemical marker of cholinergic activity. AChE is expressed not only in the Central Nervous System (CNS), Peripheral Nervous System (PNS) and muscles, but also on the surface of blood cells and saliva. This study aimed to measure salivary AChE activity in AD and to determine the feasibility of creating a simple laboratory test for diagnosing such patients. In this cross-sectional study, the recorded data were obtained from 15 Alzheimer's patients on memantine therapy and 15 healthy subjects. Unstimulated whole saliva samples were collected from the participants and salivary levels of AChE activity were determined by using the Ellman colorimetric method. The Mann Whitney U test was used to compare the average (median) of AChE activity between AD and controls. In order to adjust for possible confounding factors, partial correlation coefficient and multivariate linear regressions were used. Although the average of AChE activity in the saliva of people with AD was lower compared to the control group, we found no statistically significant differences using Mann Whitney U test (138 in control group vs. 175 in Alzheimer's patients, p value=0.25). Additionally, no significant differences were observed in the activity of this enzyme in both sexes or with increased age or duration of the disease. After adjusting for age and gender, there was no association between AChE activity and AD (regression coefficient β=0.08; p value= 0.67). Saliva AChE activity was not significantly associated with AD. This study might help in introduce a new diagnostic aid for AD or monitor patients with AD.
ERIC Educational Resources Information Center
Waller, Niels; Jones, Jeff
2011-01-01
We describe methods for assessing all possible criteria (i.e., dependent variables) and subsets of criteria for regression models with a fixed set of predictors, x (where x is an n x 1 vector of independent variables). Our methods build upon the geometry of regression coefficients (hereafter called regression weights) in n-dimensional space. For a…
Neither fixed nor random: weighted least squares meta-regression.
Stanley, T D; Doucouliagos, Hristos
2017-03-01
Our study revisits and challenges two core conventional meta-regression estimators: the prevalent use of 'mixed-effects' or random-effects meta-regression analysis and the correction of standard errors that defines fixed-effects meta-regression analysis (FE-MRA). We show how and explain why an unrestricted weighted least squares MRA (WLS-MRA) estimator is superior to conventional random-effects (or mixed-effects) meta-regression when there is publication (or small-sample) bias that is as good as FE-MRA in all cases and better than fixed effects in most practical applications. Simulations and statistical theory show that WLS-MRA provides satisfactory estimates of meta-regression coefficients that are practically equivalent to mixed effects or random effects when there is no publication bias. When there is publication selection bias, WLS-MRA always has smaller bias than mixed effects or random effects. In practical applications, an unrestricted WLS meta-regression is likely to give practically equivalent or superior estimates to fixed-effects, random-effects, and mixed-effects meta-regression approaches. However, random-effects meta-regression remains viable and perhaps somewhat preferable if selection for statistical significance (publication bias) can be ruled out and when random, additive normal heterogeneity is known to directly affect the 'true' regression coefficient. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Bhamidipati, Ravi Kanth; Syed, Muzeeb; Mullangi, Ramesh; Srinivas, Nuggehally
2018-02-01
1. Dalbavancin, a lipoglycopeptide, is approved for treating gram-positive bacterial infections. Area under plasma concentration versus time curve (AUC inf ) of dalbavancin is a key parameter and AUC inf /MIC ratio is a critical pharmacodynamic marker. 2. Using end of intravenous infusion concentration (i.e. C max ) C max versus AUC inf relationship for dalbavancin was established by regression analyses (i.e. linear, log-log, log-linear and power models) using 21 pairs of subject data. 3. The predictions of the AUC inf were performed using published C max data by application of regression equations. The quotient of observed/predicted values rendered fold difference. The mean absolute error (MAE)/root mean square error (RMSE) and correlation coefficient (r) were used in the assessment. 4. MAE and RMSE values for the various models were comparable. The C max versus AUC inf exhibited excellent correlation (r > 0.9488). The internal data evaluation showed narrow confinement (0.84-1.14-fold difference) with a RMSE < 10.3%. The external data evaluation showed that the models predicted AUC inf with a RMSE of 3.02-27.46% with fold difference largely contained within 0.64-1.48. 5. Regardless of the regression models, a single time point strategy of using C max (i.e. end of 30-min infusion) is amenable as a prospective tool for predicting AUC inf of dalbavancin in patients.
Inter-annual and spatial variability of Hamon potential evapotranspiration model coefficients
McCabe, Gregory J.; Hay, Lauren E.; Bock, Andy; Markstrom, Steven L.; Atkinson, R. Dwight
2015-01-01
Monthly calibrated values of the Hamon PET coefficient (C) are determined for 109,951 hydrologic response units (HRUs) across the conterminous United States (U.S.). The calibrated coefficient values are determined by matching calculated mean monthly Hamon PET to mean monthly free-water surface evaporation. For most locations and months the calibrated coefficients are larger than the standard value reported by Hamon. The largest changes in the coefficients were for the late winter/early spring and fall months, whereas the smallest changes were for the summer months. Comparisons of PET computed using the standard value of C and computed using calibrated values of C indicate that for most of the conterminous U.S. PET is underestimated using the standard Hamon PET coefficient, except for the southeastern U.S.
Zhang, Hua; Kurgan, Lukasz
2014-12-01
Knowledge of protein flexibility is vital for deciphering the corresponding functional mechanisms. This knowledge would help, for instance, in improving computational drug design and refinement in homology-based modeling. We propose a new predictor of the residue flexibility, which is expressed by B-factors, from protein chains that use local (in the chain) predicted (or native) relative solvent accessibility (RSA) and custom-derived amino acid (AA) alphabets. Our predictor is implemented as a two-stage linear regression model that uses RSA-based space in a local sequence window in the first stage and a reduced AA pair-based space in the second stage as the inputs. This method is easy to comprehend explicit linear form in both stages. Particle swarm optimization was used to find an optimal reduced AA alphabet to simplify the input space and improve the prediction performance. The average correlation coefficients between the native and predicted B-factors measured on a large benchmark dataset are improved from 0.65 to 0.67 when using the native RSA values and from 0.55 to 0.57 when using the predicted RSA values. Blind tests that were performed on two independent datasets show consistent improvements in the average correlation coefficients by a modest value of 0.02 for both native and predicted RSA-based predictions.
QSAR models for thiophene and imidazopyridine derivatives inhibitors of the Polo-Like Kinase 1.
Comelli, Nieves C; Duchowicz, Pablo R; Castro, Eduardo A
2014-10-01
The inhibitory activity of 103 thiophene and 33 imidazopyridine derivatives against Polo-Like Kinase 1 (PLK1) expressed as pIC50 (-logIC50) was predicted by QSAR modeling. Multivariate linear regression (MLR) was employed to model the relationship between 0D and 3D molecular descriptors and biological activities of molecules using the replacement method (MR) as variable selection tool. The 136 compounds were separated into several training and test sets. Two splitting approaches, distribution of biological data and structural diversity, and the statistical experimental design procedure D-optimal distance were applied to the dataset. The significance of the training set models was confirmed by statistically higher values of the internal leave one out cross-validated coefficient of determination (Q2) and external predictive coefficient of determination for the test set (Rtest2). The model developed from a training set, obtained with the D-optimal distance protocol and using 3D descriptor space along with activity values, separated chemical features that allowed to distinguish high and low pIC50 values reasonably well. Then, we verified that such model was sufficient to reliably and accurately predict the activity of external diverse structures. The model robustness was properly characterized by means of standard procedures and their applicability domain (AD) was analyzed by leverage method. Copyright © 2014 Elsevier B.V. All rights reserved.
Interpreting Bivariate Regression Coefficients: Going beyond the Average
ERIC Educational Resources Information Center
Halcoussis, Dennis; Phillips, G. Michael
2010-01-01
Statistics, econometrics, investment analysis, and data analysis classes often review the calculation of several types of averages, including the arithmetic mean, geometric mean, harmonic mean, and various weighted averages. This note shows how each of these can be computed using a basic regression framework. By recognizing when a regression model…
Precision Efficacy Analysis for Regression.
ERIC Educational Resources Information Center
Brooks, Gordon P.
When multiple linear regression is used to develop a prediction model, sample size must be large enough to ensure stable coefficients. If the derivation sample size is inadequate, the model may not predict well for future subjects. The precision efficacy analysis for regression (PEAR) method uses a cross- validity approach to select sample sizes…
Manafiazar, G; McFadden, T; Goonewardene, L; Okine, E; Basarab, J; Li, P; Wang, Z
2013-01-01
Residual Feed Intake (RFI) is a measure of energy efficiency. Developing an appropriate model to predict expected energy intake while accounting for multifunctional energy requirements of metabolic body weight (MBW), empty body weight (EBW), milk production energy requirements (MPER), and their nonlinear lactation profiles, is the key to successful prediction of RFI in dairy cattle. Individual daily actual energy intake and monthly body weight of 281 first-lactation dairy cows from 1 to 305 d in milk were recorded at the Dairy Research and Technology Centre of the University of Alberta (Edmonton, AB, Canada); individual monthly milk yield and compositions were obtained from the Dairy Herd Improvement Program. Combinations of different orders (1-5) of fixed (F) and random (R) factors were fitted using Legendre polynomial regression to model the nonlinear lactation profiles of MBW, EBW, and MPER over 301 d. The F5R3, F5R3, and F5R2 (subscripts indicate the order fitted) models were selected, based on the combination of the log-likelihood ratio test and the Bayesian information criterion, as the best prediction equations for MBW, EBW, and MPER, respectively. The selected models were used to predict daily individual values for these traits. To consider the body reserve changes, the differences of predicted EBW between 2 consecutive days were considered as the EBW change between these days. The smoothed total 301-d actual energy intake was then linearly regressed on the total 301-d predicted traits of MBW, EBW change, and MPER to obtain the first-lactation RFI (coefficient of determination=0.68). The mean of predicted daily average lactation RFI was 0 and ranged from -6.58 to 8.64 Mcal of NE(L)/d. Fifty-one percent of the animals had an RFI value below the mean (efficient) and 49% of them had an RFI value above the mean (inefficient). These results indicate that the first-lactation RFI can be predicted from its component traits with a reasonable coefficient of determination. The predicted RFI could be used in the dairy breeding program to increase profitability by selecting animals that are genetically superior in energy efficiency based on RFI, or through routinely measured traits, which are genetically correlated with RFI. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
40 CFR 53.34 - Test procedure for methods for PM10 and Class I methods for PM2.5.
Code of Federal Regulations, 2011 CFR
2011-07-01
... linear regression parameters (slope, intercept, and correlation coefficient) describing the relationship... correlation coefficient. (2) To pass the test for comparability, the slope, intercept, and correlation...
Su, Liyun; Zhao, Yanyong; Yan, Tianshun; Li, Fenglan
2012-01-01
Multivariate local polynomial fitting is applied to the multivariate linear heteroscedastic regression model. Firstly, the local polynomial fitting is applied to estimate heteroscedastic function, then the coefficients of regression model are obtained by using generalized least squares method. One noteworthy feature of our approach is that we avoid the testing for heteroscedasticity by improving the traditional two-stage method. Due to non-parametric technique of local polynomial estimation, it is unnecessary to know the form of heteroscedastic function. Therefore, we can improve the estimation precision, when the heteroscedastic function is unknown. Furthermore, we verify that the regression coefficients is asymptotic normal based on numerical simulations and normal Q-Q plots of residuals. Finally, the simulation results and the local polynomial estimation of real data indicate that our approach is surely effective in finite-sample situations.
NASA Technical Reports Server (NTRS)
Stolzer, Alan J.; Halford, Carl
2007-01-01
In a previous study, multiple regression techniques were applied to Flight Operations Quality Assurance-derived data to develop parsimonious model(s) for fuel consumption on the Boeing 757 airplane. The present study examined several data mining algorithms, including neural networks, on the fuel consumption problem and compared them to the multiple regression results obtained earlier. Using regression methods, parsimonious models were obtained that explained approximately 85% of the variation in fuel flow. In general data mining methods were more effective in predicting fuel consumption. Classification and Regression Tree methods reported correlation coefficients of .91 to .92, and General Linear Models and Multilayer Perceptron neural networks reported correlation coefficients of about .99. These data mining models show great promise for use in further examining large FOQA databases for operational and safety improvements.
NASA Astrophysics Data System (ADS)
Mitra, Ashis; Majumdar, Prabal Kumar; Bannerjee, Debamalya
2013-03-01
This paper presents a comparative analysis of two modeling methodologies for the prediction of air permeability of plain woven handloom cotton fabrics. Four basic fabric constructional parameters namely ends per inch, picks per inch, warp count and weft count have been used as inputs for artificial neural network (ANN) and regression models. Out of the four regression models tried, interaction model showed very good prediction performance with a meager mean absolute error of 2.017 %. However, ANN models demonstrated superiority over the regression models both in terms of correlation coefficient and mean absolute error. The ANN model with 10 nodes in the single hidden layer showed very good correlation coefficient of 0.982 and 0.929 and mean absolute error of only 0.923 and 2.043 % for training and testing data respectively.
Correlation and prediction of dynamic human isolated joint strength from lean body mass
NASA Technical Reports Server (NTRS)
Pandya, Abhilash K.; Hasson, Scott M.; Aldridge, Ann M.; Maida, James C.; Woolford, Barbara J.
1992-01-01
A relationship between a person's lean body mass and the amount of maximum torque that can be produced with each isolated joint of the upper extremity was investigated. The maximum dynamic isolated joint torque (upper extremity) on 14 subjects was collected using a dynamometer multi-joint testing unit. These data were reduced to a table of coefficients of second degree polynomials, computed using a least squares regression method. All the coefficients were then organized into look-up tables, a compact and convenient storage/retrieval mechanism for the data set. Data from each joint, direction and velocity, were normalized with respect to that joint's average and merged into files (one for each curve for a particular joint). Regression was performed on each one of these files to derive a table of normalized population curve coefficients for each joint axis, direction, and velocity. In addition, a regression table which included all upper extremity joints was built which related average torque to lean body mass for an individual. These two tables are the basis of the regression model which allows the prediction of dynamic isolated joint torques from an individual's lean body mass.
Korany, Mohamed A; Gazy, Azza A; Khamis, Essam F; Ragab, Marwa A A; Kamal, Miranda F
2018-06-01
This study outlines two robust regression approaches, namely least median of squares (LMS) and iteratively re-weighted least squares (IRLS) to investigate their application in instrument analysis of nutraceuticals (that is, fluorescence quenching of merbromin reagent upon lipoic acid addition). These robust regression methods were used to calculate calibration data from the fluorescence quenching reaction (∆F and F-ratio) under ideal or non-ideal linearity conditions. For each condition, data were treated using three regression fittings: Ordinary Least Squares (OLS), LMS and IRLS. Assessment of linearity, limits of detection (LOD) and quantitation (LOQ), accuracy and precision were carefully studied for each condition. LMS and IRLS regression line fittings showed significant improvement in correlation coefficients and all regression parameters for both methods and both conditions. In the ideal linearity condition, the intercept and slope changed insignificantly, but a dramatic change was observed for the non-ideal condition and linearity intercept. Under both linearity conditions, LOD and LOQ values after the robust regression line fitting of data were lower than those obtained before data treatment. The results obtained after statistical treatment indicated that the linearity ranges for drug determination could be expanded to lower limits of quantitation by enhancing the regression equation parameters after data treatment. Analysis results for lipoic acid in capsules, using both fluorimetric methods, treated by parametric OLS and after treatment by robust LMS and IRLS were compared for both linearity conditions. Copyright © 2018 John Wiley & Sons, Ltd.
Li, Zhenghua; Cheng, Fansheng; Xia, Zhining
2011-01-01
The chemical structures of 114 polycyclic aromatic sulfur heterocycles (PASHs) have been studied by molecular electronegativity-distance vector (MEDV). The linear relationships between gas chromatographic retention index and the MEDV have been established by a multiple linear regression (MLR) model. The results of variable selection by stepwise multiple regression (SMR) and the powerful predictive abilities of the optimization model appraised by leave-one-out cross-validation showed that the optimization model with the correlation coefficient (R) of 0.994 7 and the cross-validated correlation coefficient (Rcv) of 0.994 0 possessed the best statistical quality. Furthermore, when the 114 PASHs compounds were divided into calibration and test sets in the ratio of 2:1, the statistical analysis showed our models possesses almost equal statistical quality, the very similar regression coefficients and the good robustness. The quantitative structure-retention relationship (QSRR) model established may provide a convenient and powerful method for predicting the gas chromatographic retention of PASHs.
Cerebrospinal fluid norepinephrine and cognition in subjects across the adult age span
Wang, Lucy Y.; Murphy, Richard R.; Hanscom, Brett; Li, Ge; Millard, Steven P.; Petrie, Eric C.; Galasko, Douglas R.; Sikkema, Carl; Raskind, Murray A.; Wilkinson, Charles W.; Peskind, Elaine R.
2013-01-01
Adequate central nervous system noradrenergic activity enhances cognition, but excessive noradrenergic activity may have adverse effects on cognition. Previous studies have also demonstrated that noradrenergic activity is higher in older than younger adults. We aimed to determine relationships between cerebrospinal fluid (CSF) norepinephrine (NE) concentration and cognitive performance by using data from a CSF bank that includes samples from 258 cognitively normal participants aged 21–100 years. After adjusting for age, gender, education, and ethnicity, higher CSF NE levels (units of 100 pg/mL) are associated with poorer performance on tests of attention, processing speed, and executive function (Trail Making A: regression coefficient 1.5, standard error [SE] 0.77, p = 0.046; Trail Making B: regression coefficient 5.0, SE 2.2, p = 0.024; Stroop Word-Color Interference task: regression coefficient 6.1, SE 2.0, p = 0.003). Findings are consistent with the earlier literature relating excess noradrenergic activity with cognitive impairment. PMID:23639207
Cerebrospinal fluid norepinephrine and cognition in subjects across the adult age span.
Wang, Lucy Y; Murphy, Richard R; Hanscom, Brett; Li, Ge; Millard, Steven P; Petrie, Eric C; Galasko, Douglas R; Sikkema, Carl; Raskind, Murray A; Wilkinson, Charles W; Peskind, Elaine R
2013-10-01
Adequate central nervous system noradrenergic activity enhances cognition, but excessive noradrenergic activity may have adverse effects on cognition. Previous studies have also demonstrated that noradrenergic activity is higher in older than younger adults. We aimed to determine relationships between cerebrospinal fluid (CSF) norepinephrine (NE) concentration and cognitive performance by using data from a CSF bank that includes samples from 258 cognitively normal participants aged 21-100 years. After adjusting for age, gender, education, and ethnicity, higher CSF NE levels (units of 100 pg/mL) are associated with poorer performance on tests of attention, processing speed, and executive function (Trail Making A: regression coefficient 1.5, standard error [SE] 0.77, p = 0.046; Trail Making B: regression coefficient 5.0, SE 2.2, p = 0.024; Stroop Word-Color Interference task: regression coefficient 6.1, SE 2.0, p = 0.003). Findings are consistent with the earlier literature relating excess noradrenergic activity with cognitive impairment. Published by Elsevier Inc.
Visentin, G; McDermott, A; McParland, S; Berry, D P; Kenny, O A; Brodkorb, A; Fenelon, M A; De Marchi, M
2015-09-01
Rapid, cost-effective monitoring of milk technological traits is a significant challenge for dairy industries specialized in cheese manufacturing. The objective of the present study was to investigate the ability of mid-infrared spectroscopy to predict rennet coagulation time, curd-firming time, curd firmness at 30 and 60min after rennet addition, heat coagulation time, casein micelle size, and pH in cow milk samples, and to quantify associations between these milk technological traits and conventional milk quality traits. Samples (n=713) were collected from 605 cows from multiple herds; the samples represented multiple breeds, stages of lactation, parities, and milking times. Reference analyses were undertaken in accordance with standardized methods, and mid-infrared spectra in the range of 900 to 5,000cm(-1) were available for all samples. Prediction models were developed using partial least squares regression, and prediction accuracy was based on both cross and external validation. The proportion of variance explained by the prediction models in external validation was greatest for pH (71%), followed by rennet coagulation time (55%) and milk heat coagulation time (46%). Models to predict curd firmness 60min from rennet addition and casein micelle size, however, were poor, explaining only 25 and 13%, respectively, of the total variance in each trait within external validation. On average, all prediction models tended to be unbiased. The linear regression coefficient of the reference value on the predicted value varied from 0.17 (casein micelle size regression model) to 0.83 (pH regression model) but all differed from 1. The ratio performance deviation of 1.07 (casein micelle size prediction model) to 1.79 (pH prediction model) for all prediction models in the external validation was <2, suggesting that none of the prediction models could be used for analytical purposes. With the exception of casein micelle size and curd firmness at 60min after rennet addition, the developed prediction models may be useful as a screening method, because the concordance correlation coefficient ranged from 0.63 (heat coagulation time prediction model) to 0.84 (pH prediction model) in the external validation. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Willingness-to-pay for schistosomiasis-related health outcomes in Kenya.
Kirigia, J M; Sambo, L G; Kainyu, L H
2000-01-01
Cost-benefit analysis (CBA) provides a framework for identifying, quantifying, and valuing in monetary terms all the important costs and consequences to society of competing disease interventions. Thus, CBA requires that impacts of schistosomiasis interventions on beneficiaries'health be valued in monetary terms Economic theory requires the use of the willingness to pay (WTP) approach in valuation of changes in health as a result of intervention. It is the only approach which is consistent with the potential Pareto improvement principle, and hence, consistent with CBA. The present study developed a health outcome measure and tested its operational feasibility. Contingent valuation for certain return to normal health from various health states, and for remaining in one's current health state were elicited through direct interview of randomly selected rice farmers, teachers, and health personnel in Kenya. The WTP to avoid risk of advancing to the next more severe state, seemed to be higher than WTP for a return to normal health. Generally, there was a significant difference between the average WTP values of farmers, teachers and health personnel populations. The gender and occupation variable coefficients were positive and highly significant in all regressions. The coefficients of the other explanatory variables were generally not statistically significant, indicating that medical expenses, anxiety cost, loss of earnings, and loss of work time, implied in various health states descriptions did not have significant effect on respondents expressed WTP values. The latter finding shows that there is need for more research to identify the other (besides gender and occupation) determinants of expressed WTP values in Africa. This study has demonstrated that it is possible to elicit coherent WTP values from economically under-developed countries. Further empirical work is clearly needed to at least address the validity and reliability of the contingent valuation approach and its measurements in Africa.
Riley, D G; Gill, C A; Boldt, C R; Funkhouser, R R; Herring, A D; Riggs, P K; Sawyer, J E; Lunt, D K; Sanders, J O
2016-04-01
cattle often have the reputation for a poor or dangerous temperament. Identification of genomic regions that associate with temperament of such cattle may be useful for genetic improvement strategies. The objectives of this study were to evaluate subjective temperament scores (1 to 9; higher scores indicated more unfavorable temperament) for aggressiveness, nervousness, flightiness, gregariousness, and overall temperament of one-half steers in feedlot conditions at 1 yr of age and compare those scores of those steers when evaluated approximately 1 mo postweaning, and conduct whole genome association analyses using SNP markers and the temperament traits of those steers at 1 yr of age and for temperament traits of all calves at weaning. Contemporary groups ( < 0.001) were steers born in the same year and season, and fed in the same feedlot pen. Aggressiveness of steers at 1 yr of age was not associated with aggressiveness at weaning (linear regression coefficient did not differ from 0; = 0.96), but regressions of all other yearling scores of steers on the scores at weaning were positive (coefficients ranged from 0.26 ± 0.04 to 0.32 ± 0.04; < 0.001). Estimates of Pearson correlation coefficients (using unadjusted values and residual values) of the different traits measured at 1 yr of age were large ( > 0.63; < 0.008) except for aggressiveness with nervousness, flightiness, or gregariousness, which did not differ from 0 ( > 0.1). Five SNP on BTA 1, 24, and 29 had suggestive associations (0.17 < [adjusted for FDR] < 0.24) with aggressiveness, nervousness, or flightiness at evaluation postweaning and 13 SNP on 11 chromosomes had suggestive associations (0.07 < [adjusted for FDR] < 0.24) with aggressiveness, nervousness, flightiness, or overall temperament score of steers at 1 yr of age. Genes close to these loci with roles in neural systems of various organisms included synaptotagmin 4 (BTA 24), FAT atypical cadhedrin 3 (BTA 29), tubulin tyrosine ligase-like 1 (BTA 5), spermatogenesis associated 17 (BTA 16), stanniocalcin 2 (BTA 20), and GABA receptor γ 3 (BTA 21).
Huang, Shi; MacKinnon, David P.; Perrino, Tatiana; Gallo, Carlos; Cruden, Gracelyn; Brown, C Hendricks
2016-01-01
Mediation analysis often requires larger sample sizes than main effect analysis to achieve the same statistical power. Combining results across similar trials may be the only practical option for increasing statistical power for mediation analysis in some situations. In this paper, we propose a method to estimate: 1) marginal means for mediation path a, the relation of the independent variable to the mediator; 2) marginal means for path b, the relation of the mediator to the outcome, across multiple trials; and 3) the between-trial level variance-covariance matrix based on a bivariate normal distribution. We present the statistical theory and an R computer program to combine regression coefficients from multiple trials to estimate a combined mediated effect and confidence interval under a random effects model. Values of coefficients a and b, along with their standard errors from each trial are the input for the method. This marginal likelihood based approach with Monte Carlo confidence intervals provides more accurate inference than the standard meta-analytic approach. We discuss computational issues, apply the method to two real-data examples and make recommendations for the use of the method in different settings. PMID:28239330
Wang, S; Martinez-Lage, M; Sakai, Y; Chawla, S; Kim, S G; Alonso-Basanta, M; Lustig, R A; Brem, S; Mohan, S; Wolf, R L; Desai, A; Poptani, H
2016-01-01
Early assessment of treatment response is critical in patients with glioblastomas. A combination of DTI and DSC perfusion imaging parameters was evaluated to distinguish glioblastomas with true progression from mixed response and pseudoprogression. Forty-one patients with glioblastomas exhibiting enhancing lesions within 6 months after completion of chemoradiation therapy were retrospectively studied. All patients underwent surgery after MR imaging and were histologically classified as having true progression (>75% tumor), mixed response (25%-75% tumor), or pseudoprogression (<25% tumor). Mean diffusivity, fractional anisotropy, linear anisotropy coefficient, planar anisotropy coefficient, spheric anisotropy coefficient, and maximum relative cerebral blood volume values were measured from the enhancing tissue. A multivariate logistic regression analysis was used to determine the best model for classification of true progression from mixed response or pseudoprogression. Significantly elevated maximum relative cerebral blood volume, fractional anisotropy, linear anisotropy coefficient, and planar anisotropy coefficient and decreased spheric anisotropy coefficient were observed in true progression compared with pseudoprogression (P < .05). There were also significant differences in maximum relative cerebral blood volume, fractional anisotropy, planar anisotropy coefficient, and spheric anisotropy coefficient measurements between mixed response and true progression groups. The best model to distinguish true progression from non-true progression (pseudoprogression and mixed) consisted of fractional anisotropy, linear anisotropy coefficient, and maximum relative cerebral blood volume, resulting in an area under the curve of 0.905. This model also differentiated true progression from mixed response with an area under the curve of 0.901. A combination of fractional anisotropy and maximum relative cerebral blood volume differentiated pseudoprogression from nonpseudoprogression (true progression and mixed) with an area under the curve of 0.807. DTI and DSC perfusion imaging can improve accuracy in assessing treatment response and may aid in individualized treatment of patients with glioblastomas. © 2016 by American Journal of Neuroradiology.
Shen, Jin-Jing; Gong, Xing-Chu; Pan, Jian-Yang; Qu, Hai-Bin
2017-03-01
Design space approach was applied in this study to optimize the lime milk precipitation process of Lonicera Japonica (Jinyinhua) aqueous extract. The evaluation indices for this process were total organic acid purity and amounts of 6 organic acids obtained from per unit mass of medicinal materials. Four critical process parameters (CPPs) including drop speed of lime milk, pH value after adding lime milk, settling time and settling temperature were identified by using the weighted standardized partial regression coefficient method. Quantitative models between process evaluation indices and CPPs were established by a stepwise regression analysis. A design space was calculated by a Monte-Carlo simulation method, and then verified. The verification test results showed that the operation within the design space can guarantee the stability of the lime milk precipitation process. The recommended normal operation space is as follows: drop speed of lime milk of 1.00-1.25 mL•min⁻¹, pH value of 11.5-11.7, settling time of 1.0-1.2 h, and settling temperature of 10-20 ℃.. Copyright© by the Chinese Pharmaceutical Association.
Wen, L; Bowen, C R; Hartman, G L
2017-10-01
Dispersal of urediniospores by wind is the primary means of spread for Phakopsora pachyrhizi, the cause of soybean rust. Our research focused on the short-distance movement of urediniospores from within the soybean canopy and up to 61 m from field-grown rust-infected soybean plants. Environmental variables were used to develop and compare models including the least absolute shrinkage and selection operator regression, zero-inflated Poisson/regular Poisson regression, random forest, and neural network to describe deposition of urediniospores collected in passive and active traps. All four models identified distance of trap from source, humidity, temperature, wind direction, and wind speed as the five most important variables influencing short-distance movement of urediniospores. The random forest model provided the best predictions, explaining 76.1 and 86.8% of the total variation in the passive- and active-trap datasets, respectively. The prediction accuracy based on the correlation coefficient (r) between predicted values and the true values were 0.83 (P < 0.0001) and 0.94 (P < 0.0001) for the passive and active trap datasets, respectively. Overall, multiple machine learning techniques identified the most important variables to make the most accurate predictions of movement of P. pachyrhizi urediniospores short-distance.
[Gaussian process regression and its application in near-infrared spectroscopy analysis].
Feng, Ai-Ming; Fang, Li-Min; Lin, Min
2011-06-01
Gaussian process (GP) is applied in the present paper as a chemometric method to explore the complicated relationship between the near infrared (NIR) spectra and ingredients. After the outliers were detected by Monte Carlo cross validation (MCCV) method and removed from dataset, different preprocessing methods, such as multiplicative scatter correction (MSC), smoothing and derivate, were tried for the best performance of the models. Furthermore, uninformative variable elimination (UVE) was introduced as a variable selection technique and the characteristic wavelengths obtained were further employed as input for modeling. A public dataset with 80 NIR spectra of corn was introduced as an example for evaluating the new algorithm. The optimal models for oil, starch and protein were obtained by the GP regression method. The performance of the final models were evaluated according to the root mean square error of calibration (RMSEC), root mean square error of cross-validation (RMSECV), root mean square error of prediction (RMSEP) and correlation coefficient (r). The models give good calibration ability with r values above 0.99 and the prediction ability is also satisfactory with r values higher than 0.96. The overall results demonstrate that GP algorithm is an effective chemometric method and is promising for the NIR analysis.
Noncontact analysis of the fiber weight per unit area in prepreg by near-infrared spectroscopy.
Jiang, B; Huang, Y D
2008-05-26
The fiber weight per unit area in prepreg is an important factor to ensure the quality of the composite products. Near-infrared spectroscopy (NIRS) technology together with a noncontact reflectance sources has been applied for quality analysis of the fiber weight per unit area. The range of the unit area fiber weight was 13.39-14.14mgcm(-2). The regression method was employed by partial least squares (PLS) and principal components regression (PCR). The calibration model was developed by 55 samples to determine the fiber weight per unit area in prepreg. The determination coefficient (R(2)), root mean square error of calibration (RMSEC) and root mean square error of prediction (RMSEP) were 0.82, 0.092, 0.099, respectively. The predicted values of the fiber weight per unit area in prepreg measured by NIRS technology were comparable to the values obtained by the reference method. For this technology, the noncontact reflectance sources focused directly on the sample with neither previous treatment nor manipulation. The results of the paired t-test revealed that there was no significant difference between the NIR method and the reference method. Besides, the prepreg could be analyzed one time within 20s without sample destruction.
Köhler, A; King, R; Bahls, M; Groß, S; Steveling, A; Gärtner, S; Schipf, S; Gläser, S; Völzke, H; Felix, S B; Markus, M R P; Dörr, M
2018-01-18
Peak oxygen uptake (VO2peak) is commonly indexed by total body weight (TBW) to determine cardiopulmonary fitness (CPF). This approach may lead to misinterpretation, particularly in obese subjects. We investigated the normalization of VO2peak by different body composition markers. We analyzed combined data of 3848 subjects (1914 women; 49.7%), aged 20-90, from two independent cohorts of the population-based Study of Health in Pomerania (SHIP-2 and SHIP-TREND). VO2peak was assessed by cardiopulmonary exercise testing. Body cell mass (BCM), fat-free mass (FFM), and fat mass (FM) were determined by bioelectrical impedance analysis. The suitability of the different markers as a normalization variable was evaluated by taking into account correlation coefficients (r) and intercept (α-coefficient) values from linear regression models. A combination of high r and low α values was considered as preferable for normalization purposes. BCM was the best normalization variable for VO2peak (r = .72; P ≤ .001; α-coefficient = 63.3 mL/min; 95% confidence interval [CI]: 3.48-123) followed by FFM (r = .63; P ≤ .001; α-coefficient = 19.6 mL/min; 95% CI: -57.9-97.0). On the other hand, a much weaker correlation and a markedly higher intercept were found for TBW (r = .42; P ≤ .001; α-coefficient = 579 mL/min; 95% CI: 483 to 675). Likewise, FM was also identified as a poor normalization variable (r = .10; P ≤ .001; α-coefficient = 2133; 95% CI: 2074-2191). Sex-stratified analyses confirmed the above order for the different normalization variables. Our results suggest that BCM, followed by FFM, might be the most appropriate marker for the normalization of VO2peak when comparing CPF between subjects with different body shape. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Determining Sample Size for Accurate Estimation of the Squared Multiple Correlation Coefficient.
ERIC Educational Resources Information Center
Algina, James; Olejnik, Stephen
2000-01-01
Discusses determining sample size for estimation of the squared multiple correlation coefficient and presents regression equations that permit determination of the sample size for estimating this parameter for up to 20 predictor variables. (SLD)
Prevalence and Consequences of the Proximal Junctional Kyphosis After Spinal Deformity Surgery
Yan, Chunda; Li, Yong; Yu, Zhange
2016-01-01
Abstract The aim of this study was to estimate the prevalence and patient outcomes of proximal junctional kyphosis (PJK) in pediatric patients and adolescents who received surgical interventions for the treatment of a spinal deformity. Literature was searched in electronic databases, and studies were selected by following précised eligibility criteria. Percent prevalence values of the PJK in individual studies were pooled to achieve a weighted effect size under the random effects model. Subgroup and meta-regression analyses were performed to appraise the factors affecting PJK prevalence. Twenty-six studies (2024 patients) were included in this meta-analysis. Average age of the patients was 13.8 ± 2.75 years of which 32 ± 20 % were males. Average follow-up was 51.6 ± 38.8 (range 17 ± 13 to 218 ± 60) months. Overall, the percent prevalence of PJK (95% confidence interval) was 11.02 (10.5, 11.5) %; P < 0.00001 which was inversely associated with age (meta-regression coefficient: –1.607 [–2.86, –0.36]; 0.014). Revision surgery rate in the patients with PJK was 10%. The prevalence of PJK was positively associated with the proximal junctional angle at last follow-up (coefficient: 2.248; P = 0.012) and the change in the proximal junctional angle from surgery to last follow-up (coefficient: 2.139; P = 0.014) but not with preoperative proximal junctional angle. The prevalence of PJK in the children and adolescent patients is 11%. About 10% of those affected require revision surgery. PMID:27196453
NASA Astrophysics Data System (ADS)
Wang, Daojun; Gong, Jianhua; Ma, Ainai; Li, Wenhang; Wang, Xijun
2005-10-01
There are generally two kinds of approaches to studying geomorphic features in terms of the quantification level and difference of major considerations. One is the earlier qualitative characterization, and the other is the 2-dimension measurement that includes section pattern and projection pattern. With the development of geo-information technology, especially the 3-D geo-visualization and virtual geographic environments (VGE), 3-dimension measurement and dynamic interactive between users and geo-data/geo-graphics can be developed to understand geomorphic features deeply, and to benefit to the effective applications of such features for geographic projects like dam construction. Storage-elevation curve is very useful for site selection of projects and flood dispatching in water conservancy region, but it is just a tool querying one value from the other one. In fact, storage-elevation curve can represent comprehensively the geomorphic features including vertical section, cross section of the stream and the landform nearby. In this paper, we use quadratic regression equation shaped like y = ax2 + bx + c and the DEM data of Hong-Shi-Mao watershed, Zi Chang County, ShaanXi Province, China to find out the relationship between the coefficients of the equation and the geomorphic features based on VGE platform. It's exciting that the coefficient "a" appear to be correlative strongly with the stream scale, and the coefficient "b" may give an index to the valley shape. In the end, we use a sub-basin named Hao-Jia-Gou of the watershed as an application. The result of correlative research about quadratic regression equation and geomorphic features can save computing and improve the efficiency in silt dam systems planning.
Societal Value of Surgery for Facial Reanimation
Su, Peiyi; Ishii, Lisa E.; Joseph, Andrew; Nellis, Jason; Dey, Jacob; Bater, Kristin; Byrne, Patrick J.; Boahene, Kofi D. O.; Ishii, Masaru
2017-01-01
IMPORTANCE Patients with facial paralysis are perceived negatively by society in a number of domains. Society’s perception of the health utility of varying degrees of facial paralysis and the value society places on reconstructive surgery for facial reanimation need to be quantified. OBJECTIVE To measure health state utility of varying degrees of facial paralysis, willingness to pay (WTP) for a repair, and the subsequent value of facial reanimation surgery as perceived by society. DESIGN, SETTING, AND PARTICIPANTS This prospective observational study conducted in an academic tertiary referral center evaluated a group of 348 casual observers who viewed images of faces with unilateral facial paralysis of 3 severity levels (low, medium, and high) categorized by House-Brackmann grade. Structural equation modeling was performed to understand associations among health utility metrics, WTP, and facial perception domains. Data were collected from July 16 to September 26, 2015. MAIN OUTCOMES AND MEASURES Observer-rated (1) quality of life (QOL) using established health utility metrics (standard gamble, time trade-off, and a visual analog scale) and (2) their WTP for surgical repair. RESULTS Among the 348 observers (248 women [71.3%]; 100 men [28.7%]; mean [SD] age, 29.3 [11.6] years), mixed-effects linear regression showed that WTP increased nonlinearly with increasing severity of paralysis. Participants were willing to pay $3487 (95% CI, $2362–$4961) to repair low-grade paralysis, $8571 (95% CI, $6401–$11 234) for medium-grade paralysis, and $20 431 (95% CI, $16 273–$25 317) for high-grade paralysis. The dominant factor affecting the participants’ WTP was perceived QOL. Modeling showed that perceived QOL decreased with paralysis severity (regression coefficient, −0.004; 95% CI, −0.005 to −0.004; P < .001) and increased with attractiveness (regression coefficient, 0.002; 95% CI, 0.002 to 0.003; P < .001). Mean (SD) health utility scores calculated by the standard gamble metric for low- and high-grade paralysis were 0.98 (0.09) and 0.77 (0.25), respectively. Time trade-off and visual analog scale measures were highly correlated. We calculated mean (SD) WTP per quality-adjusted life-year, which ranged from $10 167 ($14 565) to $17 008 ($38 288) for low- to high-grade paralysis, respectively. CONCLUSIONS AND RELEVANCE Society perceives the repair of facial paralysis to be a high-value intervention. Societal WTP increases and perceived health state utility decreases with increasing House-Brackmann grade. This study demonstrates the usefulness of WTP as an objective measure to inform dimensions of disease severity and signal the value society places on proper facial function. LEVEL OF EVIDENCE NA. PMID:27892977
Campbell, J Elliott; Moen, Jeremie C; Ney, Richard A; Schnoor, Jerald L
2008-03-01
Estimates of forest soil organic carbon (SOC) have applications in carbon science, soil quality studies, carbon sequestration technologies, and carbon trading. Forest SOC has been modeled using a regression coefficient methodology that applies mean SOC densities (mass/area) to broad forest regions. A higher resolution model is based on an approach that employs a geographic information system (GIS) with soil databases and satellite-derived landcover images. Despite this advancement, the regression approach remains the basis of current state and federal level greenhouse gas inventories. Both approaches are analyzed in detail for Wisconsin forest soils from 1983 to 2001, applying rigorous error-fixing algorithms to soil databases. Resulting SOC stock estimates are 20% larger when determined using the GIS method rather than the regression approach. Average annual rates of increase in SOC stocks are 3.6 and 1.0 million metric tons of carbon per year for the GIS and regression approaches respectively.
Radon-222 concentrations in ground water and soil gas on Indian reservations in Wisconsin
DeWild, John F.; Krohelski, James T.
1995-01-01
For sites with wells finished in the sand and gravel aquifer, the coefficient of determination (R2) of the regression of concentration of radon-222 in ground water as a function of well depth is 0.003 and the significance level is 0.32, which indicates that there is not a statistically significant relation between radon-222 concentrations in ground water and well depth. The coefficient of determination of the regression of radon-222 in ground water and soil gas is 0.19 and the root mean square error of the regression line is 271 picocuries per liter. Even though the significance level (0.036) indicates a statistical relation, the root mean square error of the regression is so large that the regression equation would not give reliable predictions. Because of an inadequate number of samples, similar statistical analyses could not be performed for sites with wells finished in the crystalline and sedimentary bedrock aquifers.
ERIC Educational Resources Information Center
Kane, Michael T.; Mroch, Andrew A.
2010-01-01
In evaluating the relationship between two measures across different groups (i.e., in evaluating "differential validity") it is necessary to examine differences in correlation coefficients and in regression lines. Ordinary least squares (OLS) regression is the standard method for fitting lines to data, but its criterion for optimal fit…
Incremental Net Effects in Multiple Regression
ERIC Educational Resources Information Center
Lipovetsky, Stan; Conklin, Michael
2005-01-01
A regular problem in regression analysis is estimating the comparative importance of the predictors in the model. This work considers the 'net effects', or shares of the predictors in the coefficient of the multiple determination, which is a widely used characteristic of the quality of a regression model. Estimation of the net effects can be a…
Panel regressions to estimate low-flow response to rainfall variability in ungaged basins
Bassiouni, Maoya; Vogel, Richard M.; Archfield, Stacey A.
2016-01-01
Multicollinearity and omitted-variable bias are major limitations to developing multiple linear regression models to estimate streamflow characteristics in ungaged areas and varying rainfall conditions. Panel regression is used to overcome limitations of traditional regression methods, and obtain reliable model coefficients, in particular to understand the elasticity of streamflow to rainfall. Using annual rainfall and selected basin characteristics at 86 gaged streams in the Hawaiian Islands, regional regression models for three stream classes were developed to estimate the annual low-flow duration discharges. Three panel-regression structures (random effects, fixed effects, and pooled) were compared to traditional regression methods, in which space is substituted for time. Results indicated that panel regression generally was able to reproduce the temporal behavior of streamflow and reduce the standard errors of model coefficients compared to traditional regression, even for models in which the unobserved heterogeneity between streams is significant and the variance inflation factor for rainfall is much greater than 10. This is because both spatial and temporal variability were better characterized in panel regression. In a case study, regional rainfall elasticities estimated from panel regressions were applied to ungaged basins on Maui, using available rainfall projections to estimate plausible changes in surface-water availability and usable stream habitat for native species. The presented panel-regression framework is shown to offer benefits over existing traditional hydrologic regression methods for developing robust regional relations to investigate streamflow response in a changing climate.
Panel regressions to estimate low-flow response to rainfall variability in ungaged basins
NASA Astrophysics Data System (ADS)
Bassiouni, Maoya; Vogel, Richard M.; Archfield, Stacey A.
2016-12-01
Multicollinearity and omitted-variable bias are major limitations to developing multiple linear regression models to estimate streamflow characteristics in ungaged areas and varying rainfall conditions. Panel regression is used to overcome limitations of traditional regression methods, and obtain reliable model coefficients, in particular to understand the elasticity of streamflow to rainfall. Using annual rainfall and selected basin characteristics at 86 gaged streams in the Hawaiian Islands, regional regression models for three stream classes were developed to estimate the annual low-flow duration discharges. Three panel-regression structures (random effects, fixed effects, and pooled) were compared to traditional regression methods, in which space is substituted for time. Results indicated that panel regression generally was able to reproduce the temporal behavior of streamflow and reduce the standard errors of model coefficients compared to traditional regression, even for models in which the unobserved heterogeneity between streams is significant and the variance inflation factor for rainfall is much greater than 10. This is because both spatial and temporal variability were better characterized in panel regression. In a case study, regional rainfall elasticities estimated from panel regressions were applied to ungaged basins on Maui, using available rainfall projections to estimate plausible changes in surface-water availability and usable stream habitat for native species. The presented panel-regression framework is shown to offer benefits over existing traditional hydrologic regression methods for developing robust regional relations to investigate streamflow response in a changing climate.
Strain Gauge Balance Uncertainty Analysis at NASA Langley: A Technical Review
NASA Technical Reports Server (NTRS)
Tripp, John S.
1999-01-01
This paper describes a method to determine the uncertainties of measured forces and moments from multi-component force balances used in wind tunnel tests. A multivariate regression technique is first employed to estimate the uncertainties of the six balance sensitivities and 156 interaction coefficients derived from established balance calibration procedures. These uncertainties are then employed to calculate the uncertainties of force-moment values computed from observed balance output readings obtained during tests. Confidence and prediction intervals are obtained for each computed force and moment as functions of the actual measurands. Techniques are discussed for separate estimation of balance bias and precision uncertainties.
Determination of organic compounds in water using ultraviolet LED
NASA Astrophysics Data System (ADS)
Kim, Chihoon; Ji, Taeksoo; Eom, Joo Beom
2018-04-01
This paper describes a method of detecting organic compounds in water using an ultraviolet LED (280 nm) spectroscopy system and a photodetector. The LED spectroscopy system showed a high correlation between the concentration of the prepared potassium hydrogen phthalate and that calculated by multiple linear regression, indicating an adjusted coefficient of determination ranging from 0.953-0.993. In addition, a comparison between the performance of the spectroscopy system and the total organic carbon analyzer indicated that the difference in concentration was small. Based on the close correlation between the spectroscopy and photodetector absorbance values, organic measurement with a photodetector could be configured for monitoring.
Effect of plasma spraying modes on material properties of internal combustion engine cylinder liners
NASA Astrophysics Data System (ADS)
Timokhova, O. M.; Burmistrova, O. N.; Sirina, E. A.; Timokhov, R. S.
2018-03-01
The paper analyses different methods of remanufacturing worn-out machine parts in order to get the best performance characteristics. One of the most promising of them is a plasma spraying method. The mathematical models presented in the paper are intended to anticipate the results of plasma spraying, its effect on the properties of the material of internal combustion engine cylinder liners under repair. The experimental data and research results have been computer processed with Statistica 10.0 software package. The pare correlation coefficient values (R) and F-statistic criterion are given to confirm the statistical properties and adequacy of obtained regression equations.
Galindo-Romero, Marta; Lippert, Tristan; Gavrilov, Alexander
2015-12-01
This paper presents an empirical linear equation to predict peak pressure level of anthropogenic impulsive signals based on its correlation with the sound exposure level. The regression coefficients are shown to be weakly dependent on the environmental characteristics but governed by the source type and parameters. The equation can be applied to values of the sound exposure level predicted with a numerical model, which provides a significant improvement in the prediction of the peak pressure level. Part I presents the analysis for airgun arrays signals, and Part II considers the application of the empirical equation to offshore impact piling noise.
Climate patterns as predictors of amphibians species richness and indicators of potential stress
Battaglin, W.; Hay, L.; McCabe, G.; Nanjappa, P.; Gallant, Alisa L.
2005-01-01
Amphibians occupy a range of habitats throughout the world, but species richness is greatest in regions with moist, warm climates. We modeled the statistical relations of anuran and urodele species richness with mean annual climate for the conterminous United States, and compared the strength of these relations at national and regional levels. Model variables were calculated for county and subcounty mapping units, and included 40-year (1960-1999) annual mean and mean annual climate statistics, mapping unit average elevation, mapping unit land area, and estimates of anuran and urodele species richness. Climate data were derived from more than 7,500 first-order and cooperative meteorological stations and were interpolated to the mapping units using multiple linear regression models. Anuran and urodele species richness were calculated from the United States Geological Survey's Amphibian Research and Monitoring Initiative (ARMI) National Atlas for Amphibian Distributions. The national multivariate linear regression (MLR) model of anuran species richness had an adjusted coefficient of determination (R2) value of 0.64 and the national MLR model for urodele species richness had an R2 value of 0.45. Stratifying the United States by coarse-resolution ecological regions provided models for anUrans that ranged in R2 values from 0.15 to 0.78. Regional models for urodeles had R2 values. ranging from 0.27 to 0.74. In general, regional models for anurans were more strongly influenced by temperature variables, whereas precipitation variables had a larger influence on urodele models.
Osteoporosis prediction from the mandible using cone-beam computed tomography
Al Haffar, Iyad; Khattab, Razan
2014-01-01
Purpose This study aimed to evaluate the use of dental cone-beam computed tomography (CBCT) in the diagnosis of osteoporosis among menopausal and postmenopausal women by using only a CBCT viewer program. Materials and Methods Thirty-eight menopausal and postmenopausal women who underwent dual-energy X-ray absorptiometry (DXA) examination for hip and lumbar vertebrae were scanned using CBCT (field of view: 13 cm×15 cm; voxel size: 0.25 mm). Slices from the body of the mandible as well as the ramus were selected and some CBCT-derived variables, such as radiographic density (RD) as gray values, were calculated as gray values. Pearson's correlation, one-way analysis of variance (ANOVA), and accuracy (sensitivity and specificity) evaluation based on linear and logistic regression were performed to choose the variable that best correlated with the lumbar and femoral neck T-scores. Results RD of the whole bone area of the mandible was the variable that best correlated with and predicted both the femoral neck and the lumbar vertebrae T-scores; further, Pearson's correlation coefficients were 0.5/0.6 (p value=0.037/0.009). The sensitivity, specificity, and accuracy based on the logistic regression were 50%, 88.9%, and 78.4%, respectively, for the femoral neck, and 46.2%, 91.3%, and 75%, respectively, for the lumbar vertebrae. Conclusion Lumbar vertebrae and femoral neck osteoporosis can be predicted with high accuracy from the RD value of the body of the mandible by using a CBCT viewer program. PMID:25473633
Lombard, Pamela J.; Hodgkins, Glenn A.
2015-01-01
Regression equations to estimate peak streamflows with 1- to 500-year recurrence intervals (annual exceedance probabilities from 99 to 0.2 percent, respectively) were developed for small, ungaged streams in Maine. Equations presented here are the best available equations for estimating peak flows at ungaged basins in Maine with drainage areas from 0.3 to 12 square miles (mi2). Previously developed equations continue to be the best available equations for estimating peak flows for basin areas greater than 12 mi2. New equations presented here are based on streamflow records at 40 U.S. Geological Survey streamgages with a minimum of 10 years of recorded peak flows between 1963 and 2012. Ordinary least-squares regression techniques were used to determine the best explanatory variables for the regression equations. Traditional map-based explanatory variables were compared to variables requiring field measurements. Two field-based variables—culvert rust lines and bankfull channel widths—either were not commonly found or did not explain enough of the variability in the peak flows to warrant inclusion in the equations. The best explanatory variables were drainage area and percent basin wetlands; values for these variables were determined with a geographic information system. Generalized least-squares regression was used with these two variables to determine the equation coefficients and estimates of accuracy for the final equations.
Maas, Iris L; Nolte, Sandra; Walter, Otto B; Berger, Thomas; Hautzinger, Martin; Hohagen, Fritz; Lutz, Wolfgang; Meyer, Björn; Schröder, Johanna; Späth, Christina; Klein, Jan Philipp; Moritz, Steffen; Rose, Matthias
2017-02-01
To compare treatment effect estimates obtained from a regression discontinuity (RD) design with results from an actual randomized controlled trial (RCT). Data from an RCT (EVIDENT), which studied the effect of an Internet intervention on depressive symptoms measured with the Patient Health Questionnaire (PHQ-9), were used to perform an RD analysis, in which treatment allocation was determined by a cutoff value at baseline (PHQ-9 = 10). A linear regression model was fitted to the data, selecting participants above the cutoff who had received the intervention (n = 317) and control participants below the cutoff (n = 187). Outcome was PHQ-9 sum score 12 weeks after baseline. Robustness of the effect estimate was studied; the estimate was compared with the RCT treatment effect. The final regression model showed a regression coefficient of -2.29 [95% confidence interval (CI): -3.72 to -.85] compared with a treatment effect found in the RCT of -1.57 (95% CI: -2.07 to -1.07). Although the estimates obtained from two designs are not equal, their confidence intervals overlap, suggesting that an RD design can be a valid alternative for RCTs. This finding is particularly important for situations where an RCT may not be feasible or ethical as is often the case in clinical research settings. Copyright © 2016 Elsevier Inc. All rights reserved.
Lim, Kok Bin; Ho, Henry; Foo, Keong Tatt; Wong, Michael Yuet Chen; Fook-Chong, Stephanie
2006-12-01
The aims of this study were to define the relationship between intravesical prostatic protrusion (IPP), prostate-specific antigen (PSA) and prostate volume (PV) and to determine which one of them is the best predictor of bladder outlet obstruction (BOO) due to benign prostatic enlargement. A prospective study of 114 male patients older than 50 years examined between November 2001 and 2002 was performed. They were evaluated with digital rectal examination, International Prostate Symptoms Score, PSA, uroflowmetry, postvoid residual urine measurement, IPP and PV using transabdominal ultrasound scan. Statistical analysis included scatter plot with Spearman's correlation coefficients and nominal logistic regression Prostate volume, IPP and PSA showed parallel correlation. Although all three indices had good correlation with BOO index, IPP was the best. The Spearman rho correlation coefficients were 0.314, 0.408 and 0.507 for PV, PSA and IPP, respectively. Using receiver-operator characteristic curves, the areas under the curve for PV, PSA and IPP were 0.637, 0.703 and 0.772, respectively. The positive predictive values of PV, PSA and IPP were 65%, 68% and 72%, respectively. Using a nominal regression model, IPP remained the most significant independent index to determine BOO. All three non-invasive indices correlate with one another. The study showed that IPP is a better predictor for BOO than PSA or PV.
Machine Learning Estimation of Atom Condensed Fukui Functions.
Zhang, Qingyou; Zheng, Fangfang; Zhao, Tanfeng; Qu, Xiaohui; Aires-de-Sousa, João
2016-02-01
To enable the fast estimation of atom condensed Fukui functions, machine learning algorithms were trained with databases of DFT pre-calculated values for ca. 23,000 atoms in organic molecules. The problem was approached as the ranking of atom types with the Bradley-Terry (BT) model, and as the regression of the Fukui function. Random Forests (RF) were trained to predict the condensed Fukui function, to rank atoms in a molecule, and to classify atoms as high/low Fukui function. Atomic descriptors were based on counts of atom types in spheres around the kernel atom. The BT coefficients assigned to atom types enabled the identification (93-94 % accuracy) of the atom with the highest Fukui function in pairs of atoms in the same molecule with differences ≥0.1. In whole molecules, the atom with the top Fukui function could be recognized in ca. 50 % of the cases and, on the average, about 3 of the top 4 atoms could be recognized in a shortlist of 4. Regression RF yielded predictions for test sets with R(2) =0.68-0.69, improving the ability of BT coefficients to rank atoms in a molecule. Atom classification (as high/low Fukui function) was obtained with RF with sensitivity of 55-61 % and specificity of 94-95 %. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
NASA Astrophysics Data System (ADS)
Yadav, Manish; Singh, Nitin Kumar
2017-12-01
A comparison of the linear and non-linear regression method in selecting the optimum isotherm among three most commonly used adsorption isotherms (Langmuir, Freundlich, and Redlich-Peterson) was made to the experimental data of fluoride (F) sorption onto Bio-F at a solution temperature of 30 ± 1 °C. The coefficient of correlation (r2) was used to select the best theoretical isotherm among the investigated ones. A total of four Langmuir linear equations were discussed and out of which linear form of most popular Langmuir-1 and Langmuir-2 showed the higher coefficient of determination (0.976 and 0.989) as compared to other Langmuir linear equations. Freundlich and Redlich-Peterson isotherms showed a better fit to the experimental data in linear least-square method, while in non-linear method Redlich-Peterson isotherm equations showed the best fit to the tested data set. The present study showed that the non-linear method could be a better way to obtain the isotherm parameters and represent the most suitable isotherm. Redlich-Peterson isotherm was found to be the best representative (r2 = 0.999) for this sorption system. It is also observed that the values of β are not close to unity, which means the isotherms are approaching the Freundlich but not the Langmuir isotherm.
Saad, Karen Ruggeri; Colombo, Alexandra S; João, Silvia M Amado
2009-01-01
The purpose of this study was to investigate the reliability and validity of photogrammetry in measuring the lateral spinal inclination angles. Forty subjects (32 female and 8 males) with a mean age of 23.4 +/- 11.2 years had their scoliosis evaluated by radiographs of their trunk, determined by the Cobb angle method, and by photogrammetry. The statistical methods used included Cronbach alpha, Pearson/Spearman correlation coefficients, and regression analyses. The Cronbach alpha values showed that the photogrammetric measures showed high internal consistency, which indicated that the sample was bias free. The radiograph method showed to be more precise with intrarater reliabilities of 0.936, 0.975, and 0.945 for the thoracic, lumbar, and thoracolumbar curves, respectively, and interrater reliabilities of 0.942 and 0.879 for the angular measures of the thoracic and thoracolumbar segments, respectively. The regression analyses revealed a high determination coefficient although limited to the adjusted linear model between the radiographic and photographic measures. It was found that with more severe scoliosis, the lateral curve measures obtained with the photogrammetry were for the thoracic and lumbar regions (R = 0.619 and 0.551). The photogrammetric measures were found to be reproducible in this study and could be used as supplementary information to decrease the number of radiographs necessary for the monitoring of scoliosis.
Lu, Hsueh-Kuan; Chen, Yu-Yawn; Yeh, Chinagwen; Chuang, Chih-Lin; Chiang, Li-Ming; Lai, Chung-Liang; Casebolt, Kevin M; Huang, Ai-Chun; Lin, Wen-Long; Hsieh, Kuen-Chang
2017-08-22
The aim of this study was to evaluate leg-to-leg bioelectrical impedance analysis (LBIA) using a four-contact electrode system for measuring abdominal visceral fat area (VFA). The present study recruited 381 (240 male and 141 female) Chinese participants to compare VFA measurements estimated by a standing LBIA system (VFALBIA) with computerized tomography (CT) scanned at the L4-L5 vertebrae (VFA CT ). The total mean body mass index (BMI) was 24.7 ± 4.2 kg/m 2 . Correlation analysis, regression analysis, Bland-Altman plot, and paired sample t-tests were used to analyze the accuracy of the VFA LBIA . For the total subjects, the regression line was VFA LBIA = 0.698 VFA CT + 29.521, (correlation coefficient (r) = 0.789, standard estimate of error (SEE) = 24.470 cm 2 , p < 0.001), Lin's correlation coefficient (CCC) was 0.785; and the limit of agreement (LOA; mean difference ±2 standard deviation) ranged from -43.950 to 67.951 cm 2 , LOA% (given as a percentage of mean value measured by the CT) was 48.2%. VFA LBIA and VFA CT showed significant difference (p < 0.001). Collectively, the current study indicates that LBIA has limited potential to accurately estimate visceral fat in a clinical setting.
Lv, Shao-Wa; Liu, Dong; Hu, Pan-Pan; Ye, Xu-Yan; Xiao, Hong-Bin; Kuang, Hai-Xue
2010-03-01
To optimize the process of extracting effective constituents from Aralia elata by response surface methodology. The independent variables were ethanol concentration, reflux time and solvent fold, the dependent variable was extraction rate of total saponins in Aralia elata. Linear or no-linear mathematic models were used to estimate the relationship between independent and dependent variables. Response surface methodology was used to optimize the process of extraction. The prediction was carried out through comparing the observed and predicted values. Regression coefficient of binomial fitting complex model was as high as 0.9617, the optimum conditions of extraction process were 70% ethanol, 2.5 hours for reflux, 20-fold solvent and 3 times for extraction. The bias between observed and predicted values was -2.41%. It shows the optimum model is highly predictive.
[Sincerity of effort: isokinetic evaluation of knee extension].
Colombo, R; Demaiti, G; Sartorio, F; Orlandini, D; Vercelli, S; Ferriero, G
2008-01-01
The aim of this study was to find a reliable method to evaluate the sincerity of the muscular maximal effort performed in a dynamometric isokinetic test of knee flexion-extension. The coefficient of variation of the peak torque (CV) and 3 new indices were analysed: (1) the average coefficient of variation calculated on the complete peak torque curve (CVM); (2) the slope of the regression line in an endurance test (PRR); (3) the correlation coefficient of the peak torques in the same endurance test (CCR). Twenty healthy subjects underwent assessment in two different trials, maximal (MX) and 50% submaximal (SMX), with 20 minutes of rest between trials. Each trial consisted of 4 tests, each of 3 repetitions, at angular speed of 30, 180, 30, and 180 degrees/s, respectively, and 1 test of 15 repetitions at 240 degrees/s. Our findings confirmed the ability of CV to detect a high percentage of sincere efforts: at 30 degrees/s Sensibility (Sns)=100% and Specificity (Spc)=70%; at 180 degrees/s Sns=75%, Spc=95%. The 3 new indices here proposed showed high characteristics of Sns and Spc, generally better than those of CV. CVM showed at 180 degrees/s Sns=90% and Spc=100%, while at 30 degrees/s Sns=90%, Spc=75%. PRR was the best index identifying all the efforts, except one (Sns=100%, Spc=95%). The CCR coefficient showed Sns and Spc values both of 90%.
Ramírez-Vélez, Robinson; Correa-Bautista, Jorge Enrique; González-Ruíz, Katherine; Vivas, Andrés; Triana-Reina, Héctor Reynaldo; Martínez-Torres, Javier; Prieto-Benavides, Daniel Humberto; Carrillo, Hugo Alejandro; Ramos-Sepúlveda, Jeison Alexander; Villa-González, Emilio; García-Hermoso, Antonio
2017-01-17
Recently, a body adiposity index (BAI = (hip circumference)/((height)(1.5)) -18 ) was developed and validated in adult populations. The aim of this study was to evaluate the performance of BAI in estimating percentage body fat (BF%) in a sample of Colombian collegiate young adults. The participants were comprised of 903 volunteers (52% females, mean age = 21.4 years ± 3.3). We used the Lin's concordance correlation coefficient, linear regression, Bland-Altman's agreement analysis, concordance correlation coefficient ( ρc ) and the coefficient of determination ( R ²) between BAI, and BF%; by bioelectrical impedance analysis (BIA)). The correlation between the two methods of estimating BF% was R ² = 0.384, p < 0.001. A paired-sample t -test showed a difference between the methods (BIA BF% = 16.2 ± 3.1, BAI BF% = 30.0 ± 5.4%; p < 0.001). For BIA, bias value was 6.0 ± 6.2 BF% (95% confidence interval (CI) = -6.0 to 18.2), indicating that the BAI method overestimated BF% relative to the reference method. Lin's concordance correlation coefficient was poor ( ρc = 0.014, 95% CI = -0.124 to 0.135; p = 0.414). In Colombian college students, there was poor agreement between BAI- and BIA-based estimates of BF%, and so BAI is not accurate in people with low or high body fat percentage levels.
Guo, Ying; Little, Roderick J; McConnell, Daniel S
2012-01-01
Covariate measurement error is common in epidemiologic studies. Current methods for correcting measurement error with information from external calibration samples are insufficient to provide valid adjusted inferences. We consider the problem of estimating the regression of an outcome Y on covariates X and Z, where Y and Z are observed, X is unobserved, but a variable W that measures X with error is observed. Information about measurement error is provided in an external calibration sample where data on X and W (but not Y and Z) are recorded. We describe a method that uses summary statistics from the calibration sample to create multiple imputations of the missing values of X in the regression sample, so that the regression coefficients of Y on X and Z and associated standard errors can be estimated using simple multiple imputation combining rules, yielding valid statistical inferences under the assumption of a multivariate normal distribution. The proposed method is shown by simulation to provide better inferences than existing methods, namely the naive method, classical calibration, and regression calibration, particularly for correction for bias and achieving nominal confidence levels. We also illustrate our method with an example using linear regression to examine the relation between serum reproductive hormone concentrations and bone mineral density loss in midlife women in the Michigan Bone Health and Metabolism Study. Existing methods fail to adjust appropriately for bias due to measurement error in the regression setting, particularly when measurement error is substantial. The proposed method corrects this deficiency.
Cox regression analysis with missing covariates via nonparametric multiple imputation.
Hsu, Chiu-Hsieh; Yu, Mandi
2018-01-01
We consider the situation of estimating Cox regression in which some covariates are subject to missing, and there exists additional information (including observed event time, censoring indicator and fully observed covariates) which may be predictive of the missing covariates. We propose to use two working regression models: one for predicting the missing covariates and the other for predicting the missing probabilities. For each missing covariate observation, these two working models are used to define a nearest neighbor imputing set. This set is then used to non-parametrically impute covariate values for the missing observation. Upon the completion of imputation, Cox regression is performed on the multiply imputed datasets to estimate the regression coefficients. In a simulation study, we compare the nonparametric multiple imputation approach with the augmented inverse probability weighted (AIPW) method, which directly incorporates the two working models into estimation of Cox regression, and the predictive mean matching imputation (PMM) method. We show that all approaches can reduce bias due to non-ignorable missing mechanism. The proposed nonparametric imputation method is robust to mis-specification of either one of the two working models and robust to mis-specification of the link function of the two working models. In contrast, the PMM method is sensitive to misspecification of the covariates included in imputation. The AIPW method is sensitive to the selection probability. We apply the approaches to a breast cancer dataset from Surveillance, Epidemiology and End Results (SEER) Program.
Liu, Quan; Ma, Li; Fan, Shou-Zen; Abbod, Maysam F; Shieh, Jiann-Shing
2018-01-01
Estimating the depth of anaesthesia (DoA) in operations has always been a challenging issue due to the underlying complexity of the brain mechanisms. Electroencephalogram (EEG) signals are undoubtedly the most widely used signals for measuring DoA. In this paper, a novel EEG-based index is proposed to evaluate DoA for 24 patients receiving general anaesthesia with different levels of unconsciousness. Sample Entropy (SampEn) algorithm was utilised in order to acquire the chaotic features of the signals. After calculating the SampEn from the EEG signals, Random Forest was utilised for developing learning regression models with Bispectral index (BIS) as the target. Correlation coefficient, mean absolute error, and area under the curve (AUC) were used to verify the perioperative performance of the proposed method. Validation comparisons with typical nonstationary signal analysis methods (i.e., recurrence analysis and permutation entropy) and regression methods (i.e., neural network and support vector machine) were conducted. To further verify the accuracy and validity of the proposed methodology, the data is divided into four unconsciousness-level groups on the basis of BIS levels. Subsequently, analysis of variance (ANOVA) was applied to the corresponding index (i.e., regression output). Results indicate that the correlation coefficient improved to 0.72 ± 0.09 after filtering and to 0.90 ± 0.05 after regression from the initial values of 0.51 ± 0.17. Similarly, the final mean absolute error dramatically declined to 5.22 ± 2.12. In addition, the ultimate AUC increased to 0.98 ± 0.02, and the ANOVA analysis indicates that each of the four groups of different anaesthetic levels demonstrated significant difference from the nearest levels. Furthermore, the Random Forest output was extensively linear in relation to BIS, thus with better DoA prediction accuracy. In conclusion, the proposed method provides a concrete basis for monitoring patients' anaesthetic level during surgeries.
Peixoto, Alberto Borges; Caldas, Taciana Mara Rodrigues da Cunha; Martins, Wellington P; Da Silva Costa, Fabricio; Araujo Júnior, Edward
2016-10-01
To establish reference values for the amniotic fluid index (AFI) measurement between 26w0d and 41w6d of gestation in a Brazilian population. We performed a cross-sectional study with 1984 low-risk singleton pregnant women between 26w0d and 41w6d of gestation. AFI was measured according to the technique proposed by Phelan et al. Maternal abdomen was divided into four quadrants using the umbilicus and linea nigra as landmarks. Single vertical pocket in each quadrant was measured and the AFI was generated by the sum of these four values without umbilical cord or fetal parts. All ultrasound exams were performed by only two experienced examiners. AFI was expressed as median, interquartile range, mean and ranges in each gestational age (GA) interval. Polynomial regressions were performed to obtain the best fit with adjustment by the determination coefficient (R(2)). Mean of AFI ranged from 14.0 ± 4.1 cm (range, 9.7-14.0) at 26w0d to 8.3 ± 4.7 cm (range, 1.9-16.5) at 41w6d, respectively. The best polynomial regression fit curve was a first-degree: AFI = 16.29-0.125*GA (R(2) = 0.01). According the scatterplot, AFI values practically did not vary with advancing GA. Reference values for the AFI measurement between 26w0d and 41w6d of gestation in a low-risk Brazilian population were established.
Li, Ji; Gray, B.R.; Bates, D.M.
2008-01-01
Partitioning the variance of a response by design levels is challenging for binomial and other discrete outcomes. Goldstein (2003) proposed four definitions for variance partitioning coefficients (VPC) under a two-level logistic regression model. In this study, we explicitly derived formulae for multi-level logistic regression model and subsequently studied the distributional properties of the calculated VPCs. Using simulations and a vegetation dataset, we demonstrated associations between different VPC definitions, the importance of methods for estimating VPCs (by comparing VPC obtained using Laplace and penalized quasilikehood methods), and bivariate dependence between VPCs calculated at different levels. Such an empirical study lends an immediate support to wider applications of VPC in scientific data analysis.
Interquantile Shrinkage in Regression Models
Jiang, Liewen; Wang, Huixia Judy; Bondell, Howard D.
2012-01-01
Conventional analysis using quantile regression typically focuses on fitting the regression model at different quantiles separately. However, in situations where the quantile coefficients share some common feature, joint modeling of multiple quantiles to accommodate the commonality often leads to more efficient estimation. One example of common features is that a predictor may have a constant effect over one region of quantile levels but varying effects in other regions. To automatically perform estimation and detection of the interquantile commonality, we develop two penalization methods. When the quantile slope coefficients indeed do not change across quantile levels, the proposed methods will shrink the slopes towards constant and thus improve the estimation efficiency. We establish the oracle properties of the two proposed penalization methods. Through numerical investigations, we demonstrate that the proposed methods lead to estimations with competitive or higher efficiency than the standard quantile regression estimation in finite samples. Supplemental materials for the article are available online. PMID:24363546
Salturk, Cuneyt; Karakurt, Zuhal; Takir, Huriye Berk; Balci, Merih; Kargin, Feyza; Mocin, Ozlem Yazıcıoglu; Gungor, Gokay; Ozmen, Ipek; Oztas, Selahattin; Yalcinsoy, Murat; Evin, Ruya; Ozturk, Murat; Adiguzel, Nalan
2015-01-01
The objective of this study was to compare the change in 6-minute walking distance (6MWD) in 1 year as an indicator of exercise capacity among patients undergoing home non-invasive mechanical ventilation (NIMV) due to chronic hypercapnic respiratory failure (CHRF) caused by different etiologies. This retrospective cohort study was conducted in a tertiary pulmonary disease hospital in patients who had completed 1-year follow-up under home NIMV because of CHRF with different etiologies (ie, chronic obstructive pulmonary disease [COPD], obesity hypoventilation syndrome [OHS], kyphoscoliosis [KS], and diffuse parenchymal lung disease [DPLD]), between January 2011 and January 2012. The results of arterial blood gas (ABG) analyses and spirometry, and 6MWD measurements with 12-month interval were recorded from the patient files, in addition to demographics, comorbidities, and body mass indices. The groups were compared in terms of 6MWD via analysis of variance (ANOVA) and multiple linear regression (MLR) analysis (independent variables: analysis age, sex, baseline 6MWD, baseline forced expiratory volume in 1 second, and baseline partial carbon dioxide pressure, in reference to COPD group). A total of 105 patients with a mean age (± standard deviation) of 61±12 years of whom 37 had COPD, 34 had OHS, 20 had KS, and 14 had DPLD were included in statistical analysis. There were no significant differences between groups in the baseline and delta values of ABG and spirometry findings. Both univariate ANOVA and MLR showed that the OHS group had the lowest baseline 6MWD and the highest decrease in 1 year (linear regression coefficient -24.48; 95% CI -48.74 to -0.21, P=0.048); while the KS group had the best baseline values and the biggest improvement under home NIMV (linear regression coefficient 26.94; 95% CI -3.79 to 57.66, P=0.085). The 6MWD measurements revealed improvement in exercise capacity test in CHRF patients receiving home NIMV treatment on long-term depends on etiological diagnoses.
Latin hypercube approach to estimate uncertainty in ground water vulnerability
Gurdak, J.J.; McCray, J.E.; Thyne, G.; Qi, S.L.
2007-01-01
A methodology is proposed to quantify prediction uncertainty associated with ground water vulnerability models that were developed through an approach that coupled multivariate logistic regression with a geographic information system (GIS). This method uses Latin hypercube sampling (LHS) to illustrate the propagation of input error and estimate uncertainty associated with the logistic regression predictions of ground water vulnerability. Central to the proposed method is the assumption that prediction uncertainty in ground water vulnerability models is a function of input error propagation from uncertainty in the estimated logistic regression model coefficients (model error) and the values of explanatory variables represented in the GIS (data error). Input probability distributions that represent both model and data error sources of uncertainty were simultaneously sampled using a Latin hypercube approach with logistic regression calculations of probability of elevated nonpoint source contaminants in ground water. The resulting probability distribution represents the prediction intervals and associated uncertainty of the ground water vulnerability predictions. The method is illustrated through a ground water vulnerability assessment of the High Plains regional aquifer. Results of the LHS simulations reveal significant prediction uncertainties that vary spatially across the regional aquifer. Additionally, the proposed method enables a spatial deconstruction of the prediction uncertainty that can lead to improved prediction of ground water vulnerability. ?? 2007 National Ground Water Association.
Li, Kai Way; Chen, Chin Jung
2004-11-01
Tread groove design is very common in footwear. However, coefficient of friction (COF) measurements between the footwear material and floor using a slipmeter were commonly performed using flat footwear pads. Such measurements might underestimate the actual slip resistance of the footwear pad. This research investigates the effects of the tread groove width on the measured COF using four footwear materials, three floors, and four liquid-contamination conditions using a Brungraber Mark II slipmeter. The analysis of variance results indicated that the footwear material, floor, contamination conditions, and groove width were all significant (p < 0.0001) factors affecting the measured COF. The hypothesis that wider tread grooves result in higher COF values was true with some exceptions especially on oil contaminated floors. A regression model, with an R2 of 0.91, was established to describe and predict the relationship between the COF and the tread groove width under footwear material/floor/contamination conditions.
Tongue thickness relates to nutritional status in the elderly.
Tamura, Fumiyo; Kikutani, Takeshi; Tohara, Takashi; Yoshida, Mitsuyoshi; Yaegaki, Ken
2012-12-01
Many elderly people under long-term care suffer from malnutrition caused by dysphagia, frequently leading to sarcopenia. Our hypothesis is that sarcopenia may compromise oral function, resulting in dysphagia. The objectives of this study were to evaluate sarcopenia of the lingual muscles by measuring the tongue thickness, and elucidate its relationship with nutritional status. We examined 104 elderly subjects (mean age = 80.3 ± 7.9 years). Anthropometric data, such as triceps skinfold thickness and midarm muscle area (AMA), were obtained. The tongue thickness of the central part was determined using ultrasonography. Measurement was performed twice and the mean value was obtained. The relationship between tongue thickness and nutritional status was analyzed by Pearson's correlation coefficient and Spearman's rank correlation coefficient. AMA and age were identified by multiple-regression analysis as factors influencing tongue thickness. The results of this study suggest that malnutrition may induce sarcopenia not only in the skeletal muscles but also in the tongue.
Correlation of track irregularities and vehicle responses based on measured data
NASA Astrophysics Data System (ADS)
Karis, Tomas; Berg, Mats; Stichel, Sebastian; Li, Martin; Thomas, Dirk; Dirks, Babette
2018-06-01
Track geometry quality and dynamic vehicle response are closely related, but do not always correspond with each other in terms of maximum values and standard deviations. This can often be seen to give poor results in analyses with correlation coefficients or regression analysis. Measured data from both the EU project DynoTRAIN and the Swedish Green Train (Gröna Tåget) research programme is used in this paper to evaluate track-vehicle response for three vehicles. A single degree of freedom model is used as an inspiration to divide track-vehicle interaction into three parts, which are analysed in terms of correlation. One part, the vertical axle box acceleration divided by vehicle speed squared (?) and the second spatial derivative of the vertical track irregularities (?), is shown to be the weak link with lower correlation coefficients than the other parts. Future efforts should therefore be directed towards investigating the relation between axle box accelerations and track irregularity second derivatives.
Analysis of influential factors on haze pollution in China
NASA Astrophysics Data System (ADS)
Liu, Xiao-Hong; Jiang, Keshen
2018-05-01
This study tests the hypothesis of Environmental Kuznets Curve (EKC) between PM10 concentrations and economic growth and analyzes the influential factors of PM10 concentrations from the economic perspective by using the panel data on the PM10 concentrations of 30 provinces from 2003 to 2015 in China. Results of the regression estimation from the fully modified OLS (FMOLS) method show that a relationship characterized by an inverted U-shaped curve is observed between PM10 concentrations and gross domestic product (GDP) per capita and that an EKC exists in China’s haze pollution problem. PM10 concentrations have the most sensitive response to GDP. The elastic coefficients of the possession of civilian vehicles, urbanization and trade openness are positive values. More importantly, the elastic coefficient of the tertiary industry proportion is less than 0. Increase in the proportion of tertiary industry can effectively alleviate China’s problem on haze pollution. Lastly, relevant countermeasures and suggestions are presented.
Modeling rainfall-runoff process using soft computing techniques
NASA Astrophysics Data System (ADS)
Kisi, Ozgur; Shiri, Jalal; Tombul, Mustafa
2013-02-01
Rainfall-runoff process was modeled for a small catchment in Turkey, using 4 years (1987-1991) of measurements of independent variables of rainfall and runoff values. The models used in the study were Artificial Neural Networks (ANNs), Adaptive Neuro-Fuzzy Inference System (ANFIS) and Gene Expression Programming (GEP) which are Artificial Intelligence (AI) approaches. The applied models were trained and tested using various combinations of the independent variables. The goodness of fit for the model was evaluated in terms of the coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), coefficient of efficiency (CE) and scatter index (SI). A comparison was also made between these models and traditional Multi Linear Regression (MLR) model. The study provides evidence that GEP (with RMSE=17.82 l/s, MAE=6.61 l/s, CE=0.72 and R2=0.978) is capable of modeling rainfall-runoff process and is a viable alternative to other applied artificial intelligence and MLR time-series methods.
Optimization of tribological behaviour on Al- coconut shell ash composite at elevated temperature
NASA Astrophysics Data System (ADS)
Siva Sankara Raju, R.; Panigrahi, M. K.; Ganguly, R. I.; Srinivasa Rao, G.
2018-02-01
In this study, determine the tribological behaviour of composite at elevated temperature i.e. 50 - 150 °C. The aluminium matrix composite (AMC) are prepared with compo casting route by volume of reinforcement of coconut shell ash (CSA) such as 5, 10 and 15%. Mechanical properties of composite has enhances with increasing volume of CSA. This study details to optimization of wear behaviour of composite at elevated temperatures. The influencing parameters such as temperature, sliding velocity and sliding distance are considered. The outcome response is wear rate (mm3/m) and coefficient of friction. The experiments are designed based on Taguchi [L9] array. All the experiments are considered as constant load of 10N. Analysis of variance (ANOVA) revealed that temperature is highest influencing factor followed by sliding velocity and sliding distance. Similarly, sliding velocity is most influencing factor followed by temperature and distance on coefficient of friction (COF). Finally, corroborates analytical and regression equation values by confirmation test.
[Live birth distribution by time and place from 1981 to 1998 in Japan].
Matsushima, Noriko; Morita, Noriko; Ogata, Nozomi; Saeki, Keigo; Matsuda, Ryozo; Kurumatani, Norio
2003-01-01
To investigate the diurnal rhythm of live births labored spontaneously, and the effects of obstetric intervention on birth time distributions. The data of live births tabulated by time (one-hour intervals), date and birthplace throughout Japan between 1981 and 1998 were obtained with permission from the former Ministry of Health and Welfare. Together with an investigation of hourly birth numbers by place in each year, an annual transition of hourly birth rates in medical institutions and the diurnal rhythm of birth numbers in maternity homes and at home were analyzed using regression analysis. In every calendar year studied the hourly live birth numbers at hospitals showed a single-peak distribution pattern with maximum values at 13:00-15:00. The annual transition of hourly birth rates showed a 10% (birth numbers base) decrease in the 11:00-13:00 period in 1998 as compared with that in 1981, while there was a corresponding increase of 8% in the 13:00-15:00 period. Hourly birth numbers at clinics showed a double-peak distribution pattern with maximum values during the 11:00-12:00 and 14:00-15:00 periods in early 1980, while a single-peak distribution with a maximum value during the 13:00-15:00 period appeared in 1989 and has remained thereafter. Hourly birth rates (birth numbers base) increased by over 6% in the 13:00-15:00 and 17:00-20:00 periods over the past 18 years, while they decreased by 10% in the 9:00-13:00 period. The results at maternity homes were clearly different from those at hospitals and clinics. The live birth numbers totaled for the 18 years showed a double-phase distribution with a maximum value in the 6:00-7:00 period and a minimum value in the 19:00-20:00 period. The best-fit regression model for the obtained data was a sine curve with a maximum value at 6:00 (coefficient of determination 0.97). Hourly distributions of live births at home also fitted best to a since curve with the maximum value again at 6:00 (coefficient of determination 0.95). The results suggested that the timing of spontaneous live births follows a circadian rhythm and that obstetric intervention affects time distributions of live births by shifting over 10% of births during the night and early morning to a working day service time (9:00-17:00).
Remote sensing of PM2.5 from ground-based optical measurements
NASA Astrophysics Data System (ADS)
Li, S.; Joseph, E.; Min, Q.
2014-12-01
Remote sensing of particulate matter concentration with aerodynamic diameter smaller than 2.5 um(PM2.5) by using ground-based optical measurements of aerosols is investigated based on 6 years of hourly average measurements of aerosol optical properties, PM2.5, ceilometer backscatter coefficients and meteorological factors from Howard University Beltsville Campus facility (HUBC). The accuracy of quantitative retrieval of PM2.5 using aerosol optical depth (AOD) is limited due to changes in aerosol size distribution and vertical distribution. In this study, ceilometer backscatter coefficients are used to provide vertical information of aerosol. It is found that the PM2.5-AOD ratio can vary largely for different aerosol vertical distributions. The ratio is also sensitive to mode parameters of bimodal lognormal aerosol size distribution when the geometric mean radius for the fine mode is small. Using two Angstrom exponents calculated at three wavelengths of 415, 500, 860nm are found better representing aerosol size distributions than only using one Angstrom exponent. A regression model is proposed to assess the impacts of different factors on the retrieval of PM2.5. Compared to a simple linear regression model, the new model combining AOD and ceilometer backscatter can prominently improve the fitting of PM2.5. The contribution of further introducing Angstrom coefficients is apparent. Using combined measurements of AOD, ceilometer backscatter, Angstrom coefficients and meteorological parameters in the regression model can get a correlation coefficient of 0.79 between fitted and expected PM2.5.
Darrah, Johanna; Bartlett, Doreen; Maguire, Thomas O; Avison, William R; Lacaze-Masmonteil, Thierry
2014-01-01
Aim To compare the original normative data of the Alberta Infant Motor Scale (AIMS) (n=2202) collected 20 years ago with a contemporary sample of Canadian infants. Method This was a cross-sectional cohort study of 650 Canadian infants (338 males, 312 females; mean age 30.9wks [SD 15.5], range 2wks–18mo) assessed once on the AIMS. Assessments were stratified by age, and infants proportionally represented the ethnic diversity of Canada. Logistic regression was used to place AIMS items on an age scale representing the age at which 50% of the infants passed an item on the contemporary data set and the original data set. Forty-three items met the criterion for stable regression results in both data sets. Results The correlation coefficient between the age locations of items on the original and contemporary data sets was 0.99. The mean age difference between item locations was 0.7 weeks. Age values from the original data set when converted to the contemporary scale differed by less than 1 week. Interpretation The sequence and age at emergence of AIMS items has remained similar over 20 years and current normative values remain valid. Concern that the ‘back to sleep’ campaign has influenced the age at emergence of gross motor abilities is not supported. PMID:24684556
Li, Qi-Quan; Wang, Chang-Quan; Zhang, Wen-Jiang; Yu, Yong; Li, Bing; Yang, Juan; Bai, Gen-Chuan; Cai, Yan
2013-02-01
In this study, a radial basis function neural network model combined with ordinary kriging (RBFNN_OK) was adopted to predict the spatial distribution of soil nutrients (organic matter and total N) in a typical hilly region of Sichuan Basin, Southwest China, and the performance of this method was compared with that of ordinary kriging (OK) and regression kriging (RK). All the three methods produced the similar soil nutrient maps. However, as compared with those obtained by multiple linear regression model, the correlation coefficients between the measured values and the predicted values of soil organic matter and total N obtained by neural network model increased by 12. 3% and 16. 5% , respectively, suggesting that neural network model could more accurately capture the complicated relationships between soil nutrients and quantitative environmental factors. The error analyses of the prediction values of 469 validation points indicated that the mean absolute error (MAE) , mean relative error (MRE), and root mean squared error (RMSE) of RBFNN_OK were 6.9%, 7.4%, and 5. 1% (for soil organic matter), and 4.9%, 6.1% , and 4.6% (for soil total N) smaller than those of OK (P<0.01), and 2.4%, 2.6% , and 1.8% (for soil organic matter), and 2.1%, 2.8%, and 2.2% (for soil total N) smaller than those of RK, respectively (P<0.05).
Zhu, Xiaoyan; Song, Changchun; Swarzenski, Christopher M.; Guo, Yuedong; Zhang, Xinhow; Wang, Jiaoyue
2015-01-01
Northern peatlands contain a considerable share of the terrestrial carbon pool, which will be affected by future climatic variability. Using the static chamber technique, we investigated ecosystem respiration and soil respiration over two growing seasons (2012 and 2013) in a Carex lasiocarpa-dominated peatland in the Sanjiang Plain in China. We synchronously monitored the environmental factors controlling CO2 fluxes. Ecosystem respiration during these two growing seasons ranged from 33.3 to 506.7 mg CO2–C m−2 h−1. Through step-wise regression, variations in soil temperature at 10 cm depth alone explained 73.7% of the observed variance in log10(ER). The mean Q10 values ranged from 2.1 to 2.9 depending on the choice of depth where soil temperature was measured. The Q10 value at the 10 cm depth (2.9) appears to be a good representation for herbaceous peatland in the Sanjiang Plain when applying field-estimation based Q10values to current terrestrial ecosystem models due to the most optimized regression coefficient (63.2%). Soil respiration amounted to 57% of ecosystem respiration and played a major role in peatland carbon balance in our study. Emphasis on ecosystem respiration from temperate peatlands in the Sanjiang Plain will improve our basic understanding of carbon exchange between peatland ecosystem and the atmosphere.
Bootstrap Methods: A Very Leisurely Look.
ERIC Educational Resources Information Center
Hinkle, Dennis E.; Winstead, Wayland H.
The Bootstrap method, a computer-intensive statistical method of estimation, is illustrated using a simple and efficient Statistical Analysis System (SAS) routine. The utility of the method for generating unknown parameters, including standard errors for simple statistics, regression coefficients, discriminant function coefficients, and factor…
Sunder Raman, Ramya; Kumar, Samresh
2016-04-15
PM2.5 mass and its optical properties were measured over an ecologically sensitive zone in Central India between January and December, 2012. Meteorological parameters including temperature, relative humidity, wind speed, wind direction, and barometric pressure were also monitored. During the study period, the PM2.5 (fine PM) concentration ranged between 3.2μgm(-3) and 193.9μgm(-3) with a median concentration of 31.4μgm(-3). The attenuation coefficients, βATN at 370nm, 550nm, and 880nm had median values of 104.5Mm(-1), 79.2Mm(-1), and 59.8Mm(-1), respectively. Further, the dry scattering coefficient, βSCAT at 550nm had a median value of 17.1Mm(-1) while the absorption coefficient βABS at 550nm had a median value of 61.2Mm(-1). The relationship between fine PM mass and attenuation coefficients showed pronounced seasonality. Scattering, absorption, and attenuation coefficient at different wavelengths were all well correlated with fine PM mass only during the post-monsoon season (October, November, and December). The highest correlation (r(2)=0.81) was between fine PM mass and βSCAT at 550nm during post-monsoon season. During this season, the mass scattering efficiency (σSCAT) was 1.44m(2)g(-1). Thus, monitoring optical properties all year round, as a surrogate for fine PM mass was found unsuitable for the study location. In order to assess the relationships between fine PM mass and its optical properties and meteorological parameters, multiple linear regression (MLR) models were fitted for each season, with fine PM mass as the dependent variable. Such a model fitted for the post-monsoon season explained over 88% of the variability in fine PM mass. However, the MLR models were able to explain only 31 and 32% of the variability in fine PM during pre-monsoon (March, April, and May) and monsoon (June, July, August, and September) seasons, respectively. During the winter (January and February) season, the MLR model explained 54% of the PM2.5 variability. Copyright © 2016 Elsevier B.V. All rights reserved.
A New Test of Linear Hypotheses in OLS Regression under Heteroscedasticity of Unknown Form
ERIC Educational Resources Information Center
Cai, Li; Hayes, Andrew F.
2008-01-01
When the errors in an ordinary least squares (OLS) regression model are heteroscedastic, hypothesis tests involving the regression coefficients can have Type I error rates that are far from the nominal significance level. Asymptotically, this problem can be rectified with the use of a heteroscedasticity-consistent covariance matrix (HCCM)…
Molnos, Sophie; Baumbach, Clemens; Wahl, Simone; Müller-Nurasyid, Martina; Strauch, Konstantin; Wang-Sattler, Rui; Waldenberger, Melanie; Meitinger, Thomas; Adamski, Jerzy; Kastenmüller, Gabi; Suhre, Karsten; Peters, Annette; Grallert, Harald; Theis, Fabian J; Gieger, Christian
2017-09-29
Genome-wide association studies allow us to understand the genetics of complex diseases. Human metabolism provides information about the disease-causing mechanisms, so it is usual to investigate the associations between genetic variants and metabolite levels. However, only considering genetic variants and their effects on one trait ignores the possible interplay between different "omics" layers. Existing tools only consider single-nucleotide polymorphism (SNP)-SNP interactions, and no practical tool is available for large-scale investigations of the interactions between pairs of arbitrary quantitative variables. We developed an R package called pulver to compute p-values for the interaction term in a very large number of linear regression models. Comparisons based on simulated data showed that pulver is much faster than the existing tools. This is achieved by using the correlation coefficient to test the null-hypothesis, which avoids the costly computation of inversions. Additional tricks are a rearrangement of the order, when iterating through the different "omics" layers, and implementing this algorithm in the fast programming language C++. Furthermore, we applied our algorithm to data from the German KORA study to investigate a real-world problem involving the interplay among DNA methylation, genetic variants, and metabolite levels. The pulver package is a convenient and rapid tool for screening huge numbers of linear regression models for significant interaction terms in arbitrary pairs of quantitative variables. pulver is written in R and C++, and can be downloaded freely from CRAN at https://cran.r-project.org/web/packages/pulver/ .
Optimization of Fenton's oxidation of herbicide dicamba in water using response surface methodology
NASA Astrophysics Data System (ADS)
Sangami, Sanjeev; Manu, Basavaraju
2017-12-01
In this study Fenton's oxidation of dicamba in aqueous medium was investigated by using the response surface methodology. The influence of H2O2/COD ( A), H2O2/Fe2+ ( B), pH ( C) and reaction time ( D) as independent variables were studied on two responses (COD and dicamba removal efficiency). The dosage of H2O2 (5.35-17.4 mM) and Fe2+ (0.09-2.13 mM) were varied and optimum percentage removal of dicamba of 84.01% with H2O2 and Fe2+ dosage of 11.38 and 0.33 mM respectively. The whole oxidation process was monitored by high performance liquid chromatography (HPLC) along with liquid chromatography/mass spectrometry (LC/MS). It was found that 82% of dicamba was mineralized to oxalic acid, chloride ion, CO2 and H2O, which was confirmed with COD removal of 81.53%. The regression analysis was performed, in which standard deviation (<4%), coefficient of variation (<8), F value (Fisher's Test) (>2.74), coefficient of correlation ( R 2 = R_{adj}2) and adequate precision (>12) were in good agreement with model values. Finally, the treatment process was validated by performing the additional experiments.
Belief in complementary and alternative medicine is related to age and paranormal beliefs in adults.
Van den Bulck, Jan; Custers, Kathleen
2010-04-01
The use of complementary and alternative medicine (CAM) is widespread, even among people who use conventional medicine. Positive beliefs about CAM are common among physicians and medical students. Little is known about the beliefs regarding CAM among the general public. Among science students, belief in CAM was predicted by belief in the paranormal. In a cross-sectional study, 712 randomly selected adults (>18 years old) responded to the CAM Health Belief Questionnaire (CHBQ) and a paranormal beliefs scale. CAM beliefs were very prevalent in this sample of adult Flemish men and women. Zero-order correlations indicated that belief in CAM was associated with age (r = 0.173 P < 0.001) level of education (r = -0.079 P = 0.039) social desirability (r = -0.119 P = 0.002) and paranormal belief (r = 0.365 P < 0.001). In a multivariate model, two variables predicted CAM beliefs. Support for CAM increased with age (regression coefficient: 0.01; 95% confidence interval (CI): 0.006 to 0.014), but the strongest relationship existed between support for CAM and beliefs in the paranormal. Paranormal beliefs accounted for 14% of the variance of the CAM beliefs (regression coefficient: 0.376; 95%: CI 0.30-0.44). The level of education (regression coefficient: 0.06; 95% CI: -0.014-0.129) and social desirability (regression coefficient: -0.023; 95% CI: -0.048-0.026) did not make a significant contribution to the explained variance (<0.1%, P = 0.867). Support of CAM was very prevalent in this Flemish adult population. CAM beliefs were strongly associated with paranormal beliefs.
Kowalkowska, Joanna; Slowinska, Malgorzata A.; Slowinski, Dariusz; Dlugosz, Anna; Niedzwiedzka, Ewa; Wadolowska, Lidia
2013-01-01
The food frequency questionnaire (FFQ) and the food record (FR) are among the most common methods used in dietary research. It is important to know that is it possible to use both methods simultaneously in dietary assessment and prepare a single, comprehensive interpretation. The aim of this study was to compare the energy and nutritional value of diets, determined by the FFQ and by the three-day food records of young women. The study involved 84 female students aged 21–26 years (mean of 22.2 ± 0.8 years). Completing the FFQ was preceded by obtaining unweighted food records covering three consecutive days. Energy and nutritional value of diets was assessed for both methods (FFQ-crude, FR-crude). Data obtained for FFQ-crude were adjusted with beta-coefficient equaling 0.5915 (FFQ-adjusted) and regression analysis (FFQ-regressive). The FFQ-adjusted was calculated as FR-crude/FFQ-crude ratio of mean daily energy intake. FFQ-regressive was calculated for energy and each nutrient separately using regression equation, including FFQ-crude and FR-crude as covariates. For FR-crude and FFQ-crude the energy value of diets was standardized to 2000 kcal (FR-standardized, FFQ-standardized). Methods of statistical comparison included a dependent samples t-test, a chi-square test, and the Bland-Altman method. The mean energy intake in FFQ-crude was significantly higher than FR-crude (2740.5 kcal vs. 1621.0 kcal, respectively). For FR-standardized and FFQ-standardized, significance differences were found in the mean intake of 18 out of 31 nutrients, for FR-crude and FFQ-adjusted in 13 out of 31 nutrients and FR-crude and FFQ-regressive in 11 out of 31 nutrients. The Bland-Altman method showed an overestimation of energy and nutrient intake by FFQ-crude in comparison to FR-crude, e.g., total protein was overestimated by 34.7 g/day (95% Confidence Interval, CI: −29.6, 99.0 g/day) and fat by 48.6 g/day (95% CI: −36.4, 133.6 g/day). After regressive transformation of FFQ, the absolute difference between FFQ-regressive and FR-crude equaled 0.0 g/day and 95% CI were much better (e.g., for total protein 95% CI: −32.7, 32.7 g/day, for fat 95% CI: −49.6, 49.6 g/day). In conclusion, differences in nutritional value of diets resulted from overestimating energy intake by the FFQ in comparison to the three-day unweighted food records. Adjustment of energy and nutrient intake applied for the FFQ using various methods, particularly regression equations, significantly improved the agreement between results obtained by both methods and dietary assessment. To obtain the most accurate results in future studies using this FFQ, energy and nutrient intake should be adjusted by the regression equations presented in this paper. PMID:23877089
Mass attenuation coefficient of chromium and manganese compounds around absorption edge.
Sharanabasappa; Kaginelli, S B; Kerur, B R; Anilkumar, S; Hanumaiah, B
2009-01-01
The total mass attenuation coefficient for Potassium dichromate, Potassium chromate and Manganese acetate compounds are measured at different photon energies 5.895, 6.404, 6.490, 7.058, 8.041 and 14.390 keV using Fe-55, Co-57 and 241Am source with Copper target, radioactive sources. The photon intensity is analyzed using a high resolution HPGe detector system coupled to MCA under good geometrical arrangement. The obtained values of mass attenuation coefficient values are compared with theoretical values. This study suggests that measured mass attenuation coefficient values at and near absorption edges differ from the theoretical value by about 5-28%.
Measuring the value of air quality: application of the spatial hedonic model.
Kim, Seung Gyu; Cho, Seong-Hoon; Lambert, Dayton M; Roberts, Roland K
2010-03-01
This study applies a hedonic model to assess the economic benefits of air quality improvement following the 1990 Clean Air Act Amendment at the county level in the lower 48 United States. An instrumental variable approach that combines geographically weighted regression and spatial autoregression methods (GWR-SEM) is adopted to simultaneously account for spatial heterogeneity and spatial autocorrelation. SEM mitigates spatial dependency while GWR addresses spatial heterogeneity by allowing response coefficients to vary across observations. Positive amenity values of improved air quality are found in four major clusters: (1) in East Kentucky and most of Georgia around the Southern Appalachian area; (2) in a few counties in Illinois; (3) on the border of Oklahoma and Kansas, on the border of Kansas and Nebraska, and in east Texas; and (4) in a few counties in Montana. Clusters of significant positive amenity values may exist because of a combination of intense air pollution and consumer awareness of diminishing air quality.
NASA Astrophysics Data System (ADS)
Samadi; Wajizah, S.; Munawar, A. A.
2018-02-01
Feed plays an important factor in animal production. The purpose of this study is to apply NIRS method in determining feed values. NIRS spectra data were acquired for feed samples in wavelength range of 1000 - 2500 nm with 32 scans and 0.2 nm wavelength. Spectral data were corrected by de-trending (DT) and standard normal variate (SNV) methods. Prediction of in vitro dry matter digestibility (IVDMD) and in vitro organic matter digestibility (IVOMD) were established as model by using principal component regression (PCR) and validated using leave one out cross validation (LOOCV). Prediction performance was quantified using coefficient correlation (r) and residual predictive deviation (RPD) index. The results showed that IVDMD and IVOMD can be predicted by using SNV spectra data with r and RPD index: 0.93 and 2.78 for IVDMD ; 0.90 and 2.35 for IVOMD respectively. In conclusion, NIRS technique appears feasible to predict animal feed nutritive values.
Common y-intercept and single compound regressions of gas-particle partitioning data vs 1/T
NASA Astrophysics Data System (ADS)
Pankow, James F.
Confidence intervals are placed around the log Kp vs 1/ T correlation equations obtained using simple linear regressions (SLR) with the gas-particle partitioning data set of Yamasaki et al. [(1982) Env. Sci. Technol.16, 189-194]. The compounds and groups of compounds studied include the polycylic aromatic hydrocarbons phenanthrene + anthracene, me-phenanthrene + me-anthracene, fluoranthene, pyrene, benzo[ a]fluorene + benzo[ b]fluorene, chrysene + benz[ a]anthracene + triphenylene, benzo[ b]fluoranthene + benzo[ k]fluoranthene, and benzo[ a]pyrene + benzo[ e]pyrene (note: me = methyl). For any given compound, at equilibrium, the partition coefficient Kp equals ( F/ TSP)/ A where F is the particulate-matter associated concentration (ng m -3), A is the gas-phase concentration (ng m -3), and TSP is the concentration of particulate matter (μg m -3). At temperatures more than 10°C from the mean sampling temperature of 17°C, the confidence intervals are quite wide. Since theory predicts that similar compounds sorbing on the same particulate matter should possess very similar y-intercepts, the data set was also fitted using a special common y-intercept regression (CYIR). For most of the compounds, the CYIR equations fell inside of the SLR 95% confidence intervals. The CYIR y-intercept value is -18.48, and is reasonably close to the type of value that can be predicted for PAH compounds. The set of CYIR regression equations is probably more reliable than the set of SLR equations. For example, the CYIR-derived desorption enthalpies are much more highly correlated with vaporization enthalpies than are the SLR-derived desorption enthalpies. It is recommended that the CYIR approach be considered whenever analysing temperature-dependent gas-particle partitioning data.
2011-01-01
Background Several regression models have been proposed for estimation of isometric joint torque using surface electromyography (SEMG) signals. Common issues related to torque estimation models are degradation of model accuracy with passage of time, electrode displacement, and alteration of limb posture. This work compares the performance of the most commonly used regression models under these circumstances, in order to assist researchers with identifying the most appropriate model for a specific biomedical application. Methods Eleven healthy volunteers participated in this study. A custom-built rig, equipped with a torque sensor, was used to measure isometric torque as each volunteer flexed and extended his wrist. SEMG signals from eight forearm muscles, in addition to wrist joint torque data were gathered during the experiment. Additional data were gathered one hour and twenty-four hours following the completion of the first data gathering session, for the purpose of evaluating the effects of passage of time and electrode displacement on accuracy of models. Acquired SEMG signals were filtered, rectified, normalized and then fed to models for training. Results It was shown that mean adjusted coefficient of determination (Ra2) values decrease between 20%-35% for different models after one hour while altering arm posture decreased mean Ra2 values between 64% to 74% for different models. Conclusions Model estimation accuracy drops significantly with passage of time, electrode displacement, and alteration of limb posture. Therefore model retraining is crucial for preserving estimation accuracy. Data resampling can significantly reduce model training time without losing estimation accuracy. Among the models compared, ordinary least squares linear regression model (OLS) was shown to have high isometric torque estimation accuracy combined with very short training times. PMID:21943179
The conventional tuning fork as a quantitative tool for vibration threshold.
Alanazy, Mohammed H; Alfurayh, Nuha A; Almweisheer, Shaza N; Aljafen, Bandar N; Muayqil, Taim
2018-01-01
This study was undertaken to describe a method for quantifying vibration when using a conventional tuning fork (CTF) in comparison to a Rydel-Seiffer tuning fork (RSTF) and to provide reference values. Vibration thresholds at index finger and big toe were obtained in 281 participants. Spearman's correlations were performed. Age, weight, and height were analyzed for their covariate effects on vibration threshold. Reference values at the fifth percentile were obtained by quantile regression. The correlation coefficients between CTF and RSTF values at finger/toe were 0.59/0.64 (P = 0.001 for both). Among covariates, only age had a significant effect on vibration threshold. Reference values for CTF at finger/toe for the age groups 20-39 and 40-60 years were 7.4/4.9 and 5.8/4.6 s, respectively. Reference values for RSTF at finger/toe for the age groups 20-39 and 40-60 years were 6.9/5.5 and 6.2/4.7, respectively. CTF provides quantitative values that are as good as those provided by RSTF. Age-stratified reference data are provided. Muscle Nerve 57: 49-53, 2018. © 2017 Wiley Periodicals, Inc.
Characteristics of low-slope streams that affect O2 transfer rates
Parker, Gene W.; Desimone, Leslie A.
1991-01-01
Multiple-regression techniques were used to derive the reaeration coefficients estimating equation for low sloped streams: K2 = 3.83 MBAS-0.41 SL0.20 H-0.76, where K2 is the reaeration coefficient in base e units per day; MBAS is the methylene blue active substances concentration in milligrams per liter; SL is the water-surface slope in foot per foot; and H is the mean-flow depth in feet. Fourteen hydraulic, physical, and water-quality characteristics were regressed against 29 measured-reaeration coefficients for low-sloped (water surface slopes less than 0.002 foot per foot) streams in Massachusetts and New York. Reaeration coefficients measured from May 1985 to October 1988 ranged from 0.2 to 11.0 base e units per day for 29 low-sloped tracer studies. Concentration of methylene blue active substances is significant because it is thought to be an indicator of concentration of surfactants which could change the surface tension at the air-water interface.
Moderation analysis using a two-level regression model.
Yuan, Ke-Hai; Cheng, Ying; Maxwell, Scott
2014-10-01
Moderation analysis is widely used in social and behavioral research. The most commonly used model for moderation analysis is moderated multiple regression (MMR) in which the explanatory variables of the regression model include product terms, and the model is typically estimated by least squares (LS). This paper argues for a two-level regression model in which the regression coefficients of a criterion variable on predictors are further regressed on moderator variables. An algorithm for estimating the parameters of the two-level model by normal-distribution-based maximum likelihood (NML) is developed. Formulas for the standard errors (SEs) of the parameter estimates are provided and studied. Results indicate that, when heteroscedasticity exists, NML with the two-level model gives more efficient and more accurate parameter estimates than the LS analysis of the MMR model. When error variances are homoscedastic, NML with the two-level model leads to essentially the same results as LS with the MMR model. Most importantly, the two-level regression model permits estimating the percentage of variance of each regression coefficient that is due to moderator variables. When applied to data from General Social Surveys 1991, NML with the two-level model identified a significant moderation effect of race on the regression of job prestige on years of education while LS with the MMR model did not. An R package is also developed and documented to facilitate the application of the two-level model.
Tascon, Marcos; Romero, Lílian M; Acquaviva, Agustín; Keunchkarian, Sonia; Castells, Cecilia
2013-06-14
This study focused on an investigation into the experimental quantities inherent in the determination of partition coefficients from gas-liquid chromatographic measurements through the use of capillary columns. We prepared several squalane - (2,6,10,15,19,23-hexamethyltetracosane) - containing columns with very precisely known phase ratios and determined solute retention and hold-up times at 30, 40, 50 and 60°C. We calculated infinite dilution partition coefficients from the slopes of the linear regression of retention factors as a function of the reciprocal of the phase ratio by means of fundamental chromatographic equations. In order to minimize gas-solid and liquid-solid interface contributions to retention, the surface of the capillary inner wall was pretreated to guarantee a uniform coat of stationary phase. The validity of the proposed approach was first tested by estimating the partition coefficients of n-alkanes between n-pentane and n-nonane, for which compounds data from the literature were available. Then partition coefficients of sixteen aliphatic alcohols in squalane were determined at those four temperatures. We deliberately chose these highly challenging systems: alcohols in the reference paraffinic stationary phase. These solutes exhibited adsorption in the gas-liquid interface that contributed to retention. The corresponding adsorption constant values were estimated. We fully discuss here the uncertainties associated with each experimental measurement and how these fundamental determinations can be performed precisely by circumventing the main drawbacks. The proposed strategy is reliable and much simpler than the classical chromatographic method employing packed columns. Copyright © 2013 Elsevier B.V. All rights reserved.
Ndiath, Mansour M; Cisse, Badara; Ndiaye, Jean Louis; Gomis, Jules F; Bathiery, Ousmane; Dia, Anta Tal; Gaye, Oumar; Faye, Babacar
2015-11-18
In Senegal, considerable efforts have been made to reduce malaria morbidity and mortality during the last decade. This resulted in a marked decrease of malaria cases. With the decline of malaria cases, transmission has become sparse in most Senegalese health districts. This study investigated malaria hotspots in Keur Soce sites by using geographically-weighted regression. Because of the occurrence of hotspots, spatial modelling of malaria cases could have a considerable effect in disease surveillance. This study explored and analysed the spatial relationships between malaria occurrence and socio-economic and environmental factors in small communities in Keur Soce, Senegal, using 6 months passive surveillance. Geographically-weighted regression was used to explore the spatial variability of relationships between malaria incidence or persistence and the selected socio-economic, and human predictors. A model comparison of between ordinary least square and geographically-weighted regression was also explored. Vector dataset (spatial) of the study area by village levels and statistical data (non-spatial) on malaria confirmed cases, socio-economic status (bed net use), population data (size of the household) and environmental factors (temperature, rain fall) were used in this exploratory analysis. ArcMap 10.2 and Stata 11 were used to perform malaria hotspots analysis. From Jun to December, a total of 408 confirmed malaria cases were notified. The explanatory variables-household size, housing materials, sleeping rooms, sheep and distance to breeding site returned significant t values of -0.25, 2.3, 4.39, 1.25 and 2.36, respectively. The OLS global model revealed that it explained about 70 % (adjusted R(2) = 0.70) of the variation in malaria occurrence with AIC = 756.23. The geographically-weighted regression of malaria hotspots resulted in coefficient intercept ranging from 1.89 to 6.22 with a median of 3.5. Large positive values are distributed mainly in the southeast of the district where hotspots are more accurate while low values are mainly found in the centre and in the north. Geographically-weighted regression and OLS showed important risks factors of malaria hotspots in Keur Soce. The outputs of such models can be a useful tool to understand occurrence of malaria hotspots in Senegal. An understanding of geographical variation and determination of the core areas of the disease may provide an explanation regarding possible proximal and distal contributors to malaria elimination in Senegal.
Measurement of effective air diffusion coefficients for trichloroethene in undisturbed soil cores.
Bartelt-Hunt, Shannon L; Smith, James A
2002-06-01
In this study, we measure effective diffusion coefficients for trichloroethene in undisturbed soil samples taken from Picatinny Arsenal, New Jersey. The measured effective diffusion coefficients ranged from 0.0053 to 0.0609 cm2/s over a range of air-filled porosity of 0.23-0.49. The experimental data were compared to several previously published relations that predict diffusion coefficients as a function of air-filled porosity and porosity. A multiple linear regression analysis was developed to determine if a modification of the exponents in Millington's [Science 130 (1959) 100] relation would better fit the experimental data. The literature relations appeared to generally underpredict the effective diffusion coefficient for the soil cores studied in this work. Inclusion of a particle-size distribution parameter, d10, did not significantly improve the fit of the linear regression equation. The effective diffusion coefficient and porosity data were used to recalculate estimates of diffusive flux through the subsurface made in a previous study performed at the field site. It was determined that the method of calculation used in the previous study resulted in an underprediction of diffusive flux from the subsurface. We conclude that although Millington's [Science 130 (1959) 100] relation works well to predict effective diffusion coefficients in homogeneous soils with relatively uniform particle-size distributions, it may be inaccurate for many natural soils with heterogeneous structure and/or non-uniform particle-size distributions.
Smooth Scalar-on-Image Regression via Spatial Bayesian Variable Selection
Goldsmith, Jeff; Huang, Lei; Crainiceanu, Ciprian M.
2013-01-01
We develop scalar-on-image regression models when images are registered multidimensional manifolds. We propose a fast and scalable Bayes inferential procedure to estimate the image coefficient. The central idea is the combination of an Ising prior distribution, which controls a latent binary indicator map, and an intrinsic Gaussian Markov random field, which controls the smoothness of the nonzero coefficients. The model is fit using a single-site Gibbs sampler, which allows fitting within minutes for hundreds of subjects with predictor images containing thousands of locations. The code is simple and is provided in less than one page in the Appendix. We apply this method to a neuroimaging study where cognitive outcomes are regressed on measures of white matter microstructure at every voxel of the corpus callosum for hundreds of subjects. PMID:24729670
DOE Office of Scientific and Technical Information (OSTI.GOV)
Paduano, L.; Sartorio, R.; Vitagliano, V.
Diffusion coefficients in the ternary system {alpha}-cyclodextrin (at one concentration)-L-phenylalanine (at four concentrations)-water have been measured by using the Gouy interferometric technique. The effect of the inclusion equilibrium on the cross-term diffusion coefficients was observed. The measured diffusion coefficients in the ternary systems were used to calculate values of the binding constants. These values are in good agreement with the value obtained from calorimetric studies.
Evaluation of pelleted aspen foliage as a ruminant feedstuff
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bas, F.J.; Ehle, F.R.; Goodrich, R.D.
1985-01-01
Growth and digestion trials were made to determine the nutritive value of pelleted aspen (Populus tremuloides) foliage as a dietary ingredient for sheep. Lambs offered diets without or with 25, 50 and 75% aspen leaves, with lucerne as the other dietary ingredient, ate less and gained less weight as the proportion of aspen leaves in the diet increased (P less than 0.05). Digestibility coefficients for DM, organic matter, crude protein, gross energy, neutral detergent fibre, acid detergent fibre, hemicellulose and cellulose decreased linearly (P less than 0.01) as the percentage of aspen foliage eaten increased. Calculated digestibility of individual aspenmore » leaf components gave values as low as 16.6 and 13% for crude protein and cellulose, respectively. Coefficients of determination for the linear regressions indicated no associative effects between lucerne and aspen leaves. Due to the depressed value for crude protein digestibility, the amount of acid detergent fibre-insoluble nitrogen was estimated. Over 50% of the total N in aspen foliage was bound to the acid detergent fibre fraction, reflecting the presence of heat-damaged protein, tannin-protein complexes that are unavailable for digestion or both. After adjustment for unavailable N, the crude protein digestibility of aspen foliage was 61.5%. Balances of 10 minerals were estimated during the digestion trial. Negative mineral balances for the 75% aspen leaf diet suggest that the lambs were in a nutrient deficient condition when fed on this diet. 29 references.« less
A surface complexation model of YREE sorption on Ulva lactuca in 0.05-5.0 M NaCl solutions
NASA Astrophysics Data System (ADS)
Zoll, Alison M.; Schijf, Johan
2012-11-01
We present distribution coefficients, log iKS, for the sorption of yttrium and the rare earth elements (YREEs) on BCR-279, a dehydrated tissue homogenate of a marine macroalga, Ulva lactuca, resembling materials featured in chemical engineering studies aimed at designing renewable biosorbents. Sorption experiments were conducted in NaCl solutions of different ionic strength (0.05, 0.5, and 5.0 M) at T = 25 °C over the pH range 2.7-8.5. Distribution coefficients based on separation of the dissolved and particulate phase by conventional filtration (<0.22 μm) were corrected for the effect of colloid-bound YREEs (>3 kDa) using an existing pH-dependent model. Colloid-corrected values were renormalized to free-cation concentrations by accounting for YREE hydrolysis and chloride complexation. At each ionic strength, the pH dependence of the renormalized values is accurately described with a non-electrostatic surface complexation model (SCM) that incorporates YREE binding to three monoprotic functional groups, previously characterized by alkalimetric titration, as well as binding of YREE-hydroxide complexes (MOH2+) to the least acidic one (pKa ∼ 9.5). In non-linear regressions of the distribution coefficients as a function of pH, each pKa was fixed at its reported value, while stability constants of the four YREE surface complexes were used as adjustable parameters. Data for a single fresh U. lactuca specimen in 0.5 M NaCl show generally the same pH-dependent behavior but a lower degree of sorption and were excluded from the regressions. Good linear free-energy relations (LFERs) between stability constants of the YREE-acetate and YREE-hydroxide solution complex and surface complexes with the first and third functional group, respectively, support their prior tentative identifications as carboxyl and phenol. A similar confirmation for the second group is precluded by insufficient knowledge of the stability of YREE-phosphate complexes and a perceived lack of YREE binding in 0.05 M NaCl; this issue awaits further study. The results indicate that SCMs can be successfully applied to sorbents as daunting as marine organic matter. Despite remnant challenges, for instance resolving the contributions of individual groups to the aggregate sorption signal, our approach helps formalize seaweed’s avowed promise as an ideal biomonitor or biofilter of metal pollution in environments ranging from freshwaters to brines by uncovering what chemical mechanisms underlie its pronounced affinity for YREEs and other surface-reactive elements.
Hilario, Eric C; Stern, Alan; Wang, Charlie H; Vargas, Yenny W; Morgan, Charles J; Swartz, Trevor E; Patapoff, Thomas W
2017-01-01
Concentration determination is an important method of protein characterization required in the development of protein therapeutics. There are many known methods for determining the concentration of a protein solution, but the easiest to implement in a manufacturing setting is absorption spectroscopy in the ultraviolet region. For typical proteins composed of the standard amino acids, absorption at wavelengths near 280 nm is due to the three amino acid chromophores tryptophan, tyrosine, and phenylalanine in addition to a contribution from disulfide bonds. According to the Beer-Lambert law, absorbance is proportional to concentration and path length, with the proportionality constant being the extinction coefficient. Typically the extinction coefficient of proteins is experimentally determined by measuring a solution absorbance then experimentally determining the concentration, a measurement with some inherent variability depending on the method used. In this study, extinction coefficients were calculated based on the measured absorbance of model compounds of the four amino acid chromophores. These calculated values for an unfolded protein were then compared with an experimental concentration determination based on enzymatic digestion of proteins. The experimentally determined extinction coefficient for the native proteins was consistently found to be 1.05 times the calculated value for the unfolded proteins for a wide range of proteins with good accuracy and precision under well-controlled experimental conditions. The value of 1.05 times the calculated value was termed the predicted extinction coefficient. Statistical analysis shows that the differences between predicted and experimentally determined coefficients are scattered randomly, indicating no systematic bias between the values among the proteins measured. The predicted extinction coefficient was found to be accurate and not subject to the inherent variability of experimental methods. We propose the use of a predicted extinction coefficient for determining the protein concentration of therapeutic proteins starting from early development through the lifecycle of the product. LAY ABSTRACT: Knowing the concentration of a protein in a pharmaceutical solution is important to the drug's development and posology. There are many ways to determine the concentration, but the easiest one to use in a testing lab employs absorption spectroscopy. Absorbance of ultraviolet light by a protein solution is proportional to its concentration and path length; the proportionality constant is the extinction coefficient. The extinction coefficient of a protein therapeutic is usually determined experimentally during early product development and has some inherent method variability. In this study, extinction coefficients of several proteins were calculated based on the measured absorbance of model compounds. These calculated values for an unfolded protein were then compared with experimental concentration determinations based on enzymatic digestion of the proteins. The experimentally determined extinction coefficient for the native protein was 1.05 times the calculated value for the unfolded protein with good accuracy and precision under controlled experimental conditions, so the value of 1.05 times the calculated coefficient was called the predicted extinction coefficient. Comparison of predicted and measured extinction coefficients indicated that the predicted value was very close to the experimentally determined values for the proteins. The predicted extinction coefficient was accurate and removed the variability inherent in experimental methods. © PDA, Inc. 2017.
Mallick, P K; Thirumaran, S M K; Pourouchottamane, R; Rajapandi, S; Venkataramanan, R; Nagarajan, G; Murali, G; Rajendiran, A S
2016-03-01
The study was conducted at Southern Regional Research Center, ICAR-Central Sheep and Wool Research Institute (CSWRI), Mannavanur, Kodaikanal, Tamil Nadu to estimate genetic trends for birth weight (BWT), weaning weight (3WT), 6 months weight (6WT), and greasy fleece weight (GFY) in a Bharat Merino (BM) flock, where selection was practiced for 6WT and GFY. The data for this study represents a total of 1652 BM lambs; progeny of 144 sires spread over 15 years starting from 2000 to 2014, obtained from the BM flock of ICAR-SRRC (CSWRI), Mannavanur, Kodaikanal, Tamil Nadu, India. The genetic trends were calculated by regression of average predicted breeding values using software WOMBAT for the traits BWT, 3WT, 6WT and GFY versus the animal's birth year. The least square means were 3.28±0.02 kg, 19.08±0.23 kg, 25.00±0.35 kg and 2.13±0.07 kg for BWT, 3WT, 6WT and GFY, respectively. Genetic trends were positive and highly significant (p<0.01) for BWT, while the values for 3WT, 6WT and GFY though positive, were not significant. The estimates of genetic trends in BWT, 3WT, 6WT and GFY were 5 g, 0.8 g, 7 g and 0.3 g/year gain and the fit of the regression shows 55%, 22%, 42% and 12% coefficient of determination with the regressed value, respectively. In this study, estimated mean predicted breeding value (kg) in BWT and 3WT, 6WT and GFY were 0.067, 0.008, 0.036 and -0.003, respectively. Estimates of genetic trends indicated that there was a positive genetic improvement in all studied traits and selection would be effective for the improvement of body weight traits and GFY of BM sheep.
Estimating maize production in Kenya using NDVI: Some statistical considerations
Lewis, J.E.; Rowland, James; Nadeau , A.
1998-01-01
A regression model approach using a normalized difference vegetation index (NDVI) has the potential for estimating crop production in East Africa. However, before production estimation can become a reality, the underlying model assumptions and statistical nature of the sample data (NDVI and crop production) must be examined rigorously. Annual maize production statistics from 1982-90 for 36 agricultural districts within Kenya were used as the dependent variable; median area NDVI (independent variable) values from each agricultural district and year were extracted from the annual maximum NDVI data set. The input data and the statistical association of NDVI with maize production for Kenya were tested systematically for the following items: (1) homogeneity of the data when pooling the sample, (2) gross data errors and influence points, (3) serial (time) correlation, (4) spatial autocorrelation and (5) stability of the regression coefficients. The results of using a simple regression model with NDVI as the only independent variable are encouraging (r 0.75, p 0.05) and illustrate that NDVI can be a responsive indicator of maize production, especially in areas of high NDVI spatial variability, which coincide with areas of production variability in Kenya.
Stamey, Timothy C.
1998-01-01
Simple and reliable methods for estimating hourly streamflow are needed for the calibration and verification of a Chattahoochee River basin model between Buford Dam and Franklin, Ga. The river basin model is being developed by Georgia Department of Natural Resources, Environmental Protection Division, as part of their Chattahoochee River Modeling Project. Concurrent streamflow data collected at 19 continuous-record, and 31 partial-record streamflow stations, were used in ordinary least-squares linear regression analyses to define estimating equations, and in verifying drainage-area prorations. The resulting regression or drainage-area ratio estimating equations were used to compute hourly streamflow at the partial-record stations. The coefficients of determination (r-squared values) for the regression estimating equations ranged from 0.90 to 0.99. Observed and estimated hourly and daily streamflow data were computed for May 1, 1995, through October 31, 1995. Comparisons of observed and estimated daily streamflow data for 12 continuous-record tributary stations, that had available streamflow data for all or part of the period from May 1, 1995, to October 31, 1995, indicate that the mean error of estimate for the daily streamflow was about 25 percent.
Asquith, William H.; Cleveland, Theodore G.; Roussel, Meghan C.
2011-01-01
Estimates of peak and time of peak streamflow for small watersheds (less than about 640 acres) in a suburban to urban, low-slope setting are needed for drainage design that is cost-effective and risk-mitigated. During 2007-10, the U.S. Geological Survey (USGS), in cooperation with the Harris County Flood Control District and the Texas Department of Transportation, developed a method to estimate peak and time of peak streamflow from excess rainfall for 10- to 640-acre watersheds in the Houston, Texas, metropolitan area. To develop the method, 24 watersheds in the study area with drainage areas less than about 3.5 square miles (2,240 acres) and with concomitant rainfall and runoff data were selected. The method is based on conjunctive analysis of rainfall and runoff data in the context of the unit hydrograph method and the rational method. For the unit hydrograph analysis, a gamma distribution model of unit hydrograph shape (a gamma unit hydrograph) was chosen and parameters estimated through matching of modeled peak and time of peak streamflow to observed values on a storm-by-storm basis. Watershed mean or watershed-specific values of peak and time to peak ("time to peak" is a parameter of the gamma unit hydrograph and is distinct from "time of peak") of the gamma unit hydrograph were computed. Two regression equations to estimate peak and time to peak of the gamma unit hydrograph that are based on watershed characteristics of drainage area and basin-development factor (BDF) were developed. For the rational method analysis, a lag time (time-R), volumetric runoff coefficient, and runoff coefficient were computed on a storm-by-storm basis. Watershed-specific values of these three metrics were computed. A regression equation to estimate time-R based on drainage area and BDF was developed. Overall arithmetic means of volumetric runoff coefficient (0.41 dimensionless) and runoff coefficient (0.25 dimensionless) for the 24 watersheds were used to express the rational method in terms of excess rainfall (the excess rational method). Both the unit hydrograph method and excess rational method are shown to provide similar estimates of peak and time of peak streamflow. The results from the two methods can be combined by using arithmetic means. A nomograph is provided that shows the respective relations between the arithmetic-mean peak and time of peak streamflow to drainage areas ranging from 10 to 640 acres. The nomograph also shows the respective relations for selected BDF ranging from undeveloped to fully developed conditions. The nomograph represents the peak streamflow for 1 inch of excess rainfall based on drainage area and BDF; the peak streamflow for design storms from the nomograph can be multiplied by the excess rainfall to estimate peak streamflow. Time of peak streamflow is readily obtained from the nomograph. Therefore, given excess rainfall values derived from watershed-loss models, which are beyond the scope of this report, the nomograph represents a method for estimating peak and time of peak streamflow for applicable watersheds in the Houston metropolitan area. Lastly, analysis of the relative influence of BDF on peak streamflow is provided, and the results indicate a 0:04log10 cubic feet per second change of peak streamflow per positive unit of change in BDF. This relative change can be used to adjust peak streamflow from the method or other hydrologic methods for a given BDF to other BDF values; example computations are provided.
The Geometry of Enhancement in Multiple Regression
ERIC Educational Resources Information Center
Waller, Niels G.
2011-01-01
In linear multiple regression, "enhancement" is said to occur when R[superscript 2] = b[prime]r greater than r[prime]r, where b is a p x 1 vector of standardized regression coefficients and r is a p x 1 vector of correlations between a criterion y and a set of standardized regressors, x. When p = 1 then b [is congruent to] r and…
ERIC Educational Resources Information Center
Tong, Fuhui
2006-01-01
Background: An extensive body of researches has favored the use of regression over other parametric analyses that are based on OVA. In case of noteworthy regression results, researchers tend to explore magnitude of beta weights for the respective predictors. Purpose: The purpose of this paper is to examine both beta weights and structure…
Estimation Methods for Non-Homogeneous Regression - Minimum CRPS vs Maximum Likelihood
NASA Astrophysics Data System (ADS)
Gebetsberger, Manuel; Messner, Jakob W.; Mayr, Georg J.; Zeileis, Achim
2017-04-01
Non-homogeneous regression models are widely used to statistically post-process numerical weather prediction models. Such regression models correct for errors in mean and variance and are capable to forecast a full probability distribution. In order to estimate the corresponding regression coefficients, CRPS minimization is performed in many meteorological post-processing studies since the last decade. In contrast to maximum likelihood estimation, CRPS minimization is claimed to yield more calibrated forecasts. Theoretically, both scoring rules used as an optimization score should be able to locate a similar and unknown optimum. Discrepancies might result from a wrong distributional assumption of the observed quantity. To address this theoretical concept, this study compares maximum likelihood and minimum CRPS estimation for different distributional assumptions. First, a synthetic case study shows that, for an appropriate distributional assumption, both estimation methods yield to similar regression coefficients. The log-likelihood estimator is slightly more efficient. A real world case study for surface temperature forecasts at different sites in Europe confirms these results but shows that surface temperature does not always follow the classical assumption of a Gaussian distribution. KEYWORDS: ensemble post-processing, maximum likelihood estimation, CRPS minimization, probabilistic temperature forecasting, distributional regression models
Guenole, Nigel; Brown, Anna
2014-01-01
We report a Monte Carlo study examining the effects of two strategies for handling measurement non-invariance – modeling and ignoring non-invariant items – on structural regression coefficients between latent variables measured with item response theory models for categorical indicators. These strategies were examined across four levels and three types of non-invariance – non-invariant loadings, non-invariant thresholds, and combined non-invariance on loadings and thresholds – in simple, partial, mediated and moderated regression models where the non-invariant latent variable occupied predictor, mediator, and criterion positions in the structural regression models. When non-invariance is ignored in the latent predictor, the focal group regression parameters are biased in the opposite direction to the difference in loadings and thresholds relative to the referent group (i.e., lower loadings and thresholds for the focal group lead to overestimated regression parameters). With criterion non-invariance, the focal group regression parameters are biased in the same direction as the difference in loadings and thresholds relative to the referent group. While unacceptable levels of parameter bias were confined to the focal group, bias occurred at considerably lower levels of ignored non-invariance than was previously recognized in referent and focal groups. PMID:25278911
Zhou, Nan; Chu, Chen; Dou, Xin; Chen, Weibo; He, Jian; Yan, Jing; Zhou, Zhengyang; Yang, Xiaofeng
2018-02-08
Radiation-induced parotid damage is a common complication in patients with nasopharyngeal carcinoma (NPC) treated with radiotherapy to head and neck region, which severely reduce the life quality of those patients. The aim of this study was to early evaluate the changes of irradiated parotid glands with T2 mapping and mDIXON Quant imaging. Forty-one patients with NPC underwent conventional magnetic resonance imaging for nasopharynx and neck, and T2 mapping and mDIXON Quant imaging for bilateral parotid glands within 2 weeks before radiotherapy (pre-RT), 5 weeks after the beginning of radiotherapy (mid-RT), and 4 weeks after radiotherapy (post-RT). Parotid volume, T2 values, fat fraction (FF) values, and mean radiation dose were recorded and analyzed. From pre-RT to mid-RT, parotid volume decreased (atrophy rate, 27.0 ± 11.5%), while parotid T2 and FF values increased (change rate, 6.0 ± 6.2% for T2 value and 9.1 ± 9.9% for FF value) significantly. From mid-RT to post-RT, parotid T2 value continuously increased (change rate, 4.6 ± 7.7%), but parotid FF value decreased (change rate, - 9.9 ± 18.2%) significantly. Change rate of parotid T2 value significantly correlated with parotid atrophy rate from pre-RT to post-RT (r = 0.313, P = 0.027). Multiple linear regression analysis showed that parotid T2 value (standardized coefficient [SC] = - 0.259, P = 0.001) and FF value (SC = - 0.320, P = 0.014) negatively correlated with parotid volume, while parotid T2 value positively correlated with MR scan time point (SC = 0.476, P = 0.001) significantly. Parotid T2 and FF values showed excellent reproducibility (intraclass correlation coefficient, 0.935-0.992). T2 mapping and mDIXON Quant imaging is useful for noninvasive evaluation of radiation-induced parotid damage.
Optimization of Fish Protection System to Increase Technosphere Safety
NASA Astrophysics Data System (ADS)
Khetsuriani, E. D.; Fesenko, L. N.; Larin, D. S.
2017-11-01
The article is concerned with field study data. Drawing upon prior information and considering structural features of fish protection devices, we decided to conduct experimental research while changing three parameters: process pressure PCT, stream velocity Vp and washer nozzle inclination angle αc. The variability intervals of examined factors are shown in the Table 1. The conicity angle was assumed as a constant one. The box design B3 was chosen as a baseline being close to D-optimal designs in its statistical characteristics. The number of device rotations and its fish fry protection efficiency were accepted as the output functions of optimization. The numerical values of regression coefficients of quadratic equations describing the behavior of optimization functions Y1 and Y2 and their formulaic errors were calculated upon the test results in accordance with the planning matrix. The adequacy or inadequacy of the obtained quadratic regression model is judged via checking the condition whether Fexp ≤ Ftheor.
Developing a Model for Forecasting Road Traffic Accident (RTA) Fatalities in Yemen
NASA Astrophysics Data System (ADS)
Karim, Fareed M. A.; Abdo Saleh, Ali; Taijoobux, Aref; Ševrović, Marko
2017-12-01
The aim of this paper is to develop a model for forecasting RTA fatalities in Yemen. The yearly fatalities was modeled as the dependent variable, while the number of independent variables included the population, number of vehicles, GNP, GDP and Real GDP per capita. It was determined that all these variables are highly correlated with the correlation coefficient (r ≈ 0.9); in order to avoid multicollinearity in the model, a single variable with the highest r value was selected (real GDP per capita). A simple regression model was developed; the model was very good (R2=0.916); however, the residuals were serially correlated. The Prais-Winsten procedure was used to overcome this violation of the regression assumption. The data for a 20-year period from 1991-2010 were analyzed to build the model; the model was validated by using data for the years 2011-2013; the historical fit for the period 1991 - 2011 was very good. Also, the validation for 2011-2013 proved accurate.