Survival analysis of cervical cancer using stratified Cox regression
NASA Astrophysics Data System (ADS)
Purnami, S. W.; Inayati, K. D.; Sari, N. W. Wulan; Chosuvivatwong, V.; Sriplung, H.
2016-04-01
Cervical cancer is one of the mostly widely cancer cause of the women death in the world including Indonesia. Most cervical cancer patients come to the hospital already in an advanced stadium. As a result, the treatment of cervical cancer becomes more difficult and even can increase the death's risk. One of parameter that can be used to assess successfully of treatment is the probability of survival. This study raises the issue of cervical cancer survival patients at Dr. Soetomo Hospital using stratified Cox regression based on six factors such as age, stadium, treatment initiation, companion disease, complication, and anemia. Stratified Cox model is used because there is one independent variable that does not satisfy the proportional hazards assumption that is stadium. The results of the stratified Cox model show that the complication variable is significant factor which influent survival probability of cervical cancer patient. The obtained hazard ratio is 7.35. It means that cervical cancer patient who has complication is at risk of dying 7.35 times greater than patient who did not has complication. While the adjusted survival curves showed that stadium IV had the lowest probability of survival.
Factors Associated with Methadone Treatment Duration: A Cox Regression Analysis
Peng, Ching-Yi; Chao, En; Lee, Tony Szu-Hsien
2015-01-01
This study examined retention rates and associated predictors of methadone maintenance treatment (MMT) duration among 128 newly admitted patients in Taiwan. A semi-structured questionnaire was used to obtain demographic and drug use history. Daily records of methadone taken and test results for HIV, HCV, and morphine toxicology were taken from a computerized medical registry. Cox regression analyses were performed to examine factors associated with MMT duration. MMT retention rates were 80.5%, 68.8%, 53.9%, and 41.4% for 3, 6, 12, and 18 months, respectively. Excluding 38 patients incarcerated during the study period, retention rates were 81.1%, 73.3%, 61.1%, and 48.9% for 3 months, 6 months, 12 months, and 18 months, respectively. No participant seroconverted to HIV and 1 died during the 18-months follow-up. Results showed that being female, imprisonment, a longer distance from house to clinic, having a lower methadone dose after 30 days, being HCV positive, and in the New Taipei city program predicted early patient dropout. The findings suggest favorable MMT outcomes of HIV seroincidence and mortality. Results indicate that the need to minimize travel distance and to provide programs that meet women’s requirements justify expansion of MMT clinics in Taiwan. PMID:25875531
Vatcheva, KP; Lee, M; McCormick, JB; Rahbar, MH
2016-01-01
Objective To demonstrate the adverse impact of ignoring statistical interactions in regression models used in epidemiologic studies. Study design and setting Based on different scenarios that involved known values for coefficient of the interaction term in Cox regression models we generated 1000 samples of size 600 each. The simulated samples and a real life data set from the Cameron County Hispanic Cohort were used to evaluate the effect of ignoring statistical interactions in these models. Results Compared to correctly specified Cox regression models with interaction terms, misspecified models without interaction terms resulted in up to 8.95 fold bias in estimated regression coefficients. Whereas when data were generated from a perfect additive Cox proportional hazards regression model the inclusion of the interaction between the two covariates resulted in only 2% estimated bias in main effect regression coefficients estimates, but did not alter the main findings of no significant interactions. Conclusions When the effects are synergic, the failure to account for an interaction effect could lead to bias and misinterpretation of the results, and in some instances to incorrect policy decisions. Best practices in regression analysis must include identification of interactions, including for analysis of data from epidemiologic studies.
Dehesh, Tania; Zare, Najaf; Ayatollahi, Seyyed Mohammad Taghi
2015-01-01
Background. Univariate meta-analysis (UM) procedure, as a technique that provides a single overall result, has become increasingly popular. Neglecting the existence of other concomitant covariates in the models leads to loss of treatment efficiency. Our aim was proposing four new approximation approaches for the covariance matrix of the coefficients, which is not readily available for the multivariate generalized least square (MGLS) method as a multivariate meta-analysis approach. Methods. We evaluated the efficiency of four new approaches including zero correlation (ZC), common correlation (CC), estimated correlation (EC), and multivariate multilevel correlation (MMC) on the estimation bias, mean square error (MSE), and 95% probability coverage of the confidence interval (CI) in the synthesis of Cox proportional hazard models coefficients in a simulation study. Result. Comparing the results of the simulation study on the MSE, bias, and CI of the estimated coefficients indicated that MMC approach was the most accurate procedure compared to EC, CC, and ZC procedures. The precision ranking of the four approaches according to all above settings was MMC ≥ EC ≥ CC ≥ ZC. Conclusion. This study highlights advantages of MGLS meta-analysis on UM approach. The results suggested the use of MMC procedure to overcome the lack of information for having a complete covariance matrix of the coefficients. PMID:26413142
Simultaneous confidence bands for Cox regression from semiparametric random censorship.
Mondal, Shoubhik; Subramanian, Sundarraman
2016-01-01
Cox regression is combined with semiparametric random censorship models to construct simultaneous confidence bands (SCBs) for subject-specific survival curves. Simulation results are presented to compare the performance of the proposed SCBs with the SCBs that are based only on standard Cox. The new SCBs provide correct empirical coverage and are more informative. The proposed SCBs are illustrated with two real examples. An extension to handle missing censoring indicators is also outlined. PMID:25691289
Partial least squares Cox regression for genome-wide data.
Nygård, Ståle; Borgan, Ornulf; Lingjaerde, Ole Christian; Størvold, Hege Leite
2008-06-01
Most methods for survival prediction from high-dimensional genomic data combine the Cox proportional hazards model with some technique of dimension reduction, such as partial least squares regression (PLS). Applying PLS to the Cox model is not entirely straightforward, and multiple approaches have been proposed. The method of Park etal. (Bioinformatics 18(Suppl. 1):S120-S127, 2002) uses a reformulation of the Cox likelihood to a Poisson type likelihood, thereby enabling estimation by iteratively reweighted partial least squares for generalized linear models. We propose a modification of the method of Park et al. (2002) such that estimates of the baseline hazard and the gene effects are obtained in separate steps. The resulting method has several advantages over the method of Park et al. (2002) and other existing Cox PLS approaches, as it allows for estimation of survival probabilities for new patients, enables a less memory-demanding estimation procedure, and allows for incorporation of lower-dimensional non-genomic variables like disease grade and tumor thickness. We also propose to combine our Cox PLS method with an initial gene selection step in which genes are ordered by their Cox score and only the highest-ranking k% of the genes are retained, obtaining a so-called supervised partial least squares regression method. In simulations, both the unsupervised and the supervised version outperform other Cox PLS methods. PMID:18188699
Iuliano, Antonella; Occhipinti, Annalisa; Angelini, Claudia; De Feis, Italia; Lió, Pietro
2016-01-01
International initiatives such as the Cancer Genome Atlas (TCGA) and the International Cancer Genome Consortium (ICGC) are collecting multiple datasets at different genome-scales with the aim of identifying novel cancer biomarkers and predicting survival of patients. To analyze such data, several statistical methods have been applied, among them Cox regression models. Although these models provide a good statistical framework to analyze omic data, there is still a lack of studies that illustrate advantages and drawbacks in integrating biological information and selecting groups of biomarkers. In fact, classical Cox regression algorithms focus on the selection of a single biomarker, without taking into account the strong correlation between genes. Even though network-based Cox regression algorithms overcome such drawbacks, such network-based approaches are less widely used within the life science community. In this article, we aim to provide a clear methodological framework on the use of such approaches in order to turn cancer research results into clinical applications. Therefore, we first discuss the rationale and the practical usage of three recently proposed network-based Cox regression algorithms (i.e., Net-Cox, AdaLnet, and fastcox). Then, we show how to combine existing biological knowledge and available data with such algorithms to identify networks of cancer biomarkers and to estimate survival of patients. Finally, we describe in detail a new permutation-based approach to better validate the significance of the selection in terms of cancer gene signatures and pathway/networks identification. We illustrate the proposed methodology by means of both simulations and real case studies. Overall, the aim of our work is two-fold. Firstly, to show how network-based Cox regression models can be used to integrate biological knowledge (e.g., multi-omics data) for the analysis of survival data. Secondly, to provide a clear methodological and computational approach for
Diagnostic Measures for the Cox Regression Model with Missing Covariates
Zhu, Hongtu; Ibrahim, Joseph G.; Chen, Ming-Hui
2015-01-01
Summary This paper investigates diagnostic measures for assessing the influence of observations and model misspecification in the presence of missing covariate data for the Cox regression model. Our diagnostics include case-deletion measures, conditional martingale residuals, and score residuals. The Q-distance is proposed to examine the effects of deleting individual observations on the estimates of finite-dimensional and infinite-dimensional parameters. Conditional martingale residuals are used to construct goodness of fit statistics for testing possible misspecification of the model assumptions. A resampling method is developed to approximate the p-values of the goodness of fit statistics. Simulation studies are conducted to evaluate our methods, and a real data set is analyzed to illustrate their use. PMID:26903666
Cox Regression Models with Functional Covariates for Survival Data
Gellar, Jonathan E.; Colantuoni, Elizabeth; Needham, Dale M.; Crainiceanu, Ciprian M.
2015-01-01
We extend the Cox proportional hazards model to cases when the exposure is a densely sampled functional process, measured at baseline. The fundamental idea is to combine penalized signal regression with methods developed for mixed effects proportional hazards models. The model is fit by maximizing the penalized partial likelihood, with smoothing parameters estimated by a likelihood-based criterion such as AIC or EPIC. The model may be extended to allow for multiple functional predictors, time varying coefficients, and missing or unequally-spaced data. Methods were inspired by and applied to a study of the association between time to death after hospital discharge and daily measures of disease severity collected in the intensive care unit, among survivors of acute respiratory distress syndrome. PMID:26441487
ERIC Educational Resources Information Center
Chen, Chau-Kuang
2005-01-01
Logistic and Cox regression methods are practical tools used to model the relationships between certain student learning outcomes and their relevant explanatory variables. The logistic regression model fits an S-shaped curve into a binary outcome with data points of zero and one. The Cox regression model allows investigators to study the duration…
Xu, Haoming; Moni, Mohammad Ali; Liò, Pietro
2015-12-01
In cancer genomics, gene expression levels provide important molecular signatures for all types of cancer, and this could be very useful for predicting the survival of cancer patients. However, the main challenge of gene expression data analysis is high dimensionality, and microarray is characterised by few number of samples with large number of genes. To overcome this problem, a variety of penalised Cox proportional hazard models have been proposed. We introduce a novel network regularised Cox proportional hazard model and a novel multiplex network model to measure the disease comorbidities and to predict survival of the cancer patient. Our methods are applied to analyse seven microarray cancer gene expression datasets: breast cancer, ovarian cancer, lung cancer, liver cancer, renal cancer and osteosarcoma. Firstly, we applied a principal component analysis to reduce the dimensionality of original gene expression data. Secondly, we applied a network regularised Cox regression model on the reduced gene expression datasets. By using normalised mutual information method and multiplex network model, we predict the comorbidities for the liver cancer based on the integration of diverse set of omics and clinical data, and we find the diseasome associations (disease-gene association) among different cancers based on the identified common significant genes. Finally, we evaluated the precision of the approach with respect to the accuracy of survival prediction using ROC curves. We report that colon cancer, liver cancer and renal cancer share the CXCL5 gene, and breast cancer, ovarian cancer and renal cancer share the CCND2 gene. Our methods are useful to predict survival of the patient and disease comorbidities more accurately and helpful for improvement of the care of patients with comorbidity. Software in Matlab and R is available on our GitHub page: https://github.com/ssnhcom/NetworkRegularisedCox.git. PMID:26611766
Nie, Z Q; Ou, Y Q; Zhuang, J; Qu, Y J; Mai, J Z; Chen, J M; Liu, X Q
2016-05-10
Conditional logistic regression analysis and unconditional logistic regression analysis are commonly used in case control study, but Cox proportional hazard model is often used in survival data analysis. Most literature only refer to main effect model, however, generalized linear model differs from general linear model, and the interaction was composed of multiplicative interaction and additive interaction. The former is only statistical significant, but the latter has biological significance. In this paper, macros was written by using SAS 9.4 and the contrast ratio, attributable proportion due to interaction and synergy index were calculated while calculating the items of logistic and Cox regression interactions, and the confidence intervals of Wald, delta and profile likelihood were used to evaluate additive interaction for the reference in big data analysis in clinical epidemiology and in analysis of genetic multiplicative and additive interactions. PMID:27188374
Mortality Prediction in ICUs Using A Novel Time-Slicing Cox Regression Method
Wang, Yuan; Chen, Wenlin; Heard, Kevin; Kollef, Marin H.; Bailey, Thomas C.; Cui, Zhicheng; He, Yujie; Lu, Chenyang; Chen, Yixin
2015-01-01
Over the last few decades, machine learning and data mining have been increasingly used for clinical prediction in ICUs. However, there is still a huge gap in making full use of the time-series data generated from ICUs. Aiming at filling this gap, we propose a novel approach entitled Time Slicing Cox regression (TS-Cox), which extends the classical Cox regression into a classification method on multi-dimensional time-series. Unlike traditional classifiers such as logistic regression and support vector machines, our model not only incorporates the discriminative features derived from the time-series, but also naturally exploits the temporal orders of these features based on a Cox-like function. Empirical evaluation on MIMIC-II database demonstrates the efficacy of the TS-Cox model. Our TS-Cox model outperforms all other baseline models by a good margin in terms of AUC_PR, sensitivity and PPV, which indicates that TS-Cox may be a promising tool for mortality prediction in ICUs. PMID:26958269
Sneeringer, Stacy
2010-04-01
While a recent paper by Cox in this journal uses as its motivating factor the benefits of quantitative risk assessment, its content is entirely devoted to critiquing Sneeringer's article in the American Journal of Agricultural Economics. Cox's two main critiques of Sneeringer are fundamentally flawed and misrepresent the original article. Cox posits that Sneeringer did A and B, and then argues why A and B are incorrect. However, Sneeringer in fact did C and D; thus critiques of A and B are not applicable to Sneeringer's analysis. PMID:20345577
Box–Cox Transformation and Random Regression Models for Fecal egg Count Data
da Silva, Marcos Vinícius Gualberto Barbosa; Van Tassell, Curtis P.; Sonstegard, Tad S.; Cobuci, Jaime Araujo; Gasbarre, Louis C.
2012-01-01
Accurate genetic evaluation of livestock is based on appropriate modeling of phenotypic measurements. In ruminants, fecal egg count (FEC) is commonly used to measure resistance to nematodes. FEC values are not normally distributed and logarithmic transformations have been used in an effort to achieve normality before analysis. However, the transformed data are often still not normally distributed, especially when data are extremely skewed. A series of repeated FEC measurements may provide information about the population dynamics of a group or individual. A total of 6375 FEC measures were obtained for 410 animals between 1992 and 2003 from the Beltsville Agricultural Research Center Angus herd. Original data were transformed using an extension of the Box–Cox transformation to approach normality and to estimate (co)variance components. We also proposed using random regression models (RRM) for genetic and non-genetic studies of FEC. Phenotypes were analyzed using RRM and restricted maximum likelihood. Within the different orders of Legendre polynomials used, those with more parameters (order 4) adjusted FEC data best. Results indicated that the transformation of FEC data utilizing the Box–Cox transformation family was effective in reducing the skewness and kurtosis, and dramatically increased estimates of heritability, and measurements of FEC obtained in the period between 12 and 26 weeks in a 26-week experimental challenge period are genetically correlated. PMID:22303406
Pathway-gene identification for pancreatic cancer survival via doubly regularized Cox regression
2014-01-01
Background Recent global genomic analyses identified 69 gene sets and 12 core signaling pathways genetically altered in pancreatic cancer, which is a highly malignant disease. A comprehensive understanding of the genetic signatures and signaling pathways that are directly correlated to pancreatic cancer survival will help cancer researchers to develop effective multi-gene targeted, personalized therapies for the pancreatic cancer patients at different stages. A previous work that applied a LASSO penalized regression method, which only considered individual genetic effects, identified 12 genes associated with pancreatic cancer survival. Results In this work, we integrate pathway information into pancreatic cancer survival analysis. We introduce and apply a doubly regularized Cox regression model to identify both genes and signaling pathways related to pancreatic cancer survival. Conclusions Four signaling pathways, including Ion transport, immune phagocytosis, TGFβ (spermatogenesis), regulation of DNA-dependent transcription pathways, and 15 genes within the four pathways are identified and verified to be directly correlated to pancreatic cancer survival. Our findings can help cancer researchers design new strategies for the early detection and diagnosis of pancreatic cancer. PMID:24565114
Modern Regression Discontinuity Analysis
ERIC Educational Resources Information Center
Bloom, Howard S.
2012-01-01
This article provides a detailed discussion of the theory and practice of modern regression discontinuity (RD) analysis for estimating the effects of interventions or treatments. Part 1 briefly chronicles the history of RD analysis and summarizes its past applications. Part 2 explains how in theory an RD analysis can identify an average effect of…
Tosteson, Tor D.; Morden, Nancy E.; Stukel, Therese A.; O'Malley, A. James
2014-01-01
The estimation of treatment effects is one of the primary goals of statistics in medicine. Estimation based on observational studies is subject to confounding. Statistical methods for controlling bias due to confounding include regression adjustment, propensity scores and inverse probability weighted estimators. These methods require that all confounders are recorded in the data. The method of instrumental variables (IVs) can eliminate bias in observational studies even in the absence of information on confounders. We propose a method for integrating IVs within the framework of Cox's proportional hazards model and demonstrate the conditions under which it recovers the causal effect of treatment. The methodology is based on the approximate orthogonality of an instrument with unobserved confounders among those at risk. We derive an estimator as the solution to an estimating equation that resembles the score equation of the partial likelihood in much the same way as the traditional IV estimator resembles the normal equations. To justify this IV estimator for a Cox model we perform simulations to evaluate its operating characteristics. Finally, we apply the estimator to an observational study of the effect of coronary catheterization on survival. PMID:25506259
Multiple linear regression analysis
NASA Technical Reports Server (NTRS)
Edwards, T. R.
1980-01-01
Program rapidly selects best-suited set of coefficients. User supplies only vectors of independent and dependent data and specifies confidence level required. Program uses stepwise statistical procedure for relating minimal set of variables to set of observations; final regression contains only most statistically significant coefficients. Program is written in FORTRAN IV for batch execution and has been implemented on NOVA 1200.
Lee, Eunjee; Zhu, Hongtu; Kong, Dehan; Wang, Yalin; Giovanello, Kelly Sullivan; Ibrahim, Joseph G
2015-01-01
The aim of this paper is to develop a Bayesian functional linear Cox regression model (BFLCRM) with both functional and scalar covariates. This new development is motivated by establishing the likelihood of conversion to Alzheimer’s disease (AD) in 346 patients with mild cognitive impairment (MCI) enrolled in the Alzheimer’s Disease Neuroimaging Initiative 1 (ADNI-1) and the early markers of conversion. These 346 MCI patients were followed over 48 months, with 161 MCI participants progressing to AD at 48 months. The functional linear Cox regression model was used to establish that functional covariates including hippocampus surface morphology and scalar covariates including brain MRI volumes, cognitive performance (ADAS-Cog), and APOE status can accurately predict time to onset of AD. Posterior computation proceeds via an efficient Markov chain Monte Carlo algorithm. A simulation study is performed to evaluate the finite sample performance of BFLCRM. PMID:26900412
NASA Technical Reports Server (NTRS)
Kattan, Michael W.; Hess, Kenneth R.; Kattan, Michael W.
1998-01-01
New computationally intensive tools for medical survival analyses include recursive partitioning (also called CART) and artificial neural networks. A challenge that remains is to better understand the behavior of these techniques in effort to know when they will be effective tools. Theoretically they may overcome limitations of the traditional multivariable survival technique, the Cox proportional hazards regression model. Experiments were designed to test whether the new tools would, in practice, overcome these limitations. Two datasets in which theory suggests CART and the neural network should outperform the Cox model were selected. The first was a published leukemia dataset manipulated to have a strong interaction that CART should detect. The second was a published cirrhosis dataset with pronounced nonlinear effects that a neural network should fit. Repeated sampling of 50 training and testing subsets was applied to each technique. The concordance index C was calculated as a measure of predictive accuracy by each technique on the testing dataset. In the interaction dataset, CART outperformed Cox (P less than 0.05) with a C improvement of 0.1 (95% Cl, 0.08 to 0.12). In the nonlinear dataset, the neural network outperformed the Cox model (P less than 0.05), but by a very slight amount (0.015). As predicted by theory, CART and the neural network were able to overcome limitations of the Cox model. Experiments like these are important to increase our understanding of when one of these new techniques will outperform the standard Cox model. Further research is necessary to predict which technique will do best a priori and to assess the magnitude of superiority.
Analysis of the correlation between P53 and Cox-2 expression and prognosis in esophageal cancer
CHEN, JUN; WU, FANG; PEI, HONG-LEI; GU, WEN-DONG; NING, ZHONG-HUA; SHAO, YING-JIE; HUANG, JIN
2015-01-01
The present study aimed to explore the importance of P53 and Cox-2 protein expression in esophageal cancer and assess their influence on prognosis. The expression of P53 and Cox-2 was assessed in esophageal cancer samples from 195 patients subjected to radical surgery at Changzhou First People's Hospital (Changzhou, China) between May 2010 and December 2011. Expression of P53 and Cox-2 proteins were detected in 60.5% (118/195) and 69.7% (136/195) of the samples, respectively, and were co-expressed in 43.1% (84/195) of the samples. A correlation was identified between P53 expression and overall survival (OS) (P=0.0351) as well as disease-free survival (DFS) (P=0.0307). In addition, the co-expression of P53 and Cox-2 also correlated with OS (P=0.0040) and DFS (P=0.0042). P53 expression (P=0.023), TNM staging (P<0.001) and P53/Cox-2 co-expression (P=0.009) were identified as independent factors affecting OS in patients with esophageal cancer via a Cox multivariate regression model analysis. A similar analysis also identified P53 expression (P=0.020), TNM staging (P<0.001) and P53/Cox-2 co-expression (P=0.008) as independent prognostic factors influencing DFS in these patients. Binary logistic regression analysis demonstrated a correlation between P53 expression (P=0.012), TNM staging (P<0.001), tumor differentiation level (P=0.023) and P53/Cox-2 co-expression (P=0.021), and local recurrence or distant esophageal cancer metastasis. The results of the present study indicate that P53 and Cox-2 proteins may act synergistically in the development of esophageal cancer, and the assessment of P53/Cox-2 co-expression status in esophageal cancer biopsies may become an important diagnostic criterion to evaluate the prognosis of patients with esophageal cancer. PMID:26622818
Precision Efficacy Analysis for Regression.
ERIC Educational Resources Information Center
Brooks, Gordon P.
When multiple linear regression is used to develop a prediction model, sample size must be large enough to ensure stable coefficients. If the derivation sample size is inadequate, the model may not predict well for future subjects. The precision efficacy analysis for regression (PEAR) method uses a cross- validity approach to select sample sizes…
Devarajan, Karthik; Ebrahimi, Nader
2010-01-01
The assumption of proportional hazards (PH) fundamental to the Cox PH model sometimes may not hold in practice. In this paper, we propose a generalization of the Cox PH model in terms of the cumulative hazard function taking a form similar to the Cox PH model, with the extension that the baseline cumulative hazard function is raised to a power function. Our model allows for interaction between covariates and the baseline hazard and it also includes, for the two sample problem, the case of two Weibull distributions and two extreme value distributions differing in both scale and shape parameters. The partial likelihood approach can not be applied here to estimate the model parameters. We use the full likelihood approach via a cubic B-spline approximation for the baseline hazard to estimate the model parameters. A semi-automatic procedure for knot selection based on Akaike’s Information Criterion is developed. We illustrate the applicability of our approach using real-life data. PMID:21076652
Devarajan, Karthik; Ebrahimi, Nader
2011-01-01
The assumption of proportional hazards (PH) fundamental to the Cox PH model sometimes may not hold in practice. In this paper, we propose a generalization of the Cox PH model in terms of the cumulative hazard function taking a form similar to the Cox PH model, with the extension that the baseline cumulative hazard function is raised to a power function. Our model allows for interaction between covariates and the baseline hazard and it also includes, for the two sample problem, the case of two Weibull distributions and two extreme value distributions differing in both scale and shape parameters. The partial likelihood approach can not be applied here to estimate the model parameters. We use the full likelihood approach via a cubic B-spline approximation for the baseline hazard to estimate the model parameters. A semi-automatic procedure for knot selection based on Akaike's Information Criterion is developed. We illustrate the applicability of our approach using real-life data. PMID:21076652
Covariate analysis of survival data: a small-sample study of Cox's model
Johnson, M.E.; Tolley, H.D.; Bryson, M.C.; Goldman, A.S.
1982-09-01
Cox's proportional-hazards model is frequently used to adjust for covariate effects in survival-data analysis. The small-sample performances of the maximum partial likelihood estimators of the regression parameters in a two-covariate hazard function model are evaluated with respect to bias, variance, and power in hypothesis tests. Previous Monte Carlo work on the two-sample problem is reviewed.
Including network knowledge into Cox regression models for biomarker signature discovery.
Fröhlich, Holger
2014-03-01
Discovery of prognostic and diagnostic biomarker gene signatures for diseases, such as cancer, is seen as a major step toward a better personalized medicine. During the last decade various methods have been proposed for that purpose. However, one important obstacle for making gene signatures a standard tool in clinical diagnosis is the typical low reproducibility of these signatures combined with the difficulty to achieve a clear biological interpretation. For that purpose in the last years there has been a growing interest in approaches that try to integrate information from molecular interaction networks. Most of these methods focus on classification problems, that is learn a model from data that discriminates patients into distinct clinical groups. Far less has been published on approaches that predict a patient's event risk. In this paper, we investigate eight methods that integrate network information into multivariable Cox proportional hazard models for risk prediction in breast cancer. We compare the prediction performance of our tested algorithms via cross-validation as well as across different datasets. In addition, we highlight the stability and interpretability of obtained gene signatures. In conclusion, we find GeneRank-based filtering to be a simple, computationally cheap and highly predictive technique to integrate network information into event time prediction models. Signatures derived via this method are highly reproducible. PMID:24430933
Regression analysis of networked data
Zhou, Yan; Song, Peter X.-K.
2016-01-01
This paper concerns regression methodology for assessing relationships between multi-dimensional response variables and covariates that are correlated within a network. To address analytical challenges associated with the integration of network topology into the regression analysis, we propose a hybrid quadratic inference method that uses both prior and data-driven correlations among network nodes. A Godambe information-based tuning strategy is developed to allocate weights between the prior and data-driven network structures, so the estimator is efficient. The proposed method is conceptually simple and computationally fast, and has appealing large-sample properties. It is evaluated by simulation, and its application is illustrated using neuroimaging data from an association study of the effects of iron deficiency on auditory recognition memory in infants. PMID:27279658
Analysis of COX2 mutants reveals cytochrome oxidase subassemblies in yeast
2005-01-01
Cytochrome oxidase catalyses the reduction of oxygen to water. The mitochondrial enzyme contains up to 13 subunits, 11 in yeast, of which three, Cox1p, Cox2p and Cox3p, are mitochondrially encoded. The assembly pathway of this complex is still poorly understood. Its study in yeast has been so far impeded by the rapid turnover of unassembled subunits of the enzyme. In the present study, immunoblot analysis of blue native gels of yeast wild-type and Cox2p mutants revealed five cytochrome oxidase complexes or subcomplexes: a, b, c, d and f; a is likely to be the fully assembled enzyme; b lacks Cox6ap; d contains Cox7p and/or Cox7ap; f represents unassembled Cox1p; and c, observed only in the Cox2p mutants, contains Cox1p, Cox3p, Cox5p and Cox6p and lacks the other subunits. The identification of these novel cytochrome oxidase subcomplexes should encourage the reexamination of other yeast mutants. PMID:15921494
Gene-Based Association Analysis for Censored Traits Via Fixed Effect Functional Regressions.
Fan, Ruzong; Wang, Yifan; Yan, Qi; Ding, Ying; Weeks, Daniel E; Lu, Zhaohui; Ren, Haobo; Cook, Richard J; Xiong, Momiao; Swaroop, Anand; Chew, Emily Y; Chen, Wei
2016-02-01
Genetic studies of survival outcomes have been proposed and conducted recently, but statistical methods for identifying genetic variants that affect disease progression are rarely developed. Motivated by our ongoing real studies, here we develop Cox proportional hazard models using functional regression (FR) to perform gene-based association analysis of survival traits while adjusting for covariates. The proposed Cox models are fixed effect models where the genetic effects of multiple genetic variants are assumed to be fixed. We introduce likelihood ratio test (LRT) statistics to test for associations between the survival traits and multiple genetic variants in a genetic region. Extensive simulation studies demonstrate that the proposed Cox RF LRT statistics have well-controlled type I error rates. To evaluate power, we compare the Cox FR LRT with the previously developed burden test (BT) in a Cox model and sequence kernel association test (SKAT), which is based on mixed effect Cox models. The Cox FR LRT statistics have higher power than or similar power as Cox SKAT LRT except when 50%/50% causal variants had negative/positive effects and all causal variants are rare. In addition, the Cox FR LRT statistics have higher power than Cox BT LRT. The models and related test statistics can be useful in the whole genome and whole exome association studies. An age-related macular degeneration dataset was analyzed as an example. PMID:26782979
Regression Analysis by Example. 5th Edition
ERIC Educational Resources Information Center
Chatterjee, Samprit; Hadi, Ali S.
2012-01-01
Regression analysis is a conceptually simple method for investigating relationships among variables. Carrying out a successful application of regression analysis, however, requires a balance of theoretical results, empirical rules, and subjective judgment. "Regression Analysis by Example, Fifth Edition" has been expanded and thoroughly…
Dewi, Lestari
2016-01-01
Introduction: The enzyme cyclooxygenase (COX) is an enzyme that catalyzes the formation of one of the mediators of inflammation, the prostaglandins. Inhibition of COX allegedly can improve inflammation-induced pathological conditions. Aim: The purpose of the present study was to evaluate the potential of Sargassum sp. components, Fucoidan and alginate, as COX inhibitors. Material and methods: The study was conducted by means of a computational (in silico) method. It was performed in two main stages, the docking between COX-1 and COX-2 with Fucoidan, alginate and aspirin (for comparison) and the analysis of the amount of interactions formed and the residues directly involved in the process of interaction. Results: Our results showed that both Fucoidan and alginate had an excellent potential as inhibitors of COX-1 and COX-2. Fucoidan had a better potential as an inhibitor of COX than alginate. COX inhibition was expected to provide a more favorable effect on inflammation-related pathological conditions. Conclusion: The active compounds Fucoidan and alginate derived from Sargassum sp. were suspected to possess a good potential as inhibitors of COX-1 and COX-2. PMID:27594740
Ternès, Nils; Rotolo, Federico; Michiels, Stefan
2016-07-10
Correct selection of prognostic biomarkers among multiple candidates is becoming increasingly challenging as the dimensionality of biological data becomes higher. Therefore, minimizing the false discovery rate (FDR) is of primary importance, while a low false negative rate (FNR) is a complementary measure. The lasso is a popular selection method in Cox regression, but its results depend heavily on the penalty parameter λ. Usually, λ is chosen using maximum cross-validated log-likelihood (max-cvl). However, this method has often a very high FDR. We review methods for a more conservative choice of λ. We propose an empirical extension of the cvl by adding a penalization term, which trades off between the goodness-of-fit and the parsimony of the model, leading to the selection of fewer biomarkers and, as we show, to the reduction of the FDR without large increase in FNR. We conducted a simulation study considering null and moderately sparse alternative scenarios and compared our approach with the standard lasso and 10 other competitors: Akaike information criterion (AIC), corrected AIC, Bayesian information criterion (BIC), extended BIC, Hannan and Quinn information criterion (HQIC), risk information criterion (RIC), one-standard-error rule, adaptive lasso, stability selection, and percentile lasso. Our extension achieved the best compromise across all the scenarios between a reduction of the FDR and a limited raise of the FNR, followed by the AIC, the RIC, and the adaptive lasso, which performed well in some settings. We illustrate the methods using gene expression data of 523 breast cancer patients. In conclusion, we propose to apply our extension to the lasso whenever a stringent FDR with a limited FNR is targeted. Copyright © 2016 John Wiley & Sons, Ltd. PMID:26970107
Khosravi, Bahareh; Pourahmad, Saeedeh; Bahreini, Amin; Nikeghbalian, Saman; Mehrdad, Goli
2015-01-01
Background: Transplantation is the only treatment for patients with liver failure. Since the therapy imposes high expenses to the patients and community, identification of effective factors on survival of such patients after transplantation is valuable. Objectives: The current study attempted to model the survival of patients (two years old and above) after liver transplantation using neural network and Cox Proportional Hazards (Cox PH) regression models. The event is defined as death due to complications of liver transplantation. Patients and Methods: In a historical cohort study, the clinical findings of 1168 patients who underwent liver transplant surgery (from March 2008 to march 2013) at Shiraz Namazee Hospital Organ Transplantation Center, Shiraz, Southern Iran, were used. To model the one to five years survival of such patients, Cox PH regression model accompanied by three layers feed forward artificial neural network (ANN) method were applied on data separately and their prediction accuracy was compared using the area under the receiver operating characteristic curve (ROC). Furthermore, Kaplan-Meier method was used to estimate the survival probabilities in different years. Results: The estimated survival probability of one to five years for the patients were 91%, 89%, 85%, 84%, and 83%, respectively. The areas under the ROC were 86.4% and 80.7% for ANN and Cox PH models, respectively. In addition, the accuracy of prediction rate for ANN and Cox PH methods was equally 92.73%. Conclusions: The present study detected more accurate results for ANN method compared to those of Cox PH model to analyze the survival of patients with liver transplantation. Furthermore, the order of effective factors in patients’ survival after transplantation was clinically more acceptable. The large dataset with a few missing data was the advantage of this study, the fact which makes the results more reliable. PMID:26500682
Regression analysis of cytopathological data
Whittemore, A.S.; McLarty, J.W.; Fortson, N.; Anderson, K.
1982-12-01
Epithelial cells from the human body are frequently labelled according to one of several ordered levels of abnormality, ranging from normal to malignant. The label of the most abnormal cell in a specimen determines the score for the specimen. This paper presents a model for the regression of specimen scores against continuous and discrete variables, as in host exposure to carcinogens. Application to data and tests for adequacy of model fit are illustrated using sputum specimens obtained from a cohort of former asbestos workers.
Lee, Paul H.
2016-01-01
Healthy adults are advised to perform at least 150 min of moderate-intensity physical activity weekly, but this advice is based on studies using self-reports of questionable validity. This study examined the dose-response relationship of accelerometer-measured physical activity and sedentary behaviors on all-cause mortality using segmented Cox regression to empirically determine the break-points of the dose-response relationship. Data from 7006 adult participants aged 18 or above in the National Health and Nutrition Examination Survey waves 2003–2004 and 2005–2006 were included in the analysis and linked with death certificate data using a probabilistic matching approach in the National Death Index through December 31, 2011. Physical activity and sedentary behavior were measured using ActiGraph model 7164 accelerometer over the right hip for 7 consecutive days. Each minute with accelerometer count <100; 1952–5724; and ≥5725 were classified as sedentary, moderate-intensity physical activity, and vigorous-intensity physical activity, respectively. Segmented Cox regression was used to estimate the hazard ratio (HR) of time spent in sedentary behaviors, moderate-intensity physical activity, and vigorous-intensity physical activity and all-cause mortality, adjusted for demographic characteristics, health behaviors, and health conditions. Data were analyzed in 2016. During 47,119 person-year of follow-up, 608 deaths occurred. Each additional hour per day of sedentary behaviors was associated with a HR of 1.15 (95% CI 1.01, 1.31) among participants who spend at least 10.9 h per day on sedentary behaviors, and each additional minute per day spent on moderate-intensity physical activity was associated with a HR of 0.94 (95% CI 0.91, 0.96) among participants with daily moderate-intensity physical activity ≤14.1 min. Associations of moderate physical activity and sedentary behaviors on all-cause mortality were independent of each other. To conclude, evidence from
Regression Analysis and the Sociological Imagination
ERIC Educational Resources Information Center
De Maio, Fernando
2014-01-01
Regression analysis is an important aspect of most introductory statistics courses in sociology but is often presented in contexts divorced from the central concerns that bring students into the discipline. Consequently, we present five lesson ideas that emerge from a regression analysis of income inequality and mortality in the USA and Canada.
Regression Analysis: Legal Applications in Institutional Research
ERIC Educational Resources Information Center
Frizell, Julie A.; Shippen, Benjamin S., Jr.; Luna, Andrew L.
2008-01-01
This article reviews multiple regression analysis, describes how its results should be interpreted, and instructs institutional researchers on how to conduct such analyses using an example focused on faculty pay equity between men and women. The use of multiple regression analysis will be presented as a method with which to compare salaries of…
2014-01-01
Background Large-scale public health interventions with rapid scale-up are increasingly being implemented worldwide. Such implementation allows for a large target population to be reached in a short period of time. But when the time comes to investigate the effectiveness of these interventions, the rapid scale-up creates several methodological challenges, such as the lack of baseline data and the absence of control groups. One example of such an intervention is Avahan, the India HIV/AIDS initiative of the Bill & Melinda Gates Foundation. One question of interest is the effect of Avahan on condom use by female sex workers with their clients. By retrospectively reconstructing condom use and sex work history from survey data, it is possible to estimate how condom use rates evolve over time. However formal inference about how this rate changes at a given point in calendar time remains challenging. Methods We propose a new statistical procedure based on a mixture of binomial regression and Cox regression. We compare this new method to an existing approach based on generalized estimating equations through simulations and application to Indian data. Results Both methods are unbiased, but the proposed method is more powerful than the existing method, especially when initial condom use is high. When applied to the Indian data, the new method mostly agrees with the existing method, but seems to have corrected some implausible results of the latter in a few districts. We also show how the new method can be used to analyze the data of all districts combined. Conclusions The use of both methods can be recommended for exploratory data analysis. However for formal statistical inference, the new method has better power. PMID:24397563
Box-Cox Mixed Logit Model for Travel Behavior Analysis
NASA Astrophysics Data System (ADS)
Orro, Alfonso; Novales, Margarita; Benitez, Francisco G.
2010-09-01
To represent the behavior of travelers when they are deciding how they are going to get to their destination, discrete choice models, based on the random utility theory, have become one of the most widely used tools. The field in which these models were developed was halfway between econometrics and transport engineering, although the latter now constitutes one of their principal areas of application. In the transport field, they have mainly been applied to mode choice, but also to the selection of destination, route, and other important decisions such as the vehicle ownership. In usual practice, the most frequently employed discrete choice models implement a fixed coefficient utility function that is linear in the parameters. The principal aim of this paper is to present the viability of specifying utility functions with random coefficients that are nonlinear in the parameters, in applications of discrete choice models to transport. Nonlinear specifications in the parameters were present in discrete choice theory at its outset, although they have seldom been used in practice until recently. The specification of random coefficients, however, began with the probit and the hedonic models in the 1970s, and, after a period of apparent little practical interest, has burgeoned into a field of intense activity in recent years with the new generation of mixed logit models. In this communication, we present a Box-Cox mixed logit model, original of the authors. It includes the estimation of the Box-Cox exponents in addition to the parameters of the random coefficients distribution. Probability of choose an alternative is an integral that will be calculated by simulation. The estimation of the model is carried out by maximizing the simulated log-likelihood of a sample of observed individual choices between alternatives. The differences between the predictions yielded by models that are inconsistent with real behavior have been studied with simulation experiments.
Meta-analysis of cyclooxygenase-2 (COX-2) 765G>C polymorphism and Alzheimer's disease.
Su, Jianhua; Wen, Shihong; Zhu, Jinlong; Liu, Ruiping; Yang, Jinsong
2016-09-01
The cyclooxygenase-2 (COX-2) 765G>C polymorphism has been extensively investigated for association with Alzheimer's disease (AD). However, results of different studies have been inconsistent. The aim of the present meta-analysis was to evaluate the association between the 765G>C polymorphism of the COX-2 gene and susceptibility to AD. We searched all related subjects in PubMed, Embase, SinoMed, and China Knowledge Resource Integrated Database and identified seven studies that reported a relationship between the COX-2 765G>C polymorphism and AD. A total of 1260 cases and 1112 controls were included in the seven studies. Our data suggest that the COX-2 765G>C polymorphism may decrease the risk of AD in five genetic models. As a result, this meta-analysis suggests the 765G>C polymorphism of the COX-2 gene may be a protective factor for AD. As our sample size was limited, large-scale, well-designed studies are necessary to validate the association between the COX-2 765G>C polymorphism and AD. PMID:27443496
Using Regression Analysis: A Guided Tour.
ERIC Educational Resources Information Center
Shelton, Fred Ames
1987-01-01
Discusses the use and interpretation of multiple regression analysis with computer programs and presents a flow chart of the process. A general explanation of the flow chart is provided, followed by an example showing the development of a linear equation which could be used in estimating manufacturing overhead cost. (Author/LRW)
Commonality Analysis for the Regression Case.
ERIC Educational Resources Information Center
Murthy, Kavita
Commonality analysis is a procedure for decomposing the coefficient of determination (R superscript 2) in multiple regression analyses into the percent of variance in the dependent variable associated with each independent variable uniquely, and the proportion of explained variance associated with the common effects of predictors in various…
Kim, Sungduk; Chen, Ming-Hui; Ibrahim, Joseph G.; Shah, Arvind K.; Lin, Jianxin
2013-01-01
In this paper, we propose a class of Box-Cox transformation regression models with multidimensional random effects for analyzing multivariate responses for individual patient data (IPD) in meta-analysis. Our modeling formulation uses a multivariate normal response meta-analysis model with multivariate random effects, in which each response is allowed to have its own Box-Cox transformation. Prior distributions are specified for the Box-Cox transformation parameters as well as the regression coefficients in this complex model, and the Deviance Information Criterion (DIC) is used to select the best transformation model. Since the model is quite complex, a novel Monte Carlo Markov chain (MCMC) sampling scheme is developed to sample from the joint posterior of the parameters. This model is motivated by a very rich dataset comprising 26 clinical trials involving cholesterol lowering drugs where the goal is to jointly model the three dimensional response consisting of Low Density Lipoprotein Cholesterol (LDL-C), High Density Lipoprotein Cholesterol (HDL-C), and Triglycerides (TG) (LDL-C, HDL-C, TG). Since the joint distribution of (LDL-C, HDL-C, TG) is not multivariate normal and in fact quite skewed, a Box-Cox transformation is needed to achieve normality. In the clinical literature, these three variables are usually analyzed univariately: however, a multivariate approach would be more appropriate since these variables are correlated with each other. A detailed analysis of these data is carried out using the proposed methodology. PMID:23580436
Robust Mediation Analysis Based on Median Regression
Yuan, Ying; MacKinnon, David P.
2014-01-01
Mediation analysis has many applications in psychology and the social sciences. The most prevalent methods typically assume that the error distribution is normal and homoscedastic. However, this assumption may rarely be met in practice, which can affect the validity of the mediation analysis. To address this problem, we propose robust mediation analysis based on median regression. Our approach is robust to various departures from the assumption of homoscedasticity and normality, including heavy-tailed, skewed, contaminated, and heteroscedastic distributions. Simulation studies show that under these circumstances, the proposed method is more efficient and powerful than standard mediation analysis. We further extend the proposed robust method to multilevel mediation analysis, and demonstrate through simulation studies that the new approach outperforms the standard multilevel mediation analysis. We illustrate the proposed method using data from a program designed to increase reemployment and enhance mental health of job seekers. PMID:24079925
Wong, May C M; Lam, K F; Lo, Edward C M
2006-02-15
In some controlled clinical trials in dental research, multiple failure time data from the same patient are frequently observed that result in clustered multiple failure time. Moreover, the treatments are often delivered by more than one operator and thus the multiple failure times are clustered according to a multilevel structure when the operator effects are assumed to be random. In practice, it is often too expensive or even impossible to monitor the study subjects continuously, but they are examined periodically at some regular pre-scheduled visits. Hence, discrete or grouped clustered failure time data are collected. The aim of this paper is to illustrate the use of the Monte Carlo Markov chain (MCMC) approach and non-informative prior in a Bayesian framework to mimic the maximum likelihood (ML) estimation in a frequentist approach in multilevel modelling of clustered grouped survival data. A three-level model with additive variance components model for the random effects is considered in this paper. Both the grouped proportional hazards model and the dynamic logistic regression model are used. The approximate intra-cluster correlation of the log failure times can be estimated when the grouped proportional hazards model is used. The statistical package WinBUGS is adopted to estimate the parameter of interest based on the MCMC method. The models and method are applied to a data set obtained from a prospective clinical study on a cohort of Chinese school children that atraumatic restorative treatment (ART) restorations were placed on permanent teeth with carious lesions. Altogether 284 ART restorations were placed by five dentists and clinical status of the ART restorations was evaluated annually for 6 years after placement, thus clustered grouped failure times of the restorations were recorded. Results based on the grouped proportional hazards model revealed that clustering effect among the log failure times of the different restorations from the same child was
A method for nonlinear exponential regression analysis
NASA Technical Reports Server (NTRS)
Junkin, B. G.
1971-01-01
A computer-oriented technique is presented for performing a nonlinear exponential regression analysis on decay-type experimental data. The technique involves the least squares procedure wherein the nonlinear problem is linearized by expansion in a Taylor series. A linear curve fitting procedure for determining the initial nominal estimates for the unknown exponential model parameters is included as an integral part of the technique. A correction matrix was derived and then applied to the nominal estimate to produce an improved set of model parameters. The solution cycle is repeated until some predetermined criterion is satisfied.
Sibling dilution hypothesis: a regression surface analysis.
Marjoribanks, K
2001-08-01
This study examined relationships between sibship size (the number of children in a family), birth order, and measures of academic performance, academic self-concept, and educational aspirations at different levels of family educational resources. As part of a national longitudinal study of Australian secondary school students data were collected from 2,530 boys and 2,450 girls in Years 9 and 10. Regression surfaces were constructed from models that included terms to account for linear, interaction, and curvilinear associations among the variables. Analysis suggests the general propositions (a) family educational resources have significant associations with children's school-related outcomes at different levels of sibling variables, the relationships for girls being curvilinear, and (b) sibling variables continue to have small significant associations with affective and cognitive outcomes, after taking into account variations in family educational resources. That is, the investigation provides only partial support for the sibling dilution hypothesis. PMID:11729548
Technological Forecasting with a Multiple Regression Analysis Approach.
ERIC Educational Resources Information Center
Luftig, Jeffrey T.; Norton, Willis P.
1981-01-01
This article examines simple and multiple regression analysis as forecasting tools, and details the process by which multiple regression analysis may be used to increase the accuracy of the technology forecast. (CT)
ERIC Educational Resources Information Center
Williams, John D.; Lindem, Alfred C.
Four computer programs using the general purpose multiple linear regression program have been developed. Setwise regression analysis is a stepwise procedure for sets of variables; there will be as many steps as there are sets. Covarmlt allows a solution to the analysis of covariance design with multiple covariates. A third program has three…
A rotor optimization using regression analysis
NASA Technical Reports Server (NTRS)
Giansante, N.
1984-01-01
The design and development of helicopter rotors is subject to the many design variables and their interactions that effect rotor operation. Until recently, selection of rotor design variables to achieve specified rotor operational qualities has been a costly, time consuming, repetitive task. For the past several years, Kaman Aerospace Corporation has successfully applied multiple linear regression analysis, coupled with optimization and sensitivity procedures, in the analytical design of rotor systems. It is concluded that approximating equations can be developed rapidly for a multiplicity of objective and constraint functions and optimizations can be performed in a rapid and cost effective manner; the number and/or range of design variables can be increased by expanding the data base and developing approximating functions to reflect the expanded design space; the order of the approximating equations can be expanded easily to improve correlation between analyzer results and the approximating equations; gradients of the approximating equations can be calculated easily and these gradients are smooth functions reducing the risk of numerical problems in the optimization; the use of approximating functions allows the problem to be started easily and rapidly from various initial designs to enhance the probability of finding a global optimum; and the approximating equations are independent of the analysis or optimization codes used.
Using Dominance Analysis to Determine Predictor Importance in Logistic Regression
ERIC Educational Resources Information Center
Azen, Razia; Traxel, Nicole
2009-01-01
This article proposes an extension of dominance analysis that allows researchers to determine the relative importance of predictors in logistic regression models. Criteria for choosing logistic regression R[superscript 2] analogues were determined and measures were selected that can be used to perform dominance analysis in logistic regression. A…
Sliced Inverse Regression for Time Series Analysis
NASA Astrophysics Data System (ADS)
Chen, Li-Sue
1995-11-01
In this thesis, general nonlinear models for time series data are considered. A basic form is x _{t} = f(beta_sp{1} {T}X_{t-1},beta_sp {2}{T}X_{t-1},... , beta_sp{k}{T}X_ {t-1},varepsilon_{t}), where x_{t} is an observed time series data, X_{t } is the first d time lag vector, (x _{t},x_{t-1},... ,x _{t-d-1}), f is an unknown function, beta_{i}'s are unknown vectors, varepsilon_{t }'s are independent distributed. Special cases include AR and TAR models. We investigate the feasibility applying SIR/PHD (Li 1990, 1991) (the sliced inverse regression and principal Hessian methods) in estimating beta _{i}'s. PCA (Principal component analysis) is brought in to check one critical condition for SIR/PHD. Through simulation and a study on 3 well -known data sets of Canadian lynx, U.S. unemployment rate and sunspot numbers, we demonstrate how SIR/PHD can effectively retrieve the interesting low-dimension structures for time series data.
Choi, In-Wook; Kim, Hwang-Yong; Quan, Juan-Hua; Ryu, Jae-Gee; Sun, Rubing; Lee, Young-Ha
2015-01-01
Fascioliasis, a food-borne trematode zoonosis, is a disease primarily in cattle and sheep and occasionally in humans. Water dropwort (Oenanthe javanica), an aquatic perennial herb, is a common second intermediate host of Fasciola, and the fresh stems and leaves are widely used as a seasoning in the Korean diet. However, no information regarding Fasciola species contamination in water dropwort is available. Here, we collected 500 samples of water dropwort in 3 areas in Korea during February and March 2015, and the water dropwort contamination of Fasciola species was monitored by DNA sequencing analysis of the Fasciola hepatica and Fasciola gigantica specific mitochondrial cytochrome c oxidase subunit 1 (cox1) and nuclear ribosomal internal transcribed spacer 2 (ITS-2). Among the 500 samples assessed, the presence of F. hepatica cox1 and 1TS-2 markers were detected in 2 samples, and F. hepatica contamination was confirmed by sequencing analysis. The nucleotide sequences of cox1 PCR products from the 2 F. hepatica-contaminated samples were 96.5% identical to the F. hepatica cox1 sequences in GenBank, whereas F. gigantica cox1 sequences were 46.8% similar with the sequence detected from the cox1 positive samples. However, F. gigantica cox1 and ITS-2 markers were not detected by PCR in the 500 samples of water dropwort. Collectively, in this survey of the water dropwort contamination with Fasciola species, very low prevalence of F. hepatica contamination was detected in the samples. PMID:26537044
Choi, In-Wook; Kim, Hwang-Yong; Quan, Juan-Hua; Ryu, Jae-Gee; Sun, Rubing; Lee, Young-Ha
2015-10-01
Fascioliasis, a food-borne trematode zoonosis, is a disease primarily in cattle and sheep and occasionally in humans. Water dropwort (Oenanthe javanica), an aquatic perennial herb, is a common second intermediate host of Fasciola, and the fresh stems and leaves are widely used as a seasoning in the Korean diet. However, no information regarding Fasciola species contamination in water dropwort is available. Here, we collected 500 samples of water dropwort in 3 areas in Korea during February and March 2015, and the water dropwort contamination of Fasciola species was monitored by DNA sequencing analysis of the Fasciola hepatica and Fasciola gigantica specific mitochondrial cytochrome c oxidase subunit 1 (cox1) and nuclear ribosomal internal transcribed spacer 2 (ITS-2). Among the 500 samples assessed, the presence of F. hepatica cox1 and 1TS-2 markers were detected in 2 samples, and F. hepatica contamination was confirmed by sequencing analysis. The nucleotide sequences of cox1 PCR products from the 2 F. hepatica-contaminated samples were 96.5% identical to the F. hepatica cox1 sequences in GenBank, whereas F. gigantica cox1 sequences were 46.8% similar with the sequence detected from the cox1 positive samples. However, F. gigantica cox1 and ITS-2 markers were not detected by PCR in the 500 samples of water dropwort. Collectively, in this survey of the water dropwort contamination with Fasciola species, very low prevalence of F. hepatica contamination was detected in the samples. PMID:26537044
Giganti, Mark J.; Luz, Paula M.; Caro-Vega, Yanink; Cesar, Carina; Padgett, Denis; Koenig, Serena; Echevarria, Juan; McGowan, Catherine C.; Shepherd, Bryan E.
2015-01-01
Abstract Many studies of HIV/AIDS aggregate data from multiple cohorts to improve power and generalizability. There are several analysis approaches to account for cross-cohort heterogeneity; we assessed how different approaches can impact results from an HIV/AIDS study investigating predictors of mortality. Using data from 13,658 HIV-infected patients starting antiretroviral therapy from seven Latin American and Caribbean cohorts, we illustrate the assumptions of seven readily implementable approaches to account for across cohort heterogeneity with Cox proportional hazards models, and we compare hazard ratio estimates across approaches. As a sensitivity analysis, we modify cohort membership to generate specific heterogeneity conditions. Hazard ratio estimates varied slightly between the seven analysis approaches, but differences were not clinically meaningful. Adjusted hazard ratio estimates for the association between AIDS at treatment initiation and death varied from 2.00 to 2.20 across approaches that accounted for heterogeneity; the adjusted hazard ratio was estimated as 1.73 in analyses that ignored across cohort heterogeneity. In sensitivity analyses with more extreme heterogeneity, we noted a slightly greater distinction between approaches. Despite substantial heterogeneity between cohorts, the impact of the specific approach to account for heterogeneity was minimal in our case study. Our results suggest that it is important to account for across cohort heterogeneity in analyses, but that the specific technique for addressing heterogeneity may be less important. Because of their flexibility in accounting for cohort heterogeneity, we prefer stratification or meta-analysis methods, but we encourage investigators to consider their specific study conditions and objectives. PMID:25647087
Giganti, Mark J; Luz, Paula M; Caro-Vega, Yanink; Cesar, Carina; Padgett, Denis; Koenig, Serena; Echevarria, Juan; McGowan, Catherine C; Shepherd, Bryan E
2015-05-01
Many studies of HIV/AIDS aggregate data from multiple cohorts to improve power and generalizability. There are several analysis approaches to account for cross-cohort heterogeneity; we assessed how different approaches can impact results from an HIV/AIDS study investigating predictors of mortality. Using data from 13,658 HIV-infected patients starting antiretroviral therapy from seven Latin American and Caribbean cohorts, we illustrate the assumptions of seven readily implementable approaches to account for across cohort heterogeneity with Cox proportional hazards models, and we compare hazard ratio estimates across approaches. As a sensitivity analysis, we modify cohort membership to generate specific heterogeneity conditions. Hazard ratio estimates varied slightly between the seven analysis approaches, but differences were not clinically meaningful. Adjusted hazard ratio estimates for the association between AIDS at treatment initiation and death varied from 2.00 to 2.20 across approaches that accounted for heterogeneity; the adjusted hazard ratio was estimated as 1.73 in analyses that ignored across cohort heterogeneity. In sensitivity analyses with more extreme heterogeneity, we noted a slightly greater distinction between approaches. Despite substantial heterogeneity between cohorts, the impact of the specific approach to account for heterogeneity was minimal in our case study. Our results suggest that it is important to account for across cohort heterogeneity in analyses, but that the specific technique for addressing heterogeneity may be less important. Because of their flexibility in accounting for cohort heterogeneity, we prefer stratification or meta-analysis methods, but we encourage investigators to consider their specific study conditions and objectives. PMID:25647087
Topics in route-regression analysis
Geissler, P.H.; Sauer, J.R.
1990-01-01
The route-regression method has been used in recent years to analyze data from roadside surveys. With this method, a population trend is estimated for each route in a region, then regional trends are estimated as a weighted mean of the individual route trends. This method can accurately incorporate data that is unbalanced by changes in years surveyed and observer differences. We suggest that route-regression methodology is most efficient in the estimation of long-term (>5 year) trends, and tends to provide conservative results for low-density species.
Crager, Michael R.; Tang, Gong
2015-01-01
We propose a method for assessing an individual patient’s risk of a future clinical event using clinical trial or cohort data and Cox proportional hazards regression, combining the information from several studies using meta-analysis techniques. The method combines patient-specific estimates of the log cumulative hazard across studies, weighting by the relative precision of the estimates, using either fixed- or random-effects meta-analysis calculations. Risk assessment can be done for any future patient using a few key summary statistics determined once and for all from each study. Generalizations of the method to logistic regression and linear models are immediate. We evaluate the methods using simulation studies and illustrate their application using real data. PMID:26664111
Joint regression analysis for discrete longitudinal data.
Madsen, L; Fang, Y
2011-09-01
We introduce an approximation to the Gaussian copula likelihood of Song, Li, and Yuan (2009, Biometrics 65, 60-68) used to estimate regression parameters from correlated discrete or mixed bivariate or trivariate outcomes. Our approximation allows estimation of parameters from response vectors of length much larger than three, and is asymptotically equivalent to the Gaussian copula likelihood. We estimate regression parameters from the toenail infection data of De Backer et al. (1996, British Journal of Dermatology 134, 16-17), which consist of binary response vectors of length seven or less from 294 subjects. Although maximizing the Gaussian copula likelihood yields estimators that are asymptotically more efficient than generalized estimating equation (GEE) estimators, our simulation study illustrates that for finite samples, GEE estimators can actually be as much as 20% more efficient. PMID:21039391
Xiao, Zengming; Wu, Hao; Wu, Yang
2013-01-01
Background Numerous studies examining the relationship between Cyclooxygenase-2 (COX-2) immunoexpression and clinical outcome in osteosarcoma patients have yielded inconclusive results. Methods We accordingly conducted a meta-analysis of 9 studies (442 patients) that evaluated the correlation between COX-2 immunoexpression and clinical prognosis (death). Pooled odds ratios (OR) and risk ratios (RR) with 95% confidence intervals (95% CI) were calculated using the random-effects or fixed-effects model. Results Meta–analysis showed no significant association between COX-2 positivity and age, gender, tumor location, histology, stage, metastasis or 90% necrosis. Conversely, COX-2 immunoexpression was associated with overall survival rate (RR=2.12; 95% CI: 1.10–3.74; P=0.009) and disease-free survival rate (RR=1.63; 95% CI: 1.17–2.28; P=0.004) at 2 years. Sensitivity analysis performed by omitting low quality studies showed that the pooled results were stable. Conclusions COX-2 positivity was associated with a lower 2-year overall survival rate and disease-free survival rate. COX-2 expression change is an independent prognostic factor in patients with osteosarcoma. PMID:24358237
Strategies for Detecting Outliers in Regression Analysis: An Introductory Primer.
ERIC Educational Resources Information Center
Evans, Victoria P.
Outliers are extreme data points that have the potential to influence statistical analyses. Outlier identification is important to researchers using regression analysis because outliers can influence the model used to such an extent that they seriously distort the conclusions drawn from the data. The effects of outliers on regression analysis are…
Molecular docking analysis of known flavonoids as duel COX-2 inhibitors in the context of cancer.
Dash, Raju; Uddin, Mir Muhammad Nasir; Hosen, S M Zahid; Rahim, Zahed Bin; Dinar, Abu Mansur; Kabir, Mohammad Shah Hafez; Sultan, Ramiz Ahmed; Islam, Ashekul; Hossain, Md Kamrul
2015-01-01
Cyclooxygenase-2 (COX-2) catalyzed synthesis of prostaglandin E2 and it associates with tumor growth, infiltration, and metastasis in preclinical experiments. Known inhibitors against COX-2 exhibit toxicity. Therefore, it is of interest to screen natural compounds like flavanoids against COX-2. Molecular docking using 12 known flavanoids against COX-2 by FlexX and of ArgusLab were performed. All compounds showed a favourable binding energy of >-10 KJ/mol in FlexX and > -8 kcal/mol in ArgusLab. However, this data requires in vitro and in vivo verification for further consideration. PMID:26770028
Molecular docking analysis of known flavonoids as duel COX-2 inhibitors in the context of cancer
Dash, Raju; Uddin, Mir Muhammad Nasir; Hosen, S.M. Zahid; Rahim, Zahed Bin; Dinar, Abu Mansur; Kabir, Mohammad Shah Hafez; Sultan, Ramiz Ahmed; Islam, Ashekul; Hossain, Md Kamrul
2015-01-01
Cyclooxygenase-2 (COX-2) catalyzed synthesis of prostaglandin E2 and it associates with tumor growth, infiltration, and metastasis in preclinical experiments. Known inhibitors against COX-2 exhibit toxicity. Therefore, it is of interest to screen natural compounds like flavanoids against COX-2. Molecular docking using 12 known flavanoids against COX-2 by FlexX and of ArgusLab were performed. All compounds showed a favourable binding energy of >-10 KJ/mol in FlexX and > -8 kcal/mol in ArgusLab. However, this data requires in vitro and in vivo verification for further consideration. PMID:26770028
ERIC Educational Resources Information Center
Hecht, Jeffrey B.
The analysis of regression residuals and detection of outliers are discussed, with emphasis on determining how deviant an individual data point must be to be considered an outlier and the impact that multiple suspected outlier data points have on the process of outlier determination and treatment. Only bivariate (one dependent and one independent)…
Takagi, Daisuke; Ikeda, Ken'ichi; Kawachi, Ichiro
2012-11-01
Crime is an important determinant of public health outcomes, including quality of life, mental well-being, and health behavior. A body of research has documented the association between community social capital and crime victimization. The association between social capital and crime victimization has been examined at multiple levels of spatial aggregation, ranging from entire countries, to states, metropolitan areas, counties, and neighborhoods. In multilevel analysis, the spatial boundaries at level 2 are most often drawn from administrative boundaries (e.g., Census tracts in the U.S.). One problem with adopting administrative definitions of neighborhoods is that it ignores spatial spillover. We conducted a study of social capital and crime victimization in one ward of Tokyo city, using a spatial Durbin model with an inverse-distance weighting matrix that assigned each respondent a unique level of "exposure" to social capital based on all other residents' perceptions. The study is based on a postal questionnaire sent to 20-69 years old residents of Arakawa Ward, Tokyo. The response rate was 43.7%. We examined the contextual influence of generalized trust, perceptions of reciprocity, two types of social network variables, as well as two principal components of social capital (constructed from the above four variables). Our outcome measure was self-reported crime victimization in the last five years. In the spatial Durbin model, we found that neighborhood generalized trust, reciprocity, supportive networks and two principal components of social capital were each inversely associated with crime victimization. By contrast, a multilevel regression performed with the same data (using administrative neighborhood boundaries) found generally null associations between neighborhood social capital and crime. Spatial regression methods may be more appropriate for investigating the contextual influence of social capital in homogeneous cultural settings such as Japan. PMID
Regression Commonality Analysis: A Technique for Quantitative Theory Building
ERIC Educational Resources Information Center
Nimon, Kim; Reio, Thomas G., Jr.
2011-01-01
When it comes to multiple linear regression analysis (MLR), it is common for social and behavioral science researchers to rely predominately on beta weights when evaluating how predictors contribute to a regression model. Presenting an underutilized statistical technique, this article describes how organizational researchers can use commonality…
The Precision Efficacy Analysis for Regression Sample Size Method.
ERIC Educational Resources Information Center
Brooks, Gordon P.; Barcikowski, Robert S.
The general purpose of this study was to examine the efficiency of the Precision Efficacy Analysis for Regression (PEAR) method for choosing appropriate sample sizes in regression studies used for precision. The PEAR method, which is based on the algebraic manipulation of an accepted cross-validity formula, essentially uses an effect size to…
PRINCIPAL COMPONENTS ANALYSIS AND PARTIAL LEAST SQUARES REGRESSION
The mathematics behind the techniques of principal component analysis and partial least squares regression is presented in detail, starting from the appropriate extreme conditions. he meaning of the resultant vectors and many of their mathematical interrelationships are also pres...
3D Regression Heat Map Analysis of Population Study Data.
Klemm, Paul; Lawonn, Kai; Glaßer, Sylvia; Niemann, Uli; Hegenscheid, Katrin; Völzke, Henry; Preim, Bernhard
2016-01-01
Epidemiological studies comprise heterogeneous data about a subject group to define disease-specific risk factors. These data contain information (features) about a subject's lifestyle, medical status as well as medical image data. Statistical regression analysis is used to evaluate these features and to identify feature combinations indicating a disease (the target feature). We propose an analysis approach of epidemiological data sets by incorporating all features in an exhaustive regression-based analysis. This approach combines all independent features w.r.t. a target feature. It provides a visualization that reveals insights into the data by highlighting relationships. The 3D Regression Heat Map, a novel 3D visual encoding, acts as an overview of the whole data set. It shows all combinations of two to three independent features with a specific target disease. Slicing through the 3D Regression Heat Map allows for the detailed analysis of the underlying relationships. Expert knowledge about disease-specific hypotheses can be included into the analysis by adjusting the regression model formulas. Furthermore, the influences of features can be assessed using a difference view comparing different calculation results. We applied our 3D Regression Heat Map method to a hepatic steatosis data set to reproduce results from a data mining-driven analysis. A qualitative analysis was conducted on a breast density data set. We were able to derive new hypotheses about relations between breast density and breast lesions with breast cancer. With the 3D Regression Heat Map, we present a visual overview of epidemiological data that allows for the first time an interactive regression-based analysis of large feature sets with respect to a disease. PMID:26529689
Linear regression analysis of survival data with missing censoring indicators.
Wang, Qihua; Dinse, Gregg E
2011-04-01
Linear regression analysis has been studied extensively in a random censorship setting, but typically all of the censoring indicators are assumed to be observed. In this paper, we develop synthetic data methods for estimating regression parameters in a linear model when some censoring indicators are missing. We define estimators based on regression calibration, imputation, and inverse probability weighting techniques, and we prove all three estimators are asymptotically normal. The finite-sample performance of each estimator is evaluated via simulation. We illustrate our methods by assessing the effects of sex and age on the time to non-ambulatory progression for patients in a brain cancer clinical trial. PMID:20559722
NASA Astrophysics Data System (ADS)
Ahn, Kuk-Hyun; Palmer, Richard
2016-09-01
Despite wide use of regression-based regional flood frequency analysis (RFFA) methods, the majority are based on either ordinary least squares (OLS) or generalized least squares (GLS). This paper proposes 'spatial proximity' based RFFA methods using the spatial lagged model (SLM) and spatial error model (SEM). The proposed methods are represented by two frameworks: the quantile regression technique (QRT) and parameter regression technique (PRT). The QRT develops prediction equations for flooding quantiles in average recurrence intervals (ARIs) of 2, 5, 10, 20, and 100 years whereas the PRT provides prediction of three parameters for the selected distribution. The proposed methods are tested using data incorporating 30 basin characteristics from 237 basins in Northeastern United States. Results show that generalized extreme value (GEV) distribution properly represents flood frequencies in the study gages. Also, basin area, stream network, and precipitation seasonality are found to be the most effective explanatory variables in prediction modeling by the QRT and PRT. 'Spatial proximity' based RFFA methods provide reliable flood quantile estimates compared to simpler methods. Compared to the QRT, the PRT may be recommended due to its accuracy and computational simplicity. The results presented in this paper may serve as one possible guidepost for hydrologists interested in flood analysis at ungaged sites.
Association of COX-2 -765G>C genetic polymorphism with coronary artery disease: a meta-analysis
Zhang, Ming-Ming; Xie, Xiang; Ma, Yi-Tong; Zheng, Ying-Ying; Yang, Yi-Ning; Li, Xiao-Mei; Fu, Zhen-Yan; Liu, Fen; Chen, Bang-Dang
2015-01-01
Background: Previous studies suggested the single nucleotide polymorphism (SNP) of COX-2 -765G>C (rs20417) is associated with coronary artery disease (CAD), but the results were conflicting. In order to derive a more precise estimation of the associations, we performed a meta-analysis of the relationship between rs20417 and CAD in all published studies. Method: Databases including PubMed, Web of Science, Wanfang, SinoMed and CNKI were systematically searched. Data were extracted using standardized methods. The association was assessed by odds ratio (OR) with 95% confidence intervals (CIs).The statistical tests were performed using Review Manager 5.3.3 and Stata 12.0 software. Results: We identified a total of 14 studies involving a total of 18227 subjects. The pooled odds ratio (OR) for the association between COX-2 -765G>C and CAD and its corresponding 95% confidence interval (95% CI) were evaluated by random or fixed effect model. A significant statistical association between COX-2 -765G>C and CAD was observed in an allelic model (P=0.02, OR=0.64, 95% CI: 0.43-0.94), dominant model (P=0.04, OR=0.74, 95% CI: 0.56-0.99), and recessive model (P=0.02, OR=0.46, 95% CI: 0.23-0.90). Conclusion: This meta-analysis suggested that COX-2 -765G>C is a protective for CAD. PMID:26221283
Background stratified Poisson regression analysis of cohort data
Langholz, Bryan
2012-01-01
Background stratified Poisson regression is an approach that has been used in the analysis of data derived from a variety of epidemiologically important studies of radiation-exposed populations, including uranium miners, nuclear industry workers, and atomic bomb survivors. We describe a novel approach to fit Poisson regression models that adjust for a set of covariates through background stratification while directly estimating the radiation-disease association of primary interest. The approach makes use of an expression for the Poisson likelihood that treats the coefficients for stratum-specific indicator variables as ‘nuisance’ variables and avoids the need to explicitly estimate the coefficients for these stratum-specific parameters. Log-linear models, as well as other general relative rate models, are accommodated. This approach is illustrated using data from the Life Span Study of Japanese atomic bomb survivors and data from a study of underground uranium miners. The point estimate and confidence interval obtained from this ‘conditional’ regression approach are identical to the values obtained using unconditional Poisson regression with model terms for each background stratum. Moreover, it is shown that the proposed approach allows estimation of background stratified Poisson regression models of non-standard form, such as models that parameterize latency effects, as well as regression models in which the number of strata is large, thereby overcoming the limitations of previously available statistical software for fitting background stratified Poisson regression models. PMID:22193911
Background stratified Poisson regression analysis of cohort data.
Richardson, David B; Langholz, Bryan
2012-03-01
Background stratified Poisson regression is an approach that has been used in the analysis of data derived from a variety of epidemiologically important studies of radiation-exposed populations, including uranium miners, nuclear industry workers, and atomic bomb survivors. We describe a novel approach to fit Poisson regression models that adjust for a set of covariates through background stratification while directly estimating the radiation-disease association of primary interest. The approach makes use of an expression for the Poisson likelihood that treats the coefficients for stratum-specific indicator variables as 'nuisance' variables and avoids the need to explicitly estimate the coefficients for these stratum-specific parameters. Log-linear models, as well as other general relative rate models, are accommodated. This approach is illustrated using data from the Life Span Study of Japanese atomic bomb survivors and data from a study of underground uranium miners. The point estimate and confidence interval obtained from this 'conditional' regression approach are identical to the values obtained using unconditional Poisson regression with model terms for each background stratum. Moreover, it is shown that the proposed approach allows estimation of background stratified Poisson regression models of non-standard form, such as models that parameterize latency effects, as well as regression models in which the number of strata is large, thereby overcoming the limitations of previously available statistical software for fitting background stratified Poisson regression models. PMID:22193911
Regression Model Optimization for the Analysis of Experimental Data
NASA Technical Reports Server (NTRS)
Ulbrich, N.
2009-01-01
A candidate math model search algorithm was developed at Ames Research Center that determines a recommended math model for the multivariate regression analysis of experimental data. The search algorithm is applicable to classical regression analysis problems as well as wind tunnel strain gage balance calibration analysis applications. The algorithm compares the predictive capability of different regression models using the standard deviation of the PRESS residuals of the responses as a search metric. This search metric is minimized during the search. Singular value decomposition is used during the search to reject math models that lead to a singular solution of the regression analysis problem. Two threshold dependent constraints are also applied. The first constraint rejects math models with insignificant terms. The second constraint rejects math models with near-linear dependencies between terms. The math term hierarchy rule may also be applied as an optional constraint during or after the candidate math model search. The final term selection of the recommended math model depends on the regressor and response values of the data set, the user s function class combination choice, the user s constraint selections, and the result of the search metric minimization. A frequently used regression analysis example from the literature is used to illustrate the application of the search algorithm to experimental data.
Joint regression analysis and AMMI model applied to oat improvement
NASA Astrophysics Data System (ADS)
Oliveira, A.; Oliveira, T. A.; Mejza, S.
2012-09-01
In our work we present an application of some biometrical methods useful in genotype stability evaluation, namely AMMI model, Joint Regression Analysis (JRA) and multiple comparison tests. A genotype stability analysis of oat (Avena Sativa L.) grain yield was carried out using data of the Portuguese Plant Breeding Board, sample of the 22 different genotypes during the years 2002, 2003 and 2004 in six locations. In Ferreira et al. (2006) the authors state the relevance of the regression models and of the Additive Main Effects and Multiplicative Interactions (AMMI) model, to study and to estimate phenotypic stability effects. As computational techniques we use the Zigzag algorithm to estimate the regression coefficients and the agricolae-package available in R software for AMMI model analysis.
Time series analysis using semiparametric regression on oil palm production
NASA Astrophysics Data System (ADS)
Yundari, Pasaribu, U. S.; Mukhaiyar, U.
2016-04-01
This paper presents semiparametric kernel regression method which has shown its flexibility and easiness in mathematical calculation, especially in estimating density and regression function. Kernel function is continuous and it produces a smooth estimation. The classical kernel density estimator is constructed by completely nonparametric analysis and it is well reasonable working for all form of function. Here, we discuss about parameter estimation in time series analysis. First, we consider the parameters are exist, then we use nonparametrical estimation which is called semiparametrical. The selection of optimum bandwidth is obtained by considering the approximation of Mean Integrated Square Root Error (MISE).
Analysis of Sting Balance Calibration Data Using Optimized Regression Models
NASA Technical Reports Server (NTRS)
Ulbrich, N.; Bader, Jon B.
2010-01-01
Calibration data of a wind tunnel sting balance was processed using a candidate math model search algorithm that recommends an optimized regression model for the data analysis. During the calibration the normal force and the moment at the balance moment center were selected as independent calibration variables. The sting balance itself had two moment gages. Therefore, after analyzing the connection between calibration loads and gage outputs, it was decided to choose the difference and the sum of the gage outputs as the two responses that best describe the behavior of the balance. The math model search algorithm was applied to these two responses. An optimized regression model was obtained for each response. Classical strain gage balance load transformations and the equations of the deflection of a cantilever beam under load are used to show that the search algorithm s two optimized regression models are supported by a theoretical analysis of the relationship between the applied calibration loads and the measured gage outputs. The analysis of the sting balance calibration data set is a rare example of a situation when terms of a regression model of a balance can directly be derived from first principles of physics. In addition, it is interesting to note that the search algorithm recommended the correct regression model term combinations using only a set of statistical quality metrics that were applied to the experimental data during the algorithm s term selection process.
Accounting for the correlation between fellow eyes in regression analysis.
Glynn, R J; Rosner, B
1992-03-01
Regression techniques that appropriately use all available eyes have infrequently been applied in the ophthalmologic literature, despite advances both in the development of statistical models and in the availability of computer software to fit these models. We considered the general linear model and polychotomous logistic regression approaches of Rosner and the estimating equation approach of Liang and Zeger, applied to both linear and logistic regression. Methods were illustrated with the use of two real data sets: (1) impairment of visual acuity in patients with retinitis pigmentosa and (2) overall visual field impairment in elderly patients evaluated for glaucoma. We discuss the interpretation of coefficients from these models and the advantages of these approaches compared with alternative approaches, such as treating individuals rather than eyes as the unit of analysis, separate regression analyses of right and left eyes, or utilization of ordinary regression techniques without accounting for the correlation between fellow eyes. Specific advantages include enhanced statistical power, more interpretable regression coefficients, greater precision of estimation, and less sensitivity to missing data for some eyes. We concluded that these models should be used more frequently in ophthalmologic research, and we provide guidelines for choosing between alternative models. PMID:1543458
Regression analysis for solving diagnosis problem of children's health
NASA Astrophysics Data System (ADS)
Cherkashina, Yu A.; Gerget, O. M.
2016-04-01
The paper includes results of scientific researches. These researches are devoted to the application of statistical techniques, namely, regression analysis, to assess the health status of children in the neonatal period based on medical data (hemostatic parameters, parameters of blood tests, the gestational age, vascular-endothelial growth factor) measured at 3-5 days of children's life. In this paper a detailed description of the studied medical data is given. A binary logistic regression procedure is discussed in the paper. Basic results of the research are presented. A classification table of predicted values and factual observed values is shown, the overall percentage of correct recognition is determined. Regression equation coefficients are calculated, the general regression equation is written based on them. Based on the results of logistic regression, ROC analysis was performed, sensitivity and specificity of the model are calculated and ROC curves are constructed. These mathematical techniques allow carrying out diagnostics of health of children providing a high quality of recognition. The results make a significant contribution to the development of evidence-based medicine and have a high practical importance in the professional activity of the author.
Regression Analysis: Instructional Resource for Cost/Managerial Accounting
ERIC Educational Resources Information Center
Stout, David E.
2015-01-01
This paper describes a classroom-tested instructional resource, grounded in principles of active learning and a constructivism, that embraces two primary objectives: "demystify" for accounting students technical material from statistics regarding ordinary least-squares (OLS) regression analysis--material that students may find obscure or…
Analysis of Sting Balance Calibration Data Using Optimized Regression Models
NASA Technical Reports Server (NTRS)
Ulbrich, Norbert; Bader, Jon B.
2009-01-01
Calibration data of a wind tunnel sting balance was processed using a search algorithm that identifies an optimized regression model for the data analysis. The selected sting balance had two moment gages that were mounted forward and aft of the balance moment center. The difference and the sum of the two gage outputs were fitted in the least squares sense using the normal force and the pitching moment at the balance moment center as independent variables. The regression model search algorithm predicted that the difference of the gage outputs should be modeled using the intercept and the normal force. The sum of the two gage outputs, on the other hand, should be modeled using the intercept, the pitching moment, and the square of the pitching moment. Equations of the deflection of a cantilever beam are used to show that the search algorithm s two recommended math models can also be obtained after performing a rigorous theoretical analysis of the deflection of the sting balance under load. The analysis of the sting balance calibration data set is a rare example of a situation when regression models of balance calibration data can directly be derived from first principles of physics and engineering. In addition, it is interesting to see that the search algorithm recommended the same regression models for the data analysis using only a set of statistical quality metrics.
Quantile Regression with Censored Data
ERIC Educational Resources Information Center
Lin, Guixian
2009-01-01
The Cox proportional hazards model and the accelerated failure time model are frequently used in survival data analysis. They are powerful, yet have limitation due to their model assumptions. Quantile regression offers a semiparametric approach to model data with possible heterogeneity. It is particularly powerful for censored responses, where the…
A SAS macro for residual deviance of ordinal regression analysis.
Wan, J Y; Wang, W; Bromberg, J
1994-12-01
In this paper, a SAS macro is described for calculating the likelihood of the 'saturated' model in the analysis of ordinal regression. The outcome variable is multinomial on an ordinal scale, while the explanatory variables can be nominal or ordinal. Several ordinal regression models may be fitted to the data. One method of testing for the goodness of fit of these regression models is by comparing the residual deviance with the chi 2 distribution. In SAS, PROC LOGISTIC may be used to fit this type of data with proportional odds model. Unfortunately, the residual deviance is not available from the output. Our SAS macro will supplement the SAS output so that the residual deviance test may be carried out. The data from an ongoing HIV study is used as an illustration. PMID:7736732
Kolenda, Rafał; Ugorski, Maciej; Bednarski, Michał
2014-08-01
Sarcocysts from four Polish roe deer were collected and examined by light microscopy, small subunit ribosomal RNA (ssu rRNA), and the subunit I of cytochrome oxidase (cox1) sequence analysis. This resulted in identification of Sarcocystis gracilis, Sarcocystis oviformis, and Sarcocystis silva. However, we were unable to detect Sarcocystis capreolicanis, the fourth Sarcocystis species found previously in Norwegian roe deer. Polish sarcocysts isolated from various tissues differed in terms of their shape and size and were larger than the respective Norwegian isolates. Analysis of ssu rRNA gene revealed the lack of differences between Sarcocystis isolates belonging to one species and a very low degree of genetic diversity between Polish and Norwegian sarcocysts, ranging from 0.1% for Sarcocystis gracilis and Sarcocystis oviformis to 0.44% for Sarcocystis silva. Contrary to the results of the ssu rRNA analysis, small intraspecies differences in cox1 sequences were found among Polish Sarcocystis gracilis and Sarcocystis silva isolates. The comparison of Polish and Norwegian cox1 sequences representing the same Sarcocystis species revealed similar degree of sequence identity, namely 99.72% for Sarcocystis gracilis, 98.76% for Sarcocystis silva, and 99.85% for Sarcocystis oviformis. Phylogenetic reconstruction and genetic population analyses showed an unexpected high degree of identity between Polish and Norwegian isolates. Moreover, cox1 gene sequences turned out to be more accurate than ssu rRNA when used to reveal phylogenetic relationships among closely related species. The results of our study revealed that the same Sarcocystis species isolated from the same hosts living in different geographic regions show a very high level of genetic similarity. PMID:24948101
Agogo, George O; van der Voet, Hilko; Van't Veer, Pieter; van Eeuwijk, Fred A; Boshuizen, Hendriek C
2016-07-01
Dietary questionnaires are prone to measurement error, which bias the perceived association between dietary intake and risk of disease. Short-term measurements are required to adjust for the bias in the association. For foods that are not consumed daily, the short-term measurements are often characterized by excess zeroes. Via a simulation study, the performance of a two-part calibration model that was developed for a single-replicate study design was assessed by mimicking leafy vegetable intake reports from the multicenter European Prospective Investigation into Cancer and Nutrition (EPIC) study. In part I of the fitted two-part calibration model, a logistic distribution was assumed; in part II, a gamma distribution was assumed. The model was assessed with respect to the magnitude of the correlation between the consumption probability and the consumed amount (hereafter, cross-part correlation), the number and form of covariates in the calibration model, the percentage of zero response values, and the magnitude of the measurement error in the dietary intake. From the simulation study results, transforming the dietary variable in the regression calibration to an appropriate scale was found to be the most important factor for the model performance. Reducing the number of covariates in the model could be beneficial, but was not critical in large-sample studies. The performance was remarkably robust when fitting a one-part rather than a two-part model. The model performance was minimally affected by the cross-part correlation. PMID:27003183
Islam, Abul B M M K; Dave, Mandar; Amin, Sonia; Jensen, Roderick V; Amin, Ashok R
2016-04-01
The constitutively-expressed cyclooxygenase 1 (COX-1) and the inducible COX-2 are both involved in the conversion of arachidonic acid (AA) to prostaglandins (PGs). However, the functional roles of COX-1 at the cellular level remain unclear. We hypothesized that by comparing differential gene expression and eicosanoid metabolism in lung fibroblasts from wild-type (WT) mice and COX-2(-/-) or COX-1(-/-) mice may help address the functional roles of COX-1 in inflammation and other cellular functions. Compared to WT, the number of specifically-induced transcripts were altered descendingly as follows: COX-2(-/-)>COX-1(-/-)>WT+IL-1β. COX-1(-/-) or COX-2(-/-) cells shared about 50% of the induced transcripts with WT cells treated with IL-1β, respectively. An interactive "anti-inflammatory, proinflammatory, and redox-activated" signature in the protein-protein interactome map was observed in COX-2(-/-) cells. The augmented COX-1 mRNA (in COX-2(-/-) cells) was associated with the upregulation of mRNAs for glutathione S-transferase (GST), superoxide dismutase (SOD), NAD(P)H dehydrogenase quinone 1 (NQO1), aryl hydrocarbon receptor (AhR), peroxiredoxin, phospholipase, prostacyclin synthase, and prostaglandin E synthase, resulting in a significant increase in the levels of PGE2, PGD2, leukotriene B4 (LTB4), PGF1α, thromboxane B2 (TXB2), and PGF2α. The COX-1 plays a dominant role in shifting AA toward the LTB4 pathway and anti-inflammatory activities. Compared to WT, the upregulated COX-1 mRNA in COX-2(-/-) cells generated an "eicosanoid storm". The genomic characteristics of COX-2(-/-) is similar to that of proinflammatory cells as observed in IL-1β induced WT cells. COX-1(-/-) and COX-2(-/-) cells exhibited compensation of various eicosanoids at the genomic and metabolic levels. PMID:27012456
Parra, Edwin Roger; Lin, Flavia; Martins, Vanessa; Rangel, Maristela Peres; Capelozzi, Vera Luiza
2013-01-01
OBJECTIVE: To study the expression of COX-1 and COX-2 in the remodeled lung in systemic sclerosis (SSc) and idiopathic pulmonary fibrosis (IPF) patients, correlating that expression with patient survival. METHODS: We examined open lung biopsy specimens from 24 SSc patients and 30 IPF patients, using normal lung tissue as a control. The histological patterns included fibrotic nonspecific interstitial pneumonia (NSIP) in SSc patients and usual interstitial pneumonia (UIP) in IPF patients. We used immunohistochemistry and histomorphometry to evaluate the expression of COX-1 and COX-2 in alveolar septa, vessels, and bronchioles. We then correlated that expression with pulmonary function test results and evaluated its impact on patient survival. RESULTS: The expression of COX-1 and COX-2 in alveolar septa was significantly higher in IPF-UIP and SSc-NSIP lung tissue than in the control tissue. No difference was found between IPF-UIP and SSc-NSIP tissue regarding COX-1 and COX-2 expression. Multivariate analysis based on the Cox regression model showed that the factors associated with a low risk of death were younger age, high DLCO/alveolar volume, IPF, and high COX-1 expression in alveolar septa, whereas those associated with a high risk of death were advanced age, low DLCO/alveolar volume, SSc (with NSIP), and low COX-1 expression in alveolar septa. CONCLUSIONS: Our findings suggest that strategies aimed at preventing low COX-1 synthesis will have a greater impact on SSc, whereas those aimed at preventing high COX-2 synthesis will have a greater impact on IPF. However, prospective randomized clinical trials are needed in order to confirm that. PMID:24473763
Robust regression applied to fractal/multifractal analysis.
NASA Astrophysics Data System (ADS)
Portilla, F.; Valencia, J. L.; Tarquis, A. M.; Saa-Requejo, A.
2012-04-01
Fractal and multifractal are concepts that have grown increasingly popular in recent years in the soil analysis, along with the development of fractal models. One of the common steps is to calculate the slope of a linear fit commonly using least squares method. This shouldn't be a special problem, however, in many situations using experimental data the researcher has to select the range of scales at which is going to work neglecting the rest of points to achieve the best linearity that in this type of analysis is necessary. Robust regression is a form of regression analysis designed to circumvent some limitations of traditional parametric and non-parametric methods. In this method we don't have to assume that the outlier point is simply an extreme observation drawn from the tail of a normal distribution not compromising the validity of the regression results. In this work we have evaluated the capacity of robust regression to select the points in the experimental data used trying to avoid subjective choices. Based on this analysis we have developed a new work methodology that implies two basic steps: • Evaluation of the improvement of linear fitting when consecutive points are eliminated based on R p-value. In this way we consider the implications of reducing the number of points. • Evaluation of the significance of slope difference between fitting with the two extremes points and fitted with the available points. We compare the results applying this methodology and the common used least squares one. The data selected for these comparisons are coming from experimental soil roughness transect and simulated based on middle point displacement method adding tendencies and noise. The results are discussed indicating the advantages and disadvantages of each methodology. Acknowledgements Funding provided by CEIGRAM (Research Centre for the Management of Agricultural and Environmental Risks) and by Spanish Ministerio de Ciencia e Innovación (MICINN) through project no
The Consequences Of Model Misspecification In Regression Analysis.
Deegan, J
1976-04-01
In ordinary least squares regression analysis the desired property of unbiasedness in estimated coefficients is contingent upon the correspondence of the fitted model with the true underlying data generating process. This paper focuses on developing a systematic characterization of the error forms resulting from model misspecification in single equation models. The consequences of model misspecification, for the error forms identified, are also evaluated. PMID:26821674
Functional Regression Models for Epistasis Analysis of Multiple Quantitative Traits
Xie, Dan; Liang, Meimei; Xiong, Momiao
2016-01-01
To date, most genetic analyses of phenotypes have focused on analyzing single traits or analyzing each phenotype independently. However, joint epistasis analysis of multiple complementary traits will increase statistical power and improve our understanding of the complicated genetic structure of the complex diseases. Despite their importance in uncovering the genetic structure of complex traits, the statistical methods for identifying epistasis in multiple phenotypes remains fundamentally unexplored. To fill this gap, we formulate a test for interaction between two genes in multiple quantitative trait analysis as a multiple functional regression (MFRG) in which the genotype functions (genetic variant profiles) are defined as a function of the genomic position of the genetic variants. We use large-scale simulations to calculate Type I error rates for testing interaction between two genes with multiple phenotypes and to compare the power with multivariate pairwise interaction analysis and single trait interaction analysis by a single variate functional regression model. To further evaluate performance, the MFRG for epistasis analysis is applied to five phenotypes of exome sequence data from the NHLBI’s Exome Sequencing Project (ESP) to detect pleiotropic epistasis. A total of 267 pairs of genes that formed a genetic interaction network showed significant evidence of epistasis influencing five traits. The results demonstrate that the joint interaction analysis of multiple phenotypes has a much higher power to detect interaction than the interaction analysis of a single trait and may open a new direction to fully uncovering the genetic structure of multiple phenotypes. PMID:27104857
Functional Regression Models for Epistasis Analysis of Multiple Quantitative Traits.
Zhang, Futao; Xie, Dan; Liang, Meimei; Xiong, Momiao
2016-04-01
To date, most genetic analyses of phenotypes have focused on analyzing single traits or analyzing each phenotype independently. However, joint epistasis analysis of multiple complementary traits will increase statistical power and improve our understanding of the complicated genetic structure of the complex diseases. Despite their importance in uncovering the genetic structure of complex traits, the statistical methods for identifying epistasis in multiple phenotypes remains fundamentally unexplored. To fill this gap, we formulate a test for interaction between two genes in multiple quantitative trait analysis as a multiple functional regression (MFRG) in which the genotype functions (genetic variant profiles) are defined as a function of the genomic position of the genetic variants. We use large-scale simulations to calculate Type I error rates for testing interaction between two genes with multiple phenotypes and to compare the power with multivariate pairwise interaction analysis and single trait interaction analysis by a single variate functional regression model. To further evaluate performance, the MFRG for epistasis analysis is applied to five phenotypes of exome sequence data from the NHLBI's Exome Sequencing Project (ESP) to detect pleiotropic epistasis. A total of 267 pairs of genes that formed a genetic interaction network showed significant evidence of epistasis influencing five traits. The results demonstrate that the joint interaction analysis of multiple phenotypes has a much higher power to detect interaction than the interaction analysis of a single trait and may open a new direction to fully uncovering the genetic structure of multiple phenotypes. PMID:27104857
Poisson Regression Analysis of Illness and Injury Surveillance Data
Frome E.L., Watkins J.P., Ellis E.D.
2012-12-12
The Department of Energy (DOE) uses illness and injury surveillance to monitor morbidity and assess the overall health of the work force. Data collected from each participating site include health events and a roster file with demographic information. The source data files are maintained in a relational data base, and are used to obtain stratified tables of health event counts and person time at risk that serve as the starting point for Poisson regression analysis. The explanatory variables that define these tables are age, gender, occupational group, and time. Typical response variables of interest are the number of absences due to illness or injury, i.e., the response variable is a count. Poisson regression methods are used to describe the effect of the explanatory variables on the health event rates using a log-linear main effects model. Results of fitting the main effects model are summarized in a tabular and graphical form and interpretation of model parameters is provided. An analysis of deviance table is used to evaluate the importance of each of the explanatory variables on the event rate of interest and to determine if interaction terms should be considered in the analysis. Although Poisson regression methods are widely used in the analysis of count data, there are situations in which over-dispersion occurs. This could be due to lack-of-fit of the regression model, extra-Poisson variation, or both. A score test statistic and regression diagnostics are used to identify over-dispersion. A quasi-likelihood method of moments procedure is used to evaluate and adjust for extra-Poisson variation when necessary. Two examples are presented using respiratory disease absence rates at two DOE sites to illustrate the methods and interpretation of the results. In the first example the Poisson main effects model is adequate. In the second example the score test indicates considerable over-dispersion and a more detailed analysis attributes the over-dispersion to extra
Multivariate concentration determination using principal component regression with residual analysis
Keithley, Richard B.; Heien, Michael L.; Wightman, R. Mark
2009-01-01
Data analysis is an essential tenet of analytical chemistry, extending the possible information obtained from the measurement of chemical phenomena. Chemometric methods have grown considerably in recent years, but their wide use is hindered because some still consider them too complicated. The purpose of this review is to describe a multivariate chemometric method, principal component regression, in a simple manner from the point of view of an analytical chemist, to demonstrate the need for proper quality-control (QC) measures in multivariate analysis and to advocate the use of residuals as a proper QC method. PMID:20160977
Regression Analysis of Electric Power Price in California Power Exchange
NASA Astrophysics Data System (ADS)
Miyauchi, Hajime; Tatsuguchi, Genta; Misawa, Tetsuya
The liberalization of the electric power industries was executed from April 1998 in California State. Though this liberalization is suspended because of the extremely high bids and the outages, the information of the power price in the power exchange is very variable to investigate its structure and determination factor. From the accessible web site, we obtained the every hour data of the zone prices and the whole demand of California from April 1998 to September 2001, under the deregulation of the electric power industry. We are analyzing the prices by the regression analysis. In this paper, we compose simple regression equations successfully to classify the price data into four time zones. Next, we analyze the prices from June to September 2000 when the price cap of the power price is changed twice. The Chow test shows that the structural changes in the power price are occurred when the price cap is changed. Thus we observe the determining factor of the electric power price by the regression analysis.
2012-01-01
Background Evidence is accumulating that chronic inflammation may have an important role in prostate cancer (PCa). The COX-2 polymorphism rs2745557 (+202 C/T) has been extensively investigated as a potential risk factor for PCa, but the results have thus far been inconclusive. This meta-analysis was performed to derive a more precise estimation of the association. Methods A comprehensive search was conducted to identify all case-control studies of COX-2 rs2745557 polymorphism and PCa risk. We used odds ratios (ORs) to assess the strength of the association, and 95% confidence intervals (CIs) give a sense of the precision of the estimate. Statistical analyses were performed by Review Manage, version 5.0 and Stata 10.0. Results A total of 8 available studies were considered in the present meta-analysis, with 11356 patients and 11641 controls for rs2745557. When all groups were pooled, there was no evidence that rs2745557 had significant association with PCa under co-dominant, recessive, over-dominant, and allelic models. However, our analysis suggested that rs2745557 was associated with a lower PCa risk under dominant model in overall population (OR = 0.85, 95%CI = 0.74-0.97, P = 0.02). When stratifying for race, there was a significant association between rs2745557 polymorphism and lower PCa risk in dominant model comparison in the subgroup of Caucasians (OR = 0.86, 95%CI = 0.75-0.99, P = 0.04), but not in co-dominant, recessive, over-dominant and allelic comparisons. Conclusion Based on our meta-analysis, COX-2 rs2745557 was associated with a lower PCa risk under dominant model in Caucasians. PMID:22435969
Four cases of Taenia saginata infection with an analysis of COX1 gene.
Cho, Jaeeun; Jung, Bong-Kwang; Lim, Hyemi; Kim, Min-Jae; Yooyen, Thanapon; Lee, Dongmin; Eom, Keeseon S; Shin, Eun-Hee; Chai, Jong-Yil
2014-02-01
Human taeniases had been not uncommon in the Republic of Korea (=Korea) until the 1980s. The prevalence decreased and a national survey in 2004 revealed no Taenia egg positive cases. However, a subsequent national survey in 2012 showed 0.04% (10 cases) prevalence of Taenia spp. eggs suggesting its resurgence in Korea. We recently encountered 4 cases of Taenia saginata infection who had symptoms of taeniasis that included discharge of proglottids. We obtained several proglottids from each case. Because the morphological features of T. saginata are almost indistinguishable from those of Taenia asiatica, molecular analyses using the PCR-RFLP and DNA sequencing of the cytochrome c oxidase subunit 1 (cox1) were performed to identify the species. The PCR-RFLP patterns of all of the 4 specimens were consistent with T. saginata, and the cox1 gene sequence showed 99.8-100% identity with that of T. saginata reported previously from Korea, Japan, China, and Cambodia. All of the 4 patients had the history of travel abroad but its relation with contracting taeniasis was unclear. Our findings may suggest resurgence of T. saginata infection among people in Korea. PMID:24623887
Bar-Yaacov, Dan; Bouskila, Amos; Mishmar, Dan
2013-01-01
Recently, we found dramatic mitochondrial DNA divergence of Israeli Chamaeleo chamaeleon populations into two geographically distinct groups. We aimed to examine whether the same pattern of divergence could be found in nuclear genes. However, no genomic resource is available for any chameleon species. Here we present the first chameleon transcriptome, obtained using deep sequencing (SOLiD). Our analysis identified 164,000 sequence contigs of which 19,000 yielded unique BlastX hits. To test the efficacy of our sequencing effort, we examined whether the chameleon and other available reptilian transcriptomes harbored complete sets of genes comprising known biochemical pathways, focusing on the nDNA-encoded oxidative phosphorylation (OXPHOS) genes as a model. As a reference for the screen, we used the human 86 (including isoforms) known structural nDNA-encoded OXPHOS subunits. Analysis of 34 publicly available vertebrate transcriptomes revealed orthologs for most human OXPHOS genes. However, OXPHOS subunit COX8 (Cytochrome C oxidase subunit 8), including all its known isoforms, was consistently absent in transcriptomes of iguanian lizards, implying loss of this subunit during the radiation of this suborder. The lack of COX8 in the suborder Iguania is intriguing, since it is important for cellular respiration and ATP production. Our sequencing effort added a new resource for comparative genomic studies, and shed new light on the evolutionary dynamics of the OXPHOS system. PMID:24009133
FRATS: Functional Regression Analysis of DTI Tract Statistics
Zhu, Hongtu; Styner, Martin; Tang, Niansheng; Liu, Zhexing; Lin, Weili; Gilmore, John H.
2010-01-01
Diffusion tensor imaging (DTI) provides important information on the structure of white matter fiber bundles as well as detailed tissue properties along these fiber bundles in vivo. This paper presents a functional regression framework, called FRATS, for the analysis of multiple diffusion properties along fiber bundle as functions in an infinite dimensional space and their association with a set of covariates of interest, such as age, diagnostic status and gender, in real applications. The functional regression framework consists of four integrated components: the local polynomial kernel method for smoothing multiple diffusion properties along individual fiber bundles, a functional linear model for characterizing the association between fiber bundle diffusion properties and a set of covariates, a global test statistic for testing hypotheses of interest, and a resampling method for approximating the p-value of the global test statistic. The proposed methodology is applied to characterizing the development of five diffusion properties including fractional anisotropy, mean diffusivity, and the three eigenvalues of diffusion tensor along the splenium of the corpus callosum tract and the right internal capsule tract in a clinical study of neurodevelopment. Significant age and gestational age effects on the five diffusion properties were found in both tracts. The resulting analysis pipeline can be used for understanding normal brain development, the neural bases of neuropsychiatric disorders, and the joint effects of environmental and genetic factors on white matter fiber bundles. PMID:20335089
Regression analysis exploring teacher impact on student FCI post scores
NASA Astrophysics Data System (ADS)
Mahadeo, Jonathan V.; Manthey, Seth R.; Brewe, Eric
2013-01-01
High School Modeling Workshops are designed to improve high school physics teachers' understanding of physics and how to teach using the Modeling method. The basic assumption is that the teacher plays a critical role in their students' physics education. This study investigated teacher impacts on students' Force Concept Inventory scores, (FCI), with the hopes of identifying quantitative differences between teachers. This study examined student FCI scores from 18 teachers with at least a year of teaching high school physics. This data was then evaluated using a General Linear Model (GLM), which allowed for a regression equation to be fitted to the data. This regression equation was used to predict student post FCI scores, based on: teacher ID, student pre FCI score, gender, and representation. The results show 12 out of 18 teachers significantly impact their student post FCI scores. The GLM further revealed that of the 12 teachers only five have a positive impact on student post FCI scores. Given these differences among teachers it is our intention to extend our analysis to investigate pedagogical differences between them.
Nonparametric survival analysis using Bayesian Additive Regression Trees (BART).
Sparapani, Rodney A; Logan, Brent R; McCulloch, Robert E; Laud, Purushottam W
2016-07-20
Bayesian additive regression trees (BART) provide a framework for flexible nonparametric modeling of relationships of covariates to outcomes. Recently, BART models have been shown to provide excellent predictive performance, for both continuous and binary outcomes, and exceeding that of its competitors. Software is also readily available for such outcomes. In this article, we introduce modeling that extends the usefulness of BART in medical applications by addressing needs arising in survival analysis. Simulation studies of one-sample and two-sample scenarios, in comparison with long-standing traditional methods, establish face validity of the new approach. We then demonstrate the model's ability to accommodate data from complex regression models with a simulation study of a nonproportional hazards scenario with crossing survival functions and survival function estimation in a scenario where hazards are multiplicatively modified by a highly nonlinear function of the covariates. Using data from a recently published study of patients undergoing hematopoietic stem cell transplantation, we illustrate the use and some advantages of the proposed method in medical investigations. Copyright © 2016 John Wiley & Sons, Ltd. PMID:26854022
A Visual Analytics Approach for Correlation, Classification, and Regression Analysis
Steed, Chad A; SwanII, J. Edward; Fitzpatrick, Patrick J.; Jankun-Kelly, T.J.
2012-02-01
New approaches that combine the strengths of humans and machines are necessary to equip analysts with the proper tools for exploring today's increasing complex, multivariate data sets. In this paper, a novel visual data mining framework, called the Multidimensional Data eXplorer (MDX), is described that addresses the challenges of today's data by combining automated statistical analytics with a highly interactive parallel coordinates based canvas. In addition to several intuitive interaction capabilities, this framework offers a rich set of graphical statistical indicators, interactive regression analysis, visual correlation mining, automated axis arrangements and filtering, and data classification techniques. The current work provides a detailed description of the system as well as a discussion of key design aspects and critical feedback from domain experts.
Estimation of crown closure from AVIRIS data using regression analysis
NASA Technical Reports Server (NTRS)
Staenz, K.; Williams, D. J.; Truchon, M.; Fritz, R.
1993-01-01
Crown closure is one of the input parameters used for forest growth and yield modelling. Preliminary work by Staenz et al. indicates that imaging spectrometer data acquired with sensors such as the Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) have some potential for estimating crown closure on a stand level. The objectives of this paper are: (1) to establish a relationship between AVIRIS data and the crown closure derived from aerial photography of a forested test site within the Interior Douglas Fir biogeoclimatic zone in British Columbia, Canada; (2) to investigate the impact of atmospheric effects and the forest background on the correlation between AVIRIS data and crown closure estimates; and (3) to improve this relationship using multiple regression analysis.
Moderated regression analysis and Likert scales: too coarse for comfort.
Russell, C J; Bobko, P
1992-06-01
One of the most commonly accepted models of relationships among three variables in applied industrial and organizational psychology is the simple moderator effect. However, many authors have expressed concern over the general lack of empirical support for interaction effects reported in the literature. We demonstrate in the current sample that use of a continuous, dependent-response scale instead of a discrete, Likert-type scale, causes moderated regression analysis effect sizes to increase an average of 93%. We suggest that use of relatively coarse Likert scales to measure fine dependent responses causes information loss that, although varying widely across subjects, greatly reduces the probability of detecting true interaction effects. Specific recommendations for alternate research strategies are made. PMID:1601825
A Visual Analytics Approach for Correlation, Classification, and Regression Analysis
Steed, Chad A; SwanII, J. Edward; Fitzpatrick, Patrick J.; Jankun-Kelly, T.J.
2013-01-01
New approaches that combine the strengths of humans and machines are necessary to equip analysts with the proper tools for exploring today s increasing complex, multivariate data sets. In this paper, a visual data mining framework, called the Multidimensional Data eXplorer (MDX), is described that addresses the challenges of today s data by combining automated statistical analytics with a highly interactive parallel coordinates based canvas. In addition to several intuitive interaction capabilities, this framework offers a rich set of graphical statistical indicators, interactive regression analysis, visual correlation mining, automated axis arrangements and filtering, and data classification techniques. This chapter provides a detailed description of the system as well as a discussion of key design aspects and critical feedback from domain experts.
ADVANTAGES OF USING REGRESSION ANALYSIS TO CALCULATE RESULTS OF CHRONIC TOXICITY TESTS
Although it is traditional to calculate results of chronic toxicity tests using hypothesis testing to detect statistically significant differences from the control, calculation of results using regression analysis offers several major advantages. Regression analysis can directly ...
Spatial regression analysis of traffic crashes in Seoul.
Rhee, Kyoung-Ah; Kim, Joon-Ki; Lee, Young-ihn; Ulfarsson, Gudmundur F
2016-06-01
Traffic crashes can be spatially correlated events and the analysis of the distribution of traffic crash frequency requires evaluation of parameters that reflect spatial properties and correlation. Typically this spatial aspect of crash data is not used in everyday practice by planning agencies and this contributes to a gap between research and practice. A database of traffic crashes in Seoul, Korea, in 2010 was developed at the traffic analysis zone (TAZ) level with a number of GIS developed spatial variables. Practical spatial models using available software were estimated. The spatial error model was determined to be better than the spatial lag model and an ordinary least squares baseline regression. A geographically weighted regression model provided useful insights about localization of effects. The results found that an increased length of roads with speed limit below 30 km/h and a higher ratio of residents below age of 15 were correlated with lower traffic crash frequency, while a higher ratio of residents who moved to the TAZ, more vehicle-kilometers traveled, and a greater number of access points with speed limit difference between side roads and mainline above 30 km/h all increased the number of traffic crashes. This suggests, for example, that better control or design for merging lower speed roads with higher speed roads is important. A key result is that the length of bus-only center lanes had the largest effect on increasing traffic crashes. This is important as bus-only center lanes with bus stop islands have been increasingly used to improve transit times. Hence the potential negative safety impacts of such systems need to be studied further and mitigated through improved design of pedestrian access to center bus stop islands. PMID:26994374
Standardized Regression Coefficients as Indices of Effect Sizes in Meta-Analysis
ERIC Educational Resources Information Center
Kim, Rae Seon
2011-01-01
When conducting a meta-analysis, it is common to find many collected studies that report regression analyses, because multiple regression analysis is widely used in many fields. Meta-analysis uses effect sizes drawn from individual studies as a means of synthesizing a collection of results. However, indices of effect size from regression analyses…
Integrated analysis of incidence, progression, regression and disappearance probabilities
Huang, Guan-Hua
2008-01-01
Background Age-related maculopathy (ARM) is a leading cause of vision loss in people aged 65 or older. ARM is distinctive in that it is a disease which can transition through incidence, progression, regression and disappearance. The purpose of this study is to develop methodologies for studying the relationship of risk factors with different transition probabilities. Methods Our framework for studying this relationship includes two different analytical approaches. In the first approach, one can define, model and estimate the relationship between each transition probability and risk factors separately. This approach is similar to constraining a population to a certain disease status at the baseline, and then analyzing the probability of the constrained population to develop a different status. While this approach is intuitive, one risks losing available information while at the same time running into the problem of insufficient sample size. The second approach specifies a transition model for analyzing such a disease. This model provides the conditional probability of a current disease status based upon a previous status, and can therefore jointly analyze all transition probabilities. Throughout the paper, an analysis to determine the birth cohort effect on ARM is used as an illustration. Results and conclusion This study has found parallel separate and joint analyses to be more enlightening than any analysis in isolation. By implementing both approaches, one can obtain more reliable and more efficient results. PMID:18577235
Mixed-effects Poisson regression analysis of adverse event reports
Gibbons, Robert D.; Segawa, Eisuke; Karabatsos, George; Amatya, Anup K.; Bhaumik, Dulal K.; Brown, C. Hendricks; Kapur, Kush; Marcus, Sue M.; Hur, Kwan; Mann, J. John
2008-01-01
SUMMARY A new statistical methodology is developed for the analysis of spontaneous adverse event (AE) reports from post-marketing drug surveillance data. The method involves both empirical Bayes (EB) and fully Bayes estimation of rate multipliers for each drug within a class of drugs, for a particular AE, based on a mixed-effects Poisson regression model. Both parametric and semiparametric models for the random-effect distribution are examined. The method is applied to data from Food and Drug Administration (FDA)’s Adverse Event Reporting System (AERS) on the relationship between antidepressants and suicide. We obtain point estimates and 95 per cent confidence (posterior) intervals for the rate multiplier for each drug (e.g. antidepressants), which can be used to determine whether a particular drug has an increased risk of association with a particular AE (e.g. suicide). Confidence (posterior) intervals that do not include 1.0 provide evidence for either significant protective or harmful associations of the drug and the adverse effect. We also examine EB, parametric Bayes, and semiparametric Bayes estimators of the rate multipliers and associated confidence (posterior) intervals. Results of our analysis of the FDA AERS data revealed that newer antidepressants are associated with lower rates of suicide adverse event reports compared with older antidepressants. We recommend improvements to the existing AERS system, which are likely to improve its public health value as an early warning system. PMID:18404622
Risk factors for temporomandibular disorder: Binary logistic regression analysis
Magalhães, Bruno G.; de-Sousa, Stéphanie T.; de Mello, Victor V C.; da-Silva-Barbosa, André C.; de-Assis-Morais, Mariana P L.; Barbosa-Vasconcelos, Márcia M V.
2014-01-01
Objectives: To analyze the influence of socioeconomic and demographic factors (gender, economic class, age and marital status) on the occurrence of temporomandibular disorder. Study Design: One hundred individuals from urban areas in the city of Recife (Brazil) registered at Family Health Units was examined using Axis I of the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) which addresses myofascial pain and joint problems (disc displacement, arthralgia, osteoarthritis and oesteoarthrosis). The Brazilian Economic Classification Criteria (CCEB) was used for the collection of socioeconomic and demographic data. Then, it was categorized as Class A (high social class), Classes B/C (middle class) and Classes D/E (very poor social class). The results were analyzed using Pearson’s chi-square test for proportions, Fisher’s exact test, nonparametric Mann-Whitney test and Binary logistic regression analysis. Results: None of the participants belonged to Class A, 72% belonged to Classes B/C and 28% belonged to Classes D/E. The multivariate analysis revealed that participants from Classes D/E had a 4.35-fold greater chance of exhibiting myofascial pain and 11.3-fold greater chance of exhibiting joint problems. Conclusions: Poverty is a important condition to exhibit myofascial pain and joint problems. Key words:Temporomandibular joint disorders, risk factors, prevalence. PMID:24316706
Janssen, I.; Stebbings, J.H.
1990-01-01
In environmental epidemiology, trace and toxic substance concentrations frequently have very highly skewed distributions ranging over one or more orders of magnitude, and prediction by conventional regression is often poor. Classification and Regression Tree Analysis (CART) is an alternative in such contexts. To compare the techniques, two Pennsylvania data sets and three independent variables are used: house radon progeny (RnD) and gamma levels as predicted by construction characteristics in 1330 houses; and {approximately}200 house radon (Rn) measurements as predicted by topographic parameters. CART may identify structural variables of interest not identified by conventional regression, and vice versa, but in general the regression models are similar. CART has major advantages in dealing with other common characteristics of environmental data sets, such as missing values, continuous variables requiring transformations, and large sets of potential independent variables. CART is most useful in the identification and screening of independent variables, greatly reducing the need for cross-tabulations and nested breakdown analyses. There is no need to discard cases with missing values for the independent variables because surrogate variables are intrinsic to CART. The tree-structured approach is also independent of the scale on which the independent variables are measured, so that transformations are unnecessary. CART identifies important interactions as well as main effects. The major advantages of CART appear to be in exploring data. Once the important variables are identified, conventional regressions seem to lead to results similar but more interpretable by most audiences. 12 refs., 8 figs., 10 tabs.
Analysis of retirement income adequacy using quantile regression: A case study in Malaysia
NASA Astrophysics Data System (ADS)
Alaudin, Ros Idayuwati; Ismail, Noriszura; Isa, Zaidi
2015-09-01
Quantile regression is a statistical analysis that does not restrict attention to the conditional mean and therefore, permitting the approximation of the whole conditional distribution of a response variable. Quantile regression is a robust regression to outliers compared to mean regression models. In this paper, we demonstrate how quantile regression approach can be used to analyze the ratio of projected wealth to needs (wealth-needs ratio) during retirement.
The Variance Normalization Method of Ridge Regression Analysis.
ERIC Educational Resources Information Center
Bulcock, J. W.; And Others
The testing of contemporary sociological theory often calls for the application of structural-equation models to data which are inherently collinear. It is shown that simple ridge regression, which is commonly used for controlling the instability of ordinary least squares regression estimates in ill-conditioned data sets, is not a legitimate…
An Effect Size for Regression Predictors in Meta-Analysis
ERIC Educational Resources Information Center
Aloe, Ariel M.; Becker, Betsy Jane
2012-01-01
A new effect size representing the predictive power of an independent variable from a multiple regression model is presented. The index, denoted as r[subscript sp], is the semipartial correlation of the predictor with the outcome of interest. This effect size can be computed when multiple predictor variables are included in the regression model…
Dunstan, H. M.; Green-Willms, N. S.; Fox, T. D.
1997-01-01
We have used mutational and revertant analysis to study the elements of the 54-nucleotide COX2 5'-untranslated leader involved in translation initiation in yeast mitochondria and in activation by the COX2 translational activator, Pet111p. We generated a collection of mutants with substitutions spanning the entire COX2 5'-UTL by in vitro mutagenesis followed by mitochondrial transformation and gene replacement. The phenotypes of these mutants delimit a 31-nucleotide segment, from -16 to -46, that contains several short sequence elements necessary for COX2 5'-UTL function in translation. The sequences from -16 to -47 were shown to be partially sufficient to promote translation in a foreign context. Analysis of revertants of both the series of linker-scanning alleles and two short deletion/insertion alleles has refined the positions of several possible functional elements of the COX2 5'-untranslated leader, including a putative RNA stem-loop structure that functionally interacts with Pet111p and an octanucleotide sequence present in all S. cerevisiae mitochondrial mRNA 5'-UTLs that is a potential rRNA binding site. PMID:9286670
A Novel Multiobjective Evolutionary Algorithm Based on Regression Analysis
Song, Zhiming; Wang, Maocai; Dai, Guangming; Vasile, Massimiliano
2015-01-01
As is known, the Pareto set of a continuous multiobjective optimization problem with m objective functions is a piecewise continuous (m − 1)-dimensional manifold in the decision space under some mild conditions. However, how to utilize the regularity to design multiobjective optimization algorithms has become the research focus. In this paper, based on this regularity, a model-based multiobjective evolutionary algorithm with regression analysis (MMEA-RA) is put forward to solve continuous multiobjective optimization problems with variable linkages. In the algorithm, the optimization problem is modelled as a promising area in the decision space by a probability distribution, and the centroid of the probability distribution is (m − 1)-dimensional piecewise continuous manifold. The least squares method is used to construct such a model. A selection strategy based on the nondominated sorting is used to choose the individuals to the next generation. The new algorithm is tested and compared with NSGA-II and RM-MEDA. The result shows that MMEA-RA outperforms RM-MEDA and NSGA-II on the test instances with variable linkages. At the same time, MMEA-RA has higher efficiency than the other two algorithms. A few shortcomings of MMEA-RA have also been identified and discussed in this paper. PMID:25874246
A flexible count data regression model for risk analysis.
Guikema, Seth D; Coffelt, Jeremy P; Goffelt, Jeremy P
2008-02-01
In many cases, risk and reliability analyses involve estimating the probabilities of discrete events such as hardware failures and occurrences of disease or death. There is often additional information in the form of explanatory variables that can be used to help estimate the likelihood of different numbers of events in the future through the use of an appropriate regression model, such as a generalized linear model. However, existing generalized linear models (GLM) are limited in their ability to handle the types of variance structures often encountered in using count data in risk and reliability analysis. In particular, standard models cannot handle both underdispersed data (variance less than the mean) and overdispersed data (variance greater than the mean) in a single coherent modeling framework. This article presents a new GLM based on a reformulation of the Conway-Maxwell Poisson (COM) distribution that is useful for both underdispersed and overdispersed count data and demonstrates this model by applying it to the assessment of electric power system reliability. The results show that the proposed COM GLM can provide as good of fits to data as the commonly used existing models for overdispered data sets while outperforming these commonly used models for underdispersed data sets. PMID:18304118
Elghafghuf, Adel; Dufour, Simon; Reyher, Kristen; Dohoo, Ian; Stryhn, Henrik
2014-12-01
Mastitis is a complex disease affecting dairy cows and is considered to be the most costly disease of dairy herds. The hazard of mastitis is a function of many factors, both managerial and environmental, making its control a difficult issue to milk producers. Observational studies of clinical mastitis (CM) often generate datasets with a number of characteristics which influence the analysis of those data: the outcome of interest may be the time to occurrence of a case of mastitis, predictors may change over time (time-dependent predictors), the effects of factors may change over time (time-dependent effects), there are usually multiple hierarchical levels, and datasets may be very large. Analysis of such data often requires expansion of the data into the counting-process format - leading to larger datasets - thus complicating the analysis and requiring excessive computing time. In this study, a nested frailty Cox model with time-dependent predictors and effects was applied to Canadian Bovine Mastitis Research Network data in which 10,831 lactations of 8035 cows from 69 herds were followed through lactation until the first occurrence of CM. The model was fit to the data as a Poisson model with nested normally distributed random effects at the cow and herd levels. Risk factors associated with the hazard of CM during the lactation were identified, such as parity, calving season, herd somatic cell score, pasture access, fore-stripping, and proportion of treated cases of CM in a herd. The analysis showed that most of the predictors had a strong effect early in lactation and also demonstrated substantial variation in the baseline hazard among cows and between herds. A small simulation study for a setting similar to the real data was conducted to evaluate the Poisson maximum likelihood estimation approach with both Gaussian quadrature method and Laplace approximation. Further, the performance of the two methods was compared with the performance of a widely used estimation
Kammarnjesadakul, Patcharee; Palaga, Tanapat; Sritunyalucksana, Kallaya; Mendoza, Leonel; Krajaejun, Theerapong; Vanittanakom, Nongnuch; Tongchusak, Songsak; Denduangboripant, Jessada; Chindamporn, Ariya
2011-04-01
To investigate the phylogenetic relationship among Pythium insidiosum isolates in Thailand, we investigated the genomic DNA of 31 P. insidiosum strains isolated from humans and environmental sources from Thailand, and two from North and Central America. We used PCR to amplify the partial COX II DNA coding sequences and the ITS regions of these isolates. The nucleotide sequences of both amplicons were analyzed by the Bioedit program. Phylogenetic analysis using genetic distance method with Neighbor Joining (NJ) approach was performed using the MEGA4 software. Additional sequences of three other Pythium species, Phytophthora sojae and Lagenidium giganteum were employed as outgroups. The sizes of the COX II amplicons varied from 558-564 bp, whereas the ITS products varied from approximately 871-898 bp. Corrected sequence divergences with Kimura 2-parameter model calculated for the COX II and the ITS DNA sequences ranged between 0.0000-0.0608 and 0.0000-0.2832, respectively. Phylogenetic analysis using both the COX II and the ITS DNA sequences showed similar trees, where we found three sister groups (A(TH), B(TH), and C(TH)) among P. insidiosum strains. All Thai isolates from clinical cases and environmental sources were placed in two separated sister groups (B(TH) and C(TH)), whereas the Americas isolates were grouped into A(TH.) Although the phylogenetic tree based on both regions showed similar distribution, the COX II phylogenetic tree showed higher resolution than the one using the ITS sequences. Our study indicates that COX II gene is the better of the two alternatives to study the phylogenetic relationships among P. insidiosum strains. PMID:20818919
Regression analysis of technical parameters affecting nuclear power plant performances
Ghazy, R.; Ricotti, M. E.; Trueco, P.
2012-07-01
Since the 80's many studies have been conducted in order to explicate good and bad performances of commercial nuclear power plants (NPPs), but yet no defined correlation has been found out to be totally representative of plant operational experience. In early works, data availability and the number of operating power stations were both limited; therefore, results showed that specific technical characteristics of NPPs were supposed to be the main causal factors for successful plant operation. Although these aspects keep on assuming a significant role, later studies and observations showed that other factors concerning management and organization of the plant could instead be predominant comparing utilities operational and economic results. Utility quality, in a word, can be used to summarize all the managerial and operational aspects that seem to be effective in determining plant performance. In this paper operational data of a consistent sample of commercial nuclear power stations, out of the total 433 operating NPPs, are analyzed, mainly focusing on the last decade operational experience. The sample consists of PWR and BWR technology, operated by utilities located in different countries, including U.S. (Japan)) (France)) (Germany)) and Finland. Multivariate regression is performed using Unit Capability Factor (UCF) as the dependent variable; this factor reflects indeed the effectiveness of plant programs and practices in maximizing the available electrical generation and consequently provides an overall indication of how well plants are operated and maintained. Aspects that may not be real causal factors but which can have a consistent impact on the UCF, as technology design, supplier, size and age, are included in the analysis as independent variables. (authors)
Prognostic models in coronary artery disease: Cox and network approaches
Mora, Antonio; Sicari, Rosa; Cortigiani, Lauro; Carpeggiani, Clara; Picano, Eugenio; Capobianco, Enrico
2015-01-01
Predictive assessment of the risk of developing cardiovascular diseases is usually provided by computational approaches centred on Cox models. The complex interdependence structure underlying clinical data patterns can limit the performance of Cox analysis and complicate the interpretation of results, thus calling for complementary and integrative methods. Prognostic models are proposed for studying the risk associated with patients with known or suspected coronary artery disease (CAD) undergoing vasodilator stress echocardiography, an established technique for CAD detection and prognostication. In order to complement standard Cox models, network inference is considered a possible solution to quantify the complex relationships between heterogeneous data categories. In particular, a mutual information network is designed to explore the paths linking patient-associated variables to endpoint events, to reveal prognostic factors and to identify the best possible predictors of death. Data from a prospective, multicentre, observational study are available from a previous study, based on 4313 patients (2532 men; 64±11 years) with known (n=1547) or suspected (n=2766) CAD, who underwent high-dose dipyridamole (0.84 mg kg−1 over 6 min) stress echocardiography with coronary flow reserve (CFR) evaluation of left anterior descending (LAD) artery by Doppler. The overall mortality was the only endpoint analysed by Cox models. The estimated connectivity between clinical variables assigns a complementary value to the proposed network approach in relation to the established Cox model, for instance revealing connectivity paths. Depending on the use of multiple metrics, the constraints of regression analysis in measuring the association strength among clinical variables can be relaxed, and identification of communities and prognostic paths can be provided. On the basis of evidence from various model comparisons, we show in this CAD study that there may be characteristic
Prognostic models in coronary artery disease: Cox and network approaches.
Mora, Antonio; Sicari, Rosa; Cortigiani, Lauro; Carpeggiani, Clara; Picano, Eugenio; Capobianco, Enrico
2015-02-01
Predictive assessment of the risk of developing cardiovascular diseases is usually provided by computational approaches centred on Cox models. The complex interdependence structure underlying clinical data patterns can limit the performance of Cox analysis and complicate the interpretation of results, thus calling for complementary and integrative methods. Prognostic models are proposed for studying the risk associated with patients with known or suspected coronary artery disease (CAD) undergoing vasodilator stress echocardiography, an established technique for CAD detection and prognostication. In order to complement standard Cox models, network inference is considered a possible solution to quantify the complex relationships between heterogeneous data categories. In particular, a mutual information network is designed to explore the paths linking patient-associated variables to endpoint events, to reveal prognostic factors and to identify the best possible predictors of death. Data from a prospective, multicentre, observational study are available from a previous study, based on 4313 patients (2532 men; 64±11 years) with known (n=1547) or suspected (n=2766) CAD, who underwent high-dose dipyridamole (0.84 mg kg(-1) over 6 min) stress echocardiography with coronary flow reserve (CFR) evaluation of left anterior descending (LAD) artery by Doppler. The overall mortality was the only endpoint analysed by Cox models. The estimated connectivity between clinical variables assigns a complementary value to the proposed network approach in relation to the established Cox model, for instance revealing connectivity paths. Depending on the use of multiple metrics, the constraints of regression analysis in measuring the association strength among clinical variables can be relaxed, and identification of communities and prognostic paths can be provided. On the basis of evidence from various model comparisons, we show in this CAD study that there may be characteristic
Quantile regression provides a fuller analysis of speed data.
Hewson, Paul
2008-03-01
Considerable interest already exists in terms of assessing percentiles of speed distributions, for example monitoring the 85th percentile speed is a common feature of the investigation of many road safety interventions. However, unlike the mean, where t-tests and ANOVA can be used to provide evidence of a statistically significant change, inference on these percentiles is much less common. This paper examines the potential role of quantile regression for modelling the 85th percentile, or any other quantile. Given that crash risk may increase disproportionately with increasing relative speed, it may be argued these quantiles are of more interest than the conditional mean. In common with the more usual linear regression, quantile regression admits a simple test as to whether the 85th percentile speed has changed following an intervention in an analogous way to using the t-test to determine if the mean speed has changed by considering the significance of parameters fitted to a design matrix. Having briefly outlined the technique and briefly examined an application with a widely published dataset concerning speed measurements taken around the introduction of signs in Cambridgeshire, this paper will demonstrate the potential for quantile regression modelling by examining recent data from Northamptonshire collected in conjunction with a "community speed watch" programme. Freely available software is used to fit these models and it is hoped that the potential benefits of using quantile regression methods when examining and analysing speed data are demonstrated. PMID:18329400
Seyedmajidi, Maryam; Shafaee, Shahryar; Siadati, Sepideh; Moghaddam, Elham Alizadeh; Ghasemi, Nafiseh; Bijani, Ali; Najafi, Mostafa
2015-01-01
Background: Cyclo-oxygenase-2 (COX-2) is an early response gene that is induced by growth factors, oncogenes and carcinogens and its expression is increased in various tumors. Increased expression of COX-2 plays a significant role in the development and growth of tumors by interfering in biological processes such as cell division, cellular immunity, cell adhesion, apoptosis, and angiogenesis. This study aimed to investigate the immunohistochemical expression of COX-2 in keratocystic odontogenic tumor (KOT) in comparison with ameloblastoma and dentigerous cyst with regards to different clinical behavior and histopathological features of these lesions. Materials and Methods: Paraffined blocks of 45 cases including 15 cases of dentigerous cyst, 15 cases of KOT and 15 cases of ameloblastoma were stained with immunohistochemical method for COX-2. Five high-power fields of each sample were evaluated to determine the percentage of stained cells and the intensity of staining. Degree of immunoreactivity was obtained from the sum of two. Statistical evaluation was performed by the Kruskal-Wallis and ANOVA Mann-Whitney test (P < 0.05). Results: Overexpression of COX-2 in ameloblastoma and KOT was observed compared with dentigerous cyst (P < 0.001). However, no significant difference was observed between the expression of COX-2 in ameloblastoma and KOT (P = 0.148). Conclusion: The COX-2 expression in odontogenic tumors such as ameloblastoma and cystic neoplasm with aggressive behavior such as KOT increases. However, it does not seem that COX-2 affects the development and growth of cysts with noninvasive behavior like dentigerous cyst. PMID:26005470
Technology Transfer Automated Retrieval System (TEKTRAN)
Selective principal component regression analysis (SPCR) uses a subset of the original image bands for principal component transformation and regression. For optimal band selection before the transformation, this paper used genetic algorithms (GA). In this case, the GA process used the regression co...
Exact Analysis of Squared Cross-Validity Coefficient in Predictive Regression Models
ERIC Educational Resources Information Center
Shieh, Gwowen
2009-01-01
In regression analysis, the notion of population validity is of theoretical interest for describing the usefulness of the underlying regression model, whereas the presumably more important concept of population cross-validity represents the predictive effectiveness for the regression equation in future research. It appears that the inference…
NASA Technical Reports Server (NTRS)
Parsons, Vickie s.
2009-01-01
The request to conduct an independent review of regression models, developed for determining the expected Launch Commit Criteria (LCC) External Tank (ET)-04 cycle count for the Space Shuttle ET tanking process, was submitted to the NASA Engineering and Safety Center NESC on September 20, 2005. The NESC team performed an independent review of regression models documented in Prepress Regression Analysis, Tom Clark and Angela Krenn, 10/27/05. This consultation consisted of a peer review by statistical experts of the proposed regression models provided in the Prepress Regression Analysis. This document is the consultation's final report.
Teaching Quantitative Literacy through a Regression Analysis of Exam Performance
ERIC Educational Resources Information Center
Lindner, Andrew M.
2012-01-01
Quantitative literacy is increasingly essential for both informed citizenship and a variety of careers. Though regression is one of the most common methods in quantitative sociology, it is rarely taught until late in students' college careers. In this article, the author describes a classroom-based activity introducing students to regression…
Analysis and Interpretation of Findings Using Multiple Regression Techniques
ERIC Educational Resources Information Center
Hoyt, William T.; Leierer, Stephen; Millington, Michael J.
2006-01-01
Multiple regression and correlation (MRC) methods form a flexible family of statistical techniques that can address a wide variety of different types of research questions of interest to rehabilitation professionals. In this article, we review basic concepts and terms, with an emphasis on interpretation of findings relevant to research questions…
Growth in Mathematics Achievement: Analysis with Classification and Regression Trees
ERIC Educational Resources Information Center
Ma, Xin
2005-01-01
A recently developed statistical technique, often referred to as classification and regression trees (CART), holds great potential for researchers to discover how student-level (and school-level) characteristics interactively affect growth in mathematics achievement. CART is a host of advanced statistical methods that statistically cluster…
Grades, Gender, and Encouragement: A Regression Discontinuity Analysis
ERIC Educational Resources Information Center
Owen, Ann L.
2010-01-01
The author employs a regression discontinuity design to provide direct evidence on the effects of grades earned in economics principles classes on the decision to major in economics and finds a differential effect for male and female students. Specifically, for female students, receiving an A for a final grade in the first economics class is…
HIGH RESOLUTION FOURIER ANALYSIS WITH AUTO-REGRESSIVE LINEAR PREDICTION
Barton, J.; Shirley, D.A.
1984-04-01
Auto-regressive linear prediction is adapted to double the resolution of Angle-Resolved Photoemission Extended Fine Structure (ARPEFS) Fourier transforms. Even with the optimal taper (weighting function), the commonly used taper-and-transform Fourier method has limited resolution: it assumes the signal is zero beyond the limits of the measurement. By seeking the Fourier spectrum of an infinite extent oscillation consistent with the measurements but otherwise having maximum entropy, the errors caused by finite data range can be reduced. Our procedure developed to implement this concept adapts auto-regressive linear prediction to extrapolate the signal in an effective and controllable manner. Difficulties encountered when processing actual ARPEFS data are discussed. A key feature of this approach is the ability to convert improved measurements (signal-to-noise or point density) into improved Fourier resolution.
Yang, Man; Wang, Hong-Tao; Zhao, Miao; Meng, Wen-Bo; Ou, Jin-Qing; He, Jun-Hui; Zou, Bing; Lei, Ping-Guang
2015-01-01
Abstract Currently 2 difference classes of cyclooxygenase (COX)-2 inhibitors, coxibs and relatively selective COX-2 inhibitors, are available for patients requiring nonsteroidal anti-inflammatory drug (NSAID) therapy; their gastroprotective effect is hardly directly compared. The aim of this study was to compare the gastroprotective effect of relatively selective COX-2 inhibitors with coxibs. MEDLINE, EMBASE, and the Cochrane Library (from their inception to March 2015) were searched for potential eligible studies. We included randomized controlled trials comparing coxibs (celecoxib, etoricoxib, parecoxib, and lumiracoxib), relatively selective COX-2 inhibitors (nabumetone, meloxicam, and etodolac), and nonselective NSAIDs with a study duration ≥4 weeks. Comparative effectiveness and safety data were pooled by Bayesian network meta-analysis. The primary outcomes were ulcer complications and symptomatic ulcer. Summary effect-size was calculated as risk ratio (RR), together with the 95% confidence interval (CI). This study included 36 trials with a total of 112,351 participants. Network meta-analyses indicated no significant difference between relatively selective COX-2 inhibitors and coxibs regarding ulcer complications (RR, 1.38; 95% CI, 0.47–3.27), symptomatic ulcer (RR, 1.02; 95% CI, 0.09–3.92), and endoscopic ulcer (RR, 1.18; 95% CI, 0.37–2.96). Network meta-analyses adjusting potential influential factors (age, sex, previous ulcer disease, and follow-up time), and sensitivity analyses did not reveal any major change to the main results. Network meta-analyses suggested that relatively selective COX-2 inhibitors and coxibs were associated with comparable incidences of total adverse events (AEs) (RR, 1.09; 95% CI, 0.93–1.31), gastrointestinal AEs (RR, 1.04; 95% CI, 0.87–1.25), total withdrawals (RR, 1.00; 95% CI, 0.74–1.33), and gastrointestinal AE-related withdrawals (RR, 1.02; 95% CI, 0.57–1.74). Relatively selective COX-2 inhibitors appear to be
Yang, Man; Wang, Hong-Tao; Zhao, Miao; Meng, Wen-Bo; Ou, Jin-Qing; He, Jun-Hui; Zou, Bing; Lei, Ping-Guang
2015-10-01
Currently 2 difference classes of cyclooxygenase (COX)-2 inhibitors, coxibs and relatively selective COX-2 inhibitors, are available for patients requiring nonsteroidal anti-inflammatory drug (NSAID) therapy; their gastroprotective effect is hardly directly compared. The aim of this study was to compare the gastroprotective effect of relatively selective COX-2 inhibitors with coxibs. MEDLINE, EMBASE, and the Cochrane Library (from their inception to March 2015) were searched for potential eligible studies. We included randomized controlled trials comparing coxibs (celecoxib, etoricoxib, parecoxib, and lumiracoxib), relatively selective COX-2 inhibitors (nabumetone, meloxicam, and etodolac), and nonselective NSAIDs with a study duration ≥ 4 weeks. Comparative effectiveness and safety data were pooled by Bayesian network meta-analysis. The primary outcomes were ulcer complications and symptomatic ulcer. Summary effect-size was calculated as risk ratio (RR), together with the 95% confidence interval (CI). This study included 36 trials with a total of 112,351 participants. Network meta-analyses indicated no significant difference between relatively selective COX-2 inhibitors and coxibs regarding ulcer complications (RR, 1.38; 95% CI, 0.47-3.27), symptomatic ulcer (RR, 1.02; 95% CI, 0.09-3.92), and endoscopic ulcer (RR, 1.18; 95% CI, 0.37-2.96). Network meta-analyses adjusting potential influential factors (age, sex, previous ulcer disease, and follow-up time), and sensitivity analyses did not reveal any major change to the main results. Network meta-analyses suggested that relatively selective COX-2 inhibitors and coxibs were associated with comparable incidences of total adverse events (AEs) (RR, 1.09; 95% CI, 0.93-1.31), gastrointestinal AEs (RR, 1.04; 95% CI, 0.87-1.25), total withdrawals (RR, 1.00; 95% CI, 0.74-1.33), and gastrointestinal AE-related withdrawals (RR, 1.02; 95% CI, 0.57-1.74). Relatively selective COX-2 inhibitors appear to be associated with
Using Robust Standard Errors to Combine Multiple Regression Estimates with Meta-Analysis
ERIC Educational Resources Information Center
Williams, Ryan T.
2012-01-01
Combining multiple regression estimates with meta-analysis has continued to be a difficult task. A variety of methods have been proposed and used to combine multiple regression slope estimates with meta-analysis, however, most of these methods have serious methodological and practical limitations. The purpose of this study was to explore the use…
Analysis of apoptosis during hair follicle regression (catagen)
Lindner, G.; Botchkarev, V. A.; Botchkareva, N. V.; Ling, G.; van der Veen, C.; Paus, R.
1997-01-01
Keratinocyte apoptosis is a central element in the regulation of hair follicle regression (catagen), yet the exact location and the control of follicular keratinocyte apoptosis remain obscure. To generate an "apoptomap" of the hair follicle, we have studied selected apoptosis-associated parameters in the C57BL/6 mouse model for hair research during normal and pharmacologically manipulated, pathological catagen development. As assessed by terminal deoxynucleotide transferase dUTP fluorescein nick end-labeling (TUNEL) stain, apoptotic cells not only appeared in the regressing proximal follicle epithelium but, surprisingly, were also seen in the central inner root sheath, in the bulge/isthmus region, and in the secondary germ, but never in the dermal papilla. These apoptosis hot spots during catagen development correlated largely with a down-regulation of the Bcl-2/Bax ratio but only poorly with the expression patterns of interleukin-1beta converting enzyme, p55TNFR, and Fas/Apo-1 immunoreactivity. Instead, a higher correlation was found with p75NTR expression. During cyclophosphamide-induced follicle dystrophy and alopecia, massive keratinocyte apoptosis occurred in the entire proximal hair bulb, except in the dermal papilla, despite a strong up-regulation of Bax and p75NTR immunoreactivity. Selected receptors of the tumor necrosis factor/nerve growth factor family and members of the Bcl-2 family may also play a key role in the control of follicular keratinocyte apoptosis in situ. Images Figure 1 Figure 2 Figure 3 Figure 5. a Figure 6 Figure 8 PMID:9403711
Striker, Lora K.; Medalie, Laura
1997-01-01
This report provides the results of a detailed Level II analysis of scour potential at structure MORETH00010021 on Town Highway 1 crossing Cox Brook, Moretown, Vermont (figures 1–8). A Level II study is a basic engineering analysis of the site, including a quantitative analysis of stream stability and scour (U.S. Department of Transportation, 1993). Results of a Level I scour investigation also are included in Appendix E of this report. A Level I investigation provides a qualitative geomorphic characterization of the study site. Information on the bridge, gleaned from Vermont Agency of Transportation (VTAOT) files, was compiled prior to conducting Level I and Level II analyses and is found in Appendix D. The site is in the Green Mountain section of the New England physiographic province in north-central Vermont. The 2.85-mi2 drainage area is in a predominantly rural and forested basin. In the vicinity of the study site, the surface cover is predominantly forested. In the study area, Cox Brook has an incised, sinuous channel with a slope of approximately 0.02 ft/ft, an average channel top width of 23 ft and an average bank height of 4 ft. The channel bed material ranges from gravel to cobble with a median grain size (D50) of 47.5 mm (0.156 ft). The geomorphic assessment at the time of the Level I and Level II site visit on July 18, 1996, indicated that the reach was stable. The Town Highway 1 crossing of Cox Brook is a 29-ft-long, two-lane bridge consisting of one 27-foot steel-beam span (Vermont Agency of Transportation, written communication, October 13, 1995). The opening length of the structure parallel to the bridge face is 24.8 ft. The bridge is supported by vertical, concrete abutments with wingwalls. The channel is skewed approximately 60 degrees to the opening while the measured opening-skew-to-roadway is 40 degrees. A scour hole 1.0 ft deeper than the mean thalweg depth was observed along the left abutment downstream during the Level I assessment. The
COX7AR is a Stress-inducible Mitochondrial COX Subunit that Promotes Breast Cancer Malignancy.
Zhang, Kezhong; Wang, Guohui; Zhang, Xuebao; Hüttemann, Philipp P; Qiu, Yining; Liu, Jenney; Mitchell, Allison; Lee, Icksoo; Zhang, Chao; Lee, Jin-Sook; Pecina, Petr; Wu, Guojun; Yang, Zeng-Quan; Hüttemann, Maik; Grossman, Lawrence I
2016-01-01
Cytochrome c oxidase (COX), the terminal enzyme of the mitochondrial respiratory chain, plays a key role in regulating mitochondrial energy production and cell survival. COX subunit VIIa polypeptide 2-like protein (COX7AR) is a novel COX subunit that was recently found to be involved in mitochondrial supercomplex assembly and mitochondrial respiration activity. Here, we report that COX7AR is expressed in high energy-demanding tissues, such as brain, heart, liver, and aggressive forms of human breast cancer cells. Under cellular stress that stimulates energy metabolism, COX7AR is induced and incorporated into the mitochondrial COX complex. Functionally, COX7AR promotes cellular energy production in human mammary epithelial cells. Gain- and loss-of-function analysis demonstrates that COX7AR is required for human breast cancer cells to maintain higher rates of proliferation, clone formation, and invasion. In summary, our study revealed that COX7AR is a stress-inducible mitochondrial COX subunit that facilitates human breast cancer malignancy. These findings have important implications in the understanding and treatment of human breast cancer and the diseases associated with mitochondrial energy metabolism. PMID:27550821
COX7AR is a Stress-inducible Mitochondrial COX Subunit that Promotes Breast Cancer Malignancy
Zhang, Kezhong; Wang, Guohui; Zhang, Xuebao; Hüttemann, Philipp P.; Qiu, Yining; Liu, Jenney; Mitchell, Allison; Lee, Icksoo; Zhang, Chao; Lee, Jin-sook; Pecina, Petr; Wu, Guojun; Yang, Zeng-quan; Hüttemann, Maik; Grossman, Lawrence I.
2016-01-01
Cytochrome c oxidase (COX), the terminal enzyme of the mitochondrial respiratory chain, plays a key role in regulating mitochondrial energy production and cell survival. COX subunit VIIa polypeptide 2-like protein (COX7AR) is a novel COX subunit that was recently found to be involved in mitochondrial supercomplex assembly and mitochondrial respiration activity. Here, we report that COX7AR is expressed in high energy-demanding tissues, such as brain, heart, liver, and aggressive forms of human breast cancer cells. Under cellular stress that stimulates energy metabolism, COX7AR is induced and incorporated into the mitochondrial COX complex. Functionally, COX7AR promotes cellular energy production in human mammary epithelial cells. Gain- and loss-of-function analysis demonstrates that COX7AR is required for human breast cancer cells to maintain higher rates of proliferation, clone formation, and invasion. In summary, our study revealed that COX7AR is a stress-inducible mitochondrial COX subunit that facilitates human breast cancer malignancy. These findings have important implications in the understanding and treatment of human breast cancer and the diseases associated with mitochondrial energy metabolism. PMID:27550821
An improved multiple linear regression and data analysis computer program package
NASA Technical Reports Server (NTRS)
Sidik, S. M.
1972-01-01
NEWRAP, an improved version of a previous multiple linear regression program called RAPIER, CREDUC, and CRSPLT, allows for a complete regression analysis including cross plots of the independent and dependent variables, correlation coefficients, regression coefficients, analysis of variance tables, t-statistics and their probability levels, rejection of independent variables, plots of residuals against the independent and dependent variables, and a canonical reduction of quadratic response functions useful in optimum seeking experimentation. A major improvement over RAPIER is that all regression calculations are done in double precision arithmetic.
A regularized multivariate regression approach for eQTL analysis
Zhang, Hexin; Zhang, Yuzheng; Hsu, Li; Wang, Pei
2013-01-01
Expression quantitative trait loci (eQTLs) are genomic loci that regulate expression levels of mRNAs or proteins. Understanding these regulatory provides important clues to biological pathways that underlie diseases. In this paper, we propose a new statistical method, GroupRemMap, for identifying eQTLs. We model the relationship between gene expression and single nucleotide variants (SNVs) through multivariate linear regression models, in which gene expression levels are responses and SNV genotypes are predictors. To handle the high-dimensionality as well as to incorporate the intrinsic group structure of SNVs, we introduce a new regularization scheme to (1) control the overall sparsity of the model; (2) encourage the group selection of SNVs from the same gene; and (3) facilitate the detection of trans-hub-eQTLs. We apply the proposed method to the colorectal and breast cancer data sets from The Cancer Genome Atlas (TCGA), and identify several biologically interesting eQTLs. These findings may provide insight into biological processes associated with cancers and generate hypotheses for future studies. PMID:26085849
Development of a User Interface for a Regression Analysis Software Tool
NASA Technical Reports Server (NTRS)
Ulbrich, Norbert Manfred; Volden, Thomas R.
2010-01-01
An easy-to -use user interface was implemented in a highly automated regression analysis tool. The user interface was developed from the start to run on computers that use the Windows, Macintosh, Linux, or UNIX operating system. Many user interface features were specifically designed such that a novice or inexperienced user can apply the regression analysis tool with confidence. Therefore, the user interface s design minimizes interactive input from the user. In addition, reasonable default combinations are assigned to those analysis settings that influence the outcome of the regression analysis. These default combinations will lead to a successful regression analysis result for most experimental data sets. The user interface comes in two versions. The text user interface version is used for the ongoing development of the regression analysis tool. The official release of the regression analysis tool, on the other hand, has a graphical user interface that is more efficient to use. This graphical user interface displays all input file names, output file names, and analysis settings for a specific software application mode on a single screen which makes it easier to generate reliable analysis results and to perform input parameter studies. An object-oriented approach was used for the development of the graphical user interface. This choice keeps future software maintenance costs to a reasonable limit. Examples of both the text user interface and graphical user interface are discussed in order to illustrate the user interface s overall design approach.
A Noncentral "t" Regression Model for Meta-Analysis
ERIC Educational Resources Information Center
Camilli, Gregory; de la Torre, Jimmy; Chiu, Chia-Yi
2010-01-01
In this article, three multilevel models for meta-analysis are examined. Hedges and Olkin suggested that effect sizes follow a noncentral "t" distribution and proposed several approximate methods. Raudenbush and Bryk further refined this model; however, this procedure is based on a normal approximation. In the current research literature, this…
Advanced GIS Exercise: Predicting Rainfall Erosivity Index Using Regression Analysis
ERIC Educational Resources Information Center
Post, Christopher J.; Goddard, Megan A.; Mikhailova, Elena A.; Hall, Steven T.
2006-01-01
Graduate students from a variety of agricultural and natural resource fields are incorporating geographic information systems (GIS) analysis into their graduate research, creating a need for teaching methodologies that help students understand advanced GIS topics for use in their own research. Graduate-level GIS exercises help students understand…
ANOVA Versus Regression Analysis of ATI Designs: An Empirical Investigation.
ERIC Educational Resources Information Center
Thompson, Bruce
1986-01-01
This paper reports a Monte Carlo study of differences induced by different analysis choices over selected types of aptitude treatment interaction (ATI) data (nine combinations of three sample sizes and three population parameter effect sizes). Generally, ANOVA methods tended to overestimate smaller effect sizes and to underestimate larger effect…
NASA Astrophysics Data System (ADS)
Nishidate, Izumi; Wiswadarma, Aditya; Hase, Yota; Tanaka, Noriyuki; Maeda, Takaaki; Niizeki, Kyuichi; Aizu, Yoshihisa
2011-08-01
In order to visualize melanin and blood concentrations and oxygen saturation in human skin tissue, a simple imaging technique based on multispectral diffuse reflectance images acquired at six wavelengths (500, 520, 540, 560, 580 and 600nm) was developed. The technique utilizes multiple regression analysis aided by Monte Carlo simulation for diffuse reflectance spectra. Using the absorbance spectrum as a response variable and the extinction coefficients of melanin, oxygenated hemoglobin, and deoxygenated hemoglobin as predictor variables, multiple regression analysis provides regression coefficients. Concentrations of melanin and total blood are then determined from the regression coefficients using conversion vectors that are deduced numerically in advance, while oxygen saturation is obtained directly from the regression coefficients. Experiments with a tissue-like agar gel phantom validated the method. In vivo experiments with human skin of the human hand during upper limb occlusion and of the inner forearm exposed to UV irradiation demonstrated the ability of the method to evaluate physiological reactions of human skin tissue.
Analysis of Maryland Poisoning Deaths Using Classification And Regression Tree (CART) Analysis
Pamer, Carol; Serpi, Tracey; Finkelstein, Joseph
2008-01-01
Our study is a cross-sectional analysis of Maryland poisoning deaths for years 2003 and 2004. We used Classification and Regression Tree (CART) methodology to classify 1,204 Maryland undetermined intent poisoning deaths as either unintentional or suicidal poisonings. The predictive ability of the selected set of variables (i.e., poisoned in the home or workplace, location type where poisoned, place of death, poison type, victim race and age, year of death) was extremely good. Of the 301 test cases, only eight were misclassified by the CART regression tree. Of 1,204 undetermined intent poisoning deaths, CART classified 903 as suicides and 301 as unintentional deaths. The major strength of our study is the use of CART to differentiate with a high degree of accuracy between unintentional and suicidal poisoning deaths among Maryland undetermined intent poisoning deaths. PMID:18999168
Analysis for Regression Model Behavior by Sampling Strategy for Annual Pollutant Load Estimation.
Park, Youn Shik; Engel, Bernie A
2015-11-01
Water quality data are typically collected less frequently than streamflow data due to the cost of collection and analysis, and therefore water quality data may need to be estimated for additional days. Regression models are applicable to interpolate water quality data associated with streamflow data and have come to be extensively used, requiring relatively small amounts of data. There is a need to evaluate how well the regression models represent pollutant loads from intermittent water quality data sets. Both the specific regression model and water quality data frequency are important factors in pollutant load estimation. In this study, nine regression models from the Load Estimator (LOADEST) and one regression model from the Web-based Load Interpolation Tool (LOADIN) were evaluated with subsampled water quality data sets from daily measured water quality data sets for N, P, and sediment. Each water quality parameter had different correlations with streamflow, and the subsampled water quality data sets had various proportions of storm samples. The behaviors of the regression models differed not only by water quality parameter but also by proportion of storm samples. The regression models from LOADEST provided accurate and precise annual sediment and P load estimates using the water quality data of 20 to 40% storm samples. LOADIN provided more accurate and precise annual N load estimates than LOADEST. In addition, the results indicate that avoidance of water quality data extrapolation and availability of water quality data from storm events were crucial in annual pollutant load estimation using pollutant regression models. PMID:26641336
Barresi, Vincenza; Trovato-Salinaro, Angela; Spampinato, Giorgia; Musso, Nicolò; Castorina, Sergio; Rizzarelli, Enrico; Condorelli, Daniele Filippo
2016-08-01
Copper homeostasis and distribution is strictly regulated by a network of transporters and intracellular chaperones encoded by a group of genes collectively known as copper homeostasis genes (CHGs). In this work, analysis of The Cancer Genome Atlas database for somatic point mutations in colorectal cancer revealed that inactivating mutations are absent or extremely rare in CHGs. Using oligonucleotide microarrays, we found a strong increase in mRNA levels of the membrane copper transporter 1 protein [CTR1; encoded by the solute carrier family 31 member 1 gene (SLC31A1 gene)] in our series of colorectal carcinoma samples. CTR1 is the main copper influx transporter and changes in its expression are able to induce modifications of cellular copper accumulation. The increased SLC31A1 mRNA level is accompanied by a parallel increase in transcript levels for copper efflux pump ATP7A, copper metabolism Murr1 domain containing 1 (COMMD1), the cytochrome C oxidase assembly factors [synthesis of cytochrome c oxidase 1 (SCO1) and cytochrome c oxidase copper chaperone 11 (COX11)], the cupric reductase six transmembrane epithelial antigen of the prostate (STEAP3), and the metal-regulatory transcription factors (MTF1, MTF2) and specificity protein 1 (SP1). The significant correlation between SLC31A1,SCO1, and COX11 mRNA levels suggests that this transcriptional upregulation might be part of a coordinated program of gene regulation. Transcript-level upregulation of SLC31A1,SCO1, and COX11 was also confirmed by the analysis of different colon carcinoma cell lines (Caco-2, HT116, HT29) and cancer cell lines of different tissue origin (MCF7, PC3). Finally, exon-level expression analysis of SLC31A1 reveals differential expression of alternative transcripts in colorectal cancer and normal colonic mucosa. PMID:27516958
Deng, Yangyang; Parajuli, Prem B.
2011-08-10
Evaluation of economic feasibility of a bio-gasification facility needs understanding of its unit cost under different production capacities. The objective of this study was to evaluate the unit cost of syngas production at capacities from 60 through 1800Nm 3/h using an economic model with three regression analysis techniques (simple regression, reciprocal regression, and log-log regression). The preliminary result of this study showed that reciprocal regression analysis technique had the best fit curve between per unit cost and production capacity, with sum of error squares (SES) lower than 0.001 and coefficient of determination of (R 2) 0.996. The regression analysis techniques determined the minimum unit cost of syngas production for micro-scale bio-gasification facilities of $0.052/Nm 3, under the capacity of 2,880 Nm 3/h. The results of this study suggest that to reduce cost, facilities should run at a high production capacity. In addition, the contribution of this technique could be the new categorical criterion to evaluate micro-scale bio-gasification facility from the perspective of economic analysis.
Regression Models for Demand Reduction based on Cluster Analysis of Load Profiles
Yamaguchi, Nobuyuki; Han, Junqiao; Ghatikar, Girish; Piette, Mary Ann; Asano, Hiroshi; Kiliccote, Sila
2009-06-28
This paper provides new regression models for demand reduction of Demand Response programs for the purpose of ex ante evaluation of the programs and screening for recruiting customer enrollment into the programs. The proposed regression models employ load sensitivity to outside air temperature and representative load pattern derived from cluster analysis of customer baseline load as explanatory variables. The proposed models examined their performances from the viewpoint of validity of explanatory variables and fitness of regressions, using actual load profile data of Pacific Gas and Electric Company's commercial and industrial customers who participated in the 2008 Critical Peak Pricing program including Manual and Automated Demand Response.
ERIC Educational Resources Information Center
Barringer, Mary S.
Researchers are becoming increasingly aware of the advantages of using multiple regression as opposed to analysis of variance (ANOVA) or analysis of covariance (ANCOVA). Multiple regression is more versatile and does not force the researcher to throw away variance by categorizing intervally scaled data. Polynomial regression analysis offers the…
Exploratory regression analysis: a tool for selecting models and determining predictor importance.
Braun, Michael T; Oswald, Frederick L
2011-06-01
Linear regression analysis is one of the most important tools in a researcher's toolbox for creating and testing predictive models. Although linear regression analysis indicates how strongly a set of predictor variables, taken together, will predict a relevant criterion (i.e., the multiple R), the analysis cannot indicate which predictors are the most important. Although there is no definitive or unambiguous method for establishing predictor variable importance, there are several accepted methods. This article reviews those methods for establishing predictor importance and provides a program (in Excel) for implementing them (available for direct download at http://dl.dropbox.com/u/2480715/ERA.xlsm?dl=1) . The program investigates all 2(p) - 1 submodels and produces several indices of predictor importance. This exploratory approach to linear regression, similar to other exploratory data analysis techniques, has the potential to yield both theoretical and practical benefits. PMID:21298571
Guidelines for the use of structural versus regression analysis in geomorphic studies
Osterkamp, W.R.; McNellis, Jesse M.; Jordan, Paul Robert
1978-01-01
Regression analysis is a useful curve-fitting technique, but it often is misapplied to geomorphic data sets. When error components can be identified for both variables, the statistical technique of structural analysis is preferred. If regression results are available, conversion to a structural analysis can be made either manually or by computer. Use of computer-generated data sets permits the construction of curves relating variation between regression and structural analyses to the range of data of the independent variable. The data have randomly imposed error components of specified standard deviation and a slope of the linear relation that simulates gradient-discharge relations of natural alluvial streams. The empirically developed curves can be used to determine the need for structural analysis of real geomorphic data sets. (Woodard-USGS)
Criteria for the use of regression analysis for remote sensing of sediment and pollutants
NASA Technical Reports Server (NTRS)
Whitlock, C. H.; Kuo, C. Y.; Lecroy, S. R. (Principal Investigator)
1982-01-01
Data analysis procedures for quantification of water quality parameters that are already identified and are known to exist within the water body are considered. The liner multiple-regression technique was examined as a procedure for defining and calibrating data analysis algorithms for such instruments as spectrometers and multispectral scanners.
Partitioning Predicted Variance into Constituent Parts: A Primer on Regression Commonality Analysis.
ERIC Educational Resources Information Center
Amado, Alfred J.
Commonality analysis is a method of decomposing the R squared in a multiple regression analysis into the proportion of explained variance of the dependent variable associated with each independent variable uniquely and the proportion of explained variance associated with the common effects of one or more independent variables in various…
Regression Analysis of Physician Distribution to Identify Areas of Need: Some Preliminary Findings.
ERIC Educational Resources Information Center
Morgan, Bruce B.; And Others
A regression analysis was conducted of factors that help to explain the variance in physician distribution and which identify those factors that influence the maldistribution of physicians. Models were developed for different geographic areas to determine the most appropriate unit of analysis for the Western Missouri Area Health Education Center…
Modeling of retardance in ferrofluid with Taguchi-based multiple regression analysis
NASA Astrophysics Data System (ADS)
Lin, Jing-Fung; Wu, Jyh-Shyang; Sheu, Jer-Jia
2015-03-01
The citric acid (CA) coated Fe3O4 ferrofluids are prepared by a co-precipitation method and the magneto-optical retardance property is measured by a Stokes polarimeter. Optimization and multiple regression of retardance in ferrofluids are executed by combining Taguchi method and Excel. From the nine tests for four parameters, including pH of suspension, molar ratio of CA to Fe3O4, volume of CA, and coating temperature, influence sequence and excellent program are found. Multiple regression analysis and F-test on the significance of regression equation are performed. It is found that the model F value is much larger than Fcritical and significance level P <0.0001. So it can be concluded that the regression model has statistically significant predictive ability. Substituting excellent program into equation, retardance is obtained as 32.703°, higher than the highest value in tests by 11.4%.
Hu, W; Yu, X G; Wu, S; Tan, L P; Song, M R; Abdulahi, A Y; Wang, Z; Jiang, B; Li, G Q
2016-07-01
Ancylostoma ceylanicum is a common zoonotic nematode. Cats act as natural reservoirs of the hookworm and are involved in transmitting infection to humans, thus posing a potential risk to public health. The prevalence of feline A. ceylanicum in Guangzhou (South China) was surveyed by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP). In total, 112 faecal samples were examined; 34.8% (39/112) and 43.8% (49/112) samples were positive with hookworms by microscopy and PCR method, respectively. Among them, 40.8% of samples harboured A. ceylanicum. Twelve positive A. ceylanicum samples were selected randomly and used for cox 1 sequence analysis. Sequencing results revealed that they had 97-99% similarity with A. ceylanicum cox 1 gene sequences deposited in GenBank. A phylogenetic tree showed that A. ceylanicum isolates were divided into two groups: one comprising four isolates from Guangzhou (South China), and the other comprising those from Malaysia, Cambodia and Guangzhou. In the latter group, all A. ceylanicum isolates from Guangzhou were clustered into a minor group again. The results indicate that the high prevalence of A. ceylanicum in stray cats in South China poses a potential risk of hookworm transmission from pet cats to humans, and that A. ceylanicum may be a species complex worldwide. PMID:26123649
Trend Analysis of Cancer Mortality and Incidence in Panama, Using Joinpoint Regression Analysis
Politis, Michael; Higuera, Gladys; Chang, Lissette Raquel; Gomez, Beatriz; Bares, Juan; Motta, Jorge
2015-01-01
Abstract Cancer is one of the leading causes of death worldwide and its incidence is expected to increase in the future. In Panama, cancer is also one of the leading causes of death. In 1964, a nationwide cancer registry was started and it was restructured and improved in 2012. The aim of this study is to utilize Joinpoint regression analysis to study the trends of the incidence and mortality of cancer in Panama in the last decade. Cancer mortality was estimated from the Panamanian National Institute of Census and Statistics Registry for the period 2001 to 2011. Cancer incidence was estimated from the Panamanian National Cancer Registry for the period 2000 to 2009. The Joinpoint Regression Analysis program, version 4.0.4, was used to calculate trends by age-adjusted incidence and mortality rates for selected cancers. Overall, the trend of age-adjusted cancer mortality in Panama has declined over the last 10 years (−1.12% per year). The cancers for which there was a significant increase in the trend of mortality were female breast cancer and ovarian cancer; while the highest increases in incidence were shown for breast cancer, liver cancer, and prostate cancer. Significant decrease in the trend of mortality was evidenced for the following: prostate cancer, lung and bronchus cancer, and cervical cancer; with respect to incidence, only oral and pharynx cancer in both sexes had a significant decrease. Some cancers showed no significant trends in incidence or mortality. This study reveals contrasting trends in cancer incidence and mortality in Panama in the last decade. Although Panama is considered an upper middle income nation, this study demonstrates that some cancer mortality trends, like the ones seen in cervical and lung cancer, behave similarly to the ones seen in high income countries. In contrast, other types, like breast cancer, follow a pattern seen in countries undergoing a transition to a developed economy with its associated lifestyle, nutrition, and
Trend Analysis of Cancer Mortality and Incidence in Panama, Using Joinpoint Regression Analysis
Politis, Michael; Higuera, Gladys; Chang, Lissette Raquel; Gomez, Beatriz; Bares, Juan; Motta, Jorge
2015-01-01
Abstract Cancer is one of the leading causes of death worldwide and its incidence is expected to increase in the future. In Panama, cancer is also one of the leading causes of death. In 1964, a nationwide cancer registry was started and it was restructured and improved in 2012. The aim of this study is to utilize Joinpoint regression analysis to study the trends of the incidence and mortality of cancer in Panama in the last decade. Cancer mortality was estimated from the Panamanian National Institute of Census and Statistics Registry for the period 2001 to 2011. Cancer incidence was estimated from the Panamanian National Cancer Registry for the period 2000 to 2009. The Joinpoint Regression Analysis program, version 4.0.4, was used to calculate trends by age-adjusted incidence and mortality rates for selected cancers. Overall, the trend of age-adjusted cancer mortality in Panama has declined over the last 10 years (−1.12% per year). The cancers for which there was a significant increase in the trend of mortality were female breast cancer and ovarian cancer; while the highest increases in incidence were shown for breast cancer, liver cancer, and prostate cancer. Significant decrease in the trend of mortality was evidenced for the following: prostate cancer, lung and bronchus cancer, and cervical cancer; with respect to incidence, only oral and pharynx cancer in both sexes had a significant decrease. Some cancers showed no significant trends in incidence or mortality. This study reveals contrasting trends in cancer incidence and mortality in Panama in the last decade. Although Panama is considered an upper middle income nation, this study demonstrates that some cancer mortality trends, like the ones seen in cervical and lung cancer, behave similarly to the ones seen in high income countries. In contrast, other types, like breast cancer, follow a pattern seen in countries undergoing a transition to a developed economy with its associated lifestyle, nutrition, and
Regression analysis in interlaboratory surveys: a case study with cholesterol and triglycerides.
Munster, D J; Lever, M; Walmsley, T A
1978-10-01
1. A new interlaboratory survey design, that uses regression analysis to compare results from each laboratory with target values, was tested using cholesterol and triglyceride analyses. The fifty New Zealand laboratories involved showed considerable interlaboratory variation (CV = 8% to 27% for cholesterol, 13% to 113% for triglycerides), 30% and 40% of which was associated with systematic differences between laboratories. 2. End-of-period summaries using regression analysis confirmed the presence of systematic errors. These were either simple types caused apparently by incorrect standardisation (regression slope, B not equal to 1.0) or inappropriate blank correction (intercept, A not equal to zero) or complex types presumably due to nonlinearity or nonspecificity. Graphical display of results from each laboratory aided fault diagnosis and allowed the detection of between-run standardisation differences. 3. Method comparison studies were made: the only highly significant result being lower precision achieved by enzymatic cholesterol methods compared with other colorimetric methods. PMID:729161
2014-01-01
Background and aim Altered glucose metabolism, oxidative stress, lipid levels and inflammatory markers are important risk factors in diabetes, cardiovascular, and many other diseases. Cocoa has been shown to exert antioxidant and anti-inflammatory effects. The aim of this study is twofold: to assess the effect of Cocoa on the lipid profile and peroxidation in addition to the inflammatory markers in type 2 diabetic patients, and to represent a virtual model of probable action mechanism of observed clinical effects of Cocoa consumption using in silico analysis and bioinformatics data. Methods One hundred subjects with type 2 diabetes were included in a randomized clinical control trial. Fifty treatment subjects received 10 grams cocoa powder and 10 grams milk powder dissolved in 250 ml of boiling water, and the other fifty control subjects received only 10 grams milk powder dissolved in 250 ml boiling water. Both groups were on the mentioned regimen twice daily for 6 weeks. Blood samples were obtained prior to Cocoa consumption and 6 weeks after intervention. Serum lipids and lipoproteins profile, malondialdehyde and inflammatory markers including tumor necrosis factor-α (TNF-α), interleukin-6 (IL-6) and high sensitive C-reactive protein (hs-CRP) were measured. For statistical analysis two independent and paired samples t-test and linear regression were used. Bioinformatics and virtual analysis were performed using string data base and Molegro virtual software. Results Cocoa consumption lowered blood cholesterol,triglyceride, LDL-cholesterol, and TNF-α, hs-CRP, IL-6 significantly (P < 0.01). The results showed that the levels of HDL-cholesterol decreased significantly (P < 0.05) but Cocoa inhibited lipid peroxidation in treatment group than control group (P < 0.0001). Virtual analysis showed that the most frequent Cocoa ingredients, (+)-Catechin and (−)-Epicatechin, can dock to the enzyme COX-2. Conclusion These data support the beneficial effect
Larsson, A
1997-08-01
The objective of this study was to investigate the conditions for regression analysis of data from equilibrium experiments. One important issue was to recognize that Kd and the binding site concentration (A) are not of equal nature, although both are parameters in the regression analysis. Whereas Kd approximates to a true constant, A is subject to experimental variation due to pipetting errors and in solid-phase experiments also to uneven coating properties. While recognizing that the ideal assumptions for ordinary regression analysis are poorly satisfied, different regression models were evaluated by extensive simulations. It was first established by a 'worst case' investigation that a limited error (8%) in the dependent variable is not critical for the results obtained at curve-fitting to Langmuir's equation. Seven different equations were compared for the calculation of data representing a solid-phase equilibrium experiment with statistical but no systematic errors. All the equations are rearrangements of the law of mass action. In this setting the Scatchrd plot gave the best result, but also the double reciprocal and the Woolf plots worked well in weighted analysis. Langmuir's equation gave the best result of the 4 nonlinear regression models tested. The influence of one type of systematic error was also investigated. This assumed that 10% of the label was positioned on particles other than the functional ligand molecules. This systematic error was amplified, which resulted in a substantial bias. The calculated Kd-values varied slightly with the regression method used and were almost 24% too high in the best methods. PMID:9328576
Quantile regression for the statistical analysis of immunological data with many non-detects
2012-01-01
Background Immunological parameters are hard to measure. A well-known problem is the occurrence of values below the detection limit, the non-detects. Non-detects are a nuisance, because classical statistical analyses, like ANOVA and regression, cannot be applied. The more advanced statistical techniques currently available for the analysis of datasets with non-detects can only be used if a small percentage of the data are non-detects. Methods and results Quantile regression, a generalization of percentiles to regression models, models the median or higher percentiles and tolerates very high numbers of non-detects. We present a non-technical introduction and illustrate it with an implementation to real data from a clinical trial. We show that by using quantile regression, groups can be compared and that meaningful linear trends can be computed, even if more than half of the data consists of non-detects. Conclusion Quantile regression is a valuable addition to the statistical methods that can be used for the analysis of immunological datasets with non-detects. PMID:22769433
Family Background Variables as Instruments for Education in Income Regressions: A Bayesian Analysis
ERIC Educational Resources Information Center
Hoogerheide, Lennart; Block, Joern H.; Thurik, Roy
2012-01-01
The validity of family background variables instrumenting education in income regressions has been much criticized. In this paper, we use data from the 2004 German Socio-Economic Panel and Bayesian analysis to analyze to what degree violations of the strict validity assumption affect the estimation results. We show that, in case of moderate direct…
ERIC Educational Resources Information Center
Preacher, Kristopher J.; Curran, Patrick J.; Bauer, Daniel J.
2006-01-01
Simple slopes, regions of significance, and confidence bands are commonly used to evaluate interactions in multiple linear regression (MLR) models, and the use of these techniques has recently been extended to multilevel or hierarchical linear modeling (HLM) and latent curve analysis (LCA). However, conducting these tests and plotting the…
Factor Regression Analysis: A New Method for Weighting Predictors. Final Report.
ERIC Educational Resources Information Center
Curtis, Ervin W.
The optimum weighting of variables to predict a dependent-criterion variable is an important problem in nearly all of the social and natural sciences. Although the predominant method, multiple regression analysis (MR), yields optimum weights for the sample at hand, these weights are not generally optimum in the population from which the sample was…
Multiple Logistic Regression Analysis of Cigarette Use among High School Students
ERIC Educational Resources Information Center
Adwere-Boamah, Joseph
2011-01-01
A binary logistic regression analysis was performed to predict high school students' cigarette smoking behavior from selected predictors from 2009 CDC Youth Risk Behavior Surveillance Survey. The specific target student behavior of interest was frequent cigarette use. Five predictor variables included in the model were: a) race, b) frequency of…
Isolating the Effects of Training Using Simple Regression Analysis: An Example of the Procedure.
ERIC Educational Resources Information Center
Waugh, C. Keith
This paper provides a case example of simple regression analysis, a forecasting procedure used to isolate the effects of training from an identified extraneous variable. This case example focuses on results of a three-day sales training program to improve bank loan officers' knowledge, skill-level, and attitude regarding solicitation and sale of…
ERIC Educational Resources Information Center
Campbell, S. Duke; Greenberg, Barry
The development of a predictive equation capable of explaining a significant percentage of enrollment variability at Florida International University is described. A model utilizing trend analysis and a multiple regression approach to enrollment forecasting was adapted to investigate enrollment dynamics at the university. Four independent…
Catching up with Harvard: Results from Regression Analysis of World Universities League Tables
ERIC Educational Resources Information Center
Li, Mei; Shankar, Sriram; Tang, Kam Ki
2011-01-01
This paper uses regression analysis to test if the universities performing less well according to Shanghai Jiao Tong University's world universities league tables are able to catch up with the top performers, and to identify national and institutional factors that could affect this catching up process. We have constructed a dataset of 461…
Ultrasound-enhanced bioscouring of greige cotton: regression analysis of process factors
Technology Transfer Automated Retrieval System (TEKTRAN)
Process factors of enzyme concentration, time, power and frequency were investigated for ultrasound-enhanced bioscouring of greige cotton. A fractional factorial experimental design and subsequent regression analysis of the process factors were employed to determine the significance of each factor a...
Passing the Test: Ecological Regression Analysis in the Los Angeles County Case and Beyond.
ERIC Educational Resources Information Center
Lichtman, Allan J.
1991-01-01
Statistical analysis of racially polarized voting prepared for the Garza v County of Los Angeles (California) (1990) voting rights case is reviewed to demonstrate that ecological regression is a flexible, robust technique that illuminates the reality of ethnic voting, and superior to the neighborhood model supported by the defendants. (SLD)
Some Classroom Experiences in the Teaching of Empirical Model Building and Regression Analysis.
ERIC Educational Resources Information Center
Utter, Merlin; Wilkinson, John W.
The use of the digital computer for the presentation of the topics of empirical model building and regression analysis is discussed. The author concentrates upon a description of computing exercises which are employed to provide the students with experience in model building and evaluation in a controlled situation. The types of exercises given…
Predictive Discriminant Analysis Versus Logistic Regression in Two-Group Classification Problems.
ERIC Educational Resources Information Center
Meshbane, Alice; Morris, John D.
A method for comparing the cross-validated classification accuracies of predictive discriminant analysis and logistic regression classification models is presented under varying data conditions for the two-group classification problem. With this method, separate-group, as well as total-sample proportions of the correct classifications, can be…
What Satisfies Students?: Mining Student-Opinion Data with Regression and Decision Tree Analysis
ERIC Educational Resources Information Center
Thomas, Emily H.; Galambos, Nora
2004-01-01
To investigate how students' characteristics and experiences affect satisfaction, this study uses regression and decision tree analysis with the CHAID algorithm to analyze student-opinion data. A data mining approach identifies the specific aspects of students' university experience that most influence three measures of general satisfaction. The…
Using Refined Regression Analysis To Assess The Ecological Services Of Restored Wetlands
A hierarchical approach to regression analysis of wetland water treatment was conducted to determine which factors are the most appropriate for characterizing wetlands of differing structure and function. We used this approach in an effort to identify the types and characteristi...
A use of regression analysis in acoustical diagnostics of gear drives
NASA Technical Reports Server (NTRS)
Balitskiy, F. Y.; Genkin, M. D.; Ivanova, M. A.; Kobrinskiy, A. A.; Sokolova, A. G.
1973-01-01
A study is presented of components of the vibration spectrum as the filtered first and second harmonics of the tooth frequency which permits information to be obtained on the physical characteristics of the vibration excitation process, and an approach to be made to comparison of models of the gearing. Regression analysis of two random processes has shown a strong dependence of the second harmonic on the first, and independence of the first from the second. The nature of change in the regression line, with change in loading moment, gives rise to the idea of a variable phase shift between the first and second harmonics.
Regression analysis of non-contact acousto-thermal signature data
NASA Astrophysics Data System (ADS)
Criner, Amanda; Schehl, Norman
2016-05-01
The non-contact acousto-thermal signature (NCATS) is a nondestructive evaluation technique with potential to detect fatigue in materials such as noisy titanium and polymer matrix composites. The underlying physical mechanisms and properties may be determined by parameter estimation via nonlinear regression. The nonlinear regression analysis formulation, including the underlying models, is discussed. Several models and associated data analyses are given along with the assumptions implicit in the underlying model. The results are anomalous. These anomalous results are evaluated with respect to the accuracy of the implicit assumptions.
Detrended fluctuation analysis as a regression framework: Estimating dependence at different scales
NASA Astrophysics Data System (ADS)
Kristoufek, Ladislav
2015-02-01
We propose a framework combining detrended fluctuation analysis with standard regression methodology. The method is built on detrended variances and covariances and it is designed to estimate regression parameters at different scales and under potential nonstationarity and power-law correlations. The former feature allows for distinguishing between effects for a pair of variables from different temporal perspectives. The latter ones make the method a significant improvement over the standard least squares estimation. Theoretical claims are supported by Monte Carlo simulations. The method is then applied on selected examples from physics, finance, environmental science, and epidemiology. For most of the studied cases, the relationship between variables of interest varies strongly across scales.
NASA Astrophysics Data System (ADS)
Pradhan, B.; Buchroithner, M. F.; Mansor, S.
2009-04-01
This paper presents the assessment results of spatially based probabilistic three models using Geoinformation Techniques (GIT) for landslide susceptibility analysis at Penang Island in Malaysia. Landslide locations within the study areas were identified by interpreting aerial photographs, satellite images and supported with field surveys. Maps of the topography, soil type, lineaments and land cover were constructed from the spatial data sets. There are nine landslide related factors were extracted from the spatial database and the neural network, frequency ratio and logistic regression coefficients of each factor was computed. Landslide susceptibility maps were drawn for study area using neural network, frequency ratios and logistic regression models. For verification, the results of the analyses were compared with actual landslide locations in study area. The verification results show that frequency ratio model provides higher prediction accuracy than the ANN and regression models.
Wang, Wen-Cheng; Cho, Wen-Chien; Chen, Yin-Jen
2014-01-01
It is estimated that mainland Chinese tourists travelling to Taiwan can bring annual revenues of 400 billion NTD to the Taiwan economy. Thus, how the Taiwanese Government formulates relevant measures to satisfy both sides is the focus of most concern. Taiwan must improve the facilities and service quality of its tourism industry so as to attract more mainland tourists. This paper conducted a questionnaire survey of mainland tourists and used grey relational analysis in grey mathematics to analyze the satisfaction performance of all satisfaction question items. The first eight satisfaction items were used as independent variables, and the overall satisfaction performance was used as a dependent variable for quantile regression model analysis to discuss the relationship between the dependent variable under different quantiles and independent variables. Finally, this study further discussed the predictive accuracy of the least mean regression model and each quantile regression model, as a reference for research personnel. The analysis results showed that other variables could also affect the overall satisfaction performance of mainland tourists, in addition to occupation and age. The overall predictive accuracy of quantile regression model Q0.25 was higher than that of the other three models. PMID:24574916
Wang, Wen-Cheng; Cho, Wen-Chien; Chen, Yin-Jen
2014-01-01
It is estimated that mainland Chinese tourists travelling to Taiwan can bring annual revenues of 400 billion NTD to the Taiwan economy. Thus, how the Taiwanese Government formulates relevant measures to satisfy both sides is the focus of most concern. Taiwan must improve the facilities and service quality of its tourism industry so as to attract more mainland tourists. This paper conducted a questionnaire survey of mainland tourists and used grey relational analysis in grey mathematics to analyze the satisfaction performance of all satisfaction question items. The first eight satisfaction items were used as independent variables, and the overall satisfaction performance was used as a dependent variable for quantile regression model analysis to discuss the relationship between the dependent variable under different quantiles and independent variables. Finally, this study further discussed the predictive accuracy of the least mean regression model and each quantile regression model, as a reference for research personnel. The analysis results showed that other variables could also affect the overall satisfaction performance of mainland tourists, in addition to occupation and age. The overall predictive accuracy of quantile regression model Q0.25 was higher than that of the other three models. PMID:24574916
NASA Technical Reports Server (NTRS)
Rummler, D. R.
1976-01-01
The results are presented of investigations to apply regression techniques to the development of methodology for creep-rupture data analysis. Regression analysis techniques are applied to the explicit description of the creep behavior of materials for space shuttle thermal protection systems. A regression analysis technique is compared with five parametric methods for analyzing three simulated and twenty real data sets, and a computer program for the evaluation of creep-rupture data is presented.
The estimation of Aerosol Optical Depth in eastern China based on regression analysis
NASA Astrophysics Data System (ADS)
Wang, Jing; Shi, Runhe; Liu, Chaoshun; Zhou, Cong
2015-09-01
The atmospheric pollution and air quality issues are getting worse in China, the formation mechanism of aerosols and their environment effects attracted more and more attention. Aerosol Optical Depth (AOD) is one of the most important parameters which can indicate the atmospheric turbidity and aerosol load. High-quality AOD data are significant for the study in the atmospheric environment (i.e., air quality). This paper used MODIS/Terra AOD in 2008 to improve the coverage of MODIS/Aqua AOD, which was based on linear regression analysis model. RMSE between estimation value and AquaAOD detected through satellite is 0.132. The average value of test data was 0.812. The average of regression result was 0.807. It showed that the regression model between AODTerra and AODAqua worked well. Also, we built two sets of estimation models (MODIS AOD and OMI AOD) through stepwise regression analysis model. One is using OMI AOD and meteorological elements to estimate MODIS AOD. The value of RMSE was 0.113, which represents 13.916% of the average(R2=0.782). The other one is using MODIS AOD and meteorological elements to estimate OMI AOD. RMSE of the model is 0.132, which represents 18.182% of the average (R2=0.726).
A deformation analysis method of stepwise regression for bridge deflection prediction
NASA Astrophysics Data System (ADS)
Shen, Yueqian; Zeng, Ying; Zhu, Lei; Huang, Teng
2015-12-01
Large-scale bridges are among the most important infrastructures whose safe conditions concern people's daily activities and life safety. Monitoring of large-scale bridges is crucial since deformation might have occurred. How to obtain the deformation information and then judge the safe conditions are the key and difficult problems in bridge deformation monitoring field. Deflection is the important index for evaluation of bridge safety. This paper proposes a forecasting modeling of stepwise regression analysis. Based on the deflection monitoring data of Yangtze River Bridge, the main factors influenced deflection deformation is chiefly studied. Authors use the monitoring data to forecast the deformation value of a bridge deflection at different time from the perspective of non-bridge structure, and compared to the forecasting of gray relational analysis based on linear regression. The result show that the accuracy and reliability of stepwise regression analysis is high, which provides the scientific basis to the bridge operation management. And above all, the ideas of this research provide and effective method for bridge deformation analysis.
Lo, Benjamin W. Y.; Fukuda, Hitoshi; Angle, Mark; Teitelbaum, Jeanne; Macdonald, R. Loch; Farrokhyar, Forough; Thabane, Lehana; Levine, Mitchell A. H.
2016-01-01
Background: Classification and regression tree analysis involves the creation of a decision tree by recursive partitioning of a dataset into more homogeneous subgroups. Thus far, there is scarce literature on using this technique to create clinical prediction tools for aneurysmal subarachnoid hemorrhage (SAH). Methods: The classification and regression tree analysis technique was applied to the multicenter Tirilazad database (3551 patients) in order to create the decision-making algorithm. In order to elucidate prognostic subgroups in aneurysmal SAH, neurologic, systemic, and demographic factors were taken into account. The dependent variable used for analysis was the dichotomized Glasgow Outcome Score at 3 months. Results: Classification and regression tree analysis revealed seven prognostic subgroups. Neurological grade, occurrence of post-admission stroke, occurrence of post-admission fever, and age represented the explanatory nodes of this decision tree. Split sample validation revealed classification accuracy of 79% for the training dataset and 77% for the testing dataset. In addition, the occurrence of fever at 1-week post-aneurysmal SAH is associated with increased odds of post-admission stroke (odds ratio: 1.83, 95% confidence interval: 1.56–2.45, P < 0.01). Conclusions: A clinically useful classification tree was generated, which serves as a prediction tool to guide bedside prognostication and clinical treatment decision making. This prognostic decision-making algorithm also shed light on the complex interactions between a number of risk factors in determining outcome after aneurysmal SAH. PMID:27512607
Gao, Jun; Lavergne, M. Ruth; McIntyre, Paul
2013-01-01
Classification and regression tree (CART) analysis was used to identify subpopulations with lower palliative care program (PCP) enrolment rates. CART analysis uses recursive partitioning to group predictors. The PCP enrolment rate was 72 percent for the 6,892 adults who died of cancer from 2000 and 2005 in two counties in Nova Scotia, Canada. The lowest PCP enrolment rates were for nursing home residents over 82 years (27 percent), a group residing more than 43 kilometres from the PCP (31 percent), and another group living less than two weeks after their cancer diagnosis (37 percent). The highest rate (86 percent) was for the 2,118 persons who received palliative radiation. Findings from multiple logistic regression (MLR) were provided for comparison. CART findings identified low PCP enrolment subpopulations that were defined by interactions among demographic, social, medical, and health system predictors. PMID:21805944
Forecasting municipal solid waste generation using prognostic tools and regression analysis.
Ghinea, Cristina; Drăgoi, Elena Niculina; Comăniţă, Elena-Diana; Gavrilescu, Marius; Câmpean, Teofil; Curteanu, Silvia; Gavrilescu, Maria
2016-11-01
For an adequate planning of waste management systems the accurate forecast of waste generation is an essential step, since various factors can affect waste trends. The application of predictive and prognosis models are useful tools, as reliable support for decision making processes. In this paper some indicators such as: number of residents, population age, urban life expectancy, total municipal solid waste were used as input variables in prognostic models in order to predict the amount of solid waste fractions. We applied Waste Prognostic Tool, regression analysis and time series analysis to forecast municipal solid waste generation and composition by considering the Iasi Romania case study. Regression equations were determined for six solid waste fractions (paper, plastic, metal, glass, biodegradable and other waste). Accuracy Measures were calculated and the results showed that S-curve trend model is the most suitable for municipal solid waste (MSW) prediction. PMID:27454099
Inhibition of cyclooxygenase (COX)-2 affects endothelial progenitor cell proliferation
Colleselli, Daniela; Bijuklic, Klaudija; Mosheimer, Birgit A.; Kaehler, Christian M. . E-mail: C.M.Kaehler@uibk.ac.at
2006-09-10
Growing evidence indicates that inducible cyclooxygenase-2 (COX-2) is involved in the pathogenesis of inflammatory disorders and various types of cancer. Endothelial progenitor cells recruited from the bone marrow have been shown to be involved in the formation of new vessels in malignancies and discussed for being a key point in tumour progression and metastasis. However, until now, nothing is known about an interaction between COX and endothelial progenitor cells (EPC). Expression of COX-1 and COX-2 was detected by semiquantitative RT-PCR and Western blot. Proliferation kinetics, cell cycle distribution and rate of apoptosis were analysed by MTT test and FACS analysis. Further analyses revealed an implication of Akt phosphorylation and caspase-3 activation. Both COX-1 and COX-2 expression can be found in bone-marrow-derived endothelial progenitor cells in vitro. COX-2 inhibition leads to a significant reduction in proliferation of endothelial progenitor cells by an increase in apoptosis and cell cycle arrest. COX-2 inhibition leads further to an increased cleavage of caspase-3 protein and inversely to inhibition of Akt activation. Highly proliferating endothelial progenitor cells can be targeted by selective COX-2 inhibition in vitro. These results indicate that upcoming therapy strategies in cancer patients targeting COX-2 may be effective in inhibiting tumour vasculogenesis as well as angiogenic processes.
Genetic analysis of tolerance to infections using random regressions: a simulation study.
Kause, Antti
2011-08-01
Tolerance to infections is the ability of a host to limit the impact of a given pathogen burden on host performance. This simulation study demonstrated the merit of using random regressions to estimate unbiased genetic variances for tolerance slope and its genetic correlations with other traits, which could not be obtained using the previously implemented statistical methods. Genetic variance in tolerance was estimated as genetic variance in regression slopes of host performance along an increasing pathogen burden level. Random regressions combined with covariance functions allowed genetic variance for host performance to be estimated at any point along the pathogen burden trajectory, providing a novel means to analyse infection-induced changes in genetic variation of host performance. Yet, the results implied that decreasing family size as well as a non-zero environmental or genetic correlation between initial host performance before infection and pathogen burden led to biased estimates for tolerance genetic variance. In both cases, genetic correlation between tolerance slope and host performance in a pathogen-free environment became artificially negative, implying a genetic trade-off when it did not exist. Moreover, recording a normally distributed pathogen burden as a threshold trait is not a realistic way of obtaining unbiased estimates for tolerance genetic variance. The results show that random regressions are suitable for the genetic analysis of tolerance, given suitable data structure collected either under field or experimental conditions. PMID:21767462
Augmented kludge waveforms and Gaussian process regression for EMRI data analysis
NASA Astrophysics Data System (ADS)
Chua, Alvin J. K.
2016-05-01
Extreme-mass-ratio inspirals (EMRIs) will be an important type of astrophysical source for future space-based gravitational-wave detectors. There is a trade-off between accuracy and computational speed for the EMRI waveform templates required in the analysis of data from these detectors. We discuss how the systematic error incurred by using faster templates may be reduced with improved models such as augmented kludge waveforms, and marginalised over with statistical techniques such as Gaussian process regression.
Non-Stationary Hydrologic Frequency Analysis using B-Splines Quantile Regression
NASA Astrophysics Data System (ADS)
Nasri, B.; St-Hilaire, A.; Bouezmarni, T.; Ouarda, T.
2015-12-01
Hydrologic frequency analysis is commonly used by engineers and hydrologists to provide the basic information on planning, design and management of hydraulic structures and water resources system under the assumption of stationarity. However, with increasing evidence of changing climate, it is possible that the assumption of stationarity would no longer be valid and the results of conventional analysis would become questionable. In this study, we consider a framework for frequency analysis of extreme flows based on B-Splines quantile regression, which allows to model non-stationary data that have a dependence on covariates. Such covariates may have linear or nonlinear dependence. A Markov Chain Monte Carlo (MCMC) algorithm is used to estimate quantiles and their posterior distributions. A coefficient of determination for quantiles regression is proposed to evaluate the estimation of the proposed model for each quantile level. The method is applied on annual maximum and minimum streamflow records in Ontario, Canada. Climate indices are considered to describe the non-stationarity in these variables and to estimate the quantiles in this case. The results show large differences between the non-stationary quantiles and their stationary equivalents for annual maximum and minimum discharge with high annual non-exceedance probabilities. Keywords: Quantile regression, B-Splines functions, MCMC, Streamflow, Climate indices, non-stationarity.
Repeated-measures regression designs and analysis for environmental effects monitoring programs
NASA Astrophysics Data System (ADS)
Paine, Michael D.; Skinner, Marc A.; Kilgour, Bruce W.; DeBlois, Elisabeth M.; Tracy, Ellen
2014-12-01
This paper provides a general overview of repeated-measures (RM) regression designs and analysis for marine monitoring programs, in support of sediment chemistry, particle size and benthic macroinvertebrate community analyses provided as part of this series. In RM regression designs, the same n replicates (usually stations in monitoring programs) are re-sampled (i.e., repeatedly measured) at t>1 Times (usually years). The stations provide variation in the predictor, or X variables. In the Terra Nova environmental effects monitoring (EEM) program, n=48 stations were sampled in each of t=7 years from 2000 to 2010. Two distance measures from five drill centres (sources of drilling wastes) were fixed predictor variables. RM regression designs are rarely used in environmental monitoring programs, but are often suitable and would be appropriate if applied to data from many monitoring programs. For the Terra Nova EEM program, carry-over effects, or persistent and usually small-scale variations among stations unrelated to distance, were strong for most sediment quality variables. Whenever natural carry-over effects are strong, RM designs and analysis will usually be more powerful and suitable than alternative approaches to the analysis.
Gulbransen, Dana J; McGlathery, Karen J; Marklund, Maria; Norris, James N; Gurgel, Carlos Frederico D
2012-10-01
Gracilaria vermiculophylla (Ohmi) Papenfuss is an invasive alga that is native to Southeast Asia and has invaded many estuaries in North America and Europe. It is difficult to differentiate G. vermiculophylla from native forms using morphology and therefore molecular techniques are needed. In this study, we used three molecular markers (rbcL, cox2-cox3 spacer, cox1) to identify G. vermiculophylla at several locations in the western Atlantic. RbcL and cox2-cox3 spacer markers confirmed the presence of G. vermiculophylla on the east coast of the USA from Massachusetts to South Carolina. We used a 507 base pair region of cox1 mtDNA to (i) verify the widespread distribution of G. vermiculophylla in the Virginia (VA) coastal bays and (ii) determine the intraspecific diversity of these algae. Cox1 haplotype richness in the VA coastal bays was much higher than that previously found in other invaded locations, as well as some native locations. This difference is likely attributed to the more intensive sampling design used in this study, which was able to detect richness created by multiple, diverse introductions. On the basis of our results, we recommend that future studies take differences in sampling design into account when comparing haplotype richness and diversity between native and non-native studies in the literature. PMID:27011285
Mahdi, Chanif; Nurdiana, Nurdiana; Kikuchi, Takheshi; Fatchiyah, Fatchiyah
2014-01-01
To understand the structural features that dictate the selectivity of the two isoforms of the prostaglandin H2 synthase (PGHS/COX), the three-dimensional (3D) structure of COX-1/COX-2 was assessed by means of binding energy calculation of virtual molecular dynamic with using ligand alpha-Patchouli alcohol isomers. Molecular interaction studies with COX-1 and COX-2 were done using the molecular docking tools by Hex 8.0. Interactions were further visualized by using Discovery Studio Client 3.5 software tool. The binding energy of molecular interaction was calculated by AMBER12 and Virtual Molecular Dynamic 1.9.1 software. The analysis of the alpha-Patchouli alcohol isomer compounds showed that all alpha-Patchouli alcohol isomers were suggested as inhibitor of COX-1 and COX-2. Collectively, the scoring binding energy calculation (with PBSA Model Solvent) of alpha-Patchouli alcohol isomer compounds (CID442384, CID6432585, CID3080622, CID10955174, and CID56928117) was suggested as candidate for a selective COX-1 inhibitor and CID521903 as nonselective COX-1/COX-2. PMID:25484897
Raharjo, Sentot Joko; Mahdi, Chanif; Nurdiana, Nurdiana; Kikuchi, Takheshi; Fatchiyah, Fatchiyah
2014-01-01
To understand the structural features that dictate the selectivity of the two isoforms of the prostaglandin H2 synthase (PGHS/COX), the three-dimensional (3D) structure of COX-1/COX-2 was assessed by means of binding energy calculation of virtual molecular dynamic with using ligand alpha-Patchouli alcohol isomers. Molecular interaction studies with COX-1 and COX-2 were done using the molecular docking tools by Hex 8.0. Interactions were further visualized by using Discovery Studio Client 3.5 software tool. The binding energy of molecular interaction was calculated by AMBER12 and Virtual Molecular Dynamic 1.9.1 software. The analysis of the alpha-Patchouli alcohol isomer compounds showed that all alpha-Patchouli alcohol isomers were suggested as inhibitor of COX-1 and COX-2. Collectively, the scoring binding energy calculation (with PBSA Model Solvent) of alpha-Patchouli alcohol isomer compounds (CID442384, CID6432585, CID3080622, CID10955174, and CID56928117) was suggested as candidate for a selective COX-1 inhibitor and CID521903 as nonselective COX-1/COX-2. PMID:25484897
GENE-LEVEL PHARMACOGENETIC ANALYSIS ON SURVIVAL OUTCOMES USING GENE-TRAIT SIMILARITY REGRESSION
Tzeng, Jung-Ying; Lu, Wenbin; Hsu, Fang-Chi
2014-01-01
Gene/pathway-based methods are drawing significant attention due to their usefulness in detecting rare and common variants that affect disease susceptibility. The biological mechanism of drug responses indicates that a gene-based analysis has even greater potential in pharmacogenetics. Motivated by a study from the Vitamin Intervention for Stroke Prevention (VISP) trial, we develop a gene-trait similarity regression for survival analysis to assess the effect of a gene or pathway on time-to-event outcomes. The similarity regression has a general framework that covers a range of survival models, such as the proportional hazards model and the proportional odds model. The inference procedure developed under the proportional hazards model is robust against model misspecification. We derive the equivalence between the similarity survival regression and a random effects model, which further unifies the current variance-component based methods. We demonstrate the effectiveness of the proposed method through simulation studies. In addition, we apply the method to the VISP trial data to identify the genes that exhibit an association with the risk of a recurrent stroke. TCN2 gene was found to be associated with the recurrent stroke risk in the low-dose arm. This gene may impact recurrent stroke risk in response to cofactor therapy. PMID:25018788
Yao, Yan; Wang, Chang-yue; Liu, Hui-jun; Tang, Jian-bin; Cai, Jin-hui; Wang, Jing-jun
2015-07-01
Forest bio-fuel, a new type renewable energy, has attracted increasing attention as a promising alternative. In this study, a new method called Sparse Partial Least Squares Regression (SPLS) is used to construct the proximate analysis model to analyze the fuel characteristics of sawdust combining Near Infrared Spectrum Technique. Moisture, Ash, Volatile and Fixed Carbon percentage of 80 samples have been measured by traditional proximate analysis. Spectroscopic data were collected by Nicolet NIR spectrometer. After being filtered by wavelet transform, all of the samples are divided into training set and validation set according to sample category and producing area. SPLS, Principle Component Regression (PCR), Partial Least Squares Regression (PLS) and Least Absolute Shrinkage and Selection Operator (LASSO) are presented to construct prediction model. The result advocated that SPLS can select grouped wavelengths and improve the prediction performance. The absorption peaks of the Moisture is covered in the selected wavelengths, well other compositions have not been confirmed yet. In a word, SPLS can reduce the dimensionality of complex data sets and interpret the relationship between spectroscopic data and composition concentration, which will play an increasingly important role in the field of NIR application. PMID:26717741
Analysis of ontogenetic spectra of populations of plants and lichens via ordinal regression
NASA Astrophysics Data System (ADS)
Sofronov, G. Yu.; Glotov, N. V.; Ivanov, S. M.
2015-03-01
Ontogenetic spectra of plants and lichens tend to vary across the populations. This means that if several subsamples within a sample (or a population) were collected, then the subsamples would not be homogeneous. Consequently, the statistical analysis of the aggregated data would not be correct, which could potentially lead to false biological conclusions. In order to take into account the heterogeneity of the subsamples, we propose to use ordinal regression, which is a type of generalized linear regression. In this paper, we study the populations of cowberry Vaccinium vitis-idaea L. and epiphytic lichens Hypogymnia physodes (L.) Nyl. and Pseudevernia furfuracea (L.) Zopf. We obtain estimates for the proportions of between-sample variability in the total variability of the ontogenetic spectra of the populations.
NASA Astrophysics Data System (ADS)
Dervilis, N.; Worden, K.; Cross, E. J.
2015-07-01
In the data-based approach to structural health monitoring (SHM), the absence of data from damaged structures in many cases forces a dependence on novelty detection as a means of diagnosis. Unfortunately, this means that benign variations in the operating or environmental conditions of the structure must be handled very carefully, lest they lead to false alarms. If novelty detection is implemented in terms of outlier detection, the outliers may arise in the data as the result of both benign and malign causes and it is important to understand their sources. Comparatively recent developments in the field of robust regression have the potential to provide ways of exploring and visualising SHM data as a means of shedding light on the different origins of outliers. The current paper will illustrate the use of robust regression for SHM data analysis through experimental data acquired from the Z24 and Tamar Bridges, although the methods are general and not restricted to SHM or civil infrastructure.
Cao, Han-Han; Du, Ruo-Fei; Yang, Jia-Ning; Feng, Yi
2014-03-01
In this paper, microcrystalline cellulose WJ101 was used as a model material to investigate the effect of various process parameters on granule yield and friability after dry granulation with a single factor and the effect of comprehensive inspection process parameters on the effect of granule yield and friability, then the correlation between process parameters and granule quality was established. The regress equation was established between process parameters and granule yield and friability by multiple regression analysis, the affecting the order of the size of the order of the process parameters on granule yield and friability was: rollers speed > rollers pressure > speed of horizontal feed. Granule yield was positively correlated with pressure and speed of horizontal feed and negatively correlated rollers speed, while friability was on the contrary. By comparison, fitted value and real value, fitted and real value are basically the same of no significant differences (P > 0.05) and with high precision and reliability. PMID:24961115
NASA Astrophysics Data System (ADS)
Urrutia, J. D.; Bautista, L. A.; Baccay, E. B.
2014-04-01
The aim of this study was to develop mathematical models for estimating earthquake casualties such as death, number of injured persons, affected families and total cost of damage. To quantify the direct damages from earthquakes to human beings and properties given the magnitude, intensity, depth of focus, location of epicentre and time duration, the regression models were made. The researchers formulated models through regression analysis using matrices and used α = 0.01. The study considered thirty destructive earthquakes that hit the Philippines from the inclusive years 1968 to 2012. Relevant data about these said earthquakes were obtained from Philippine Institute of Volcanology and Seismology. Data on damages and casualties were gathered from the records of National Disaster Risk Reduction and Management Council. The mathematical models made are as follows: This study will be of great value in emergency planning, initiating and updating programs for earthquake hazard reductionin the Philippines, which is an earthquake-prone country.
Alados, C.L.; Pueyo, Y.; Giner, M.L.; Navarro, T.; Escos, J.; Barroso, F.; Cabezudo, B.; Emlen, J.M.
2003-01-01
We studied the effect of grazing on the degree of regression of successional vegetation dynamic in a semi-arid Mediterranean matorral. We quantified the spatial distribution patterns of the vegetation by fractal analyses, using the fractal information dimension and spatial autocorrelation measured by detrended fluctuation analyses (DFA). It is the first time that fractal analysis of plant spatial patterns has been used to characterize the regressive ecological succession. Plant spatial patterns were compared over a long-term grazing gradient (low, medium and heavy grazing pressure) and on ungrazed sites for two different plant communities: A middle dense matorral of Chamaerops and Periploca at Sabinar-Romeral and a middle dense matorral of Chamaerops, Rhamnus and Ulex at Requena-Montano. The two communities differed also in the microclimatic characteristics (sea oriented at the Sabinar-Romeral site and inland oriented at the Requena-Montano site). The information fractal dimension increased as we moved from a middle dense matorral to discontinuous and scattered matorral and, finally to the late regressive succession, at Stipa steppe stage. At this stage a drastic change in the fractal dimension revealed a change in the vegetation structure, accurately indicating end successional vegetation stages. Long-term correlation analysis (DFA) revealed that an increase in grazing pressure leads to unpredictability (randomness) in species distributions, a reduction in diversity, and an increase in cover of the regressive successional species, e.g. Stipa tenacissima L. These comparisons provide a quantitative characterization of the successional dynamic of plant spatial patterns in response to grazing perturbation gradient. ?? 2002 Elsevier Science B.V. All rights reserved.
ERIC Educational Resources Information Center
Johns, Stephanie
2010-01-01
Kathy Cox, the superintendent of schools for Georgia, believes "excellence is not an accident". She made a name for herself by winning $1 million proving she was smarter than a fifth-grader on a popular television show. This article presents a profile of Cox, her family, her role as school superintendent, and her accomplishments. Although she…
Validation of a heteroscedastic hazards regression model.
Wu, Hong-Dar Isaac; Hsieh, Fushing; Chen, Chen-Hsin
2002-03-01
A Cox-type regression model accommodating heteroscedasticity, with a power factor of the baseline cumulative hazard, is investigated for analyzing data with crossing hazards behavior. Since the approach of partial likelihood cannot eliminate the baseline hazard, an overidentified estimating equation (OEE) approach is introduced in the estimation procedure. It by-product, a model checking statistic, is presented to test for the overall adequacy of the heteroscedastic model. Further, under the heteroscedastic model setting, we propose two statistics to test the proportional hazards assumption. Implementation of this model is illustrated in a data analysis of a cancer clinical trial. PMID:11878222
Meadows, Cheyney; Rajala-Schultz, Päivi J; Frazer, Grant S; Meiring, Richard W; Hoblet, Kent H
2006-12-18
An observational study was conducted in order to assess the impact of a contract breeding program on the reproductive performance in a selected group of Ohio dairies using event-time analysis. The contract breeding program was offered by a breeding co-operative and featured tail chalking and daily evaluation of cows for insemination by co-operative technicians. Dairy employees no longer handled estrus detection activities. Between early 2002 and mid-2004, test-day records related to production and reproduction were obtained for 16,453 lactations representing 11,398 cows in a non-random sample of 31 dairies identified as well-managed client herds of the breeding co-operative. Of the 31 herds, 15 were using the contract breeding at the start of the data acquisition period, having started in the previous 2 years. The remaining 16 herds managed their own breeding program and used the co-operative for semen purchase. Cox proportional hazards modeling techniques were used to estimate the association of the contract breeding, as well as the effect of other significant predictors, with the hazard of pregnancy. Two separate Cox models were developed and compared: one that only considered fixed covariates and a second that included both fixed and time-varying covariates. Estimates of effects were expressed as the hazard ratio (HR) for pregnancy. Results of the fixed covariates model indicated that, controlling for breed, herd size, use of ovulation synchronization protocols in the herd, whether somatic cell score exceeded 4.5 prior to pregnancy or censoring, parity, calving season, and maximum test-day milk prior to pregnancy or censoring, the contract breeding program was associated with an increased hazard of pregnancy (HR=1.315; 95% CI 1.261-1.371). The results of the time-varying covariates model, which controlled for breed, herd size, use of ovulation synchronization protocols, somatic cell score above 4.5, parity, calving season, and testing season also found that the
Dhanya, S; Kumari Roshni, V S
2016-01-01
Textures play an important role in image classification. This paper proposes a high performance texture classification method using a combination of multiresolution analysis tool and linear regression modelling by channel elimination. The correlation between different frequency regions has been validated as a sort of effective texture characteristic. This method is motivated by the observation that there exists a distinctive correlation between the image samples belonging to the same kind of texture, at different frequency regions obtained by a wavelet transform. Experimentally, it is observed that this correlation differs across textures. The linear regression modelling is employed to analyze this correlation and extract texture features that characterize the samples. Our method considers not only the frequency regions but also the correlation between these regions. This paper primarily focuses on applying the Dual Tree Complex Wavelet Packet Transform and the Linear Regression model for classification of the obtained texture features. Additionally the paper also presents a comparative assessment of the classification results obtained from the above method with two more types of wavelet transform methods namely the Discrete Wavelet Transform and the Discrete Wavelet Packet Transform. PMID:26835234
Oil and gas pipeline construction cost analysis and developing regression models for cost estimation
NASA Astrophysics Data System (ADS)
Thaduri, Ravi Kiran
In this study, cost data for 180 pipelines and 136 compressor stations have been analyzed. On the basis of the distribution analysis, regression models have been developed. Material, Labor, ROW and miscellaneous costs make up the total cost of a pipeline construction. The pipelines are analyzed based on different pipeline lengths, diameter, location, pipeline volume and year of completion. In a pipeline construction, labor costs dominate the total costs with a share of about 40%. Multiple non-linear regression models are developed to estimate the component costs of pipelines for various cross-sectional areas, lengths and locations. The Compressor stations are analyzed based on the capacity, year of completion and location. Unlike the pipeline costs, material costs dominate the total costs in the construction of compressor station, with an average share of about 50.6%. Land costs have very little influence on the total costs. Similar regression models are developed to estimate the component costs of compressor station for various capacities and locations.
Irrechukwu, Onyi N; Reiter, David A; Lin, Ping-Chang; Roque, Remigio A; Fishbein, Kenneth W; Spencer, Richard G
2012-06-01
Increased sensitivity in the characterization of cartilage matrix status by magnetic resonance (MR) imaging, through the identification of surrogate markers for tissue quality, would be of great use in the noninvasive evaluation of engineered cartilage. Recent advances in MR evaluation of cartilage include multiexponential and multiparametric analysis, which we now extend to engineered cartilage. We studied constructs which developed from chondrocytes seeded in collagen hydrogels. MR measurements of transverse relaxation times were performed on samples after 1, 2, 3, and 4 weeks of development. Corresponding biochemical measurements of sulfated glycosaminoglycan (sGAG) were also performed. sGAG per wet weight increased from 7.74±1.34 μg/mg in week 1 to 21.06±4.14 μg/mg in week 4. Using multiexponential T₂ analysis, we detected at least three distinct water compartments, with T₂ values and weight fractions of (45 ms, 3%), (200 ms, 4%), and (500 ms, 97%), respectively. These values are consistent with known properties of engineered cartilage and previous studies of native cartilage. Correlations between sGAG and MR measurements were examined using conventional univariate analysis with T₂ data from monoexponential fits with individual multiexponential compartment fractions and sums of these fractions, through multiple linear regression based on linear combinations of fractions, and, finally, with multivariate analysis using the support vector regression (SVR) formalism. The phenomenological relationship between T₂ from monoexponential fitting and sGAG exhibited a correlation coefficient of r²=0.56, comparable to the more physically motivated correlations between individual fractions or sums of fractions and sGAG; the correlation based on the sum of the two proteoglycan-associated fractions was r²=0.58. Correlations between measured sGAG and those calculated using standard linear regression were more modest, with r² in the range 0
COX2 Inhibition Reduces Aortic Valve Calcification In Vivo
Wirrig, Elaine E.; Gomez, M. Victoria; Hinton, Robert B.; Yutzey, Katherine E.
2016-01-01
Objective Calcific aortic valve disease (CAVD) is a significant cause of morbidity and mortality, which affects approximately 1% of the US population and is characterized by calcific nodule formation and stenosis of the valve. Klotho-deficient mice were used to study the molecular mechanisms of CAVD as they develop robust aortic valve (AoV) calcification. Through microarray analysis of AoV tissues from klotho-deficient and wild type mice, increased expression of the gene encoding cyclooxygenase 2/COX2 (Ptgs2) was found. COX2 activity contributes to bone differentiation and homeostasis, thus the contribution of COX2 activity to AoV calcification was assessed. Approach and Results In klotho-deficient mice, COX2 expression is increased throughout regions of valve calcification and is induced in the valvular interstitial cells (VICs) prior to calcification formation. Similarly, COX2 expression is increased in human diseased AoVs. Treatment of cultured porcine aortic VICs with osteogenic media induces bone marker gene expression and calcification in vitro, which is blocked by inhibition of COX2 activity. In vivo, genetic loss of function of COX2 cyclooxygenase activity partially rescues AoV calcification in klotho-deficient mice. Moreover, pharmacologic inhibition of COX2 activity in klotho-deficient mice via celecoxib-containing diet reduces AoV calcification and blocks osteogenic gene expression. Conclusions COX2 expression is upregulated in CAVD and its activity contributes to osteogenic gene induction and valve calcification in vitro and in vivo. PMID:25722432
NASA Astrophysics Data System (ADS)
Liu, Pudong; Shi, Runhe; Wang, Hong; Bai, Kaixu; Gao, Wei
2014-10-01
Leaf pigments are key elements for plant photosynthesis and growth. Traditional manual sampling of these pigments is labor-intensive and costly, which also has the difficulty in capturing their temporal and spatial characteristics. The aim of this work is to estimate photosynthetic pigments at large scale by remote sensing. For this purpose, inverse model were proposed with the aid of stepwise multiple linear regression (SMLR) analysis. Furthermore, a leaf radiative transfer model (i.e. PROSPECT model) was employed to simulate the leaf reflectance where wavelength varies from 400 to 780 nm at 1 nm interval, and then these values were treated as the data from remote sensing observations. Meanwhile, simulated chlorophyll concentration (Cab), carotenoid concentration (Car) and their ratio (Cab/Car) were taken as target to build the regression model respectively. In this study, a total of 4000 samples were simulated via PROSPECT with different Cab, Car and leaf mesophyll structures as 70% of these samples were applied for training while the last 30% for model validation. Reflectance (r) and its mathematic transformations (1/r and log (1/r)) were all employed to build regression model respectively. Results showed fair agreements between pigments and simulated reflectance with all adjusted coefficients of determination (R2) larger than 0.8 as 6 wavebands were selected to build the SMLR model. The largest value of R2 for Cab, Car and Cab/Car are 0.8845, 0.876 and 0.8765, respectively. Meanwhile, mathematic transformations of reflectance showed little influence on regression accuracy. We concluded that it was feasible to estimate the chlorophyll and carotenoids and their ratio based on statistical model with leaf reflectance data.
Analysis of sparse data in logistic regression in medical research: A newer approach
Devika, S; Jeyaseelan, L; Sebastian, G
2016-01-01
Background and Objective: In the analysis of dichotomous type response variable, logistic regression is usually used. However, the performance of logistic regression in the presence of sparse data is questionable. In such a situation, a common problem is the presence of high odds ratios (ORs) with very wide 95% confidence interval (CI) (OR: >999.999, 95% CI: <0.001, >999.999). In this paper, we addressed this issue by using penalized logistic regression (PLR) method. Materials and Methods: Data from case-control study on hyponatremia and hiccups conducted in Christian Medical College, Vellore, Tamil Nadu, India was used. The outcome variable was the presence/absence of hiccups and the main exposure variable was the status of hyponatremia. Simulation dataset was created with different sample sizes and with a different number of covariates. Results: A total of 23 cases and 50 controls were used for the analysis of ordinary and PLR methods. The main exposure variable hyponatremia was present in nine (39.13%) of the cases and in four (8.0%) of the controls. Of the 23 hiccup cases, all were males and among the controls, 46 (92.0%) were males. Thus, the complete separation between gender and the disease group led into an infinite OR with 95% CI (OR: >999.999, 95% CI: <0.001, >999.999) whereas there was a finite and consistent regression coefficient for gender (OR: 5.35; 95% CI: 0.42, 816.48) using PLR. After adjusting for all the confounding variables, hyponatremia entailed 7.9 (95% CI: 2.06, 38.86) times higher risk for the development of hiccups as was found using PLR whereas there was an overestimation of risk OR: 10.76 (95% CI: 2.17, 53.41) using the conventional method. Simulation experiment shows that the estimated coverage probability of this method is near the nominal level of 95% even for small sample sizes and for a large number of covariates. Conclusions: PLR is almost equal to the ordinary logistic regression when the sample size is large and is superior in
NASA Astrophysics Data System (ADS)
Goovaerts, Pierre
2013-06-01
Analyzing temporal trends in health outcomes can provide a more comprehensive picture of the burden of a disease like cancer and generate new insights about the impact of various interventions. In the United States such an analysis is increasingly conducted using joinpoint regression outside a spatial framework, which overlooks the existence of significant variation among U.S. counties and states with regard to the incidence of cancer. This paper presents several innovative ways to account for space in joinpoint regression: (1) prior filtering of noise in the data by binomial kriging and use of the kriging variance as measure of reliability in weighted least-square regression, (2) detection of significant boundaries between adjacent counties based on tests of parallelism of time trends and confidence intervals of annual percent change of rates, and (3) creation of spatially compact groups of counties with similar temporal trends through the application of hierarchical cluster analysis to the results of boundary analysis. The approach is illustrated using time series of proportions of prostate cancer late-stage cases diagnosed yearly in every county of Florida since 1980s. The annual percent change (APC) in late-stage diagnosis and the onset years for significant declines vary greatly across Florida. Most counties with non-significant average APC are located in the north-western part of Florida, known as the Panhandle, which is more rural than other parts of Florida. The number of significant boundaries peaked in the early 1990s when prostate-specific antigen (PSA) test became widely available, a temporal trend that suggests the existence of geographical disparities in the implementation and/or impact of the new screening procedure, in particular as it began available.
Hofland, G.S.; Barton, C.C.
1990-10-01
The computer program FREQFIT is designed to perform regression and statistical chi-squared goodness of fit analysis on one-dimensional or two-dimensional data. The program features an interactive user dialogue, numerous help messages, an option for screen or line printer output, and the flexibility to use practically any commercially available graphics package to create plots of the program`s results. FREQFIT is written in Microsoft QuickBASIC, for IBM-PC compatible computers. A listing of the QuickBASIC source code for the FREQFIT program, a user manual, and sample input data, output, and plots are included. 6 refs., 1 fig.
NASA Astrophysics Data System (ADS)
Sugihara, Shigemitsu; Shinozaki, Tsuguhiro; Ohishi, Hiroyuki; Araki, Yoshinori; Furukawa, Kohei
It is difficult to deregulate sediment-related disaster warning information, for the reason that it is difficult to quantify the risk of disaster after the heavy rain. If we can quantify the risk according to the rain situation, it will be an indication of deregulation. In this study, using logistic regression analysis, we quantified the risk according to the rain situation as the probability of disaster occurrence. And we analyzed the setup of resolutive criterion for sediment-related disaster warning information. As a result, we can improve convenience of the evaluation method of probability of disaster occurrence, which is useful to provide information of imminently situation.
NASA Technical Reports Server (NTRS)
Waller, M. C.
1976-01-01
An electro-optical device called an oculometer which tracks a subject's lookpoint as a time function has been used to collect data in a real-time simulation study of instrument landing system (ILS) approaches. The data describing the scanning behavior of a pilot during the instrument approaches have been analyzed by use of a stepwise regression analysis technique. A statistically significant correlation between pilot workload, as indicated by pilot ratings, and scanning behavior has been established. In addition, it was demonstrated that parameters derived from the scanning behavior data can be combined in a mathematical equation to provide a good representation of pilot workload.
Regression Models for the Analysis of Longitudinal Gaussian Data from Multiple Sources
O’Brien, Liam M.; Fitzmaurice, Garrett M.
2006-01-01
We present a regression model for the joint analysis of longitudinal multiple source Gaussian data. Longitudinal multiple source data arise when repeated measurements are taken from two or more sources, and each source provides a measure of the same underlying variable and on the same scale. This type of data generally produces a relatively large number of observations per subject; thus estimation of an unstructured covariance matrix often may not be possible. We consider two methods by which parsimonious models for the covariance can be obtained for longitudinal multiple source data. The methods are illustrated with an example of multiple informant data arising from a longitudinal interventional trial in psychiatry. PMID:15726666
Amene, E; Hanson, L A; Zahn, E A; Wild, S R; Döpfer, D
2016-07-01
The purpose of this study was to apply a novel statistical method for variable selection and a model-based approach for filling data gaps in mortality rates associated with foodborne diseases using the WHO Vital Registration mortality dataset. Correlation analysis and elastic net regularization methods were applied to drop redundant variables and to select the most meaningful subset of predictors. Whenever predictor data were missing, multiple imputation was used to fill in plausible values. Cluster analysis was applied to identify similar groups of countries based on the values of the predictors. Finally, a Bayesian hierarchical regression model was fit to the final dataset for predicting mortality rates. From 113 potential predictors, 32 were retained after correlation analysis. Out of these 32 predictors, eight with non-zero coefficients were selected using the elastic net regularization method. Based on the values of these variables, four clusters of countries were identified. The uncertainty of predictions was large for countries within clusters lacking mortality rates, and it was low for a cluster that had mortality rate information. Our results demonstrated that, using Bayesian hierarchical regression models, a data-driven clustering of countries and a meaningful subset of predictors can be used to fill data gaps in foodborne disease mortality. PMID:26785774
Ziemssen, Tjalf; Reimann, Manja; Gasch, Julia; Rüdiger, Heinz
2013-09-01
Biological rhythms, describing the temporal variation of biological processes, are a characteristic feature of complex systems. The analysis of biological rhythms can provide important insights into the pathophysiology of different diseases, especially, in cardiovascular medicine. In the field of the autonomic nervous system, heart rate variability (HRV) and baroreflex sensitivity (BRS) describe important fluctuations of blood pressure and heart rate which are often analyzed by Fourier transformation. However, these parameters are stochastic with overlaying rhythmical structures. R-R intervals as independent variables of time are not equidistant. That is why the trigonometric regressive spectral (TRS) analysis--reviewed in this paper--was introduced, considering both the statistical and rhythmical features of such time series. The data segments required for TRS analysis can be as short as 20 s allowing for dynamic evaluation of heart rate and blood pressure interaction over longer periods. Beyond HRV, TRS also estimates BRS based on linear regression analyses of coherent heart rate and blood pressure oscillations. An additional advantage is that all oscillations are analyzed by the same (maximal) number of R-R intervals thereby providing a high number of individual BRS values. This ensures a high confidence level of BRS determination which, along with short recording periods, may be of profound clinical relevance. The dynamic assessment of heart rate and blood pressure spectra by TRS allows a more precise evaluation of cardiovascular modulation under different settings as has already been demonstrated in different clinical studies. PMID:23812502
NASA Astrophysics Data System (ADS)
Mandal, Nilrudra; Doloi, Biswanath; Mondal, Biswanath
2016-01-01
In the present study, an attempt has been made to apply the Taguchi parameter design method and regression analysis for optimizing the cutting conditions on surface finish while machining AISI 4340 steel with the help of the newly developed yttria based Zirconia Toughened Alumina (ZTA) inserts. These inserts are prepared through wet chemical co-precipitation route followed by powder metallurgy process. Experiments have been carried out based on an orthogonal array L9 with three parameters (cutting speed, depth of cut and feed rate) at three levels (low, medium and high). Based on the mean response and signal to noise ratio (SNR), the best optimal cutting condition has been arrived at A3B1C1 i.e. cutting speed is 420 m/min, depth of cut is 0.5 mm and feed rate is 0.12 m/min considering the condition smaller is the better approach. Analysis of Variance (ANOVA) is applied to find out the significance and percentage contribution of each parameter. The mathematical model of surface roughness has been developed using regression analysis as a function of the above mentioned independent variables. The predicted values from the developed model and experimental values are found to be very close to each other justifying the significance of the model. A confirmation run has been carried out with 95 % confidence level to verify the optimized result and the values obtained are within the prescribed limit.
Selenium Exposure and Cancer Risk: an Updated Meta-analysis and Meta-regression
Cai, Xianlei; Wang, Chen; Yu, Wanqi; Fan, Wenjie; Wang, Shan; Shen, Ning; Wu, Pengcheng; Li, Xiuyang; Wang, Fudi
2016-01-01
The objective of this study was to investigate the associations between selenium exposure and cancer risk. We identified 69 studies and applied meta-analysis, meta-regression and dose-response analysis to obtain available evidence. The results indicated that high selenium exposure had a protective effect on cancer risk (pooled OR = 0.78; 95%CI: 0.73–0.83). The results of linear and nonlinear dose-response analysis indicated that high serum/plasma selenium and toenail selenium had the efficacy on cancer prevention. However, we did not find a protective efficacy of selenium supplement. High selenium exposure may have different effects on specific types of cancer. It decreased the risk of breast cancer, lung cancer, esophageal cancer, gastric cancer, and prostate cancer, but it was not associated with colorectal cancer, bladder cancer, and skin cancer. PMID:26786590
Bareth, Bettina; Dennerlein, Sven; Mick, David U.; Nikolov, Miroslav; Urlaub, Henning
2013-01-01
Cox1, the core subunit of the cytochrome c oxidase, receives two heme a cofactors during assembly of the 13-subunit enzyme complex. However, at which step of the assembly process and how heme is inserted into Cox1 have remained an enigma. Shy1, the yeast SURF1 homolog, has been implicated in heme transfer to Cox1, whereas the heme a synthase, Cox15, catalyzes the final step of heme a synthesis. Here we performed a comprehensive analysis of cytochrome c oxidase assembly intermediates containing Shy1. Our analyses suggest that Cox15 displays a role in cytochrome c oxidase assembly, which is independent of its functions as the heme a synthase. Cox15 forms protein complexes with Shy1 and also associates with Cox1-containing complexes independently of Shy1 function. These findings indicate that Shy1 does not serve as a mobile heme carrier between the heme a synthase and maturing Cox1 but rather cooperates with Cox15 for heme transfer and insertion in early assembly intermediates of cytochrome c oxidase. PMID:23979592
NASA Astrophysics Data System (ADS)
Rajab, Jasim M.; MatJafri, M. Z.; Lim, H. S.
2013-06-01
This study encompasses columnar ozone modelling in the peninsular Malaysia. Data of eight atmospheric parameters [air surface temperature (AST), carbon monoxide (CO), methane (CH4), water vapour (H2Ovapour), skin surface temperature (SSKT), atmosphere temperature (AT), relative humidity (RH), and mean surface pressure (MSP)] data set, retrieved from NASA's Atmospheric Infrared Sounder (AIRS), for the entire period (2003-2008) was employed to develop models to predict the value of columnar ozone (O3) in study area. The combined method, which is based on using both multiple regressions combined with principal component analysis (PCA) modelling, was used to predict columnar ozone. This combined approach was utilized to improve the prediction accuracy of columnar ozone. Separate analysis was carried out for north east monsoon (NEM) and south west monsoon (SWM) seasons. The O3 was negatively correlated with CH4, H2Ovapour, RH, and MSP, whereas it was positively correlated with CO, AST, SSKT, and AT during both the NEM and SWM season periods. Multiple regression analysis was used to fit the columnar ozone data using the atmospheric parameter's variables as predictors. A variable selection method based on high loading of varimax rotated principal components was used to acquire subsets of the predictor variables to be comprised in the linear regression model of the atmospheric parameter's variables. It was found that the increase in columnar O3 value is associated with an increase in the values of AST, SSKT, AT, and CO and with a drop in the levels of CH4, H2Ovapour, RH, and MSP. The result of fitting the best models for the columnar O3 value using eight of the independent variables gave about the same values of the R (≈0.93) and R2 (≈0.86) for both the NEM and SWM seasons. The common variables that appeared in both regression equations were SSKT, CH4 and RH, and the principal precursor of the columnar O3 value in both the NEM and SWM seasons was SSKT.
Ma, Ya-Nan; Wang, Jing; Dong, Guang-Hui; Liu, Miao-Miao; Wang, Da; Liu, Yu-Qin; Zhao, Yang; Ren, Wan-Hui; Lee, Yungling Leo; Zhao, Ya-Dong; He, Qin-Cheng
2013-01-01
Background There have been few published studies on spirometric reference values for healthy children in China. We hypothesize that there would have been changes in lung function that would not have been precisely predicted by the existing spirometric reference equations. The objective of the study was to develop more accurate predictive equations for spirometric reference values for children aged 9 to 15 years in Northeast China. Methodology/Principal Findings Spirometric measurements were obtained from 3,922 children, including 1,974 boys and 1,948 girls, who were randomly selected from five cities of Liaoning province, Northeast China, using the ATS (American Thoracic Society) and ERS (European Respiratory Society) standards. The data was then randomly split into a training subset containing 2078 cases and a validation subset containing 1844 cases. Predictive equations used multiple linear regression techniques with three predictor variables: height, age and weight. Model goodness of fit was examined using the coefficient of determination or the R2 and adjusted R2. The predicted values were compared with those obtained from the existing spirometric reference equations. The results showed the prediction equations using linear regression analysis performed well for most spirometric parameters. Paired t-tests were used to compare the predicted values obtained from the developed and existing spirometric reference equations based on the validation subset. The t-test for males was not statistically significant (p>0.01). The predictive accuracy of the developed equations was higher than the existing equations and the predictive ability of the model was also validated. Conclusion/Significance We developed prediction equations using linear regression analysis of spirometric parameters for children aged 9–15 years in Northeast China. These equations represent the first attempt at predicting lung function for Chinese children following the ATS/ERS Task Force 2005
Automated particle identification through regression analysis of size, shape and colour
NASA Astrophysics Data System (ADS)
Rodriguez Luna, J. C.; Cooper, J. M.; Neale, S. L.
2016-04-01
Rapid point of care diagnostic tests and tests to provide therapeutic information are now available for a range of specific conditions from the measurement of blood glucose levels for diabetes to card agglutination tests for parasitic infections. Due to a lack of specificity these test are often then backed up by more conventional lab based diagnostic methods for example a card agglutination test may be carried out for a suspected parasitic infection in the field and if positive a blood sample can then be sent to a lab for confirmation. The eventual diagnosis is often achieved by microscopic examination of the sample. In this paper we propose a computerized vision system for aiding in the diagnostic process; this system used a novel particle recognition algorithm to improve specificity and speed during the diagnostic process. We will show the detection and classification of different types of cells in a diluted blood sample using regression analysis of their size, shape and colour. The first step is to define the objects to be tracked by a Gaussian Mixture Model for background subtraction and binary opening and closing for noise suppression. After subtracting the objects of interest from the background the next challenge is to predict if a given object belongs to a certain category or not. This is a classification problem, and the output of the algorithm is a Boolean value (true/false). As such the computer program should be able to "predict" with reasonable level of confidence if a given particle belongs to the kind we are looking for or not. We show the use of a binary logistic regression analysis with three continuous predictors: size, shape and color histogram. The results suggest this variables could be very useful in a logistic regression equation as they proved to have a relatively high predictive value on their own.
Regression analysis of growth responses to water depth in three wetland plant species
Sorrell, Brian K.; Tanner, Chris C.; Brix, Hans
2012-01-01
Background and aims Plant species composition in wetlands and on lakeshores often shows dramatic zonation, which is frequently ascribed to differences in flooding tolerance. This study compared the growth responses to water depth of three species (Phormium tenax, Carex secta and Typha orientalis) differing in depth preferences in wetlands, using non-linear and quantile regression analyses to establish how flooding tolerance can explain field zonation. Methodology Plants were established for 8 months in outdoor cultures in waterlogged soil without standing water, and then randomly allocated to water depths from 0 to 0.5 m. Morphological and growth responses to depth were followed for 54 days before harvest, and then analysed by repeated-measures analysis of covariance, and non-linear and quantile regression analysis (QRA), to compare flooding tolerances. Principal results Growth responses to depth differed between the three species, and were non-linear. Phormium tenax growth decreased rapidly in standing water >0.25 m depth, C. secta growth increased initially with depth but then decreased at depths >0.30 m, accompanied by increased shoot height and decreased shoot density, and T. orientalis was unaffected by the 0- to 0.50-m depth range. In P. tenax the decrease in growth was associated with a decrease in the number of leaves produced per ramet and in C. secta the effect of water depth was greatest for the tallest shoots. Allocation patterns were unaffected by depth. Conclusions The responses are consistent with the principle that zonation in the field is primarily structured by competition in shallow water and by physiological flooding tolerance in deep water. Regression analyses, especially QRA, proved to be powerful tools in distinguishing genuine phenotypic responses to water depth from non-phenotypic variation due to size and developmental differences. PMID:23259044
JOINT STRUCTURE SELECTION AND ESTIMATION IN THE TIME-VARYING COEFFICIENT COX MODEL
Xiao, Wei; Lu, Wenbin; Zhang, Hao Helen
2016-01-01
Time-varying coefficient Cox model has been widely studied and popularly used in survival data analysis due to its flexibility for modeling covariate effects. It is of great practical interest to accurately identify the structure of covariate effects in a time-varying coefficient Cox model, i.e. covariates with null effect, constant effect and truly time-varying effect, and estimate the corresponding regression coefficients. Combining the ideas of local polynomial smoothing and group nonnegative garrote, we develop a new penalization approach to achieve such goals. Our method is able to identify the underlying true model structure with probability tending to one and simultaneously estimate the time-varying coefficients consistently. The asymptotic normalities of the resulting estimators are also established. We demonstrate the performance of our method using simulations and an application to the primary biliary cirrhosis data. PMID:27540275
NASA Astrophysics Data System (ADS)
Buck, J. A.; Underhill, P. R.; Morelli, J.; Krause, T. W.
2016-02-01
Nuclear steam generators (SGs) are a critical component for ensuring safe and efficient operation of a reactor. Life management strategies are implemented in which SG tubes are regularly inspected by conventional eddy current testing (ECT) and ultrasonic testing (UT) technologies to size flaws, and safe operating life of SGs is predicted based on growth models. ECT, the more commonly used technique, due to the rapidity with which full SG tube wall inspection can be performed, is challenged when inspecting ferromagnetic support structure materials in the presence of magnetite sludge and multiple overlapping degradation modes. In this work, an emerging inspection method, pulsed eddy current (PEC), is being investigated to address some of these particular inspection conditions. Time-domain signals were collected by an 8 coil array PEC probe in which ferromagnetic drilled support hole diameter, depth of rectangular tube frets and 2D tube off-centering were varied. Data sets were analyzed with a modified principal components analysis (MPCA) to extract dominant signal features. Multiple linear regression models were applied to MPCA scores to size hole diameter as well as size rectangular outer diameter tube frets. Models were improved through exploratory factor analysis, which was applied to MPCA scores to refine selection for regression models inputs by removing nonessential information.
Survival regression analysis: a powerful tool for evaluating fighting and assessment.
Moya-Laraño; Wise
2000-09-01
Theoretical models of animal contests frequently generate predictions about how asymmetries (e.g. differences in size, residence status) between contestants affect fight duration. Linear regression and nonparametric correlation analyses are commonly used to test the fit of data to such models. We show how survival regression analysis (SRA) is a powerful technique for studying the effect of asymmetries on the duration of contests. SRA, which is under-utilized by students of animal behaviour, offers several advantages over more frequently used procedures. It provides unbiased parameter estimates even when including censored data (i.e. results of contests that have not ended at the time when observations are stopped). The analysis of hazard functions, which is a component of SRA, is an easy way to test for consistency with predictions of the sequential assessment game model. These and other advantages of SRA are illustrated by using SRA and more conventional methods to analyse the effect of asymmetries on contest duration for encounters between female Mediterranean tarantulas, Lycosa tarentula (L.). It is hoped that this example of the advantages of SRA will encourage more widespread use of this powerful technique. Copyright 2000 The Association for the Study of Animal Behaviour. PMID:11007639
Poisson regression analysis of mortality among male workers at a thorium-processing plant
Liu, Zhiyuan; Lee, Tze-San; Kotek, T.J.
1991-12-31
Analyses of mortality among a cohort of 3119 male workers employed between 1915 and 1973 at a thorium-processing plant were updated to the end of 1982. Of the whole group, 761 men were deceased and 2161 men were still alive, while 197 men were lost to follow-up. A total of 250 deaths was added to the 511 deaths observed in the previous study. The standardized mortality ratio (SMR) for all causes of death was 1.12 with 95% confidence interval (CI) of 1.05-1.21. The SMRs were also significantly increased for all malignant neoplasms (SMR = 1.23, 95% CI = 1.04-1.43) and lung cancer (SMR = 1.36, 95% CI = 1.02-1.78). Poisson regression analysis was employed to evaluate the joint effects of job classification, duration of employment, time since first employment, age and year at first employment on mortality of all malignant neoplasms and lung cancer. A comparison of internal and external analyses with the Poisson regression model was also conducted and showed no obvious difference in fitting the data on lung cancer mortality of the thorium workers. The results of the multivariate analysis showed that there was no significant effect of all the study factors on mortality due to all malignant neoplasms and lung cancer. Therefore, further study is needed for the former thorium workers.
Error analysis of leaf area estimates made from allometric regression models
NASA Technical Reports Server (NTRS)
Feiveson, A. H.; Chhikara, R. S.
1986-01-01
Biological net productivity, measured in terms of the change in biomass with time, affects global productivity and the quality of life through biochemical and hydrological cycles and by its effect on the overall energy balance. Estimating leaf area for large ecosystems is one of the more important means of monitoring this productivity. For a particular forest plot, the leaf area is often estimated by a two-stage process. In the first stage, known as dimension analysis, a small number of trees are felled so that their areas can be measured as accurately as possible. These leaf areas are then related to non-destructive, easily-measured features such as bole diameter and tree height, by using a regression model. In the second stage, the non-destructive features are measured for all or for a sample of trees in the plots and then used as input into the regression model to estimate the total leaf area. Because both stages of the estimation process are subject to error, it is difficult to evaluate the accuracy of the final plot leaf area estimates. This paper illustrates how a complete error analysis can be made, using an example from a study made on aspen trees in northern Minnesota. The study was a joint effort by NASA and the University of California at Santa Barbara known as COVER (Characterization of Vegetation with Remote Sensing).
Regression-based adaptive sparse polynomial dimensional decomposition for sensitivity analysis
NASA Astrophysics Data System (ADS)
Tang, Kunkun; Congedo, Pietro; Abgrall, Remi
2014-11-01
Polynomial dimensional decomposition (PDD) is employed in this work for global sensitivity analysis and uncertainty quantification of stochastic systems subject to a large number of random input variables. Due to the intimate structure between PDD and Analysis-of-Variance, PDD is able to provide simpler and more direct evaluation of the Sobol' sensitivity indices, when compared to polynomial chaos (PC). Unfortunately, the number of PDD terms grows exponentially with respect to the size of the input random vector, which makes the computational cost of the standard method unaffordable for real engineering applications. In order to address this problem of curse of dimensionality, this work proposes a variance-based adaptive strategy aiming to build a cheap meta-model by sparse-PDD with PDD coefficients computed by regression. During this adaptive procedure, the model representation by PDD only contains few terms, so that the cost to resolve repeatedly the linear system of the least-square regression problem is negligible. The size of the final sparse-PDD representation is much smaller than the full PDD, since only significant terms are eventually retained. Consequently, a much less number of calls to the deterministic model is required to compute the final PDD coefficients.
A least trimmed square regression method for second level FMRI effective connectivity analysis.
Li, Xingfeng; Coyle, Damien; Maguire, Liam; McGinnity, Thomas Martin
2013-01-01
We present a least trimmed square (LTS) robust regression method to combine different runs/subjects for second/high level effective connectivity analysis. The basic idea of this method is to treat the extreme nonlinear model variability as outliers if they exceed a certain threshold. A bootstrap method for the LTS estimation is employed to detect model outliers. We compared the LTS robust method with a non-robust method using simulated and real datasets. The difference between LTS and the non-robust method for second level effective connectivity analysis is significant, suggesting the conventional non-robust method is easily affected by the model variability from the first level analysis. In addition, after these outliers are detected and excluded for the high level analysis, the model coefficients of the second level are combined within the framework of a mixed model. The variance of the mixed model is estimated using the Newton-Raphson (NR) type Levenberg-Marquardt algorithm. Three sets of real data are adopted to compare conventional methods which do not include random effects in the analysis with a mixed model for second level effective connectivity analysis. The results show that the conventional method is significantly different from the mixed model when greater model variability exists, suggesting there is a strong random effect, and the mixed model should be employed for the second level effective connectivity analysis. PMID:23093379
Analysis of changes in extreme temperature and precipitation using quantile regression
NASA Astrophysics Data System (ADS)
Lee, Kyoungmi; Baek, Hee-Jeong; Cho, ChunHo
2013-04-01
One of the important research areas in climatology is to identify whether the long-period tendencies of change in meteorological variables appear. In the past, the analysis has been limited by the estimation of long-period trends for annual or seasonal average values on meteorological variables. However, recently, the interest in the trends regarding the whole range of values for meteorological variables, including the extreme ones, has arisen. The quantile regression is the regression analysis method for estimating the regression slopes for the values of any quantile from 0 to 1 of dependent variable distributions. This method provides a more complete picture for the conditional distribution of the dependent variable given the independent variable when both lower and upper or all quantiles are of interest. This study examines the changes in regional extreme temperature and precipitation in South Korea using quantile regression, which is applied to analyze trends, not only in the mean but in all parts of the data distribution. The results show considerable diversity across space and quantile level in South Korea. For daily temperatures in winter, the slopes in lower quantiles generally have a more distinct increase trend compared to the upper quantiles. The time series for daily minimum temperature during the winter season only shows a significant increasing trend in the lower quantile. In case of summer, most sites show an increase trend in both lower and upper quantiles for daily minimum temperature, while there are a number of sites with a decrease trend for daily maximum temperature. It was also found that the increase trend of extreme low temperature in large urban areas (0.80°C/decade) is much larger than in rural areas (0.54°C/decade) due to the effects of urbanization. Extreme climate events can have greater negative impacts on society, economy and natural environments than changes in climate means. The fast growth of population and industrialization in
NASA Astrophysics Data System (ADS)
Păniţă, Ovidiu
2015-09-01
In the years 2012-2014 on Banu-Maracine DRS there were tested an assortment of 25 isogenic lines of wheat (Triticum aestivum ssp.vulgare), the analyzed characters being the number of seeds/spike, seeds weight/spike (g), no. of spikes/m2, weight of a thousand seeds (WTS) (g) and no. of emerged plants/m2. Based on recorded data and statistical processing of those, they were identified a numbers of links between these characters. Also available regression models were identified between some of the studied characters. Based on component analysis, no. of seeds/spike and seeds weight/spike are components that influence in excess of 88% variance analysis, a total of seven genotypes with positive scores for both factors.
Modelling and analysis of turbulent datasets using Auto Regressive Moving Average processes
Faranda, Davide Dubrulle, Bérengère; Daviaud, François; Pons, Flavio Maria Emanuele; Saint-Michel, Brice; Herbert, Éric; Cortet, Pierre-Philippe
2014-10-15
We introduce a novel way to extract information from turbulent datasets by applying an Auto Regressive Moving Average (ARMA) statistical analysis. Such analysis goes well beyond the analysis of the mean flow and of the fluctuations and links the behavior of the recorded time series to a discrete version of a stochastic differential equation which is able to describe the correlation structure in the dataset. We introduce a new index Υ that measures the difference between the resulting analysis and the Obukhov model of turbulence, the simplest stochastic model reproducing both Richardson law and the Kolmogorov spectrum. We test the method on datasets measured in a von Kármán swirling flow experiment. We found that the ARMA analysis is well correlated with spatial structures of the flow, and can discriminate between two different flows with comparable mean velocities, obtained by changing the forcing. Moreover, we show that the Υ is highest in regions where shear layer vortices are present, thereby establishing a link between deviations from the Kolmogorov model and coherent structures. These deviations are consistent with the ones observed by computing the Hurst exponents for the same time series. We show that some salient features of the analysis are preserved when considering global instead of local observables. Finally, we analyze flow configurations with multistability features where the ARMA technique is efficient in discriminating different stability branches of the system.
Modelling and analysis of turbulent datasets using Auto Regressive Moving Average processes
NASA Astrophysics Data System (ADS)
Faranda, Davide; Pons, Flavio Maria Emanuele; Dubrulle, Bérengère; Daviaud, François; Saint-Michel, Brice; Herbert, Éric; Cortet, Pierre-Philippe
2014-10-01
We introduce a novel way to extract information from turbulent datasets by applying an Auto Regressive Moving Average (ARMA) statistical analysis. Such analysis goes well beyond the analysis of the mean flow and of the fluctuations and links the behavior of the recorded time series to a discrete version of a stochastic differential equation which is able to describe the correlation structure in the dataset. We introduce a new index Υ that measures the difference between the resulting analysis and the Obukhov model of turbulence, the simplest stochastic model reproducing both Richardson law and the Kolmogorov spectrum. We test the method on datasets measured in a von Kármán swirling flow experiment. We found that the ARMA analysis is well correlated with spatial structures of the flow, and can discriminate between two different flows with comparable mean velocities, obtained by changing the forcing. Moreover, we show that the Υ is highest in regions where shear layer vortices are present, thereby establishing a link between deviations from the Kolmogorov model and coherent structures. These deviations are consistent with the ones observed by computing the Hurst exponents for the same time series. We show that some salient features of the analysis are preserved when considering global instead of local observables. Finally, we analyze flow configurations with multistability features where the ARMA technique is efficient in discriminating different stability branches of the system.
The Impact of Outliers on Net-Benefit Regression Model in Cost-Effectiveness Analysis.
Wen, Yu-Wen; Tsai, Yi-Wen; Wu, David Bin-Chia; Chen, Pei-Fen
2013-01-01
Ordinary least square (OLS) in regression has been widely used to analyze patient-level data in cost-effectiveness analysis (CEA). However, the estimates, inference and decision making in the economic evaluation based on OLS estimation may be biased by the presence of outliers. Instead, robust estimation can remain unaffected and provide result which is resistant to outliers. The objective of this study is to explore the impact of outliers on net-benefit regression (NBR) in CEA using OLS and to propose a potential solution by using robust estimations, i.e. Huber M-estimation, Hampel M-estimation, Tukey's bisquare M-estimation, MM-estimation and least trimming square estimation. Simulations under different outlier-generating scenarios and an empirical example were used to obtain the regression estimates of NBR by OLS and five robust estimations. Empirical size and empirical power of both OLS and robust estimations were then compared in the context of hypothesis testing. Simulations showed that the five robust approaches compared with OLS estimation led to lower empirical sizes and achieved higher empirical powers in testing cost-effectiveness. Using real example of antiplatelet therapy, the estimated incremental net-benefit by OLS estimation was lower than those by robust approaches because of outliers in cost data. Robust estimations demonstrated higher probability of cost-effectiveness compared to OLS estimation. The presence of outliers can bias the results of NBR and its interpretations. It is recommended that the use of robust estimation in NBR can be an appropriate method to avoid such biased decision making. PMID:23840378
Huang, Dong; Cabral, Ricardo; De la Torre, Fernando
2016-02-01
Discriminative methods (e.g., kernel regression, SVM) have been extensively used to solve problems such as object recognition, image alignment and pose estimation from images. These methods typically map image features ( X) to continuous (e.g., pose) or discrete (e.g., object category) values. A major drawback of existing discriminative methods is that samples are directly projected onto a subspace and hence fail to account for outliers common in realistic training sets due to occlusion, specular reflections or noise. It is important to notice that existing discriminative approaches assume the input variables X to be noise free. Thus, discriminative methods experience significant performance degradation when gross outliers are present. Despite its obvious importance, the problem of robust discriminative learning has been relatively unexplored in computer vision. This paper develops the theory of robust regression (RR) and presents an effective convex approach that uses recent advances on rank minimization. The framework applies to a variety of problems in computer vision including robust linear discriminant analysis, regression with missing data, and multi-label classification. Several synthetic and real examples with applications to head pose estimation from images, image and video classification and facial attribute classification with missing data are used to illustrate the benefits of RR. PMID:26761740
Rubio, Francisco J; Genton, Marc G
2016-06-30
We study Bayesian linear regression models with skew-symmetric scale mixtures of normal error distributions. These kinds of models can be used to capture departures from the usual assumption of normality of the errors in terms of heavy tails and asymmetry. We propose a general noninformative prior structure for these regression models and show that the corresponding posterior distribution is proper under mild conditions. We extend these propriety results to cases where the response variables are censored. The latter scenario is of interest in the context of accelerated failure time models, which are relevant in survival analysis. We present a simulation study that demonstrates good frequentist properties of the posterior credible intervals associated with the proposed priors. This study also sheds some light on the trade-off between increased model flexibility and the risk of over-fitting. We illustrate the performance of the proposed models with real data. Although we focus on models with univariate response variables, we also present some extensions to the multivariate case in the Supporting Information. Copyright © 2016 John Wiley & Sons, Ltd. PMID:26856806
NASA Astrophysics Data System (ADS)
Simms, Laura E.; Engebretson, Mark J.; Pilipenko, Viacheslav; Reeves, Geoffrey D.; Clilverd, Mark
2016-04-01
The daily maximum relativistic electron flux at geostationary orbit can be predicted well with a set of daily averaged predictor variables including previous day's flux, seed electron flux, solar wind velocity and number density, AE index, IMF Bz, Dst, and ULF and VLF wave power. As predictor variables are intercorrelated, we used multiple regression analyses to determine which are the most predictive of flux when other variables are controlled. Empirical models produced from regressions of flux on measured predictors from 1 day previous were reasonably effective at predicting novel observations. Adding previous flux to the parameter set improves the prediction of the peak of the increases but delays its anticipation of an event. Previous day's solar wind number density and velocity, AE index, and ULF wave activity are the most significant explanatory variables; however, the AE index, measuring substorm processes, shows a negative correlation with flux when other parameters are controlled. This may be due to the triggering of electromagnetic ion cyclotron waves by substorms that cause electron precipitation. VLF waves show lower, but significant, influence. The combined effect of ULF and VLF waves shows a synergistic interaction, where each increases the influence of the other on flux enhancement. Correlations between observations and predictions for this 1 day lag model ranged from 0.71 to 0.89 (average: 0.78). A path analysis of correlations between predictors suggests that solar wind and IMF parameters affect flux through intermediate processes such as ring current (Dst), AE, and wave activity.
A cautionary note on the use of EESC-based regression analysis for ozone trend studies
NASA Astrophysics Data System (ADS)
Kuttippurath, J.; Bodeker, G. E.; Roscoe, H. K.; Nair, P. J.
2015-01-01
Equivalent effective stratospheric chlorine (EESC) construct of ozone regression models attributes ozone changes to EESC changes using a single value of the sensitivity of ozone to EESC over the whole period. Using space-based total column ozone (TCO) measurements, and a synthetic TCO time series constructed such that EESC does not fall below its late 1990s maximum, we demonstrate that the EESC-based estimates of ozone changes in the polar regions (70-90°) after 2000 may, falsely, suggest an EESC-driven increase in ozone over this period. An EESC-based regression of our synthetic "failed Montreal Protocol with constant EESC" time series suggests a positive TCO trend that is statistically significantly different from zero over 2001-2012 when, in fact, no recovery has taken place. Our analysis demonstrates that caution needs to be exercised when using explanatory variables, with a single fit coefficient, fitted to the entire data record, to interpret changes in only part of the record.
NASA Astrophysics Data System (ADS)
Nordemann, D. J. R.; Rigozo, N. R.; de Souza Echer, M. P.; Echer, E.
2008-11-01
We present here an implementation of a least squares iterative regression method applied to the sine functions embedded in the principal components extracted from geophysical time series. This method seems to represent a useful improvement for the non-stationary time series periodicity quantitative analysis. The principal components determination followed by the least squares iterative regression method was implemented in an algorithm written in the Scilab (2006) language. The main result of the method is to obtain the set of sine functions embedded in the series analyzed in decreasing order of significance, from the most important ones, likely to represent the physical processes involved in the generation of the series, to the less important ones that represent noise components. Taking into account the need of a deeper knowledge of the Sun's past history and its implication to global climate change, the method was applied to the Sunspot Number series (1750-2004). With the threshold and parameter values used here, the application of the method leads to a total of 441 explicit sine functions, among which 65 were considered as being significant and were used for a reconstruction that gave a normalized mean squared error of 0.146.
Improved Regression Analysis of Temperature-Dependent Strain-Gage Balance Calibration Data
NASA Technical Reports Server (NTRS)
Ulbrich, N.
2015-01-01
An improved approach is discussed that may be used to directly include first and second order temperature effects in the load prediction algorithm of a wind tunnel strain-gage balance. The improved approach was designed for the Iterative Method that fits strain-gage outputs as a function of calibration loads and uses a load iteration scheme during the wind tunnel test to predict loads from measured gage outputs. The improved approach assumes that the strain-gage balance is at a constant uniform temperature when it is calibrated and used. First, the method introduces a new independent variable for the regression analysis of the balance calibration data. The new variable is designed as the difference between the uniform temperature of the balance and a global reference temperature. This reference temperature should be the primary calibration temperature of the balance so that, if needed, a tare load iteration can be performed. Then, two temperature{dependent terms are included in the regression models of the gage outputs. They are the temperature difference itself and the square of the temperature difference. Simulated temperature{dependent data obtained from Triumph Aerospace's 2013 calibration of NASA's ARC-30K five component semi{span balance is used to illustrate the application of the improved approach.
NASA Astrophysics Data System (ADS)
Liu, Pao-Wen Grace; Tsai, Jiun-Horng; Lai, Hsin-Chih; Tsai, Der-Min; Li, Li-Wei
2013-11-01
Sensitivity of meteorological variation to air quality has attracted people's attention since climate change became a world issue. The goal of this study is to investigate the sensitivity of ground-level ozone concentrations to temperature variation in Taiwan. Several multivariate regression models were built based on historical data of ozone and meteorological variables at three cities located in northern, mid-western, and southern Taiwan. Results of descriptive statistics indicate that the severe pollution from the highest to the minor conditions following by the order of the southern (Pingtung), mid-western (Fengyuan), and the northern sites (Hsichih). Multiple regression models containing a principal component trigger variable effectively simulated the historical ozone exceedance during 2004-2009. Inclusion of the PC trigger were improved R2 from the lowest 0.38 to the highest 0.58. High probability of detection and critical success index (mostly between 85% and 90%) and low false alarm rates (0-2.6%) were achieved for predicting the high ozone days (≧100 ppb). The results of sensitivity analysis indicated that (1) the ozone sensitivity was positively correlated with the temperature variation, (2) the sensitivity levels were opposite to that of the ozone problem severity, (3) the sensitivity was mostly apparent in ozone seasons, and (4) the sensitivity strongly depended on the seasonality in the urban cities Hischih and Fengyuan, but weakly depended on seasonality in the rural city Pingtung.
Combining regression analysis and air quality modelling to predict benzene concentration levels
NASA Astrophysics Data System (ADS)
Vlachokostas, Ch.; Achillas, Ch.; Chourdakis, E.; Moussiopoulos, N.
2011-05-01
State of the art epidemiological research has found consistent associations between traffic-related air pollution and various outcomes, such as respiratory symptoms and premature mortality. However, many urban areas are characterised by the absence of the necessary monitoring infrastructure, especially for benzene (C 6H 6), which is a known human carcinogen. The use of environmental statistics combined with air quality modelling can be of vital importance in order to assess air quality levels of traffic-related pollutants in an urban area in the case where there are no available measurements. This paper aims at developing and presenting a reliable approach, in order to forecast C 6H 6 levels in urban environments, demonstrated for Thessaloniki, Greece. Multiple stepwise regression analysis is used and a strong statistical relationship is detected between C 6H 6 and CO. The adopted regression model is validated in order to depict its applicability and representativeness. The presented results demonstrate that the adopted approach is capable of capturing C 6H 6 concentration trends and should be considered as complementary to air quality monitoring.
Meta-analysis and meta-regression of transcriptomic responses to water stress in Arabidopsis.
Rest, Joshua S; Wilkins, Olivia; Yuan, Wei; Purugganan, Michael D; Gurevitch, Jessica
2016-02-01
The large amounts of transcriptome data available for Arabidopsis thaliana make a compelling case for the need to generalize results across studies and extract the most robust and meaningful information possible from them. The results of various studies seeking to identify water stress-responsive genes only partially overlap. The aim of this work was to combine transcriptomic studies in a systematic way that identifies commonalities in response, taking into account variation among studies due to batch effects as well as sampling variation, while also identifying the effect of study-specific variables, such as the method of applying water stress, and the part of the plant the mRNA was extracted from. We used meta-analysis, the quantitative synthesis of independent research results, to summarize expression responses to water stress across studies, and meta-regression to model the contribution of covariates that may affect gene expression. We found that some genes with small but consistent differential responses become evident only when results are synthesized across experiments, and are missed in individual studies. We also identified genes with expression responses that are attributable to use of different plant parts and alternative methods for inducing water stress. Our results indicate that meta-analysis and meta-regression provide a powerful approach for identifying a robust gene set that is less sensitive to idiosyncratic results and for quantifying study characteristics that result in contrasting gene expression responses across studies. Combining meta-analysis with individual analyses may contribute to a richer understanding of the biology of water stress responses, and may prove valuable in other gene expression studies. PMID:26756945
Inferring genetic networks from DNA microarray data by multiple regression analysis.
Kato, M; Tsunoda, T; Takagi, T
2000-01-01
Inferring gene regulatory networks by differential equations from the time series data of a DNA microarray is one of the most challenging tasks in the post-genomic era. However, there have been no studies actually inferring gene regulatory networks by differential equations from genome-level data. The reason for this is that the number of parameters in the equations exceeds the number of measured time points. We here succeeded in executing the inference, not by directly determining parameters but by applying multiple regression analysis to our equations. We derived our differential equations and steady state equations from the rate equations of transcriptional reactions in an organism. Verification with a number of genes related to respiration indicated the validity and effectiveness of our method. Moreover, the steady state equations were more appropriate than the differential equations for the microarray data used. PMID:11700593
On-line contextual influences during reading normal text: a multiple-regression analysis.
Pynte, Joel; New, Boris; Kennedy, Alan
2008-09-01
On-line contextual influences during reading were examined in a series of multiple-regression analyses conducted on a large-scale corpus of eye-movement data, using Latent Semantic Analysis (LSA) to assess the degree of contextual constraints exerted on a given target word by the immediately prior word and by the prior sentence fragment. A decrease in inspection time was observed as contextual constraints increased. Word-level constraints exerted their influence both forward (on both single-fixation and gaze durations) and backward (on gaze duration only). An independent sentence-level effect was only visible in the forward direction, and only for gaze duration. Gaze duration was also sensitive to the depth of embedding of the target word in the syntactic structure. We conclude that both low-level and high-level contextual constraints can translate in the eye-movement record. PMID:18701125
Tam, Vivian W Y; Wang, K; Tam, C M
2008-04-01
Recycled demolished concrete (DC) as recycled aggregate (RA) and recycled aggregate concrete (RAC) is generally suitable for most construction applications. Low-grade applications, including sub-base and roadwork, have been implemented in many countries; however, higher-grade activities are rarely considered. This paper examines relationships among DC characteristics, properties of their RA and strength of their RAC using regression analysis. Ten samples collected from demolition sites are examined. The results show strong correlation among the DC samples, properties of RA and RAC. It should be highlighted that inferior quality of DC will lower the quality of RA and thus their RAC. Prediction of RAC strength is also formulated from the DC characteristics and the RA properties. From that, the RAC performance from DC and RA can be estimated. In addition, RAC design requirements can also be developed at the initial stage of concrete demolition. Recommendations are also given to improve the future concreting practice. PMID:17764837
Sparse regression analysis of task-relevant information distribution in the brain
NASA Astrophysics Data System (ADS)
Rish, Irina; Cecchi, Guillermo A.; Heuton, Kyle; Baliki, Marwan N.; Apkarian, A. Vania
2012-02-01
One of key topics in fMRI analysis is discovery of task-related brain areas. We focus on predictive accuracy as a better relevance measure than traditional univariate voxel activations that miss important multivariate voxel interactions. We use sparse regression (more specifically, the Elastic Net1) to learn predictive models simultaneously with selection of predictive voxel subsets, and to explore transition from task-relevant to task-irrelevant areas. Exploring the space of sparse solutions reveals a much wider spread of task-relevant information in the brain than it is typically suggested by univariate correlations. This happens for several tasks we considered, and is most noticeable in case of complex tasks such as pain rating; however, for certain simpler tasks, a clear separation between a small subset of relevant voxels and the rest of the brain is observed even with multivariate approach to measuring relevance.
Imai, Chisato; Hashizume, Masahiro
2015-01-01
Background: Time series analysis is suitable for investigations of relatively direct and short-term effects of exposures on outcomes. In environmental epidemiology studies, this method has been one of the standard approaches to assess impacts of environmental factors on acute non-infectious diseases (e.g. cardiovascular deaths), with conventionally generalized linear or additive models (GLM and GAM). However, the same analysis practices are often observed with infectious diseases despite of the substantial differences from non-infectious diseases that may result in analytical challenges. Methods: Following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines, systematic review was conducted to elucidate important issues in assessing the associations between environmental factors and infectious diseases using time series analysis with GLM and GAM. Published studies on the associations between weather factors and malaria, cholera, dengue, and influenza were targeted. Findings: Our review raised issues regarding the estimation of susceptible population and exposure lag times, the adequacy of seasonal adjustments, the presence of strong autocorrelations, and the lack of a smaller observation time unit of outcomes (i.e. daily data). These concerns may be attributable to features specific to infectious diseases, such as transmission among individuals and complicated causal mechanisms. Conclusion: The consequence of not taking adequate measures to address these issues is distortion of the appropriate risk quantifications of exposures factors. Future studies should pay careful attention to details and examine alternative models or methods that improve studies using time series regression analysis for environmental determinants of infectious diseases. PMID:25859149
Japanese elderly persons walk faster than non-Asian elderly persons: a meta-regression analysis
Ando, Masataka; Kamide, Naoto
2015-01-01
[Purpose] The purpose of this study was to clarify ethnic differences in walking speed by comparing walking speed in both Japanese and non-Asian elderly individuals and to investigate the necessity of consideration of ethnic differences in walking speed. [Subjects and Methods] Articles that reported comfortable walking speeds for community-dwelling elderly individuals were identified from electronic databases. Articles that involved community-dwelling individuals who were 60 years old or older and well functioning were included in the study. Articles that involved Asians were excluded. Weighted means for 5-m walking times were calculated as walking speeds from the Japanese and non-Asian sample data. The effects of age, gender, and ethnicity on 5-m walking times were then investigated using meta-regression analysis. [Results] Twenty studies (34 groups) were included for Japanese, and 16 studies (28 groups) were included for non-Asians. The weighted mean 5-m walking time was estimated to be 4.15 sec (95% confidence interval [CI]: 3.87–4.44) for Japanese and 4.24 sec (95% CI: 4.09–4.40) for non-Asians. Furthermore, using meta-regression analysis adjusted for age and gender, the 5-m walking time was 0.40 sec faster (95% CI: 0.03–0.77) for Japanese than for non-Asian elderly individuals. [Conclusion] Walking speed appeared faster for Japanese community-dwelling elderly individuals than for non-Asian elderly individuals. PMID:26696722
The Analysis of Internet Addiction Scale Using Multivariate Adaptive Regression Splines
Kayri, M
2010-01-01
Background: Determining real effects on internet dependency is too crucial with unbiased and robust statistical method. MARS is a new non-parametric method in use in the literature for parameter estimations of cause and effect based research. MARS can both obtain legible model curves and make unbiased parametric predictions. Methods: In order to examine the performance of MARS, MARS findings will be compared to Classification and Regression Tree (C&RT) findings, which are considered in the literature to be efficient in revealing correlations between variables. The data set for the study is taken from “The Internet Addiction Scale” (IAS), which attempts to reveal addiction levels of individuals. The population of the study consists of 754 secondary school students (301 female, 443 male students with 10 missing data). MARS 2.0 trial version is used for analysis by MARS method and C&RT analysis was done by SPSS. Results: MARS obtained six base functions of the model. As a common result of these six functions, regression equation of the model was found. Over the predicted variable, MARS showed that the predictors of daily Internet-use time on average, the purpose of Internet-use, grade of students and occupations of mothers had a significant effect (P< 0.05). In this comparative study, MARS obtained different findings from C&RT in dependency level prediction. Conclusion: The fact that MARS revealed extent to which the variable, which was considered significant, changes the character of the model was observed in this study. PMID:23113038
Bode, Manuela; Woellhaf, Michael W.; Bohnert, Maria; van der Laan, Martin; Sommer, Frederik; Jung, Martin; Zimmermann, Richard; Schroda, Michael; Herrmann, Johannes M.
2015-01-01
Members of the twin Cx9C protein family constitute the largest group of proteins in the intermembrane space (IMS) of mitochondria. Despite their conserved nature and their essential role in the biogenesis of the respiratory chain, the molecular function of twin Cx9C proteins is largely unknown. We performed a SILAC-based quantitative proteomic analysis to identify interaction partners of the conserved twin Cx9C protein Cox19. We found that Cox19 interacts in a dynamic manner with Cox11, a copper transfer protein that facilitates metalation of the Cu(B) center of subunit 1 of cytochrome c oxidase. The interaction with Cox11 is critical for the stable accumulation of Cox19 in mitochondria. Cox19 consists of a helical hairpin structure that forms a hydrophobic surface characterized by two highly conserved tyrosine-leucine dipeptides. These residues are essential for Cox19 function and its specific binding to a cysteine-containing sequence in Cox11. Our observations suggest that an oxidative modification of this cysteine residue of Cox11 stimulates Cox19 binding, pointing to a redox-regulated interplay of Cox19 and Cox11 that is critical for copper transfer in the IMS and thus for biogenesis of cytochrome c oxidase. PMID:25926683
Regression analysis of "current status" life tables on duration of breastfeeding in Sri Lanka.
Smith, D P
1985-01-01
Provided that women report the dates of their children's births with reasonable accuracy, it is possible to derive good estimates of the duration of breastfeeding from women's breastfeeding status at the time of the interview. This paper illustrates the application of conventional regression techniques to the analysis of breastfeeding rates derived in this manner. Construction of current status rates is explained and a comparison between open interval, closed interval, and current status breastfeeding life tables is presented, indicating the extent of bias to which tables of the former types are open. Birth-weighted rates are used for WFS data from Sri Lanka; the variables entered into the regression equation include parity, educational level, residence, work experience since marriage and use of contraception since the birth. Contraception is not found to influence net breastfeeding rates in the 1st interval (1-16 months), although it is about as prevalent as in later intervals. The positive coefficients at intervals beyond the 1st also imply that contraceptive use is not a substitute for lactation in Sri Lanka or not a predominant one. Lifetime urban residence is associated with short durations of breastfeeding, and being an urban migrant is associated with intermediate durations relative to those of rural women. The effects of residence on breastfeeding are especially pronounced in the 1st interval. By parity as by contraception, differences in breastfeeding rates are not significant at short durations but become so with time as lower parity women reach pregnancy. Patterns by age are similar, but less sharp. Middle school attendance and work at home are both strongly associated with with lactation behavior, the former negatively and the latter to about an equal degree positively. Working outside the home seems not to influence breastfeeding to any great extent. In the multiple attribute regressions, middle schooling depresses breastfeeding durations about as
NASA Astrophysics Data System (ADS)
Díaz, S.; Deferrari, G.; Martinioni, D.; Oberto, A.
2000-05-01
Factors affecting UV radiation at the earth's surface include the solar zenith angle, earth-sun distance, clouds, aerosols, altitude, ozone and the ground's albedo. The variation of some factors, such as solar zenith angle and earth-sun distance, is well established. Total column ozone and UV radiation are inversely related, but the presence of clouds may affect the resulting UV in such a way that a depletion in the total column ozone may not always lead to an increase in the radiation at the earth's surface. The aim of this paper is to determine the contribution to the variation of the biologically effective irradiance by geometric factors, clouds and ozone, jointly and separately, in Ushuaia (54°49'S, 68°19'W, sea level), and the seasonal variation of this relationship, given the magnitude and seasonal distribution of the ozone depletion and the frequent presence of high cloud cover in this site. For this purpose, multivariate and simple regression analyses of daily and monthly integrated irradiances weighted by the DNA damage action spectrum as a function of total column ozone and the integrated irradiances in the band 337-342 nm (as a proxy for cloud cover and geometric factors) have been performed. For the analysed period (September 1989-December 1996) more than 97% of the variation of the DNA damage weighted daily integrated irradiances is described by changes in ozone, clouds and geometric factors. Simple regression analysis for daily integrated irradiances, grouped by month, shows that most of this variation is explained by clouds and geometric factors, except in spring, when strong ozone depletion occurs intermittently over this area. When monthly trends are removed, similar results are observed, except for late winter.
NASA Astrophysics Data System (ADS)
Jolly, William H.
1992-05-01
Relationships defining the ballistic limit of Space Station Freedom's (SSF) dual wall protection systems have been determined. These functions were regressed from empirical data found in Marshall Space Flight Center's (MSFC) Hypervelocity Impact Testing Summary (HITS) for the velocity range between three and seven kilometers per second. A stepwise linear least squares regression was used to determine the coefficients of several expressions that define a ballistic limit surface. Using statistical significance indicators and graphical comparisons to other limit curves, a final set of expressions is recommended for potential use in Probability of No Critical Flaw (PNCF) calculations for Space Station. The three equations listed below represent the mean curves for normal, 45 degree, and 65 degree obliquity ballistic limits, respectively, for a dual wall protection system consisting of a thin 6061-T6 aluminum bumper spaced 4.0 inches from a .125 inches thick 2219-T87 rear wall with multiple layer thermal insulation installed between the two walls. Normal obliquity is d(sub c) = 1.0514 v(exp 0.2983 t(sub 1)(exp 0.5228). Forty-five degree obliquity is d(sub c) = 0.8591 v(exp 0.0428) t(sub 1)(exp 0.2063). Sixty-five degree obliquity is d(sub c) = 0.2824 v(exp 0.1986) t(sub 1)(exp -0.3874). Plots of these curves are provided. A sensitivity study on the effects of using these new equations in the probability of no critical flaw analysis indicated a negligible increase in the performance of the dual wall protection system for SSF over the current baseline. The magnitude of the increase was 0.17 percent over 25 years on the MB-7 configuration run with the Bumper II program code.
NASA Technical Reports Server (NTRS)
Jolly, William H.
1992-01-01
Relationships defining the ballistic limit of Space Station Freedom's (SSF) dual wall protection systems have been determined. These functions were regressed from empirical data found in Marshall Space Flight Center's (MSFC) Hypervelocity Impact Testing Summary (HITS) for the velocity range between three and seven kilometers per second. A stepwise linear least squares regression was used to determine the coefficients of several expressions that define a ballistic limit surface. Using statistical significance indicators and graphical comparisons to other limit curves, a final set of expressions is recommended for potential use in Probability of No Critical Flaw (PNCF) calculations for Space Station. The three equations listed below represent the mean curves for normal, 45 degree, and 65 degree obliquity ballistic limits, respectively, for a dual wall protection system consisting of a thin 6061-T6 aluminum bumper spaced 4.0 inches from a .125 inches thick 2219-T87 rear wall with multiple layer thermal insulation installed between the two walls. Normal obliquity is d(sub c) = 1.0514 v(exp 0.2983 t(sub 1)(exp 0.5228). Forty-five degree obliquity is d(sub c) = 0.8591 v(exp 0.0428) t(sub 1)(exp 0.2063). Sixty-five degree obliquity is d(sub c) = 0.2824 v(exp 0.1986) t(sub 1)(exp -0.3874). Plots of these curves are provided. A sensitivity study on the effects of using these new equations in the probability of no critical flaw analysis indicated a negligible increase in the performance of the dual wall protection system for SSF over the current baseline. The magnitude of the increase was 0.17 percent over 25 years on the MB-7 configuration run with the Bumper II program code.
Andrianov, B V; Goryacheva, I I; Vlasov, S V; Gorelova, T V; Harutyunova, M V; Harutyunova, K V; Mayilyan, K R; Zakharov, I A
2015-03-01
Black flies (Diptera, Simuliidae) are well known for their medical, environmental, and veterinary importance. The simuliid fauna of Armenia includes 53 species. A number of dominant species are of ecological importance. Complex analysis, which involved morphometric, cytogenetic, and molecular genetic approaches, was conducted to characterize the species status of black flies inhabiting the territory of Armenia. It was shown that the predominant simuliid species, Simulium paraequinum and Simulium kiritshenkoi, belong to a group of species with minimal variability of the cox1 gene. The recently discovered species, Simulium noellery and Simulium [B.] erythrocephalum, which are new to Armenia, can be considered as potentially invasive, which is supported by the low level of variability of the cox1 gene. PMID:26027374
Gibbons, Robert D; Segawa, Eisuke; Karabatsos, George; Amatya, Anup K; Bhaumik, Dulal K; Brown, C Hendricks; Kapur, Kush; Marcus, Sue M; Hur, Kwan; Mann, J John
2008-05-20
A new statistical methodology is developed for the analysis of spontaneous adverse event (AE) reports from post-marketing drug surveillance data. The method involves both empirical Bayes (EB) and fully Bayes estimation of rate multipliers for each drug within a class of drugs, for a particular AE, based on a mixed-effects Poisson regression model. Both parametric and semiparametric models for the random-effect distribution are examined. The method is applied to data from Food and Drug Administration (FDA)'s Adverse Event Reporting System (AERS) on the relationship between antidepressants and suicide. We obtain point estimates and 95 per cent confidence (posterior) intervals for the rate multiplier for each drug (e.g. antidepressants), which can be used to determine whether a particular drug has an increased risk of association with a particular AE (e.g. suicide). Confidence (posterior) intervals that do not include 1.0 provide evidence for either significant protective or harmful associations of the drug and the adverse effect. We also examine EB, parametric Bayes, and semiparametric Bayes estimators of the rate multipliers and associated confidence (posterior) intervals. Results of our analysis of the FDA AERS data revealed that newer antidepressants are associated with lower rates of suicide adverse event reports compared with older antidepressants. We recommend improvements to the existing AERS system, which are likely to improve its public health value as an early warning system. PMID:18404622
Binary Logistic Regression Analysis of Foramen Magnum Dimensions for Sex Determination
Kamath, Venkatesh Gokuldas; Asif, Muhammed; Shetty, Radhakrishna; Avadhani, Ramakrishna
2015-01-01
Purpose. The structural integrity of foramen magnum is usually preserved in fire accidents and explosions due to its resistant nature and secluded anatomical position and this study attempts to determine its sexing potential. Methods. The sagittal and transverse diameters and area of foramen magnum of seventy-two skulls (41 male and 31 female) from south Indian population were measured. The analysis was done using Student's t-test, linear correlation, histogram, Q-Q plot, and Binary Logistic Regression (BLR) to obtain a model for sex determination. The predicted probabilities of BLR were analysed using Receiver Operating Characteristic (ROC) curve. Result. BLR analysis and ROC curve revealed that the predictability of the dimensions in sexing the crania was 69.6% for sagittal diameter, 66.4% for transverse diameter, and 70.3% for area of foramen. Conclusion. The sexual dimorphism of foramen magnum dimensions is established. However, due to considerable overlapping of male and female values, it is unwise to singularly rely on the foramen measurements. However, considering the high sex predictability percentage of its dimensions in the present study and the studies preceding it, the foramen measurements can be used to supplement other sexing evidence available so as to precisely ascertain the sex of the skeleton. PMID:26346917
NASA Astrophysics Data System (ADS)
Rajab, Jasim Mohammed; Jafri, Mohd. Zubir Mat; Lim, Hwee San; Abdullah, Khiruddin
2012-10-01
This study encompasses air surface temperature (AST) modeling in the lower atmosphere. Data of four atmosphere pollutant gases (CO, O3, CH4, and H2O) dataset, retrieved from the National Aeronautics and Space Administration Atmospheric Infrared Sounder (AIRS), from 2003 to 2008 was employed to develop a model to predict AST value in the Malaysian peninsula using the multiple regression method. For the entire period, the pollutants were highly correlated (R=0.821) with predicted AST. Comparisons among five stations in 2009 showed close agreement between the predicted AST and the observed AST from AIRS, especially in the southwest monsoon (SWM) season, within 1.3 K, and for in situ data, within 1 to 2 K. The validation results of AST with AST from AIRS showed high correlation coefficient (R=0.845 to 0.918), indicating the model's efficiency and accuracy. Statistical analysis in terms of β showed that H2O (0.565 to 1.746) tended to contribute significantly to high AST values during the northeast monsoon season. Generally, these results clearly indicate the advantage of using the satellite AIRS data and a correlation analysis study to investigate the impact of atmospheric greenhouse gases on AST over the Malaysian peninsula. A model was developed that is capable of retrieving the Malaysian peninsulan AST in all weather conditions, with total uncertainties ranging between 1 and 2 K.
Risky decision making in Attention-Deficit/Hyperactivity Disorder: A meta-regression analysis.
Dekkers, Tycho J; Popma, Arne; Agelink van Rentergem, Joost A; Bexkens, Anika; Huizenga, Hilde M
2016-04-01
ADHD has been associated with various forms of risky real life decision making, for example risky driving, unsafe sex and substance abuse. However, results from laboratory studies on decision making deficits in ADHD have been inconsistent, probably because of between study differences. We therefore performed a meta-regression analysis in which 37 studies (n ADHD=1175; n Control=1222) were included, containing 52 effect sizes. The overall analysis yielded a small to medium effect size (standardized mean difference=.36, p<.001, 95% CI [.22, .51]), indicating that groups with ADHD showed more risky decision making than control groups. There was a trend for a moderating influence of co-morbid Disruptive Behavior Disorders (DBD): studies including more participants with co-morbid DBD had larger effect sizes. No moderating influence of co-morbid internalizing disorders, age or task explicitness was found. These results indicate that ADHD is related to increased risky decision making in laboratory settings, which tended to be more pronounced if ADHD is accompanied by DBD. We therefore argue that risky decision making should have a more prominent role in research on the neuropsychological and -biological mechanisms of ADHD, which can be useful in ADHD assessment and intervention. PMID:26978323
Generalized multilevel function-on-scalar regression and principal component analysis.
Goldsmith, Jeff; Zipunnikov, Vadim; Schrack, Jennifer
2015-06-01
This manuscript considers regression models for generalized, multilevel functional responses: functions are generalized in that they follow an exponential family distribution and multilevel in that they are clustered within groups or subjects. This data structure is increasingly common across scientific domains and is exemplified by our motivating example, in which binary curves indicating physical activity or inactivity are observed for nearly 600 subjects over 5 days. We use a generalized linear model to incorporate scalar covariates into the mean structure, and decompose subject-specific and subject-day-specific deviations using multilevel functional principal components analysis. Thus, functional fixed effects are estimated while accounting for within-function and within-subject correlations, and major directions of variability within and between subjects are identified. Fixed effect coefficient functions and principal component basis functions are estimated using penalized splines; model parameters are estimated in a Bayesian framework using Stan, a programming language that implements a Hamiltonian Monte Carlo sampler. Simulations designed to mimic the application have good estimation and inferential properties with reasonable computation times for moderate datasets, in both cross-sectional and multilevel scenarios; code is publicly available. In the application we identify effects of age and BMI on the time-specific change in probability of being active over a 24-hour period; in addition, the principal components analysis identifies the patterns of activity that distinguish subjects and days within subjects. PMID:25620473
PUMA: A Unified Framework for Penalized Multiple Regression Analysis of GWAS Data
Hoffman, Gabriel E.; Logsdon, Benjamin A.; Mezey, Jason G.
2013-01-01
Penalized Multiple Regression (PMR) can be used to discover novel disease associations in GWAS datasets. In practice, proposed PMR methods have not been able to identify well-supported associations in GWAS that are undetectable by standard association tests and thus these methods are not widely applied. Here, we present a combined algorithmic and heuristic framework for PUMA (Penalized Unified Multiple-locus Association) analysis that solves the problems of previously proposed methods including computational speed, poor performance on genome-scale simulated data, and identification of too many associations for real data to be biologically plausible. The framework includes a new minorize-maximization (MM) algorithm for generalized linear models (GLM) combined with heuristic model selection and testing methods for identification of robust associations. The PUMA framework implements the penalized maximum likelihood penalties previously proposed for GWAS analysis (i.e. Lasso, Adaptive Lasso, NEG, MCP), as well as a penalty that has not been previously applied to GWAS (i.e. LOG). Using simulations that closely mirror real GWAS data, we show that our framework has high performance and reliably increases power to detect weak associations, while existing PMR methods can perform worse than single marker testing in overall performance. To demonstrate the empirical value of PUMA, we analyzed GWAS data for type 1 diabetes, Crohns's disease, and rheumatoid arthritis, three autoimmune diseases from the original Wellcome Trust Case Control Consortium. Our analysis replicates known associations for these diseases and we discover novel etiologically relevant susceptibility loci that are invisible to standard single marker tests, including six novel associations implicating genes involved in pancreatic function, insulin pathways and immune-cell function in type 1 diabetes; three novel associations implicating genes in pro- and anti-inflammatory pathways in Crohn's disease; and one
PUMA: a unified framework for penalized multiple regression analysis of GWAS data.
Hoffman, Gabriel E; Logsdon, Benjamin A; Mezey, Jason G
2013-01-01
Penalized Multiple Regression (PMR) can be used to discover novel disease associations in GWAS datasets. In practice, proposed PMR methods have not been able to identify well-supported associations in GWAS that are undetectable by standard association tests and thus these methods are not widely applied. Here, we present a combined algorithmic and heuristic framework for PUMA (Penalized Unified Multiple-locus Association) analysis that solves the problems of previously proposed methods including computational speed, poor performance on genome-scale simulated data, and identification of too many associations for real data to be biologically plausible. The framework includes a new minorize-maximization (MM) algorithm for generalized linear models (GLM) combined with heuristic model selection and testing methods for identification of robust associations. The PUMA framework implements the penalized maximum likelihood penalties previously proposed for GWAS analysis (i.e. Lasso, Adaptive Lasso, NEG, MCP), as well as a penalty that has not been previously applied to GWAS (i.e. LOG). Using simulations that closely mirror real GWAS data, we show that our framework has high performance and reliably increases power to detect weak associations, while existing PMR methods can perform worse than single marker testing in overall performance. To demonstrate the empirical value of PUMA, we analyzed GWAS data for type 1 diabetes, Crohns's disease, and rheumatoid arthritis, three autoimmune diseases from the original Wellcome Trust Case Control Consortium. Our analysis replicates known associations for these diseases and we discover novel etiologically relevant susceptibility loci that are invisible to standard single marker tests, including six novel associations implicating genes involved in pancreatic function, insulin pathways and immune-cell function in type 1 diabetes; three novel associations implicating genes in pro- and anti-inflammatory pathways in Crohn's disease; and one
The Arabidopsis COX11 Homolog is Essential for Cytochrome c Oxidase Activity
Radin, Ivan; Mansilla, Natanael; Rödel, Gerhard; Steinebrunner, Iris
2015-01-01
Members of the ubiquitous COX11 (cytochrome c oxidase 11) protein family are involved in copper delivery to the COX complex. In this work, we characterize the Arabidopsis thaliana COX11 homolog (encoded by locus At1g02410). Western blot analyses and confocal microscopy identified Arabidopsis COX11 as an integral mitochondrial protein. Despite sharing high sequence and structural similarities, the Arabidopsis COX11 is not able to functionally replace the Saccharomyces cerevisiae COX11 homolog. Nevertheless, further analysis confirmed the hypothesis that Arabidopsis COX11 is essential for COX activity. Disturbance of COX11 expression through knockdown (KD) or overexpression (OE) affected COX activity. In KD lines, the activity was reduced by ~50%, resulting in root growth inhibition, smaller rosettes and leaf curling. In OE lines, the reduction was less pronounced (~80% of the wild type), still resulting in root growth inhibition. Additionally, pollen germination was impaired in COX11 KD and OE plants. This effect on pollen germination can only partially be attributed to COX deficiency and may indicate a possible auxiliary role of COX11 in ROS metabolism. In agreement with its role in energy production, the COX11 promoter is highly active in cells and tissues with high-energy demand for example shoot and root meristems, or vascular tissues of source and sink organs. In COX11 KD lines, the expression of the plasma-membrane copper transporter COPT2 and of several copper chaperones was altered, indicative of a retrograde signaling pathway pertinent to copper homeostasis. Based on our data, we postulate that COX11 is a mitochondrial chaperone, which plays an important role for plant growth and pollen germination as an essential COX complex assembly factor. PMID:26734017
The Arabidopsis COX11 Homolog is Essential for Cytochrome c Oxidase Activity.
Radin, Ivan; Mansilla, Natanael; Rödel, Gerhard; Steinebrunner, Iris
2015-01-01
Members of the ubiquitous COX11 (cytochrome c oxidase 11) protein family are involved in copper delivery to the COX complex. In this work, we characterize the Arabidopsis thaliana COX11 homolog (encoded by locus At1g02410). Western blot analyses and confocal microscopy identified Arabidopsis COX11 as an integral mitochondrial protein. Despite sharing high sequence and structural similarities, the Arabidopsis COX11 is not able to functionally replace the Saccharomyces cerevisiae COX11 homolog. Nevertheless, further analysis confirmed the hypothesis that Arabidopsis COX11 is essential for COX activity. Disturbance of COX11 expression through knockdown (KD) or overexpression (OE) affected COX activity. In KD lines, the activity was reduced by ~50%, resulting in root growth inhibition, smaller rosettes and leaf curling. In OE lines, the reduction was less pronounced (~80% of the wild type), still resulting in root growth inhibition. Additionally, pollen germination was impaired in COX11 KD and OE plants. This effect on pollen germination can only partially be attributed to COX deficiency and may indicate a possible auxiliary role of COX11 in ROS metabolism. In agreement with its role in energy production, the COX11 promoter is highly active in cells and tissues with high-energy demand for example shoot and root meristems, or vascular tissues of source and sink organs. In COX11 KD lines, the expression of the plasma-membrane copper transporter COPT2 and of several copper chaperones was altered, indicative of a retrograde signaling pathway pertinent to copper homeostasis. Based on our data, we postulate that COX11 is a mitochondrial chaperone, which plays an important role for plant growth and pollen germination as an essential COX complex assembly factor. PMID:26734017
Beyond Multiple Regression: Using Commonality Analysis to Better Understand R[superscript 2] Results
ERIC Educational Resources Information Center
Warne, Russell T.
2011-01-01
Multiple regression is one of the most common statistical methods used in quantitative educational research. Despite the versatility and easy interpretability of multiple regression, it has some shortcomings in the detection of suppressor variables and for somewhat arbitrarily assigning values to the structure coefficients of correlated…
Nonlinear regression on Riemannian manifolds and its applications to Neuro-image analysis ★
Banerjee, Monami; Chakraborty, Rudrasis; Ofori, Edward; Vaillancourt, David
2016-01-01
Regression in its most common form where independent and dependent variables are in ℝn is a ubiquitous tool in Sciences and Engineering. Recent advances in Medical Imaging has lead to a wide spread availability of manifold-valued data leading to problems where the independent variables are manifold-valued and dependent are real-valued or vice-versa. The most common method of regression on a manifold is the geodesic regression, which is the counterpart of linear regression in Euclidean space. Often, the relation between the variables is highly complex, and existing most commonly used geodesic regression can prove to be inaccurate. Thus, it is necessary to resort to a non-linear model for regression. In this work we present a novel Kernel based non-linear regression method when the mapping to be estimated is either from M → ℝn or ℝn → M, where M is a Riemannian manifold. A key advantage of this approach is that there is no requirement for the manifold-valued data to necessarily inherit an ordering from the data in ℝn. We present several synthetic and real data experiments along with comparisons to the state-of-the-art geodesic regression method in literature and thus validating the effectiveness of the proposed algorithm. PMID:27110601
ERIC Educational Resources Information Center
Kaplan, David
2005-01-01
This article considers the problem of estimating dynamic linear regression models when the data are generated from finite mixture probability density function where the mixture components are characterized by different dynamic regression model parameters. Specifically, conventional linear models assume that the data are generated by a single…
CATEGORICAL REGRESSION ANALYSIS OF ACUTE INHALATION TOXICITY DATA FOR HYDROGEN SULFIDE
Categorical regression is one of the tools offered by the U.S. EPA for derivation of acute reference exposures (AREs), which are dose-response assessments for acute exposures to inhaled chemicals. Categorical regression is used as a meta-analytical technique to calculate probabi...
Regression Analysis of Combined Gene Expression Regulation in Acute Myeloid Leukemia
Li, Yue; Liang, Minggao; Zhang, Zhaolei
2014-01-01
Gene expression is a combinatorial function of genetic/epigenetic factors such as copy number variation (CNV), DNA methylation (DM), transcription factors (TF) occupancy, and microRNA (miRNA) post-transcriptional regulation. At the maturity of microarray/sequencing technologies, large amounts of data measuring the genome-wide signals of those factors became available from Encyclopedia of DNA Elements (ENCODE) and The Cancer Genome Atlas (TCGA). However, there is a lack of an integrative model to take full advantage of these rich yet heterogeneous data. To this end, we developed RACER (Regression Analysis of Combined Expression Regulation), which fits the mRNA expression as response using as explanatory variables, the TF data from ENCODE, and CNV, DM, miRNA expression signals from TCGA. Briefly, RACER first infers the sample-specific regulatory activities by TFs and miRNAs, which are then used as inputs to infer specific TF/miRNA-gene interactions. Such a two-stage regression framework circumvents a common difficulty in integrating ENCODE data measured in generic cell-line with the sample-specific TCGA measurements. As a case study, we integrated Acute Myeloid Leukemia (AML) data from TCGA and the related TF binding data measured in K562 from ENCODE. As a proof-of-concept, we first verified our model formalism by 10-fold cross-validation on predicting gene expression. We next evaluated RACER on recovering known regulatory interactions, and demonstrated its superior statistical power over existing methods in detecting known miRNA/TF targets. Additionally, we developed a feature selection procedure, which identified 18 regulators, whose activities clustered consistently with cytogenetic risk groups. One of the selected regulators is miR-548p, whose inferred targets were significantly enriched for leukemia-related pathway, implicating its novel role in AML pathogenesis. Moreover, survival analysis using the inferred activities identified C-Fos as a potential AML
Demenais, F M; Laing, A E; Bonney, G E
1992-01-01
Segregation analysis of discrete traits can be conducted by the classical mixed model and the recently introduced regressive models. The mixed model assumes an underlying liability to the disease, to which a major gene, a multifactorial component, and random environment contribute independently. Affected persons have a liability exceeding a threshold. The regressive logistic models assume that the logarithm of the odds of being affected is a linear function of major genotype effects, the phenotypes of older relatives, and other covariates. A formulation of the regressive models, based on an underlying liability model, has been recently proposed. The regression coefficients on antecedents are expressed in terms of the relevant familial correlations and a one-to-one correspondence with the parameters of the mixed model can thus be established. Computer simulations are conducted to evaluate the fit of the two formulations of the regressive models to the mixed model on nuclear families. The two forms of the class D regressive model provide a good fit to a generated mixed model, in terms of both hypothesis testing and parameter estimation. The simpler class A regressive model, which assumes that the outcomes of children depend solely on the outcomes of parents, is not robust against a sib-sib correlation exceeding that specified by the model, emphasizing testing class A against class D. The studies reported here show that if the true state of nature is that described by the mixed model, then a regressive model will do just as well. Moreover, the regressive models, allowing for more patterns of family dependence, provide a flexible framework to understand gene-environment interactions in complex diseases. PMID:1487139
Montanaro, Fabio; Ceppi, Marcello; Puntoni, Riccardo; Silvano, Stefania; Gennaro, Valerio
2004-04-01
The authors investigated the relationship between asbestos exposure and respiratory cancer mortality among maintenance workers and other blue-collar workers at an Italian oil refinery. The cohort contained 931 men, 29,511 person-years, and 489 deaths. Poisson regression analysis using white-collar workers as an internal referent group provided relative risk estimates (RRs) for main causes of death, adjusted for age, age at hiring, calendar period, length of exposure, and latency. Among maintenance workers, RRs for all tumors (RR = 1.50), digestive system cancers (RR = 1.41), lung cancers (RR = 1.53), and nonmalignant respiratory diseases (RR = 1.71) were significantly increased (p < 0.05); no significant excess was found for all causes and among maintenance (RR = 1.12) and other blue-collar workers (RR = 1.01). Results confirm the increased risk of death from respiratory diseases and cancer among maintenance workers exposed to asbestos, whereas other smoking-related diseases (circulatory system) were not statistically different among groups. PMID:16189991
Shayan, Zahra; Mezerji, Naser Mohammad Gholi; Shayan, Leila; Naseri, Parisa
2016-01-01
Background: Logistic regression (LR) and linear discriminant analysis (LDA) are two popular statistical models for prediction of group membership. Although they are very similar, the LDA makes more assumptions about the data. When categorical and continuous variables used simultaneously, the optimal choice between the two models is questionable. In most studies, classification error (CE) is used to discriminate between subjects in several groups, but this index is not suitable to predict the accuracy of the outcome. The present study compared LR and LDA models using classification indices. Methods: This cross-sectional study selected 243 cancer patients. Sample sets of different sizes (n = 50, 100, 150, 200, 220) were randomly selected and the CE, B, and Q classification indices were calculated by the LR and LDA models. Results: CE revealed the a lack of superiority for one model over the other, but the results showed that LR performed better than LDA for the B and Q indices in all situations. No significant effect for sample size on CE was noted for selection of an optimal model. Assessment of the accuracy of prediction of real data indicated that the B and Q indices are appropriate for selection of an optimal model. Conclusion: The results of this study showed that LR performs better in some cases and LDA in others when based on CE. The CE index is not appropriate for classification, although the B and Q indices performed better and offered more efficient criteria for comparison and discrimination between groups.
Predicting pesticide removal efficacy of vegetated filter strips: A meta-regression analysis.
Chen, Huajin; Grieneisen, Michael L; Zhang, Minghua
2016-04-01
Vegetated Filter Strips (VFS's) are widely used for alleviating agricultural pesticide loadings to surface water bodies. However, effective tools are lacking to quantify the performance of VFS's in reducing off-site pesticide transport. In this study, we applied meta-regression to develop a model for predicting VFS pesticide retention efficiency based on hydrologic responses of VFS's, incoming pollutant characteristics and the interaction within and between these two factor groups (R(2)=0.83). In cross-validation analysis, our model (Q(2)=0.81) outperformed the existing pesticide retention module of VFSMOD (Q(2)=0.72) by explicitly accounting for interaction effect and the categorical effect of pesticide adsorption properties. Based on the 181 data points studied, infiltration had a leading, positive influence on pesticide retention, followed by sedimentation and interaction between the two. Interaction between infiltration and pesticide adsorption properties was also prominent, as the influence of infiltration was significantly lower for strongly adsorbed pesticides. In addition, the clay content of incoming sediment was negatively associated with pesticide retention. Our model is not only valuable in predicting VFS performance, but also provides a quantitative characterization of the interacting VFS processes, thereby facilitating a deeper understanding of the underlying mechanisms. PMID:26802340
NASA Astrophysics Data System (ADS)
Elnasir, Selma; Shamsuddin, Siti Mariyam; Farokhi, Sajad
2015-01-01
Palm vein recognition (PVR) is a promising new biometric that has been applied successfully as a method of access control by many organizations, which has even further potential in the field of forensics. The palm vein pattern has highly discriminative features that are difficult to forge because of its subcutaneous position in the palm. Despite considerable progress and a few practical issues, providing accurate palm vein readings has remained an unsolved issue in biometrics. We propose a robust and more accurate PVR method based on the combination of wavelet scattering (WS) with spectral regression kernel discriminant analysis (SRKDA). As the dimension of WS generated features is quite large, SRKDA is required to reduce the extracted features to enhance the discrimination. The results based on two public databases-PolyU Hyper Spectral Palmprint public database and PolyU Multi Spectral Palmprint-show the high performance of the proposed scheme in comparison with state-of-the-art methods. The proposed approach scored a 99.44% identification rate and a 99.90% verification rate [equal error rate (EER)=0.1%] for the hyperspectral database and a 99.97% identification rate and a 99.98% verification rate (EER=0.019%) for the multispectral database.
Coons, D M; Boulton, R B; Bisson, L F
1995-01-01
The kinetics of glucose uptake in Saccharomyces cerevisiae are complex. An Eadie-Hofstee (rate of uptake versus rate of uptake over substrate concentration) plot of glucose uptake shows a nonlinear form typical of a multicomponent system. The nature of the constituent components is a subject of debate. It has recently been suggested that this nonlinearity is due to either a single saturable component together with free diffusion of glucose or a single constitutive component with a variable Km, rather than the action of multiple hexose transporters. Genetic data support the existence of a family of differentially regulated glucose transporters, encoded by the HXT genes. In this work, kinetic expressions and nonlinear regression analysis, based on an improved zero trans-influx assay, were used to address the nature of the components of the transport system. The results indicate that neither one component with free diffusion nor a single permease with a variable Km can explain the observed uptake rates. Results of uptake experiments, including the use of putative alternative substrates as inhibitory compounds, support the model derived from genetic analyses of a multicomponent system with at least two components, one a high-affinity carrier and the other a low-affinity carrier. This approach was extended to characterize the activity of the SNF3 protein and identify its role in the depression of high-affinity uptake. The kinetic data support a role of SNF3 as a regulatory protein that may not itself be a transporter. PMID:7768825
Savescu, Roxana Florenta; Laba, Marian
2016-06-01
This paper highlights the statistical methodology used in a dissection experiment carried out in Romania to calibrate and standardize two classification devices, OptiGrade PRO (OGP) and Fat-o-Meat'er (FOM). One hundred forty-five carcasses were measured using the two probes and dissected according to the European reference method. To derive prediction formulas for each device, multiple linear regression analysis was performed on the relationship between the reference lean meat percentage and the back fat and muscle thicknesses, using the ordinary least squares technique. The root mean squared error of prediction calculated using the leave-one-out cross validation met European Commission (EC) requirements. The application of the new prediction equations reduced the gap between the lean meat percentage measured with the OGP and FOM from 2.43% (average for the period Q3/2006-Q2/2008) to 0.10% (average for the period Q3/2008-Q4/2014), providing the basis for a fair payment system for the pig producers. PMID:26835835
Tahsin, Subrina; Chang, Ni-Bin
2016-02-01
Stormwater wet detention ponds have been a commonly employed best management practice for stormwater management throughout the world for many years. In the past, the trophic state index values have been used to evaluate seasonal changes in water quality and rank lakes within a region or between several regions; yet, to date, there is no similar index for stormwater wet detention ponds. This study aimed to develop a new multivariate trophic state index (MTSI) suitable for conducting a rapid eutrophication assessment of stormwater wet detention ponds under uncertainty with respect to three typical physical and chemical properties. Six stormwater wet detention ponds in Florida were selected for demonstration of the new MTSI with respect to total phosphorus (TP), total nitrogen (TN), and Secchi disk depth (SDD) as cognitive assessment metrics to sense eutrophication potential collectively and inform the environmental impact holistically. Due to the involvement of multiple endogenous variables (i.e., TN, TP, and SDD) for the eutrophication assessment simultaneously under uncertainty, fuzzy synthetic evaluation was applied to first standardize and synchronize the sources of uncertainty in the decision analysis. The ordered probit regression model was then formulated for assessment based on the concept of MTSI with the inputs from the fuzzy synthetic evaluation. It is indicative that the severe eutrophication condition is present during fall, which might be due to frequent heavy summer storm events contributing to high-nutrient inputs in these six ponds. PMID:26733470
Anomalous particle pinch and scaling of vin/D based on transport analysis and multiple regression
NASA Astrophysics Data System (ADS)
Becker, G.; Kardaun, O.
2007-01-01
Predictions of density profiles in current tokamaks and ITER require a validated scaling relation for vin/D where vin is the anomalous inward drift velocity and D is the anomalous diffusion coefficient. Transport analysis is necessary for determining the anomalous particle pinch from measured density profiles and for separating the impact of particle sources. A set of discharges in ASDEX Upgrade, DIII-D, JET and ASDEX is analysed using a special version of the 1.5-D BALDUR transport code. Profiles of ρsvin/D with ρs the effective separatrix radius, five other dimensionless parameters and many further quantities in the confinement zone are compiled, resulting in the dataset VIND1.dat, which covers a wide parameter range. Weighted multiple regression is applied to the ASDEX Upgrade subset which leads to a two-term scaling \\rho _sv_in ({x'}) /D ({x'}) =0.0432 [ { ({L_{T_{\\rme}} ({ \\bar {x}'}) / \\rho _s}) ^{-2.58}+7.13 \\, U_L^{1.55} \
Ismail, Abbas; Josephat, Peter
2014-01-01
Tuberculosis (TB) is one of the most important public health problems in Tanzania and was declared as a national public health emergency in 2006. Community and individual knowledge and perceptions are critical factors in the control of the disease. The objective of this study was to analyze the knowledge and perception on the transmission of TB in Tanzania. Multinomial Logistic Regression analysis was considered in order to quantify the impact of knowledge and perception on TB. The data used was adopted as secondary data from larger national survey 2007-08 Tanzania HIV/AIDS and Malaria Indicator Survey. The findings across groups revealed that knowledge on TB transmission increased with an increase in age and level of education. People in rural areas had less knowledge regarding tuberculosis transmission compared to urban areas [OR = 0.7]. People with the access to radio [OR = 1.7] were more knowledgeable on tuberculosis transmission compared to those who did not have access to radio. People who did not have telephone [OR = 0.6] were less knowledgeable on tuberculosis route of transmission compared to those who had telephone. The findings showed that socio-demographic factors such as age, education, place of residence and owning telephone or radio varied systematically with knowledge on tuberculosis transmission. PMID:26867270
A New Global Regression Analysis Method for the Prediction of Wind Tunnel Model Weight Corrections
NASA Technical Reports Server (NTRS)
Ulbrich, Norbert Manfred; Bridge, Thomas M.; Amaya, Max A.
2014-01-01
A new global regression analysis method is discussed that predicts wind tunnel model weight corrections for strain-gage balance loads during a wind tunnel test. The method determines corrections by combining "wind-on" model attitude measurements with least squares estimates of the model weight and center of gravity coordinates that are obtained from "wind-off" data points. The method treats the least squares fit of the model weight separate from the fit of the center of gravity coordinates. Therefore, it performs two fits of "wind- off" data points and uses the least squares estimator of the model weight as an input for the fit of the center of gravity coordinates. Explicit equations for the least squares estimators of the weight and center of gravity coordinates are derived that simplify the implementation of the method in the data system software of a wind tunnel. In addition, recommendations for sets of "wind-off" data points are made that take typical model support system constraints into account. Explicit equations of the confidence intervals on the model weight and center of gravity coordinates and two different error analyses of the model weight prediction are also discussed in the appendices of the paper.
Wong, Y Joel; Owen, Jesse; Shea, Munyi
2012-01-01
How are specific dimensions of masculinity related to psychological distress in specific groups of men? To address this question, the authors used latent class regression to assess the optimal number of latent classes that explained differential relationships between conformity to masculine norms and psychological distress in a racially diverse sample of 223 men. The authors identified a 2-class solution. Both latent classes demonstrated very different associations between conformity to masculine norms and psychological distress. In Class 1 (labeled risk avoiders; n = 133), conformity to the masculine norm of risk-taking was negatively related to psychological distress. In Class 2 (labeled detached risk-takers; n = 90), conformity to the masculine norms of playboy, self-reliance, and risk-taking was positively related to psychological distress, whereas conformity to the masculine norm of violence was negatively related to psychological distress. A post hoc analysis revealed that younger men and Asian American men (compared with Latino and White American men) had significantly greater odds of being in Class 2 versus Class 1. The implications of these findings for future research and clinical practice are examined. PMID:22229799
A systematic review and meta-regression analysis of mivacurium for tracheal intubation.
Vanlinthout, L E H; Mesfin, S H; Hens, N; Vanacker, B F; Robertson, E N; Booij, L H D J
2014-12-01
We systematically reviewed factors associated with intubation conditions in randomised controlled trials of mivacurium, using random-effects meta-regression analysis. We included 29 studies of 1050 healthy participants. Four factors explained 72.9% of the variation in the probability of excellent intubation conditions: mivacurium dose, 24.4%; opioid use, 29.9%; time to intubation and age together, 18.6%. The odds ratio (95% CI) for excellent intubation was 3.14 (1.65-5.73) for doubling the mivacurium dose, 5.99 (2.14-15.18) for adding opioids to the intubation sequence, and 6.55 (6.01-7.74) for increasing the delay between mivacurium injection and airway insertion from 1 to 2 min in subjects aged 25 years and 2.17 (2.01-2.69) for subjects aged 70 years, p < 0.001 for all. We conclude that good conditions for tracheal intubation are more likely by delaying laryngoscopy after injecting a higher dose of mivacurium with an opioid, particularly in older people. PMID:25040541
Hou, J
1989-01-01
Cixian county, one of the high-risk counties of esophageal cancer in the world, has a standardized mortality of 142.19/10(5) population, 1969-1971. The incidence of esophageal cancer had dropped year by year from 1974 to 1982. The significance of the incidence tendency was studied. The results are highly significant (P less than 0.001). The causative factors of esophageal cancer including five independent variables: X1 (number of people taking sanitized water), X2 (number of people on pickled Chinese cabbage), X3 (annual output of fruit), X4 (annual output of fresh vegetable) and X5 (annual output of sweet potato) and one dependent variable Y (morbidity of esophageal cancer) were studied by correlative analysis and multiple stepwise regression. Three correlative factors (X1, X2, and X5) with significant effect on the esophageal cancer were selected from the five suspected factors. The result indicated that taking sanitized water, reducing the number of people on pickled Chinese cabbage, changing the structure of food and keeping the nutrient balance, might decrease the incidence of esophageal cancer. PMID:2789130
Psychosocial Variables and Time to Injury Onset: A Hurdle Regression Analysis Model
Sibold, Jeremy; Zizzi, Samuel
2012-01-01
Context: Psychological variables have been shown to be related to athletic injury and time missed from participation in sport. We are unaware of any empirical examination of the influence of psychological variables on time to onset of injury. Objective: To examine the influence of orthopaedic and psychosocial variables on time to injury in college athletes. Patients or Other Participants: One hundred seventy-seven (men = 116, women = 61; age = 19.45 ± 1.39 years) National Collegiate Athletic Association Division II athletes. Main Outcome Measure(s): Hurdle regression analysis (HRA) was used to determine the influence of predictor variables on days to first injury. Results: Worry (z = 2.98, P = .003), concentration disruption (z = −3.95, P < .001), and negative life-event stress (z = 5.02, P < .001) were robust predictors of days to injury. Orthopaedic risk score was not a predictor (z = 1.28, P = .20). Conclusions: These findings support previous research on the stress-injury relationship, and our group is the first to use HRA in athletic injury data. These data support the addition of psychological screening as part of preseason health examinations for collegiate athletes. PMID:23068591
Power Law Regression Analysis of Heat Flux Width in Type I ELMs
NASA Astrophysics Data System (ADS)
Stephens, C. D.; Makowski, M. A.; Leonard, A. W.; Osborne, T. H.
2014-10-01
In this project, a database of Type I ELM characteristics has been assembled and will be used to investigate possible dependencies of the heat flux width on physics and engineering parameters. At the edge near the divertor, high impulsive heat loads are imparted onto the surface. The impact of these ELMs can cause a reduction in divertor lifetime if the heat flux is great enough due to material erosion. A program will be used to analyze data, extract relevant, measurable quantities, and record the quantities in the table. Care is taken to accurately capture the complex space/time structure of the ELM. Then correlations between discharge and equilibrium parameters will be investigated. Power law regression analysis will be used to help determine the dependence of the heat flux width on these various measurable quantities and parameters. This will enable us to better understand the physics of heat flux at the edge. Work supported in part by the National Undergraduate Fellowship Program in Plasma Physics and Fusion Energy Sciences and the US DOE under DE-FG02-04ER54761, DE-AC52-07NA27344, DE-FC02-04ER54698.
An adaptive regression mixture model for fMRI cluster analysis.
Oikonomou, Vangelis P; Blekas, Konstantinos
2013-04-01
Functional magnetic resonance imaging (fMRI) has become one of the most important techniques for studying the human brain in action. A common problem in fMRI analysis is the detection of activated brain regions in response to an experimental task. In this work we propose a novel clustering approach for addressing this issue using an adaptive regression mixture model. The main contribution of our method is the employment of both spatial and sparse properties over the body of the mixture model. Thus, the clustering approach is converted into a maximum a posteriori estimation approach, where the expectation-maximization algorithm is applied for model training. Special care is also given to estimate the kernel scalar parameter per cluster of the design matrix by presenting a multi-kernel scheme. In addition an incremental training procedure is presented so as to make the approach independent on the initialization of the model parameters. The latter also allows us to introduce an efficient stopping criterion of the process for determining the optimum brain activation area. To assess the effectiveness of our method, we have conducted experiments with simulated and real fMRI data, where we have demonstrated its ability to produce improved performance and functional activation detection capabilities. PMID:23047865
The basis function regression in pharmaceutical analysis. Theory and example of application.
Komsta, Lukasz; Skibiński, Robert; Paryło, Marta; Dudek, Karolina
2008-08-01
The BFR (Basis Function Regression) is an interesting alternative to common techniques (such as PCR or PLS) in chemometrics. It is based on projecting the spectral information onto some number of equally spaced spline bases, than obtaining integrals of resulted curves. Existing references show that in certain cases it can reduce almost twice the RMSEP values. As this technique is not so popular in chemometrics nor applied in pharmaceutical analysis, it is desirable to present its theoretical considerations and implementation (with example MATLAB/Octave code). As an illustrative example we present the chemometric model for content recognition of a tablet (12 possible compounds in binary or ternary combinations) from the UV spectrum of its methanolic extract. The BFR technique gave lowest prediction error and the estimators obtained have more meritorical meaning than in case of PCR, PLS and other techniques used for comparison. In our opinion this technique should be considered in any chemometric approach as it often shows better performance. PMID:18450403
Comparison of linear discriminant analysis and logistic regression for data classification
NASA Astrophysics Data System (ADS)
Liong, Choong-Yeun; Foo, Sin-Fan
2013-04-01
Linear discriminant analysis (LDA) and logistic regression (LR) are often used for the purpose of classifying populations or groups using a set of predictor variables. Assumptions of multivariate normality and equal variance-covariance matrices across groups are required before proceeding with LDA, but such assumptions are not required for LR and hence LR is considered to be much more robust than LDA. In this paper, several real datasets which are different in terms of normality, number of independent variables and sample size are used to study the performance of both methods. The methods are compared based on the percentage of correct classification and B index. The results show that overall, LR performs better regardless of the distribution of the data is normal or nonnormal. However, LR needs longer computing time than LDA with the increase in sample size. The performance of LDA was also tested by using various prior probabilities. The results show that the average percentage of correct classification and the B index are higher when the prior probability is set based on the group size rather than using equal probabilities for all groups.
NASA Astrophysics Data System (ADS)
Gizaw, Mesgana Seyoum; Gan, Thian Yew
2016-07-01
Regional Flood Frequency Analysis (RFFA) is a statistical method widely used to estimate flood quantiles of catchments with limited streamflow data. In addition, to estimate the flood quantile of ungauged sites, there could be only a limited number of stations with complete dataset are available from hydrologically similar, surrounding catchments. Besides traditional regression based RFFA methods, recent applications of machine learning algorithms such as the artificial neural network (ANN) have shown encouraging results in regional flood quantile estimations. Another novel machine learning technique that is becoming widely applicable in the hydrologic community is the Support Vector Regression (SVR). In this study, an RFFA model based on SVR was developed to estimate regional flood quantiles for two study areas, one with 26 catchments located in southeastern British Columbia (BC) and another with 23 catchments located in southern Ontario (ON), Canada. The SVR-RFFA model for both study sites was developed from 13 sets of physiographic and climatic predictors for the historical period. The Ef (Nash Sutcliffe coefficient) and R2 of the SVR-RFFA model was about 0.7 when estimating flood quantiles of 10, 25, 50 and 100 year return periods which indicate satisfactory model performance in both study areas. In addition, the SVR-RFFA model also performed well based on other goodness-of-fit statistics such as BIAS (mean bias) and BIASr (relative BIAS). If the amount of data available for training RFFA models is limited, the SVR-RFFA model was found to perform better than an ANN based RFFA model, and with significantly lower median CV (coefficient of variation) of the estimated flood quantiles. The SVR-RFFA model was then used to project changes in flood quantiles over the two study areas under the impact of climate change using the RCP4.5 and RCP8.5 climate projections of five Coupled Model Intercomparison Project (CMIP5) GCMs (Global Climate Models) for the 2041
A Bayesian ridge regression analysis of congestion's impact on urban expressway safety.
Shi, Qi; Abdel-Aty, Mohamed; Lee, Jaeyoung
2016-03-01
With the rapid growth of traffic in urban areas, concerns about congestion and traffic safety have been heightened. This study leveraged both Automatic Vehicle Identification (AVI) system and Microwave Vehicle Detection System (MVDS) installed on an expressway in Central Florida to explore how congestion impacts the crash occurrence in urban areas. Multiple congestion measures from the two systems were developed. To ensure more precise estimates of the congestion's effects, the traffic data were aggregated into peak and non-peak hours. Multicollinearity among traffic parameters was examined. The results showed the presence of multicollinearity especially during peak hours. As a response, ridge regression was introduced to cope with this issue. Poisson models with uncorrelated random effects, correlated random effects, and both correlated random effects and random parameters were constructed within the Bayesian framework. It was proven that correlated random effects could significantly enhance model performance. The random parameters model has similar goodness-of-fit compared with the model with only correlated random effects. However, by accounting for the unobserved heterogeneity, more variables were found to be significantly related to crash frequency. The models indicated that congestion increased crash frequency during peak hours while during non-peak hours it was not a major crash contributing factor. Using the random parameter model, the three congestion measures were compared. It was found that all congestion indicators had similar effects while Congestion Index (CI) derived from MVDS data was a better congestion indicator for safety analysis. Also, analyses showed that the segments with higher congestion intensity could not only increase property damage only (PDO) crashes, but also more severe crashes. In addition, the issues regarding the necessity to incorporate specific congestion indicator for congestion's effects on safety and to take care of the
NASA Astrophysics Data System (ADS)
Pradhan, Biswajeet
Recently, in 2006 and 2007 heavy monsoons rainfall have triggered floods along Malaysia's east coast as well as in southern state of Johor. The hardest hit areas are along the east coast of peninsular Malaysia in the states of Kelantan, Terengganu and Pahang. The city of Johor was particularly hard hit in southern side. The flood cost nearly billion ringgit of property and many lives. The extent of damage could have been reduced or minimized if an early warning system would have been in place. This paper deals with flood susceptibility analysis using logistic regression model. We have evaluated the flood susceptibility and the effect of flood-related factors along the Kelantan river basin using the Geographic Information System (GIS) and remote sensing data. Previous flooded areas were extracted from archived radarsat images using image processing tools. Flood susceptibility mapping was conducted in the study area along the Kelantan River using radarsat imagery and then enlarged to 1:25,000 scales. Topographical, hydrological, geological data and satellite images were collected, processed, and constructed into a spatial database using GIS and image processing. The factors chosen that influence flood occurrence were: topographic slope, topographic aspect, topographic curvature, DEM and distance from river drainage, all from the topographic database; flow direction, flow accumulation, extracted from hydrological database; geology and distance from lineament, taken from the geologic database; land use from SPOT satellite images; soil texture from soil database; and the vegetation index value from SPOT satellite images. Flood susceptible areas were analyzed and mapped using the probability-logistic regression model. Results indicate that flood prone areas can be performed at 1:25,000 which is comparable to some conventional flood hazard map scales. The flood prone areas delineated on these maps correspond to areas that would be inundated by significant flooding
Wagner, Philippe; Ghith, Nermin; Leckie, George
2016-01-01
Background and Aim Many multilevel logistic regression analyses of “neighbourhood and health” focus on interpreting measures of associations (e.g., odds ratio, OR). In contrast, multilevel analysis of variance is rarely considered. We propose an original stepwise analytical approach that distinguishes between “specific” (measures of association) and “general” (measures of variance) contextual effects. Performing two empirical examples we illustrate the methodology, interpret the results and discuss the implications of this kind of analysis in public health. Methods We analyse 43,291 individuals residing in 218 neighbourhoods in the city of Malmö, Sweden in 2006. We study two individual outcomes (psychotropic drug use and choice of private vs. public general practitioner, GP) for which the relative importance of neighbourhood as a source of individual variation differs substantially. In Step 1 of the analysis, we evaluate the OR and the area under the receiver operating characteristic (AUC) curve for individual-level covariates (i.e., age, sex and individual low income). In Step 2, we assess general contextual effects using the AUC. Finally, in Step 3 the OR for a specific neighbourhood characteristic (i.e., neighbourhood income) is interpreted jointly with the proportional change in variance (i.e., PCV) and the proportion of ORs in the opposite direction (POOR) statistics. Results For both outcomes, information on individual characteristics (Step 1) provide a low discriminatory accuracy (AUC = 0.616 for psychotropic drugs; = 0.600 for choosing a private GP). Accounting for neighbourhood of residence (Step 2) only improved the AUC for choosing a private GP (+0.295 units). High neighbourhood income (Step 3) was strongly associated to choosing a private GP (OR = 3.50) but the PCV was only 11% and the POOR 33%. Conclusion Applying an innovative stepwise multilevel analysis, we observed that, in Malmö, the neighbourhood context per se had a negligible
The Mitochondrial Genome of Conus textile, coxI-coxII Intergenic Sequences and Conoidean Evolution
Bandyopadhyay, Pradip K; Stevenson, Bradford J.; Ownby, John-Paul; Cady, Matthew T.; Watkins, Maren; Olivera, Baldomero M.
2009-01-01
The cone snails belong to the superfamily Conoidea, comprising ∼10,000 venomous marine gastropods. We determined the complete mitochondrial DNA sequence of Conus textile. The gene order is identical in Conus textile, Lophiotoma cerithiformis (another Conoidean gastropod), and the neogastropod Ilyanassa obsoleta, (not in the superfamily Conoidea). However, the intergenic interval between the coxI/coxII genes, was much longer in C. textile (165 bp) than in any other previously analyzed gastropod. We used the intergenic region to evaluate evolutionary patterns. In most neogastropods and three conidean families the intergenic interval is small (<30 nucleotides). Within Conus, the variation is from 130-170 bp, and each different clade within Conus has a narrower size distribution. In Conasprella, a subgenus traditionally assigned to Conus, the intergenic regions vary between 200-500 bp, suggesting that the species in Conasprella are not congeneric with Conus. The intergenic region was used for phylogenetic analysis of a group of fish-hunting Conus, despite the short length resolution was better than using standard markers. Thus, the coxI/coxII intergenic region can be used both to define evolutionary relationships between species in a clade, and to understand broad evolutionary patterns across the large superfamily Conoidea. PMID:17936021
Nakasone, Yutaka Ikeda, Osamu; Yamashita, Yasuyuki; Kudoh, Kouichi; Shigematsu, Yoshinori; Harada, Kazunori
2007-09-15
We applied multivariate analysis to the clinical findings in patients with acute gastrointestinal (GI) hemorrhage and compared the relationship between these findings and angiographic evidence of extravasation. Our study population consisted of 46 patients with acute GI bleeding. They were divided into two groups. In group 1 we retrospectively analyzed 41 angiograms obtained in 29 patients (age range, 25-91 years; average, 71 years). Their clinical findings including the shock index (SI), diastolic blood pressure, hemoglobin, platelet counts, and age, which were quantitatively analyzed. In group 2, consisting of 17 patients (age range, 21-78 years; average, 60 years), we prospectively applied statistical analysis by a logistics regression model to their clinical findings and then assessed 21 angiograms obtained in these patients to determine whether our model was useful for predicting the presence of angiographic evidence of extravasation. On 18 of 41 (43.9%) angiograms in group 1 there was evidence of extravasation; in 3 patients it was demonstrated only by selective angiography. Factors significantly associated with angiographic visualization of extravasation were the SI and patient age. For differentiation between cases with and cases without angiographic evidence of extravasation, the maximum cutoff point was between 0.51 and 0.0.53. Of the 21 angiograms obtained in group 2, 13 (61.9%) showed evidence of extravasation; in 1 patient it was demonstrated only on selective angiograms. We found that in 90% of the cases, the prospective application of our model correctly predicted the angiographically confirmed presence or absence of extravasation. We conclude that in patients with GI hemorrhage, angiographic visualization of extravasation is associated with the pre-embolization SI. Patients with a high SI value should undergo study to facilitate optimal treatment planning.
Regression Analysis of Top of Descent Location for Idle-thrust Descents
NASA Technical Reports Server (NTRS)
Stell, Laurel; Bronsvoort, Jesper; McDonald, Greg
2013-01-01
In this paper, multiple regression analysis is used to model the top of descent (TOD) location of user-preferred descent trajectories computed by the flight management system (FMS) on over 1000 commercial flights into Melbourne, Australia. The independent variables cruise altitude, final altitude, cruise Mach, descent speed, wind, and engine type were also recorded or computed post-operations. Both first-order and second-order models are considered, where cross-validation, hypothesis testing, and additional analysis are used to compare models. This identifies the models that should give the smallest errors if used to predict TOD location for new data in the future. A model that is linear in TOD altitude, final altitude, descent speed, and wind gives an estimated standard deviation of 3.9 nmi for TOD location given the trajec- tory parameters, which means about 80% of predictions would have error less than 5 nmi in absolute value. This accuracy is better than demonstrated by other ground automation predictions using kinetic models. Furthermore, this approach would enable online learning of the model. Additional data or further knowl- edge of algorithms is necessary to conclude definitively that no second-order terms are appropriate. Possible applications of the linear model are described, including enabling arriving aircraft to fly optimized descents computed by the FMS even in congested airspace. In particular, a model for TOD location that is linear in the independent variables would enable decision support tool human-machine interfaces for which a kinetic approach would be computationally too slow.
Banno, Masahiro; Koide, Takayoshi; Aleksic, Branko; Okada, Takashi; Kikuchi, Tsutomu; Kohmura, Kunihiro; Adachi, Yasunori; Kawano, Naoko; Iidaka, Tetsuya; Ozaki, Norio
2012-01-01
Objectives This study investigated what clinical and sociodemographic factors affected Wisconsin Card Sorting Test (WCST) factor scores of patients with schizophrenia to evaluate parameters or items of the WCST. Design Cross-sectional study. Setting Patients with schizophrenia from three hospitals participated. Participants Participants were recruited from July 2009 to August 2011. 131 Japanese patients with schizophrenia (84 men and 47 women, 43.5±13.8 years (mean±SD)) entered and completed the study. Participants were recruited in the study if they (1) met DSM-IV criteria for schizophrenia; (2) were physically healthy and (3) had no mood disorders, substance abuse, neurodevelopmental disorders, epilepsy or mental retardation. We examined their basic clinical and sociodemographic factors (sex, age, education years, age of onset, duration of illness, chlorpromazine equivalent doses and the positive and negative syndrome scale (PANSS) scores). Primary and secondary outcome measures All patients carried out the WCST Keio version. Five indicators were calculated, including categories achieved (CA), perseverative errors in Milner (PEM) and Nelson (PEN), total errors (TE) and difficulties of maintaining set (DMS). From the principal component analysis, we identified two factors (1 and 2). We assessed the relationship between these factor scores and clinical and sociodemographic factors, using multiple logistic regression analysis. Results Factor 1 was mainly composed of CA, PEM, PEN and TE. Factor 2 was mainly composed of DMS. The factor 1 score was affected by age, education years and the PANSS negative scale score. The factor 2 score was affected by duration of illness. Conclusions Age, education years, PANSS negative scale score and duration of illness affected WCST factor scores in patients with schizophrenia. Using WCST factor scores may reduce the possibility of type I errors due to multiple comparisons. PMID:23135537
Pineda, Silvia; Real, Francisco X.; Kogevinas, Manolis; Carrato, Alfredo; Chanock, Stephen J.
2015-01-01
Omics data integration is becoming necessary to investigate the genomic mechanisms involved in complex diseases. During the integration process, many challenges arise such as data heterogeneity, the smaller number of individuals in comparison to the number of parameters, multicollinearity, and interpretation and validation of results due to their complexity and lack of knowledge about biological processes. To overcome some of these issues, innovative statistical approaches are being developed. In this work, we propose a permutation-based method to concomitantly assess significance and correct by multiple testing with the MaxT algorithm. This was applied with penalized regression methods (LASSO and ENET) when exploring relationships between common genetic variants, DNA methylation and gene expression measured in bladder tumor samples. The overall analysis flow consisted of three steps: (1) SNPs/CpGs were selected per each gene probe within 1Mb window upstream and downstream the gene; (2) LASSO and ENET were applied to assess the association between each expression probe and the selected SNPs/CpGs in three multivariable models (SNP, CPG, and Global models, the latter integrating SNPs and CPGs); and (3) the significance of each model was assessed using the permutation-based MaxT method. We identified 48 genes whose expression levels were significantly associated with both SNPs and CPGs. Importantly, 36 (75%) of them were replicated in an independent data set (TCGA) and the performance of the proposed method was checked with a simulation study. We further support our results with a biological interpretation based on an enrichment analysis. The approach we propose allows reducing computational time and is flexible and easy to implement when analyzing several types of omics data. Our results highlight the importance of integrating omics data by applying appropriate statistical strategies to discover new insights into the complex genetic mechanisms involved in disease
Lees, Mackenzie C.; Merani, Shaheed; Tauh, Keerit; Khadaroo, Rachel G.
2015-01-01
Background Older adults (≥ 65 yr) are the fastest growing population and are presenting in increasing numbers for acute surgical care. Emergency surgery is frequently life threatening for older patients. Our objective was to identify predictors of mortality and poor outcome among elderly patients undergoing emergency general surgery. Methods We conducted a retrospective cohort study of patients aged 65–80 years undergoing emergency general surgery between 2009 and 2010 at a tertiary care centre. Demographics, comorbidities, in-hospital complications, mortality and disposition characteristics of patients were collected. Logistic regression analysis was used to identify covariate-adjusted predictors of in-hospital mortality and discharge of patients home. Results Our analysis included 257 patients with a mean age of 72 years; 52% were men. In-hospital mortality was 12%. Mortality was associated with patients who had higher American Society of Anesthesiologists (ASA) class (odds ratio [OR] 3.85, 95% confidence interval [CI] 1.43–10.33, p = 0.008) and in-hospital complications (OR 1.93, 95% CI 1.32–2.83, p = 0.001). Nearly two-thirds of patients discharged home were younger (OR 0.92, 95% CI 0.85–0.99, p = 0.036), had lower ASA class (OR 0.45, 95% CI 0.27–0.74, p = 0.002) and fewer in-hospital complications (OR 0.69, 95% CI 0.53–0.90, p = 0.007). Conclusion American Society of Anesthesiologists class and in-hospital complications are perioperative predictors of mortality and disposition in the older surgical population. Understanding the predictors of poor outcome and the importance of preventing in-hospital complications in older patients will have important clinical utility in terms of preoperative counselling, improving health care and discharging patients home. PMID:26204143
Determinants for changing the treatment of COPD: a regression analysis from a clinical audit
López-Campos, Jose Luis; Abad Arranz, María; Calero Acuña, Carmen; Romero Valero, Fernando; Ayerbe García, Ruth; Hidalgo Molina, Antonio; Aguilar Perez-Grovas, Ricardo I; García Gil, Francisco; Casas Maldonado, Francisco; Caballero Ballesteros, Laura; Sánchez Palop, María; Pérez-Tejero, Dolores; Segado, Alejandro; Calvo Bonachera, Jose; Hernández Sierra, Bárbara; Doménech, Adolfo; Arroyo Varela, Macarena; González Vargas, Francisco; Cruz Rueda, Juan J
2016-01-01
Introduction This study is an analysis of a pilot COPD clinical audit that evaluated adherence to guidelines for patients with COPD in a stable disease phase during a routine visit in specialized secondary care outpatient clinics in order to identify the variables associated with the decision to step-up or step-down pharmacological treatment. Methods This study was a pilot clinical audit performed at hospital outpatient respiratory clinics in the region of Andalusia, Spain (eight provinces with over eight million inhabitants), in which 20% of centers in the area (catchment population 3,143,086 inhabitants) were invited to participate. Treatment changes were evaluated in terms of the number of prescribed medications and were classified as step-up, step-down, or no change. Three backward stepwise binominal multivariate logistic regression analyses were conducted to evaluate variables associated with stepping up, stepping down, and inhaled corticosteroids discontinuation. Results The present analysis evaluated 565 clinical records (91%) of the complete audit. Of those records, 366 (64.8%) cases saw no change in pharmacological treatment, while 99 patients (17.5%) had an increase in the number of drugs, 55 (9.7%) had a decrease in the number of drugs, and 45 (8.0%) noted a change to other medication for a similar therapeutic scheme. Exacerbations were the main factor in stepping up treatment, as were the symptoms themselves. In contrast, rather than symptoms, doctors used forced expiratory volume in 1 second and previous treatment with long-term antibiotics or inhaled corticosteroids as the key determinants to stepping down treatment. Conclusion The majority of doctors did not change the prescription. When changes were made, a number of related factors were noted. Future trials must evaluate whether these therapeutic changes impact clinically relevant outcomes at follow-up. PMID:27330285
Zhang, Man; Liu, Xu-Hua; He, Xiong-Kui; Zhang, Lu-Da; Zhao, Long-Lian; Li, Jun-Hui
2010-05-01
In the present paper, taking 66 wheat samples for testing materials, ridge regression technology in near-infrared (NIR) spectroscopy quantitative analysis was researched. The NIR-ridge regression model for determination of protein content was established by NIR spectral data of 44 wheat samples to predict the protein content of the other 22 samples. The average relative error was 0.015 18 between the predictive results and Kjeldahl's values (chemical analysis values). And the predictive results were compared with those values derived through partial least squares (PLS) method, showing that ridge regression method was deserved to be chosen for NIR spectroscopy quantitative analysis. Furthermore, in order to reduce the disturbance to predictive capacity of the quantitative analysis model resulting from irrelevant information, one effective way is to screen the wavelength information. In order to select the spectral information with more content information and stronger relativity with the composition or the nature of the samples to improve the model's predictive accuracy, ridge regression was used to select wavelength information in this paper. The NIR-ridge regression model was established with the spectral information at 4 wavelength points, which were selected from 1 297 wavelength points, to predict the protein content of the 22 samples. The average relative error was 0.013 7 and the correlation coefficient reached 0.981 7 between the predictive results and Kjeldahl's values. The results showed that ridge regression was able to screen the essential wavelength information from a large amount of spectral information. It not only can simplify the model and effectively reduce the disturbance resulting from collinearity information, but also has practical significance for designing special NIR analysis instrument for analyzing specific component in some special samples. PMID:20672604
Zhang, Lu-da; Zhao, Li-li; Zhao, Long-lian; Li, Jun-hui; Yan, Yan-lu
2005-08-01
This paper introduces the principle and method with which the model about the quantitative analysis of Fourier transformation near infrared (NIR) spectroscopy by MAXR regression procedure can be established. In this way, the authors have selected the wave length information by Matlab language design programming in order to establish the quantitative analysis models with near infrared spectroscopy. Taking sixty-six wheat samples as experiment materials, quantitative analysis models to determine protein content are established with thirty-three samples. The relative coefficient are 0.977 1 and 0.976 5 respectively and the standard error are 0.335 and 0.340 between the predication result of the two models which include respectively two or three wave length information and Kjeldahl's value for the protein content of the another thirty-three wheat samples. When selecting the wave length information, the MAXR regression procedure can establish the optimum regression models which contain 1 or 2...or k wavelength information respectively. MAXR regression procedure is a useful method when selecting the optimum wavelength information because of its shorter computation time, and the method not only can carefully select the essential wavelength information to establish NIR spectroscopy quantitative analysis models of resisting multicollinearity information disturbance, but also to establish the work for selecting optimum wavelength information which can direct to design the special NIR analysis instrument for analyzing specific component in the special samples. PMID:16329486
NASA Astrophysics Data System (ADS)
Yang, Jianhong; Yi, Cancan; Xu, Jinwu; Ma, Xianghong
2015-05-01
A new LIBS quantitative analysis method based on analytical line adaptive selection and Relevance Vector Machine (RVM) regression model is proposed. First, a scheme of adaptively selecting analytical line is put forward in order to overcome the drawback of high dependency on a priori knowledge. The candidate analytical lines are automatically selected based on the built-in characteristics of spectral lines, such as spectral intensity, wavelength and width at half height. The analytical lines which will be used as input variables of regression model are determined adaptively according to the samples for both training and testing. Second, an LIBS quantitative analysis method based on RVM is presented. The intensities of analytical lines and the elemental concentrations of certified standard samples are used to train the RVM regression model. The predicted elemental concentration analysis results will be given with a form of confidence interval of probabilistic distribution, which is helpful for evaluating the uncertainness contained in the measured spectra. Chromium concentration analysis experiments of 23 certified standard high-alloy steel samples have been carried out. The multiple correlation coefficient of the prediction was up to 98.85%, and the average relative error of the prediction was 4.01%. The experiment results showed that the proposed LIBS quantitative analysis method achieved better prediction accuracy and better modeling robustness compared with the methods based on partial least squares regression, artificial neural network and standard support vector machine.
NASA Astrophysics Data System (ADS)
Denli, H. H.; Koc, Z.
2015-12-01
Estimation of real properties depending on standards is difficult to apply in time and location. Regression analysis construct mathematical models which describe or explain relationships that may exist between variables. The problem of identifying price differences of properties to obtain a price index can be converted into a regression problem, and standard techniques of regression analysis can be used to estimate the index. Considering regression analysis for real estate valuation, which are presented in real marketing process with its current characteristics and quantifiers, the method will help us to find the effective factors or variables in the formation of the value. In this study, prices of housing for sale in Zeytinburnu, a district in Istanbul, are associated with its characteristics to find a price index, based on information received from a real estate web page. The associated variables used for the analysis are age, size in m2, number of floors having the house, floor number of the estate and number of rooms. The price of the estate represents the dependent variable, whereas the rest are independent variables. Prices from 60 real estates have been used for the analysis. Same price valued locations have been found and plotted on the map and equivalence curves have been drawn identifying the same valued zones as lines.
Regression analysis to predict growth performance from dietary net energy in growing-finishing pigs.
Nitikanchana, S; Dritz, S S; Tokach, M D; DeRouchey, J M; Goodband, R D; White, B J
2015-06-01
Data from 41 trials with multiple energy levels (285 observations) were used in a meta-analysis to predict growth performance based on dietary NE concentration. Nutrient and energy concentrations in all diets were estimated using the NRC ingredient library. Predictor variables examined for best fit models using Akaike information criteria included linear and quadratic terms of NE, BW, CP, standardized ileal digestible (SID) Lys, crude fiber, NDF, ADF, fat, ash, and their interactions. The initial best fit models included interactions between NE and CP or SID Lys. After removal of the observations that fed SID Lys below the suggested requirement, these terms were no longer significant. Including dietary fat in the model with NE and BW significantly improved the G:F prediction model, indicating that NE may underestimate the influence of fat on G:F. The meta-analysis indicated that, as long as diets are adequate for other nutrients (i.e., Lys), dietary NE is adequate to predict changes in ADG across different dietary ingredients and conditions. The analysis indicates that ADG increases with increasing dietary NE and BW but decreases when BW is above 87 kg. The G:F ratio improves with increasing dietary NE and fat but decreases with increasing BW. The regression equations were then evaluated by comparing the actual and predicted performance of 543 finishing pigs in 2 trials fed 5 dietary treatments, included 3 different levels of NE by adding wheat middlings, soybean hulls, dried distillers grains with solubles (DDGS; 8 to 9% oil), or choice white grease (CWG) to a corn-soybean meal-based diet. Diets were 1) 30% DDGS, 20% wheat middlings, and 4 to 5% soybean hulls (low energy); 2) 20% wheat middlings and 4 to 5% soybean hulls (low energy); 3) a corn-soybean meal diet (medium energy); 4) diet 2 supplemented with 3.7% CWG to equalize the NE level to diet 3 (medium energy); and 5) a corn-soybean meal diet with 3.7% CWG (high energy). Only small differences were observed
Association of Hemostatic Markers with Atrial Fibrillation: A Meta-Analysis and Meta-Regression
Xiang, Ying; Wu, Long; Xu, Bin; Zhang, Yao; Ma, Xiangyu; Li, Yafei; Song, Zhiyuan; Zhong, Li
2015-01-01
Background There is growing evidence that indicates the presence of a prothrombotic state in atrial fibrillation (AF). However, the role of hemostatic markers in AF remains inconclusive. Methods We conducted a meta-analysis of observational studies to evaluate the association between hemostatic markers and AF. A meta-regression was performed to explore potential sources of heterogeneity. Results A total of 59 studies met our inclusion criteria for the meta-analysis. For platelet activation, increased circulating platelet factor-4, β-thromboglobulin (BTG) and P-selectin were significantly higher in AF cases compared with controls (standardized mean difference [SMD][95% confidence interval (CI)]: 1.72[0.96–2.49], 1.61[1.03–2.19] and 0.50[0.23–0.77], respectively). For coagulation activation, increased levels of plasma D-dimer, fibrinogen, thrombin-antithrombin, prothrombin fragment 1+2, and antithrombin-III were significantly associated with AF (SMD[95% CI]: 1.82[1.38–2.26], 0.72[0.55–0.89], 0.42[0.13–0.72], 1.00 [0.00–1.99] and 1.38[0.16–2.60], respectively). For fibrinolytic function, tissue-type plasminogen activator and plasminogen activator inhibitor-1 were significantly increased in AF cases compared with controls (SMD[95% CI]: 0.86[0.04–1.67] and 0.87[0.28–1.47], respectively) but the associations became nonsignificant after performing subgroup analysis by anticoagulants treatment status. For endothelial function, increased von Willebrand factor was significantly associated with AF (SMD, 0.79; 95% CI, 0.60–0.99); however, no association was observed for soluble thrombomodulin (SMD, 0.60; 95% CI, -0.13–1.33). Conclusions Increased circulating hemostatic factors (PF-4, BTG, P-selectin, D-dimer, fibrinogen, TAT, F1+2, AT- III, and vWf) are significantly associated with AF. Future research is necessary to elucidate the precise mechanism of the prothrombotic state and how hemostatic markers promote thromboembolism in AF. PMID:25884835
Liu, Bilan; Qiu, Xing; Zhu, Tong; Tian, Wei; Hu, Rui; Ekholm, Sven; Schifitto, Giovanni; Zhong, Jianhui
2016-01-01
Quantitative measurement of localized longitudinal changes in brain abnormalities at an individual level may offer critical information for disease diagnosis and treatment. The voxel-wise permutation-based method SPREAD/iSPREAD, which combines resampling and spatial regression of neighboring voxels, provides an effective and robust method for detecting subject-specific longitudinal changes within the whole brain, especially for longitudinal studies with a limited number of scans. As an extension of SPREAD/iSPREAD, we present a general method that facilitates analysis of serial Diffusion Tensor Imaging (DTI) measurements (with more than two time points) for testing localized changes in longitudinal studies. Two types of voxel-level test statistics (model-free test statistics, which measure intra-subject variability across time, and test statistics based on general linear model that incorporate specific lesion evolution models) were estimated and tested against the null hypothesis among groups of DTI data across time. The implementation and utility of the proposed statistical method were demonstrated by both Monte Carlo simulations and applications on clinical DTI data from human brain in vivo. By a design of test statistics based on the disease progression model, it was possible to apportion the true significant voxels attributed to the disease progression and those caused by underlying anatomical differences that cannot be explained by the model, which led to improvement in false positive (FP) control in the results. Extension of the proposed method to include other diseases or drug effect models, as well as the feasibility of global statistics, was discussed. The proposed statistical method can be extended to a broad spectrum of longitudinal studies with carefully designed test statistics, which helps to detect localized changes at the individual level. PMID:26977399
Ilic, Milena; Ilic, Irena
2014-01-01
Background Limited data on mortality from malignant lymphatic and hematopoietic neoplasms have been published for Serbia. Methods The study covered population of Serbia during the 1991–2010 period. Mortality trends were assessed using the joinpoint regression analysis. Results Trend for overall death rates from malignant lymphoid and haematopoietic neoplasms significantly decreased: by −2.16% per year from 1991 through 1998, and then significantly increased by +2.20% per year for the 1998–2010 period. The growth during the entire period was on average +0.8% per year (95% CI 0.3 to 1.3). Mortality was higher among males than among females in all age groups. According to the comparability test, mortality trends from malignant lymphoid and haematopoietic neoplasms in men and women were parallel (final selected model failed to reject parallelism, P = 0.232). Among younger Serbian population (0–44 years old) in both sexes: trends significantly declined in males for the entire period, while in females 15–44 years of age mortality rates significantly declined only from 2003 onwards. Mortality trend significantly increased in elderly in both genders (by +1.7% in males and +1.5% in females in the 60–69 age group, and +3.8% in males and +3.6% in females in the 70+ age group). According to the comparability test, mortality trend for Hodgkin's lymphoma differed significantly from mortality trends for all other types of malignant lymphoid and haematopoietic neoplasms (P<0.05). Conclusion Unfavourable mortality trend in Serbia requires targeted intervention for risk factors control, early diagnosis and modern therapy. PMID:25333862
Liu, Bilan; Qiu, Xing; Zhu, Tong; Tian, Wei; Hu, Rui; Ekholm, Sven; Schifitto, Giovanni; Zhong, Jianhui
2016-01-01
Quantitative measurement of localized longitudinal changes in brain abnormalities at an individual level may offer critical information for disease diagnosis and treatment. The voxel-wise permutation-based method SPREAD/iSPREAD, which combines resampling and spatial regression of neighboring voxels, provides an effective and robust method for detecting subject-specific longitudinal changes within the whole brain, especially for longitudinal studies with a limited number of scans. As an extension of SPREAD/iSPREAD, we present a general method that facilitates analysis of serial Diffusion Tensor Imaging (DTI) measurements (with more than two time points) for testing localized changes in longitudinal studies. Two types of voxel-level test statistics (model-free test statistics, which measure intra-subject variability across time, and test statistics based on general linear model that incorporate specific lesion evolution models) were estimated and tested against the null hypothesis among groups of DTI data across time. The implementation and utility of the proposed statistical method were demonstrated by both Monte Carlo simulations and applications on clinical DTI data from human brain in vivo. By a design of test statistics based on the disease progression model, it was possible to apportion the true significant voxels attributed to the disease progression and those caused by underlying anatomical differences that cannot be explained by the model, which led to improvement in false positive (FP) control in the results. Extension of the proposed method to include other diseases or drug effect models, as well as the feasibility of global statistics, was discussed. The proposed statistical method can be extended to a broad spectrum of longitudinal studies with carefully designed test statistics, which helps to detect localized changes at the individual level. PMID:26977399
Expert Involvement Predicts mHealth App Downloads: Multivariate Regression Analysis of Urology Apps
Osório, Luís; Cavadas, Vitor; Fraga, Avelino; Carrasquinho, Eduardo; Cardoso de Oliveira, Eduardo; Castelo-Branco, Miguel; Roobol, Monique J
2016-01-01
Background Urological mobile medical (mHealth) apps are gaining popularity with both clinicians and patients. mHealth is a rapidly evolving and heterogeneous field, with some urology apps being downloaded over 10,000 times and others not at all. The factors that contribute to medical app downloads have yet to be identified, including the hypothetical influence of expert involvement in app development. Objective The objective of our study was to identify predictors of the number of urology app downloads. Methods We reviewed urology apps available in the Google Play Store and collected publicly available data. Multivariate ordinal logistic regression evaluated the effect of publicly available app variables on the number of apps being downloaded. Results Of 129 urology apps eligible for study, only 2 (1.6%) had >10,000 downloads, with half having ≤100 downloads and 4 (3.1%) having none at all. Apps developed with expert urologist involvement (P=.003), optional in-app purchases (P=.01), higher user rating (P<.001), and more user reviews (P<.001) were more likely to be installed. App cost was inversely related to the number of downloads (P<.001). Only data from the Google Play Store and the developers’ websites, but not other platforms, were publicly available for analysis, and the level and nature of expert involvement was not documented. Conclusions The explicit participation of urologists in app development is likely to enhance its chances to have a higher number of downloads. This finding should help in the design of better apps and further promote urologist involvement in mHealth. Official certification processes are required to ensure app quality and user safety. PMID:27421338
Qian, Cheng; Wei, Baozhu; Ding, Jinye; Wu, Huiting; Cai, Xiaotao; Li, Benlei; Wang, Yanggan
2015-11-15
Rosuvastatin and atorvastatin both are high-intensity statins. However, which statin is more effective for the reversion of coronary atherosclerotic plaques remains inconclusive. We, therefore, conducted a meta-analysis to provide further evidence for proper statin selection. Pubmed, The Cochrane Library, Embase, Chinese BioMedicine, and China National Knowledge Infrastructure databases were systematically searched for eligible publications. We also manually reviewed the references from all relevant literature for more trials. Only studies that met our predefined inclusion criteria up to March 31, 2015, were enrolled. Five randomized controlled trials, 4 published in English and 1 in Chinese, were finally included in our study with a total of 1,556 participants, of whom 772 were in the rosuvastatin group and 784 in the atorvastatin group. The dose ratios of rosuvastatin versus atorvastatin were 1:2 in all included trials. Pooling across the studies demonstrated that compared with atorvastatin, rosuvastatin administration further reduced the total atheroma volume (weighted mean difference [WMD] -1.61 mm(3), 95% confidence interval [CI] -2.70 to -0.52; p = 0.004) and percent atheroma volume (WMD -0.34%, 95% CI -0.64 to -0.03; p = 0.03) and improved the lumen volume more significantly (WMD 2.10 mm(3), 95% CI 0.04 to 4.17; p = 0.046). The comparative regression of plaques was not different across subgroups. In conclusion, rosuvastatin is superior to atorvastatin in the reversion of coronary atherosclerotic plaques. PMID:26385518
NASA Astrophysics Data System (ADS)
Lu, Dan; Ye, Ming; Hill, Mary C.
2012-09-01
Confidence intervals based on classical regression theories augmented to include prior information and credible intervals based on Bayesian theories are conceptually different ways to quantify parametric and predictive uncertainties. Because both confidence and credible intervals are used in environmental modeling, we seek to understand their differences and similarities. This is of interest in part because calculating confidence intervals typically requires tens to thousands of model runs, while Bayesian credible intervals typically require tens of thousands to millions of model runs. Given multi-Gaussian distributed observation errors, our theoretical analysis shows that, for linear or linearized-nonlinear models, confidence and credible intervals are always numerically identical when consistent prior information is used. For nonlinear models, nonlinear confidence and credible intervals can be numerically identical if parameter confidence regions defined using the approximate likelihood method and parameter credible regions estimated using Markov chain Monte Carlo realizations are numerically identical and predictions are a smooth, monotonic function of the parameters. Both occur if intrinsic model nonlinearity is small. While the conditions of Gaussian errors and small intrinsic model nonlinearity are violated by many environmental models, heuristic tests using analytical and numerical models suggest that linear and nonlinear confidence intervals can be useful approximations of uncertainty even under significantly nonideal conditions. In the context of epistemic model error for a complex synthetic nonlinear groundwater problem, the linear and nonlinear confidence and credible intervals for individual models performed similarly enough to indicate that the computationally frugal confidence intervals can be useful in many circumstances. Experiences with these groundwater models are expected to be broadly applicable to many environmental models. We suggest that for
ERIC Educational Resources Information Center
Brabant, Marie-Eve; Hebert, Martine; Chagnon, Francois
2013-01-01
This study explored the clinical profiles of 77 female teenager survivors of sexual abuse and examined the association of abuse-related and personal variables with suicidal ideations. Analyses revealed that 64% of participants experienced suicidal ideations. Findings from classification and regression tree analysis indicated that depression,…
ERIC Educational Resources Information Center
Thomas, Emily H.; Galambos, Nora
To investigate how students' characteristics and experiences affect satisfaction, this study used regression and decision-tree analysis with the CHAID algorithm to analyze student opinion data from a sample of 1,783 college students. A data-mining approach identifies the specific aspects of students' university experience that most influence three…
ERIC Educational Resources Information Center
Muller, Veronica; Brooks, Jessica; Tu, Wei-Mo; Moser, Erin; Lo, Chu-Ling; Chan, Fong
2015-01-01
Purpose: The main objective of this study was to determine the extent to which physical and cognitive-affective factors are associated with fibromyalgia (FM) fatigue. Method: A quantitative descriptive design using correlation techniques and multiple regression analysis. The participants consisted of 302 members of the National Fibromyalgia &…
ERIC Educational Resources Information Center
Kanyongo, Gibbs Y.; Certo, Janine; Launcelot, Brown I.
2006-01-01
In this study, we report results of a study examining the relationship between home environment factors and reading achievement in Zimbabwe. The study utilised data collected by the Southern and Eastern Africa Consortium for Monitoring Educational Quality (SACMEQ). The data were submitted to linear regression analysis through structural equation…
ERIC Educational Resources Information Center
Fraas, John W.; Newman, Isadore
1996-01-01
In a conjoint-analysis consumer-preference study, researchers must determine whether the product factor estimates, which measure consumer preferences, should be calculated and interpreted for each respondent or collectively. Multiple regression models can determine whether to aggregate data by examining factor-respondent interaction effects. This…
Analysis of Multivariate Experimental Data Using A Simplified Regression Model Search Algorithm
NASA Technical Reports Server (NTRS)
Ulbrich, Norbert M.
2013-01-01
A new regression model search algorithm was developed that may be applied to both general multivariate experimental data sets and wind tunnel strain-gage balance calibration data. The algorithm is a simplified version of a more complex algorithm that was originally developed for the NASA Ames Balance Calibration Laboratory. The new algorithm performs regression model term reduction to prevent overfitting of data. It has the advantage that it needs only about one tenth of the original algorithm's CPU time for the completion of a regression model search. In addition, extensive testing showed that the prediction accuracy of math models obtained from the simplified algorithm is similar to the prediction accuracy of math models obtained from the original algorithm. The simplified algorithm, however, cannot guarantee that search constraints related to a set of statistical quality requirements are always satisfied in the optimized regression model. Therefore, the simplified algorithm is not intended to replace the original algorithm. Instead, it may be used to generate an alternate optimized regression model of experimental data whenever the application of the original search algorithm fails or requires too much CPU time. Data from a machine calibration of NASA's MK40 force balance is used to illustrate the application of the new search algorithm.
Significant drivers of the virtual water trade evaluated with a multivariate regression analysis
NASA Astrophysics Data System (ADS)
Tamea, Stefania; Laio, Francesco; Ridolfi, Luca
2014-05-01
International trade of food is vital for the food security of many countries, which rely on trade to compensate for an agricultural production insufficient to feed the population. At the same time, food trade has implications on the distribution and use of water resources, because through the international trade of food commodities, countries virtually displace the water used for food production, known as "virtual water". Trade thus implies a network of virtual water fluxes from exporting to importing countries, which has been estimated to displace more than 2 billions of m3 of water per year, or about the 2% of the annual global precipitation above land. It is thus important to adequately identify the dynamics and the controlling factors of the virtual water trade in that it supports and enables the world food security. Using the FAOSTAT database of international trade and the virtual water content available from the Water Footprint Network, we reconstructed 25 years (1986-2010) of virtual water fluxes. We then analyzed the dependence of exchanged fluxes on a set of major relevant factors, that includes: population, gross domestic product, arable land, virtual water embedded in agricultural production and dietary consumption, and geographical distance between countries. Significant drivers have been identified by means of a multivariate regression analysis, applied separately to the export and import fluxes of each country; temporal trends are outlined and the relative importance of drivers is assessed by a commonality analysis. Results indicate that population, gross domestic product and geographical distance are the major drivers of virtual water fluxes, with a minor (but non-negligible) contribution given by the agricultural production of exporting countries. Such drivers have become relevant for an increasing number of countries throughout the years, with an increasing variance explained by the distance between countries and a decreasing role of the gross
Solving Logistic Regression with Group Cardinality Constraints for Time Series Analysis
Zhang, Yong; Pohl, Kilian M.
2016-01-01
We propose an algorithm to distinguish 3D+t images of healthy from diseased subjects by solving logistic regression based on cardinality constrained, group sparsity. This method reduces the risk of overfitting by providing an elegant solution to identifying anatomical regions most impacted by disease. It also ensures that consistent identification across the time series by grouping each image feature across time and counting the number of non-zero groupings. While popular in medical imaging, group cardinality constrained problems are generally solved by relaxing counting with summing over the groupings. We instead solve the original problem by generalizing a penalty decomposition algorithm, which alternates between minimizing a logistic regression function with a regularizer based on the Frobenius norm and enforcing sparsity. Applied to 86 cine MRIs of healthy cases and subjects with Tetralogy of Fallot (TOF), our method correctly identifies regions impacted by TOF and obtains a statistically significant higher classification accuracy than logistic regression without and relaxed grouped sparsity constraint.
Ambient-temperature regression analysis for estimating retrofit savings in commercial buildings
Kissock, J.K.; Reddy, T.A.; Claridge, D.E.
1998-08-01
This paper describes a procedure for estimating weather-adjusted retrofit savings in commercial buildings using ambient-temperature regression models. The selection of ambient temperature as the sole independent regression variable is discussed. An approximate method for determining the uncertainty of savings and a method for identifying the data time scale which minimizes the uncertainty of savings ar developed. The appropriate users of both linear and change-point models for estimating savings based on expected heating and cooling relationships for common HVAC systems are described. A case study example illustrates the procedure.
Development of LACIE CCEA-1 weather/wheat yield models. [regression analysis
NASA Technical Reports Server (NTRS)
Strommen, N. D.; Sakamoto, C. M.; Leduc, S. K.; Umberger, D. E. (Principal Investigator)
1979-01-01
The advantages and disadvantages of the casual (phenological, dynamic, physiological), statistical regression, and analog approaches to modeling for grain yield are examined. Given LACIE's primary goal of estimating wheat production for the large areas of eight major wheat-growing regions, the statistical regression approach of correlating historical yield and climate data offered the Center for Climatic and Environmental Assessment the greatest potential return within the constraints of time and data sources. The basic equation for the first generation wheat-yield model is given. Topics discussed include truncation, trend variable, selection of weather variables, episodic events, strata selection, operational data flow, weighting, and model results.
2013-01-01
Background A tandem technique of hard equipment is often used for the chemical analysis of a single cell to first isolate and then detect the wanted identities. The first part is the separation of wanted chemicals from the bulk of a cell; the second part is the actual detection of the important identities. To identify the key structural modifications around ligand binding, the present study aims to develop a counterpart of tandem technique for cheminformatics. A statistical regression and its outliers act as a computational technique for separation. Results A PPARγ (peroxisome proliferator-activated receptor gamma) agonist cellular system was subjected to such an investigation. Results show that this tandem regression-outlier analysis, or the prioritization of the context equations tagged with features of the outliers, is an effective regression technique of cheminformatics to detect key structural modifications, as well as their tendency of impact to ligand binding. Conclusions The key structural modifications around ligand binding are effectively extracted or characterized out of cellular reactions. This is because molecular binding is the paramount factor in such ligand cellular system and key structural modifications around ligand binding are expected to create outliers. Therefore, such outliers can be captured by this tandem regression-outlier analysis. PMID:23627990
Nagatsuka, Kazuyuki; Miyata, Shigeki; Kada, Akiko; Kawamura, Atsushi; Nakagawara, Jyoji; Furui, Eisuke; Takiuchi, Shin; Taomoto, Katsushi; Kario, Kazuomi; Uchiyama, Shinichiro; Saito, Kozue; Nagao, Takehiko; Kitagawa, Kazuo; Hosomi, Naohisa; Tanaka, Keiji; Kaikita, Koichi; Katayama, Yasuo; Abumiya, Takeo; Nakane, Hiroshi; Wada, Hideo; Hattori, Akira; Kimura, Kazumi; Isshiki, Takaaki; Nishikawa, Masakatsu; Yamawaki, Takemori; Yonemoto, Naohiro; Okada, Hiromi; Ogawa, Hisao; Minematsu, Kazuo; Miyata, Toshiyuki
2016-08-01
Several studies have indicated that approximately 25 % of patients treated with aspirin exhibit high on-treatment platelet reactivity (HTPR), which is potentially associated with cardiovascular events (CVEs). However, this association is still controversial, since the mechanisms by which HTPR contributes to CVEs remain unclear and a no standardised definition of HTPR has been established. To determine whether HTPR is associated with CVE recurrence and what type of assay would best predict CVE recurrence, we conducted a multicentre prospective cohort study of 592 stable cardiovascular outpatients treated with aspirin monotherapy for secondary prevention. Their HTPR was determined by arachidonic acid- or collagen-induced aggregation assays using two different agonist concentrations. Residual cyclooxygenase (COX)-1 activity was assessed by measuring serum thromboxane (TX)B2 or urinary 11-dehydro TXB2. Shear-induced platelet thrombus formation was also examined. We followed all patients for two years to evaluate how these seven indexes were related to the recurrence of CVEs (cerebral infarction, transient ischaemic attack, myocardial infarction, unstable angina, revascularisation, other arterial thrombosis, or cardiovascular death). Of 583 patients eligible for the analysis, CVEs occurred in 69 (11.8 %). A Cox regression model identified several classical risk factors associated with CVEs. However, neither HTPR nor high residual COX-1 activity was significantly associated with CVEs, even by applying cut-off values suggested in previous reports or a receiver-operating characteristic analysis. In conclusion, recurrence of CVEs occurred independently of HTPR and residual COX-1 activity. Thus, our findings do not support the use of platelet or COX-1 functional testing for predicting clinical outcomes in stable cardiovascular patients. PMID:27098431
ERIC Educational Resources Information Center
Wiley, Kristofor R.
2013-01-01
Many of the social and emotional needs that have historically been associated with gifted students have been questioned on the basis of recent empirical evidence. Research on the topic, however, is often limited by sample size, selection bias, or definition. This study addressed these limitations by applying linear regression methodology to data…
ERIC Educational Resources Information Center
Cohen, Ayala; Nahum-Shani, Inbal; Doveh, Etti
2010-01-01
In their seminal paper, Edwards and Parry (1993) presented the polynomial regression as a better alternative to applying difference score in the study of congruence. Although this method is increasingly applied in congruence research, its complexity relative to other methods for assessing congruence (e.g., difference score methods) was one of the…
ERIC Educational Resources Information Center
Baylor, Carolyn; Yorkston, Kathryn; Bamer, Alyssa; Britton, Deanna; Amtmann, Dagmar
2010-01-01
Purpose: To explore variables associated with self-reported communicative participation in a sample (n = 498) of community-dwelling adults with multiple sclerosis (MS). Method: A battery of questionnaires was administered online or on paper per participant preference. Data were analyzed using multiple linear backward stepwise regression. The…
Computation of major solute concentrations and loads in German rivers using regression analysis.
Steele, T.D.
1980-01-01
Regression functions between concentrations of several inorganic solutes and specific conductance and between specific conductance and stream discharge were derived from intermittent samples collected for 2 rivers in West Germany. These functions, in conjunction with daily records of streamflow, were used to determine monthly and annual solute loadings. -from Author
Using Robust Variance Estimation to Combine Multiple Regression Estimates with Meta-Analysis
ERIC Educational Resources Information Center
Williams, Ryan
2013-01-01
The purpose of this study was to explore the use of robust variance estimation for combining commonly specified multiple regression models and for combining sample-dependent focal slope estimates from diversely specified models. The proposed estimator obviates traditionally required information about the covariance structure of the dependent…
Decision Models for Admission into Teacher Preparation: An Application of Regression Analysis.
ERIC Educational Resources Information Center
Garcia, Ricardo A.; Denton, Jon J.
This investigation explores the feasibility of constructing regression models for predicting teaching success and classroom behavior utilizing various measures of attitudes, personality, and psychological factors as predictors. The purpose of the study is to devise decision models that predict a candidate's success in student teaching as…
Multiple linear regression models are often used to predict levels of fecal indicator bacteria (FIB) in recreational swimming waters based on independent variables (IVs) such as meteorologic, hydrodynamic, and water-quality measures. The IVs used for these analyses are traditiona...
NASA Astrophysics Data System (ADS)
Keat, Sim Chong; Chun, Beh Boon; San, Lim Hwee; Jafri, Mohd Zubir Mat
2015-04-01
Climate change due to carbon dioxide (CO2) emissions is one of the most complex challenges threatening our planet. This issue considered as a great and international concern that primary attributed from different fossil fuels. In this paper, regression model is used for analyzing the causal relationship among CO2 emissions based on the energy consumption in Malaysia using time series data for the period of 1980-2010. The equations were developed using regression model based on the eight major sources that contribute to the CO2 emissions such as non energy, Liquefied Petroleum Gas (LPG), diesel, kerosene, refinery gas, Aviation Turbine Fuel (ATF) and Aviation Gasoline (AV Gas), fuel oil and motor petrol. The related data partly used for predict the regression model (1980-2000) and partly used for validate the regression model (2001-2010). The results of the prediction model with the measured data showed a high correlation coefficient (R2=0.9544), indicating the model's accuracy and efficiency. These results are accurate and can be used in early warning of the population to comply with air quality standards.
Multiple Regression Analysis of Factors that May Influence Middle School Science Scores
ERIC Educational Resources Information Center
Glover, Judith
2012-01-01
The purpose of this quantitative multiple regression study was to determine whether a relationship existed between Maryland State Assessment (MSA) reading scores, MSA math scores, gender, ethnicity, age, and MSA science scores. Also examined was if MSA reading scores, MSA math scores, gender, ethnicity, and age can be used in combination or alone…
ERIC Educational Resources Information Center
Luna, Andrew L.; Brennan, Kelly A.
2009-01-01
This study uses a regression model to determine if a significant difference exists between the actual budget allocation that an academic department received and the model's predicted budget allocation for that same department. Budget data from a Southeastern Master's/Comprehensive state university were used as the dependent variable, and the…
Analysis of Multivariate Experimental Data Using A Simplified Regression Model Search Algorithm
NASA Technical Reports Server (NTRS)
Ulbrich, Norbert Manfred
2013-01-01
A new regression model search algorithm was developed in 2011 that may be used to analyze both general multivariate experimental data sets and wind tunnel strain-gage balance calibration data. The new algorithm is a simplified version of a more complex search algorithm that was originally developed at the NASA Ames Balance Calibration Laboratory. The new algorithm has the advantage that it needs only about one tenth of the original algorithm's CPU time for the completion of a search. In addition, extensive testing showed that the prediction accuracy of math models obtained from the simplified algorithm is similar to the prediction accuracy of math models obtained from the original algorithm. The simplified algorithm, however, cannot guarantee that search constraints related to a set of statistical quality requirements are always satisfied in the optimized regression models. Therefore, the simplified search algorithm is not intended to replace the original search algorithm. Instead, it may be used to generate an alternate optimized regression model of experimental data whenever the application of the original search algorithm either fails or requires too much CPU time. Data from a machine calibration of NASA's MK40 force balance is used to illustrate the application of the new regression model search algorithm.
Risk Factors of Falls in Community-Dwelling Older Adults: Logistic Regression Tree Analysis
ERIC Educational Resources Information Center
Yamashita, Takashi; Noe, Douglas A.; Bailer, A. John
2012-01-01
Purpose of the Study: A novel logistic regression tree-based method was applied to identify fall risk factors and possible interaction effects of those risk factors. Design and Methods: A nationally representative sample of American older adults aged 65 years and older (N = 9,592) in the Health and Retirement Study 2004 and 2006 modules was used.…
A Latent Class Regression Analysis of Men's Conformity to Masculine Norms and Psychological Distress
ERIC Educational Resources Information Center
Wong, Y. Joel; Owen, Jesse; Shea, Munyi
2012-01-01
How are specific dimensions of masculinity related to psychological distress in specific groups of men? To address this question, the authors used latent class regression to assess the optimal number of latent classes that explained differential relationships between conformity to masculine norms and psychological distress in a racially diverse…
On the Usefulness of a Multilevel Logistic Regression Approach to Person-Fit Analysis
ERIC Educational Resources Information Center
Conijn, Judith M.; Emons, Wilco H. M.; van Assen, Marcel A. L. M.; Sijtsma, Klaas
2011-01-01
The logistic person response function (PRF) models the probability of a correct response as a function of the item locations. Reise (2000) proposed to use the slope parameter of the logistic PRF as a person-fit measure. He reformulated the logistic PRF model as a multilevel logistic regression model and estimated the PRF parameters from this…
Genetic analysis of carcass traits in beef cattle using random regression models.
Englishby, T M; Banos, G; Moore, K L; Coffey, M P; Evans, R D; Berry, D P
2016-04-01
Livestock mature at different rates depending, in part, on their genetic merit; therefore, the optimal age at slaughter for progeny of certain sires may differ. The objective of the present study was to examine sire-level genetic profiles for carcass weight, carcass conformation, and carcass fat in cattle of multiple beef and dairy breeds, including crossbreeds. Slaughter records from 126,214 heifers and 124,641 steers aged between 360 and 1,200 d and from 86,089 young bulls aged between 360 and 720 d were used in the analysis; animals were from 15,127 sires. Variance components for each trait across age at slaughter were generated using sire random regression models that included quadratic polynomials for fixed and random effects; heterogeneous residual variances were assumed across ages. Heritability estimates across genders ranged from 0.08 (±0.02) to 0.34 (±0.02) for carcass weight, from 0.24 (±0.02) to 0.42 (±0.01) for conformation, and from 0.16 (±0.03) to 0.40 (±0.02) for fat score. Genetic correlations within each trait across ages weakened as the interval between ages compared lengthened but were all >0.64, suggesting a similar genetic background for each trait across different ages. Eigenvalues and eigenfunctions of the additive genetic covariance matrix revealed genetic variability among animals in their growth profiles for carcass traits, although most of the genetic variability was associated with the height of the growth profile. At the same age, a positive genetic correlation (0.60 to 0.78; SE ranged from 0.01 to 0.04) existed between carcass weight and conformation, whereas negative genetic correlations existed between fatness and both conformation (-0.46 to 0.08; SE ranged from 0.02 to 0.09) and carcass weight (-0.48 to -0.16; SE ranged from 0.02 to 0.14) at the same age. The estimated genetic parameters in the present study indicate genetic variability in the growth trajectory in cattle, which can be exploited through breeding programs and
Huntley, J D; Gould, R L; Liu, K; Smith, M; Howard, R J
2015-01-01
Objectives To review the efficacy of cognitive interventions on improving general cognition in dementia. Method Online literature databases and trial registers, previous systematic reviews and leading journals were searched for relevant randomised controlled trials. A systematic review, random-effects meta-analyses and meta-regression were conducted. Cognitive interventions were categorised as: cognitive stimulation (CS), involving a range of social and cognitive activities to stimulate multiple cognitive domains; cognitive training (CT), involving repeated practice of standardised tasks targeting a specific cognitive function; cognitive rehabilitation (CR), which takes a person-centred approach to target impaired function; or mixed CT and stimulation (MCTS). Separate analyses were conducted for general cognitive outcome measures and for studies using ‘active’ (designed to control for non-specific therapeutic effects) and non-active (minimal or no intervention) control groups. Results 33 studies were included. Significant positive effect sizes (Hedges’ g) were found for CS with the mini-mental state examination (MMSE) (g=0.51, 95% CI 0.29 to 0.69; p<0.001) compared to non-active controls and (g=0.35, 95% CI 0.06 to 0.65; p=0.019) compared to active controls. Significant benefit was also seen with the Alzheimer's disease Assessment Scale-Cognition (ADAS-Cog) (g=−0.26, 95% CI −0.445 to −0.08; p=0.005). There was no evidence that CT or MCTS produced significant improvements on general cognition outcomes and not enough CR studies for meta-analysis. The lowest accepted minimum clinically important difference was reached in 11/17 CS studies for the MMSE, but only 2/9 studies for the ADAS-Cog. Additionally, 95% prediction intervals suggested that although statistically significant, CS may not lead to benefits on the ADAS-Cog in all clinical settings. Conclusions CS improves scores on MMSE and ADAS-Cog in dementia, but benefits on the ADAS-Cog are generally
NASA Astrophysics Data System (ADS)
Tomczyk, Aleksandra; Ewertowski, Marek; White, Piran; Kasprzak, Leszek
2016-04-01
The dual role of many Protected Natural Areas in providing benefits for both conservation and recreation poses challenges for management. Although recreation-based damage to ecosystems can occur very quickly, restoration can take many years. The protection of conservation interests at the same as providing for recreation requires decisions to be made about how to prioritise and direct management actions. Trails are commonly used to divert visitors from the most important areas of a site, but high visitor pressure can lead to increases in trail width and a concomitant increase in soil erosion. Here we use detailed field data on condition of recreational trails in Gorce National Park, Poland, as the basis for a regression tree analysis to determine the factors influencing trail deterioration, and link specific trail impacts with environmental, use related and managerial factors. We distinguished 12 types of trails, characterised by four levels of degradation: (1) trails with an acceptable level of degradation; (2) threatened trails; (3) damaged trails; and (4) heavily damaged trails. Damaged trails were the most vulnerable of all trails and should be prioritised for appropriate conservation and restoration. We also proposed five types of monitoring of recreational trail conditions: (1) rapid inventory of negative impacts; (2) monitoring visitor numbers and variation in type of use; (3) change-oriented monitoring focusing on sections of trail which were subjected to changes in type or level of use or subjected to extreme weather events; (4) monitoring of dynamics of trail conditions; and (5) full assessment of trail conditions, to be carried out every 10-15 years. The application of the proposed framework can enhance the ability of Park managers to prioritise their trail management activities, enhancing trail conditions and visitor safety, while minimising adverse impacts on the conservation value of the ecosystem. A.M.T. was supported by the Polish Ministry of
Regression analysis of recent changes in cardiovascular morbidity and mortality in The Netherlands.
Bonneux, L.; Looman, C. W.; Barendregt, J. J.; Van der Maas, P. J.
1997-01-01
OBJECTIVES: To test whether recent declines in mortality from coronary heart disease were associated with increased mortality from other cardiovascular diseases. DESIGN: Poisson regression analysis of national data on causes of death and hospital discharges. SETTING AND SUBJECTS: Population of the Netherlands, 1969-93. MAIN OUTCOME MEASURES: Annual changes in mortality from coronary heart disease, stroke, and other cardiovascular diseases and annual changes in hospital discharge rates for acute coronary events, stroke, and congestive heart failures. RESULTS: Patterns of cardiovascular mortality changed abruptly in 1987-93. Annual decline in mortality from coronary heart disease increased sharply for women and men: from -1.9% (95% confidence interval -2.2% to -1.6%) and -1.7% (-1.9% to -1.4%) respectively in 1979-86 to -3.1% (-3.5% to -2.6%) and -4.2% (-4.6% to -3.9%) in 1987-93. The longstanding decline in mortality from stroke levelled off: from annual change of -3.3% (-3.7% to -2.8%) and -3.2% (-3.7% to -2.8%) in 1979-86 to -0.1% (-0.7% to 0.4%) and -1.1% (-1.7% to -0.5%) in 1987-93. Mortality from other cardiovascular diseases, however, started to increase: from -2.0% (-2.4% to -1.6%) and -0.2% (-0.5% to 0.2%) in 1979-86 to 1.5% (1.0% to 2.0%) and 1.9% (1.5% to 2.3%) in 1987-93. Hospital discharge rates for acute coronary heart disease, congestive heart failure, and stroke increased during 1980-6. During 1987-93 discharge rates for stroke and coronary heart disease stabilised but rates for congestive heart failure increased. CONCLUSION: Improved management of coronary heart disease seems to have reduced mortality, but some of the gains are lost to deaths from stroke and other cardiovascular diseases. The increasing numbers of patients with coronary heart disease who survive will increase demands on health services for long term care. PMID:9080996
Regression analysis of time trends in perinatal mortality in Germany 1980-1993.
Scherb, H; Weigelt, E; Brüske-Hohlfeld, I
2000-01-01
Numerous investigations have been carried out on the possible impact of the Chernobyl accident on the prevalence of anomalies at birth and on perinatal mortality. In many cases the studies were aimed at the detection of differences of pregnancy outcome measurements between regions or time periods. Most authors conclude that there is no evidence of a detrimental physical effect on congenital anomalies or other outcomes of pregnancy following the accident. In this paper, we report on statistical analyses of time trends of perinatal mortality in Germany. Our main intention is to investigate whether perinatal mortality, as reflected in official records, was increased in 1987 as a possible effect of the Chernobyl accident. We show that, in Germany as a whole, there was a significantly elevated perinatal mortality proportion in 1987 as compared to the trend function. The increase is 4.8% (p = 0.0046) of the expected perinatal death proportion for 1987. Even more pronounced levels of 8.2% (p = 0. 0458) and 8.5% (p = 0.0702) may be found in the higher contaminated areas of the former German Democratic Republic (GDR), including West Berlin, and of Bavaria, respectively. To investigate the impact of statistical models on results, we applied three standard regression techniques. The observed significant increase in 1987 is independent of the statistical model used. Stillbirth proportions show essentially the same behavior as perinatal death proportions, but the results for all of Germany are nonsignificant due to the smaller numbers involved. Analysis of the association of stillbirth proportions with the (137)Cs deposition on a district level in Bavaria discloses a significant relationship. Our results are in contrast to those of many analyses of the health consequences of the Chernobyl accident and contradict the present radiobiologic knowledge. As we are dealing with highly aggregated data, other causes or artifacts may explain the observed effects. Hence, the findings
NASA Astrophysics Data System (ADS)
Grégoire, G.
2014-12-01
The logistic regression originally is intended to explain the relationship between the probability of an event and a set of covariables. The model's coefficients can be interpreted via the odds and odds ratio, which are presented in introduction of the chapter. The observations are possibly got individually, then we speak of binary logistic regression. When they are grouped, the logistic regression is said binomial. In our presentation we mainly focus on the binary case. For statistical inference the main tool is the maximum likelihood methodology: we present the Wald, Rao and likelihoods ratio results and their use to compare nested models. The problems we intend to deal with are essentially the same as in multiple linear regression: testing global effect, individual effect, selection of variables to build a model, measure of the fitness of the model, prediction of new values… . The methods are demonstrated on data sets using R. Finally we briefly consider the binomial case and the situation where we are interested in several events, that is the polytomous (multinomial) logistic regression and the particular case of ordinal logistic regression.
Criteria for the use of regression analysis for remote sensing of sediment and pollutants
NASA Technical Reports Server (NTRS)
Whitlock, C. H.; Kuo, C. Y.; Lecroy, S. R.
1982-01-01
An examination of limitations, requirements, and precision of the linear multiple-regression technique for quantification of marine environmental parameters is conducted. Both environmental and optical physics conditions have been defined for which an exact solution to the signal response equations is of the same form as the multiple regression equation. Various statistical parameters are examined to define a criteria for selection of an unbiased fit when upwelled radiance values contain error and are correlated with each other. Field experimental data are examined to define data smoothing requirements in order to satisfy the criteria of Daniel and Wood (1971). Recommendations are made concerning improved selection of ground-truth locations to maximize variance and to minimize physical errors associated with the remote sensing experiment.
Lee, Soo Min; Lee, Jae-Won
2014-11-01
In this study, the optimal conditions for biomass torrefaction were determined by comparing the gain of energy content to the weight loss of biomass from the final products. Torrefaction experiments were performed at temperatures ranging from 220 to 280°C using 20-80min reaction times. Polynomial regression models ranging from the 1st to the 3rd order were used to determine a relationship between the severity factor (SF) and calorific value or weight loss. The intersection of two regression models for calorific value and weight loss was determined and assumed to be the optimized SF. The optimized SFs on each biomass ranged from 6.056 to 6.372. Optimized torrefaction conditions were determined at various reaction times of 15, 30, and 60min. The average optimized temperature was 248.55°C in the studied biomass when torrefaction was performed for 60min. PMID:25266685
Sanford, Ward E.; Nelms, David L.; Pope, Jason P.; Selnick, David L.
2012-01-01
This study by the U.S. Geological Survey, prepared in cooperation with the Virginia Department of Environmental Quality, quantifies the components of the hydrologic cycle across the Commonwealth of Virginia. Long-term, mean fluxes were calculated for precipitation, surface runoff, infiltration, total evapotranspiration (ET), riparian ET, recharge, base flow (or groundwater discharge) and net total outflow. Fluxes of these components were first estimated on a number of real-time-gaged watersheds across Virginia. Specific conductance was used to distinguish and separate surface runoff from base flow. Specific-conductance data were collected every 15 minutes at 75 real-time gages for approximately 18 months between March 2007 and August 2008. Precipitation was estimated for 1971–2000 using PRISM climate data. Precipitation and temperature from the PRISM data were used to develop a regression-based relation to estimate total ET. The proportion of watershed precipitation that becomes surface runoff was related to physiographic province and rock type in a runoff regression equation. Component flux estimates from the watersheds were transferred to flux estimates for counties and independent cities using the ET and runoff regression equations. Only 48 of the 75 watersheds yielded sufficient data, and data from these 48 were used in the final runoff regression equation. The base-flow proportion for the 48 watersheds averaged 72 percent using specific conductance, a value that was substantially higher than the 61 percent average calculated using a graphical-separation technique (the USGS program PART). Final results for the study are presented as component flux estimates for all counties and independent cities in Virginia.
Predicting Student Success on the Texas Chemistry STAAR Test: A Logistic Regression Analysis
ERIC Educational Resources Information Center
Johnson, William L.; Johnson, Annabel M.; Johnson, Jared
2012-01-01
Background: The context is the new Texas STAAR end-of-course testing program. Purpose: The authors developed a logistic regression model to predict who would pass-or-fail the new Texas chemistry STAAR end-of-course exam. Setting: Robert E. Lee High School (5A) with an enrollment of 2700 students, Tyler, Texas. Date of the study was the 2011-2012…
NASA Astrophysics Data System (ADS)
Schlechtingen, Meik; Ferreira Santos, Ilmar
2011-07-01
This paper presents the research results of a comparison of three different model based approaches for wind turbine fault detection in online SCADA data, by applying developed models to five real measured faults and anomalies. The regression based model as the simplest approach to build a normal behavior model is compared to two artificial neural network based approaches, which are a full signal reconstruction and an autoregressive normal behavior model. Based on a real time series containing two generator bearing damages the capabilities of identifying the incipient fault prior to the actual failure are investigated. The period after the first bearing damage is used to develop the three normal behavior models. The developed or trained models are used to investigate how the second damage manifests in the prediction error. Furthermore the full signal reconstruction and the autoregressive approach are applied to further real time series containing gearbox bearing damages and stator temperature anomalies. The comparison revealed all three models being capable of detecting incipient faults. However, they differ in the effort required for model development and the remaining operational time after first indication of damage. The general nonlinear neural network approaches outperform the regression model. The remaining seasonality in the regression model prediction error makes it difficult to detect abnormality and leads to increased alarm levels and thus a shorter remaining operational period. For the bearing damages and the stator anomalies under investigation the full signal reconstruction neural network gave the best fault visibility and thus led to the highest confidence level.
Gmur, Stephan; Vogt, Daniel; Zabowski, Darlene; Moskal, L. Monika
2012-01-01
The characterization of soil attributes using hyperspectral sensors has revealed patterns in soil spectra that are known to respond to mineral composition, organic matter, soil moisture and particle size distribution. Soil samples from different soil horizons of replicated soil series from sites located within Washington and Oregon were analyzed with the FieldSpec Spectroradiometer to measure their spectral signatures across the electromagnetic range of 400 to 1,000 nm. Similarity rankings of individual soil samples reveal differences between replicate series as well as samples within the same replicate series. Using classification and regression tree statistical methods, regression trees were fitted to each spectral response using concentrations of nitrogen, carbon, carbonate and organic matter as the response variables. Statistics resulting from fitted trees were: nitrogen R2 0.91 (p < 0.01) at 403, 470, 687, and 846 nm spectral band widths, carbonate R2 0.95 (p < 0.01) at 531 and 898 nm band widths, total carbon R2 0.93 (p < 0.01) at 400, 409, 441 and 907 nm band widths, and organic matter R2 0.98 (p < 0.01) at 300, 400, 441, 832 and 907 nm band widths. Use of the 400 to 1,000 nm electromagnetic range utilizing regression trees provided a powerful, rapid and inexpensive method for assessing nitrogen, carbon, carbonate and organic matter for upper soil horizons in a nondestructive method. PMID:23112620
GIS-assisted regression analysis to identify sources of selenium in streams
See, Randolph B.; Naftz, David L.; Qualls, Charles L.
1992-01-01
Using a geographic information system, a regression model has been developed to identify and to assess potential sources of selenium in the Kendrick Reclamation Project Area, Wyoming. A variety of spatially distributed factors was examined to determine which factors are most likely to affect selenium discharge in tributaries to the North Platte River. Areas of Upper Cretaceous Cody Shale and Quaternary alluvial deposits and irrigated land, length of irrigation canals, and boundaries of hydrologic subbasins of the major tributaries to the North Platte River were digitized and stored in a geographic information system. Selenium concentrations in samples of soil, plant material, ground water, and surface water were determined and evaluated. The location of all sampling sites was digitized and stored in the geographic information system, together with the selenium concentrations in samples. A regression model was developed using stepwise multiple regression of median selenium discharges on the physical and chemical characteristics of hydrologic subbasins. Results indicate that the intensity of irrigation in a hydrologic subbasin, as determined by area of irrigated land and length of irrigation delivery canals, accounts for the largest variation in median selenium discharges among subbasins. Tributaries draining hydrologic subbasins with greater intensity of irrigation result in greater selenium discharges to the North Platte River than do tributaries draining subbasins with lesser intensity of irrigation.
Gmur, Stephan; Vogt, Daniel; Zabowski, Darlene; Moskal, L Monika
2012-01-01
The characterization of soil attributes using hyperspectral sensors has revealed patterns in soil spectra that are known to respond to mineral composition, organic matter, soil moisture and particle size distribution. Soil samples from different soil horizons of replicated soil series from sites located within Washington and Oregon were analyzed with the FieldSpec Spectroradiometer to measure their spectral signatures across the electromagnetic range of 400 to 1,000 nm. Similarity rankings of individual soil samples reveal differences between replicate series as well as samples within the same replicate series. Using classification and regression tree statistical methods, regression trees were fitted to each spectral response using concentrations of nitrogen, carbon, carbonate and organic matter as the response variables. Statistics resulting from fitted trees were: nitrogen R(2) 0.91 (p < 0.01) at 403, 470, 687, and 846 nm spectral band widths, carbonate R(2) 0.95 (p < 0.01) at 531 and 898 nm band widths, total carbon R(2) 0.93 (p < 0.01) at 400, 409, 441 and 907 nm band widths, and organic matter R(2) 0.98 (p < 0.01) at 300, 400, 441, 832 and 907 nm band widths. Use of the 400 to 1,000 nm electromagnetic range utilizing regression trees provided a powerful, rapid and inexpensive method for assessing nitrogen, carbon, carbonate and organic matter for upper soil horizons in a nondestructive method. PMID:23112620
Capacitance Regression Modelling Analysis on Latex from Selected Rubber Tree Clones
NASA Astrophysics Data System (ADS)
Rosli, A. D.; Hashim, H.; Khairuzzaman, N. A.; Mohd Sampian, A. F.; Baharudin, R.; Abdullah, N. E.; Sulaiman, M. S.; Kamaru'zzaman, M.
2015-11-01
This paper investigates the capacitance regression modelling performance of latex for various rubber tree clones, namely clone 2002, 2008, 2014 and 3001. Conventionally, the rubber tree clones identification are based on observation towards tree features such as shape of leaf, trunk, branching habit and pattern of seeds texture. The former method requires expert persons and very time-consuming. Currently, there is no sensing device based on electrical properties that can be employed to measure different clones from latex samples. Hence, with a hypothesis that the dielectric constant of each clone varies, this paper discusses the development of a capacitance sensor via Capacitance Comparison Bridge (known as capacitance sensor) to measure an output voltage of different latex samples. The proposed sensor is initially tested with 30ml of latex sample prior to gradually addition of dilution water. The output voltage and capacitance obtained from the test are recorded and analyzed using Simple Linear Regression (SLR) model. This work outcome infers that latex clone of 2002 has produced the highest and reliable linear regression line with determination coefficient of 91.24%. In addition, the study also found that the capacitive elements in latex samples deteriorate if it is diluted with higher volume of water.
External validation of a Cox prognostic model: principles and methods
2013-01-01
Background A prognostic model should not enter clinical practice unless it has been demonstrated that it performs a useful role. External validation denotes evaluation of model performance in a sample independent of that used to develop the model. Unlike for logistic regression models, external validation of Cox models is sparsely treated in the literature. Successful validation of a model means achieving satisfactory discrimination and calibration (prediction accuracy) in the validation sample. Validating Cox models is not straightforward because event probabilities are estimated relative to an unspecified baseline function. Methods We describe statistical approaches to external validation of a published Cox model according to the level of published information, specifically (1) the prognostic index only, (2) the prognostic index together with Kaplan-Meier curves for risk groups, and (3) the first two plus the baseline survival curve (the estimated survival function at the mean prognostic index across the sample). The most challenging task, requiring level 3 information, is assessing calibration, for which we suggest a method of approximating the baseline survival function. Results We apply the methods to two comparable datasets in primary breast cancer, treating one as derivation and the other as validation sample. Results are presented for discrimination and calibration. We demonstrate plots of survival probabilities that can assist model evaluation. Conclusions Our validation methods are applicable to a wide range of prognostic studies and provide researchers with a toolkit for external validation of a published Cox model. PMID:23496923
NASA Astrophysics Data System (ADS)
Coe, Rob; Dalrymple, Brent
More than 1000 friends, students, and colleagues from all over the country filled Stanford Memorial Chapel (Stanford, Calif.) on February 3, 1987, to join in “A Celebration of the Life of Allan Cox.” Allan died early on the morning of January 27 while bicycling, the sport he had come to love the most. Between pieces of his favorite music by Bach and Mozart, Stanford administrators and colleagues spoke in tribute of Allan's unique qualities as friend, scientist, teacher, and dean of the School of Earth Sciences. James Rosse, Vice President and Provost of Stanford University, struck a particularly resonant chord with his personal remarks: "Allan reached out to each person he knew with the warmth and attention that can only come from deep respect and affection for others. I never heard him speak ill of others, and I do not believe he was capable of doing anything that would harm another being. He cared too much to intrude where he was not wanted, but his curiosity about people and the loving care with which he approached them broke down reserve to create remarkable friendships. His enthusiasm and good humor made him a welcome guest in the hearts of the hundreds of students and colleagues who shared the opportunity of knowing Allan Cox as a person."
Brahma, K.C.; Pal, B.K.; Das, C.
2005-07-01
Different models of vibration studies are examined. A case analysis to determine the parameters governing the prediction of blast vibration in an opencast coal mine is described. A regression model was developed to evaluate peak particle velocity (PPV) of the blast. The results are applicable to forecasting ground vibration before blasting and to the design of various parameters in controlled blasting. 16 refs., 1 fig., 1 tab.
Biochemistry of cyclooxygenase (COX)-2 inhibitors and molecular pathology of COX-2 in neoplasia.
Fosslien, E
2000-10-01
Several types of human tumors overexpress cyclooxygenase (COX) -2 but not COX-1, and gene knockout transfection experiments demonstrate a central role of COX-2 in experimental tumorigenesis. COX-2 produces prostaglandins that inhibit apoptosis and stimulate angiogenesis and invasiveness. Selective COX-2 inhibitors reduce prostaglandin synthesis, restore apoptosis, and inhibit cancer cell proliferation. In animal studies they limit carcinogen-induced tumorigenesis. In contrast, aspirin-like nonselective NSAIDs such as sulindac and indomethacin inhibit not only the enzymatic action of the highly inducible, proinflammatory COX-2 but the constitutively expressed, cytoprotective COX-1 as well. Consequently, nonselective NSAIDs can cause platelet dysfunction, gastrointestinal ulceration, and kidney damage. For that reason, selective inhibition of COX-2 to treat neoplastic proliferation is preferable to nonselective inhibition. Selective COX-2 inhibitors, such as meloxicam, celecoxib (SC-58635), and rofecoxib (MK-0966), are NSAIDs that have been modified chemically to preferentially inhibit COX-2 but not COX-1. For instance, meloxicam inhibits the growth of cultured colon cancer cells (HCA-7 and Moser-S) that express COX-2 but has no effect on HCT-116 tumor cells that do not express COX-2. NS-398 induces apoptosis in COX-2 expressing LNCaP prostate cancer cells and, surprisingly, in colon cancer S/KS cells that does not express COX-2. This effect may due to induction of apoptosis through uncoupling of oxidative phosphorylation and down-regulation of Bcl-2, as has been demonstrated for some nonselective NSAIDs, for instance, flurbiprofen. COX-2 mRNA and COX-2 protein is constitutively expressed in the kidney, brain, spinal cord, and ductus deferens, and in the uterus during implantation. In addition, COX-2 is constitutively and dominantly expressed in the pancreatic islet cells. These findings might somewhat limit the use of presently available selective COX-2 inhibitors
Spontaneous regression of testicular germ cell tumors: an analysis of 42 cases.
Balzer, Bonnie L; Ulbright, Thomas M
2006-07-01
Spontaneous regression of testicular germ cell tumors (GCTs) is a well-recognized phenomenon but has been incompletely characterized. Many pathologists are not familiar with the findings that support a diagnosis of a "burnt-out" primary in a patient with metastatic GCT. We therefore report the clinical, gross, and histologic findings in 42 cases of testicular GCT that showed either complete (26) or greater than 50% scarring (16). Thirty-seven patients (88%) had either known GCT metastasis or some residual testicular GCT, and none had treatment before orchiectomy. The patients were 17 to 67 years old, with a median of 32. Thirty presented with symptoms of metastasis, 7 with a testicular mass, 2 with elevated human chronic gonadotropin, and 1 with testicular pain. In 2 patients the presentation was unknown. Two patients had prior orchiopexy; another had an intraabdominal testis, and 2 others had prior contralateral seminoma (20 and 42 years previously). Gross descriptions in 37 cases identified white to tan scars, 0.6 to 2.4 cm, in 33. These were circumscribed in 16, with 15 of these having nodular or multinodular configurations and 1 a band-like appearance. In 9 cases the scar was ill defined or stellate, and in 8 cases no further details concerning the scar configuration were available. In 4 cases no scar was apparent; 2 of these had received intraoperative biopsy. Microscopically, all cases showed circumscribed to irregular foci of scarring, distinct from the adjacent parenchyma, in association with widespread testicular atrophy. Other common features were lymphoplasmacytic infiltrates in the scars (37/42) and "ghost" tubules in scars (31/42). Less common features in the scars included angiomatous foci (22/42), siderophages (15/42), and coarse intratubular calcifications (6/42); in the surrounding testis they included intratubular germ cell neoplasia, unclassified (IGCNU) (22/42), Leydig cell prominence (18/42), and necrosis (5/42). Tubular microliths occurred in
Watanabe, K I; Ohama, T
2001-01-01
In the unicellular green alga, Chlamydomonas reinhardtii, cytochrome oxidase subunit 2 (cox2) and 3 (cox3) genes are missing from the mitochondrial genome. We isolated and sequenced a BAC clone that carries the whole cox3 gene and its corresponding cDNA. Almost the entire cox2 gene and its cDNA were also determined. Comparison of the genomic and the corresponding cDNA sequences revealed that the cox3 gene contains as many as nine spliceosomal introns and that cox2 bears six introns. Putative mitochondria targeting signals were predicted at each N terminal of the cox genes. These spliceosomal introns were typical GT-AG-type introns, which are very common not only in Chlamydomonas nuclear genes but also in diverse eukaryotic taxa. We found no particular distinguishing features in the cox introns. Comparative analysis of these genes with the various mitochondrial genes showed that 8 of the 15 introns were interrupting the conserved mature protein coding segments, while the other 7 introns were located in the N-terminal target peptide regions. Phylogenetic analysis of the evolutionary position of C. reinhardtii in Chlorophyta was carried out and the existence of the cox2 and cox3 genes in the mitochondrial genome was superimposed in the tree. This analysis clearly shows that these cox genes were relocated during the evolution of Chlorophyceae. It is apparent that long before the estimated period of relocation of these mitochondrial genes, the cytosol had lost the splicing ability for group II introns. Therefore, at least eight introns located in the mature protein coding region cannot be the direct descendant of group II introns. Here, we conclude that the presence of these introns is due to the invasion of spliceosomal introns, which occurred during the evolution of Chlorophyceae. This finding provides concrete evidence supporting the "intron-late" model, which rests largely on the mobility of spliceosomal introns. PMID:11675593
Zhang, Chen; Li, Xiaoming; Su, Shaobing; Hong, Yan; Zhou, Yuejiao; Tang, Zhenzhu; Shen, Zhiyong
2015-01-01
Limited data are available regarding risk factors that are related to intimate partner violence (IPV) against female sex workers (FSWs) in the context of stable partnerships. Out of the 1,022 FSWs, 743 reported ever having a stable partnership and 430 (more than half) of those reported experiencing IPV. Hierarchical multivariate regression revealed that some characteristics of stable partners (e.g., low education, alcohol use) and relationship stressors (e.g., frequent friction, concurrent partnerships) were independently predictive of IPV against FSWs. Public health professionals who design future violence prevention interventions targeting FSWs need to consider the influence of their stable partners. PMID:24730642
Pagnini, Francesco; Manzoni, Gian Mauro; Tagliaferri, Aurora; Gibbons, Chris J
2015-08-01
Depression in people with amyotrophic lateral sclerosis, a fatal and progressive neurodegenerative disorder, is a serious issue with important clinical consequences. However, physical impairment may confound the diagnosis when using generic questionnaires. We conducted a comprehensive review of literature. Mean scores from depression questionnaires were meta-regressed on study-level mean time since onset of symptoms. Data from 103 studies (3190 subjects) indicate that the Beck Depression Inventory and, to a lesser degree, the Hospital Anxiety and Depression Scale are influenced by the time since symptom onset, strongly related to physical impairment. Our results suggest that widely used depression scales overestimate depression due to confounding with physical symptoms. PMID:24764286
2013-01-01
Background Microarray technology can acquire information about thousands of genes simultaneously. We analyzed published breast cancer microarray databases to predict five-year recurrence and compared the performance of three data mining algorithms of artificial neural networks (ANN), decision trees (DT) and logistic regression (LR) and two composite models of DT-ANN and DT-LR. The collection of microarray datasets from the Gene Expression Omnibus, four breast cancer datasets were pooled for predicting five-year breast cancer relapse. After data compilation, 757 subjects, 5 clinical variables and 13,452 genetic variables were aggregated. The bootstrap method, Mann–Whitney U test and 20-fold cross-validation were performed to investigate candidate genes with 100 most-significant p-values. The predictive powers of DT, LR and ANN models were assessed using accuracy and the area under ROC curve. The associated genes were evaluated using Cox regression. Results The DT models exhibited the lowest predictive power and the poorest extrapolation when applied to the test samples. The ANN models displayed the best predictive power and showed the best extrapolation. The 21 most-associated genes, as determined by integration of each model, were analyzed using Cox regression with a 3.53-fold (95% CI: 2.24-5.58) increased risk of breast cancer five-year recurrence… Conclusions The 21 selected genes can predict breast cancer recurrence. Among these genes, CCNB1, PLK1 and TOP2A are in the cell cycle G2/M DNA damage checkpoint pathway. Oncologists can offer the genetic information for patients when understanding the gene expression profiles on breast cancer recurrence. PMID:23506640
ERIC Educational Resources Information Center
Phillips, Gary W.
The usefulness of path analysis as a means of better understanding various linear models is demonstrated. First, two linear models are presented in matrix form using linear structural relations (LISREL) notation. The two models, regression and factor analysis, are shown to be identical although the research question and data matrix to which these…
NASA Technical Reports Server (NTRS)
Patnaik, Surya N.; Guptill, James D.; Hopkins, Dale A.; Lavelle, Thomas M.
2000-01-01
The NASA Engine Performance Program (NEPP) can configure and analyze almost any type of gas turbine engine that can be generated through the interconnection of a set of standard physical components. In addition, the code can optimize engine performance by changing adjustable variables under a set of constraints. However, for engine cycle problems at certain operating points, the NEPP code can encounter difficulties: nonconvergence in the currently implemented Powell's optimization algorithm and deficiencies in the Newton-Raphson solver during engine balancing. A project was undertaken to correct these deficiencies. Nonconvergence was avoided through a cascade optimization strategy, and deficiencies associated with engine balancing were eliminated through neural network and linear regression methods. An approximation-interspersed cascade strategy was used to optimize the engine's operation over its flight envelope. Replacement of Powell's algorithm by the cascade strategy improved the optimization segment of the NEPP code. The performance of the linear regression and neural network methods as alternative engine analyzers was found to be satisfactory. This report considers two examples-a supersonic mixed-flow turbofan engine and a subsonic waverotor-topped engine-to illustrate the results, and it discusses insights gained from the improved version of the NEPP code.
Ordinal logistic regression analysis on the nutritional status of children in KarangKitri village
NASA Astrophysics Data System (ADS)
Ohyver, Margaretha; Yongharto, Kimmy Octavian
2015-09-01
Ordinal logistic regression is a statistical technique that can be used to describe the relationship between ordinal response variable with one or more independent variables. This method has been used in various fields including in the health field. In this research, ordinal logistic regression is used to describe the relationship between nutritional status of children with age, gender, height, and family status. Nutritional status of children in this research is divided into over nutrition, well nutrition, less nutrition, and malnutrition. The purpose for this research is to describe the characteristics of children in the KarangKitri Village and to determine the factors that influence the nutritional status of children in the KarangKitri village. There are three things that obtained from this research. First, there are still children who are not categorized as well nutritional status. Second, there are children who come from sufficient economic level which include in not normal status. Third, the factors that affect the nutritional level of children are age, family status, and height.
Fakayode, Sayo O; Mitchell, Breanna S; Pollard, David A
2014-08-01
Accurate understanding of analyte boiling points (BP) is of critical importance in gas chromatographic (GC) separation and crude oil refinery operation in petrochemical industries. This study reported the first combined use of GC separation and partial-least-square (PLS1) multivariate regression analysis of petrochemical structural activity relationship (SAR) for accurate BP determination of two commercially available (D3710 and MA VHP) calibration gas mix samples. The results of the BP determination using PLS1 multivariate regression were further compared with the results of traditional simulated distillation method of BP determination. The developed PLS1 regression was able to correctly predict analytes BP in D3710 and MA VHP calibration gas mix samples, with a root-mean-square-%-relative-error (RMS%RE) of 6.4%, and 10.8% respectively. In contrast, the overall RMS%RE of 32.9% and 40.4%, respectively obtained for BP determination in D3710 and MA VHP using a traditional simulated distillation method were approximately four times larger than the corresponding RMS%RE of BP prediction using MRA, demonstrating the better predictive ability of MRA. The reported method is rapid, robust, and promising, and can be potentially used routinely for fast analysis, pattern recognition, and analyte BP determination in petrochemical industries. PMID:24881546
Mokhtari, Mehdi; Miri, Mohammad; Nikoonahad, Ali; Jalilian, Ali; Naserifar, Razi; Ghaffari, Hamid Reza; Kazembeigi, Farogh
2016-11-01
The aim of this study was to investigate the impact of the environmental factors on cutaneous leishmaniasis (CL) prevalence and morbidity in Ilam province, western Iran, as a known endemic area for this disease. Accurate locations of 3237 CL patients diagnosed from 2013 to 2015, their demographic information, and data of 17 potentially predictive environmental variables (PPEVs) were prepared to be used in Geographic Information System (GIS) and Land-Use Regression (LUR) analysis. The prevalence, risk, and predictive risk maps were provided using Inverse Distance Weighting (IDW) model in GIS software. Regression analysis was used to determine how environmental variables affect on CL prevalence. All maps and regression models were developed based on the annual and three-year average of the CL prevalence. The results showed that there was statistically significant relationship (P value≤0.05) between CL prevalence and 11 (64%) PPEVs which were elevation, population, rainfall, temperature, urban land use, poorland, dry farming, inceptisol and aridisol soils, and forest and irrigated lands. The highest probability of the CL prevalence was predicted in the west of the study area and frontier with Iraq. An inverse relationship was found between CL prevalence and environmental factors, including elevation, covering soil, rainfall, agricultural irrigation, and elevation while this relation was positive for temperature, urban land use, and population density. Environmental factors were found to be an important predictive variables for CL prevalence and should be considered in management strategies for CL control. PMID:27496622
iNOS signaling interacts with COX-2 pathway in colonic fibroblasts.
Zhu, Yingting; Zhu, Min; Lance, Peter
2012-10-01
COX-2 and iNOS are two major inflammatory mediators implicated in colorectal inflammation and cancer. Previously, the role of colorectal fibroblasts involved in regulation of COX-2 and iNOS expression was largely ignored. In addition, the combined interaction of COX-2 and iNOS signalings and their significance in the progression of colorectal inflammation and cancer within the fibroblasts have received little investigation. To address those issues, we investigated the role of colonic fibroblasts in the regulation of COX-2 and iNOS gene expression, and explored possible mechanisms of interaction between COX-2 and iNOS signalings using a colonic CCD-18Co fibroblast line and LPS, a potential stimulator of COX-2 and iNOS. Our results clearly demonstrated that LPS activated COX-2 gene expression and enhanced PGE(2) production, stimulated iNOS gene expression and promoted NO production in the fibroblasts. Interestingly, activation of COX-2 signaling by LPS was not involved in activation of iNOS signaling, while activation of iNOS signaling by LPS contributed in part to activation of COX-2 signaling. Further analysis indicated that PKC plays a major role in the activation and interaction of COX-2 and iNOS signalings induced by LPS in the fibroblasts. PMID:22683859
Ku80 cooperates with CBP to promote COX-2 expression and tumor growth
Qin, Yu; Xuan, Yang; Jia, Yunlu; Hu, Wenxian; Yu, Wendan; Dai, Meng; Li, Zhenglin; Yi, Canhui; Zhao, Shilei; Li, Mei; Du, Sha; Cheng, Wei; Xiao, Xiangsheng; Chen, Yiming; Wu, Taihua; Meng, Songshu; Yuan, Yuhui; Liu, Quentin; Huang, Wenlin; Guo, Wei; Wang, Shusen; Deng, Wuguo
2015-01-01
Cyclooxygenase-2 (COX-2) plays an important role in lung cancer development and progression. Using streptavidin-agarose pulldown and proteomics assay, we identified and validated Ku80, a dimer of Ku participating in the repair of broken DNA double strands, as a new binding protein of the COX-2 gene promoter. Overexpression of Ku80 up-regulated COX-2 promoter activation and COX-2 expression in lung cancer cells. Silencing of Ku80 by siRNA down-regulated COX-2 expression and inhibited tumor cell growth in vitro and in a xenograft mouse model. Ku80 knockdown suppressed phosphorylation of ERK, resulting in an inactivation of the MAPK pathway. Moreover, CBP, a transcription co-activator, interacted with and acetylated Ku80 to co-regulate the activation of COX-2 promoter. Overexpression of CBP increased Ku80 acetylation, thereby promoting COX-2 expression and cell growth. Suppression of CBP by a CBP-specific inhibitor or siRNA inhibited COX-2 expression as well as tumor cell growth. Tissue microarray immunohistochemical analysis of lung adenocarcinomas revealed a strong positive correlation between levels of Ku80 and COX-2 and clinicopathologic variables. Overexpression of Ku80 was associated with poor prognosis in patients with lung cancers. We conclude that Ku80 promotes COX-2 expression and tumor growth and is a potential therapeutic target in lung cancer. PMID:25797267
Chi, Peter; Aras, Radha; Martin, Katie; Favero, Carlita
2016-05-15
Fetal Alcohol Spectrum Disorders (FASD) collectively describes the constellation of effects resulting from human alcohol consumption during pregnancy. Even with public awareness, the incidence of FASD is estimated to be upwards of 5% in the general population and is becoming a global health problem. The physical, cognitive, and behavioral impairments of FASD are recapitulated in animal models. Recently rodent models utilizing voluntary drinking paradigms have been developed that accurately reflect moderate consumption, which makes up the majority of FASD cases. The range in severity of FASD characteristics reflects the frequency, dose, developmental timing, and individual susceptibility to alcohol exposure. As most rodent models of FASD use C57BL/6 mice, there is a need to expand the stocks of mice studied in order to more fully understand the complex neurobiology of this disorder. To that end, we allowed pregnant Swiss Webster mice to voluntarily drink ethanol via the drinking in the dark (DID) paradigm throughout their gestation period. Ethanol exposure did not alter gestational outcomes as determined by no significant differences in maternal weight gain, maternal liquid consumption, litter size, or pup weight at birth or weaning. Despite seemingly normal gestation, ethanol-exposed offspring exhibit significantly altered timing to achieve developmental milestones (surface righting, cliff aversion, and open field traversal), as analyzed through mixed-effects Cox proportional hazards models. These results confirm Swiss Webster mice as a viable option to study the incidence and causes of ethanol-induced neurobehavioral alterations during development. Future studies in our laboratory will investigate the brain regions and molecules responsible for these behavioral changes. PMID:26765502
Chen, Chau-Kuang; Bruce, Michelle; Tyler, Lauren; Brown, Claudine; Garrett, Angelica; Goggins, Susan; Lewis-Polite, Brandy; Weriwoh, Mirabel L; Juarez, Paul D.; Hood, Darryl B.; Skelton, Tyler
2014-01-01
The goal of this study was to analyze a 54-item instrument for assessment of perception of exposure to environmental contaminants within the context of the built environment, or exposome. This exposome was defined in five domains to include 1) home and hobby, 2) school, 3) community, 4) occupation, and 5) exposure history. Interviews were conducted with child-bearing-age minority women at Metro Nashville General Hospital at Meharry Medical College. Data were analyzed utilizing DTReg software for Support Vector Machine (SVM) modeling followed by an SPSS package for a logistic regression model. The target (outcome) variable of interest was respondent's residence by ZIP code. The results demonstrate that the rank order of important variables with respect to SVM modeling versus traditional logistic regression models is almost identical. This is the first study documenting that SVM analysis has discriminate power for determination of higher-ordered spatial relationships on an environmental exposure history questionnaire. PMID:23395953
Fenske, Nora; Burns, Jacob; Hothorn, Torsten; Rehfuess, Eva A.
2013-01-01
Background Most attempts to address undernutrition, responsible for one third of global child deaths, have fallen behind expectations. This suggests that the assumptions underlying current modelling and intervention practices should be revisited. Objective We undertook a comprehensive analysis of the determinants of child stunting in India, and explored whether the established focus on linear effects of single risks is appropriate. Design Using cross-sectional data for children aged 0–24 months from the Indian National Family Health Survey for 2005/2006, we populated an evidence-based diagram of immediate, intermediate and underlying determinants of stunting. We modelled linear, non-linear, spatial and age-varying effects of these determinants using additive quantile regression for four quantiles of the Z-score of standardized height-for-age and logistic regression for stunting and severe stunting. Results At least one variable within each of eleven groups of determinants was significantly associated with height-for-age in the 35% Z-score quantile regression. The non-modifiable risk factors child age and sex, and the protective factors household wealth, maternal education and BMI showed the largest effects. Being a twin or multiple birth was associated with dramatically decreased height-for-age. Maternal age, maternal BMI, birth order and number of antenatal visits influenced child stunting in non-linear ways. Findings across the four quantile and two logistic regression models were largely comparable. Conclusions Our analysis confirms the multifactorial nature of child stunting. It emphasizes the need to pursue a systems-based approach and to consider non-linear effects, and suggests that differential effects across the height-for-age distribution do not play a major role. PMID:24223839
Cardiovascular hazard of selective COX-2 inhibitors: myth or reality?
Chiolero, Arnaud; Maillard, Marc P; Burnier, Michel
2002-05-01
Since 1998, two selective inhibitors of COX-2 have been approved in many countries for the treatment of rheumatoid arthritis, osteoarthritis and acute pain. These new drugs have a significantly reduced gastrointestinal toxicity when compared with non-selective COX inhibitors. However, the results of two large clinical trials conducted in patients with osteoarthritis and rheumatoid arthritis have recently raised some concerns regarding the cardiovascular safety of these new drugs. The purpose of this paper is to review the potential mechanisms whereby selective COX-2 inhibitors could increase the cardiovascular risk of patients and to analyse the data indicating that this clinical risk indeed exists. The authors' analysis shows that even though there are pathophysiological mechanisms which could explain why selective COX-2 inhibition might increase the cardiovascular risk in patients, the actual level of evidence demonstrating that the risk is indeed increased is weak. Because of the importance of the issue, additional studies must be conducted with this class of agents. Meanwhile, it is crucial to emphasise that neither selective COX-2 inhibitors nor conventional NSAIDs replace aspirin in patients with a high cardiovascular risk. PMID:12904159
Schümberg, Katharina; Polyakova, Maryna; Steiner, Johann; Schroeter, Matthias L.
2016-01-01
S100B has been linked to glial pathology in several psychiatric disorders. Previous studies found higher S100B serum levels in patients with schizophrenia compared to healthy controls, and a number of covariates influencing the size of this effect have been proposed in the literature. Here, we conducted a meta-analysis and meta-regression analysis on alterations of serum S100B in schizophrenia in comparison with healthy control subjects. The meta-analysis followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement to guarantee a high quality and reproducibility. With strict inclusion criteria 19 original studies could be included in the quantitative meta-analysis, comprising a total of 766 patients and 607 healthy control subjects. The meta-analysis confirmed higher values of the glial serum marker S100B in schizophrenia if compared with control subjects. Meta-regression analyses revealed significant effects of illness duration and clinical symptomatology, in particular the total score of the Positive and Negative Syndrome Scale (PANSS), on serum S100B levels in schizophrenia. In sum, results confirm glial pathology in schizophrenia that is modulated by illness duration and related to clinical symptomatology. Further studies are needed to investigate mechanisms and mediating factors related to these findings. PMID:26941608
Schümberg, Katharina; Polyakova, Maryna; Steiner, Johann; Schroeter, Matthias L
2016-01-01
S100B has been linked to glial pathology in several psychiatric disorders. Previous studies found higher S100B serum levels in patients with schizophrenia compared to healthy controls, and a number of covariates influencing the size of this effect have been proposed in the literature. Here, we conducted a meta-analysis and meta-regression analysis on alterations of serum S100B in schizophrenia in comparison with healthy control subjects. The meta-analysis followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement to guarantee a high quality and reproducibility. With strict inclusion criteria 19 original studies could be included in the quantitative meta-analysis, comprising a total of 766 patients and 607 healthy control subjects. The meta-analysis confirmed higher values of the glial serum marker S100B in schizophrenia if compared with control subjects. Meta-regression analyses revealed significant effects of illness duration and clinical symptomatology, in particular the total score of the Positive and Negative Syndrome Scale (PANSS), on serum S100B levels in schizophrenia. In sum, results confirm glial pathology in schizophrenia that is modulated by illness duration and related to clinical symptomatology. Further studies are needed to investigate mechanisms and mediating factors related to these findings. PMID:26941608
NASA Technical Reports Server (NTRS)
Smith, Timothy D.; Steffen, Christopher J., Jr.; Yungster, Shaye; Keller, Dennis J.
1998-01-01
The all rocket mode of operation is shown to be a critical factor in the overall performance of a rocket based combined cycle (RBCC) vehicle. An axisymmetric RBCC engine was used to determine specific impulse efficiency values based upon both full flow and gas generator configurations. Design of experiments methodology was used to construct a test matrix and multiple linear regression analysis was used to build parametric models. The main parameters investigated in this study were: rocket chamber pressure, rocket exit area ratio, injected secondary flow, mixer-ejector inlet area, mixer-ejector area ratio, and mixer-ejector length-to-inlet diameter ratio. A perfect gas computational fluid dynamics analysis, using both the Spalart-Allmaras and k-omega turbulence models, was performed with the NPARC code to obtain values of vacuum specific impulse. Results from the multiple linear regression analysis showed that for both the full flow and gas generator configurations increasing mixer-ejector area ratio and rocket area ratio increase performance, while increasing mixer-ejector inlet area ratio and mixer-ejector length-to-diameter ratio decrease performance. Increasing injected secondary flow increased performance for the gas generator analysis, but was not statistically significant for the full flow analysis. Chamber pressure was found to be not statistically significant.
Regression analysis of current-status data: an application to breast-feeding.
Grummer-strawn, L M
1993-09-01
"Although techniques for calculating mean survival time from current-status data are well known, their use in multiple regression models is somewhat troublesome. Using data on current breast-feeding behavior, this article considers a number of techniques that have been suggested in the literature, including parametric, nonparametric, and semiparametric models as well as the application of standard schedules. Models are tested in both proportional-odds and proportional-hazards frameworks....I fit [the] models to current status data on breast-feeding from the Demographic and Health Survey (DHS) in six countries: two African (Mali and Ondo State, Nigeria), two Asian (Indonesia and Sri Lanka), and two Latin American (Colombia and Peru)." PMID:12155396
Kang, Seung-Wan; Byun, Gukdo; Park, Hun-Joon
2014-12-01
This paper presents empirical research into the relationship between leader-follower value congruence in social responsibility and the level of ethical satisfaction for employees in the workplace. 163 dyads were analyzed, each consisting of a team leader and an employee working at a large manufacturing company in South Korea. Following current methodological recommendations for congruence research, polynomial regression and response surface modeling methodologies were used to determine the effects of value congruence. Results indicate that leader-follower value congruence in social responsibility was positively related to the ethical satisfaction of employees. Furthermore, employees' ethical satisfaction was stronger when aligned with a leader with high social responsibility. The theoretical and practical implications are discussed. PMID:25539173
Cohen, Ira L; Liu, Xudong; Hudson, Melissa; Gillis, Jennifer; Cavalari, Rachel N S; Romanczyk, Raymond G; Karmel, Bernard Z; Gardner, Judith M
2016-09-01
In order to improve discrimination accuracy between Autism Spectrum Disorder (ASD) and similar neurodevelopmental disorders, a data mining procedure, Classification and Regression Trees (CART), was used on a large multi-site sample of PDD Behavior Inventory (PDDBI) forms on children with and without ASD. Discrimination accuracy exceeded 80 %, generalized to an independent validation set, and generalized across age groups and sites, and agreed well with ADOS classifications. Parent PDDBIs yielded better results than teacher PDDBIs but, when CART predictions agreed across informants, sensitivity increased. Results also revealed three subtypes of ASD: minimally verbal, verbal, and atypical; and two, relatively common subtypes of non-ASD children: social pragmatic problems and good social skills. These subgroups corresponded to differences in behavior profiles and associated bio-medical findings. PMID:27318809
Monsalve, Irene F.; Pérez, Alejandro; Molinaro, Nicola
2014-01-01
During language comprehension, semantic contextual information is used to generate expectations about upcoming items. This has been commonly studied through the N400 event-related potential (ERP), as a measure of facilitated lexical retrieval. However, the associative relationships in multi-word expressions (MWE) may enable the generation of a categorical expectation, leading to lexical retrieval before target word onset. Processing of the target word would thus reflect a target-identification mechanism, possibly indexed by a P3 ERP component. However, given their time overlap (200–500 ms post-stimulus onset), differentiating between N400/P3 ERP responses (averaged over multiple linguistically variable trials) is problematic. In the present study, we analyzed EEG data from a previous experiment, which compared ERP responses to highly expected words that were placed either in a MWE or a regular non-fixed compositional context, and to low predictability controls. We focused on oscillatory dynamics and regression analyses, in order to dissociate between the two contexts by modeling the electrophysiological response as a function of item-level parameters. A significant interaction between word position and condition was found in the regression model for power in a theta range (~7–9 Hz), providing evidence for the presence of qualitative differences between conditions. Power levels within this band were lower for MWE than compositional contexts when the target word appeared later on in the sentence, confirming that in the former lexical retrieval would have taken place before word onset. On the other hand, gamma-power (~50–70 Hz) was also modulated by predictability of the item in all conditions, which is interpreted as an index of a similar “matching” sub-step for both types of contexts, binding an expected representation and the external input. PMID:25161630
Wang, Chong; Sun, Qun; Wahab, Magd Abdel; Zhang, Xingyu; Xu, Limin
2015-09-01
Rotary cup brushes mounted on each side of a road sweeper undertake heavy debris removal tasks but the characteristics have not been well known until recently. A Finite Element (FE) model that can analyze brush deformation and predict brush characteristics have been developed to investigate the sweeping efficiency and to assist the controller design. However, the FE model requires large amount of CPU time to simulate each brush design and operating scenario, which may affect its applications in a real-time system. This study develops a mathematical regression model to summarize the FE modeled results. The complex brush load characteristic curves were statistically analyzed to quantify the effects of cross-section, length, mounting angle, displacement and rotational speed etc. The data were then fitted by a multiple variable regression model using the maximum likelihood method. The fitted results showed good agreement with the FE analysis results and experimental results, suggesting that the mathematical regression model may be directly used in a real-time system to predict characteristics of different brushes under varying operating conditions. The methodology may also be used in the design and optimization of rotary brush tools. PMID:26123978
Strong, Mark; Oakley, Jeremy E; Brennan, Alan
2014-04-01
The partial expected value of perfect information (EVPI) quantifies the expected benefit of learning the values of uncertain parameters in a decision model. Partial EVPI is commonly estimated via a 2-level Monte Carlo procedure in which parameters of interest are sampled in an outer loop, and then conditional on these, the remaining parameters are sampled in an inner loop. This is computationally demanding and may be difficult if correlation between input parameters results in conditional distributions that are hard to sample from. We describe a novel nonparametric regression-based method for estimating partial EVPI that requires only the probabilistic sensitivity analysis sample (i.e., the set of samples drawn from the joint distribution of the parameters and the corresponding net benefits). The method is applicable in a model of any complexity and with any specification of input parameter distribution. We describe the implementation of the method via 2 nonparametric regression modeling approaches, the Generalized Additive Model and the Gaussian process. We demonstrate in 2 case studies the superior efficiency of the regression method over the 2-level Monte Carlo method. R code is made available to implement the method. PMID:24246566
MACRO FOR ESTIMATING THE BOX-COX POWER TRANSFORMATION
In their classic paper, Box and Cox (1964) demonstrated how a dependent variable could be transformed to satisfy simultaneously, assumptions implicit in the analysis of linear models. For the class of analyses in which the response of interest is positive and where no transformat...
Nicoară, Simona D; Ștefănuţ, Anne C; Nascutzy, Constanta; Zaharie, Gabriela C; Toader, Laura E; Drugan, Tudor C
2016-01-01
BACKGROUND Retinopathy is a serious complication related to prematurity and a leading cause of childhood blindness. The aggressive posterior form of retinopathy of prematurity (APROP) has a worse anatomical and functional outcome following laser therapy, as compared with the classic form of the disease. The main outcome measures are the APROP regression rate, structural outcomes, and complications associated with intravitreal bevacizumab (IVB) versus laser photocoagulation in APROP. MATERIAL AND METHODS This is a retrospective case series that includes infants with APROP who received either IVB or laser photocoagulation and had a follow-up of at least 60 weeks (for the laser photocoagulation group) and 80 weeks (for the IVB group). In the first group, laser photocoagulation of the retina was carried out and in the second group, 1 bevacizumab injection was administered intravitreally. The following parameters were analyzed in each group: sex, gestational age, birth weight, postnatal age and postmenstrual age at treatment, APROP regression, sequelae, and complications. Statistical analysis was performed using Microsoft Excel and IBM SPSS (version 23.0). RESULTS The laser photocoagulation group consisted of 6 premature infants (12 eyes) and the IVB group consisted of 17 premature infants (34 eyes). Within the laser photocoagulation group, the evolution was favorable in 9 eyes (75%) and unfavorable in 3 eyes (25%). Within the IVB group, APROP regressed in 29 eyes (85.29%) and failed to regress in 5 eyes (14.71%). These differences are statistically significant, as proved by the McNemar test (P<0.001). CONCLUSIONS The IVB group had a statistically significant better outcome compared with the laser photocoagulation group, in APROP in our series. PMID:27062023
Nicoară, Simona D.; Ştefănuţ, Anne C.; Nascutzy, Constanta; Zaharie, Gabriela C.; Toader, Laura E.; Drugan, Tudor C.
2016-01-01
Background Retinopathy is a serious complication related to prematurity and a leading cause of childhood blindness. The aggressive posterior form of retinopathy of prematurity (APROP) has a worse anatomical and functional outcome following laser therapy, as compared with the classic form of the disease. The main outcome measures are the APROP regression rate, structural outcomes, and complications associated with intravitreal bevacizumab (IVB) versus laser photocoagulation in APROP. Material/Methods This is a retrospective case series that includes infants with APROP who received either IVB or laser photocoagulation and had a follow-up of at least 60 weeks (for the laser photocoagulation group) and 80 weeks (for the IVB group). In the first group, laser photocoagulation of the retina was carried out and in the second group, 1 bevacizumab injection was administered intravitreally. The following parameters were analyzed in each group: sex, gestational age, birth weight, postnatal age and postmenstrual age at treatment, APROP regression, sequelae, and complications. Statistical analysis was performed using Microsoft Excel and IBM SPSS (version 23.0). Results The laser photocoagulation group consisted of 6 premature infants (12 eyes) and the IVB group consisted of 17 premature infants (34 eyes). Within the laser photocoagulation group, the evolution was favorable in 9 eyes (75%) and unfavorable in 3 eyes (25%). Within the IVB group, APROP regressed in 29 eyes (85.29%) and failed to regress in 5 eyes (14.71%). These differences are statistically significant, as proved by the McNemar test (P<0.001). Conclusions The IVB group had a statistically significant better outcome compared with the laser photocoagulation group, in APROP in our series. PMID:27062023
Exposure to diesel exhaust upregulates COX-2 expression in ApoE knockout mice
Bai, Ni; Tranfield, Erin M.; Kavanagh, Terrance J.; Kaufman, Joel D.; Rosenfeld, Michael E.; van Eeden, Stephan F.
2015-01-01
Introduction We have shown that diesel exhaust (DE) inhalation caused progression of atherosclerosis; however, the mechanisms are not fully understood. We hypothesize that exposure to DE upregulates cyclooxygenase (COX) expression and activity, which could play a role in DE-induced atherosclerosis. Methods ApoE knockout mice (30-week old) fed with regular chow were exposed to DE (at 200 μg/m3 of particulate matter) or filtered air (control) for 7 weeks (6 h/day, 5 days/week). The protein and mRNA expression of COX-1 and COX-2 were evaluated by immunohistochemistry analysis and quantitative real-time PCR, respectively. To examine COX activity, thoracic aortae were mounted in a wire myograph, and phenylephrine (PE)-stimulated vasoconstriction was measured with and without the presence of COX antagonists (indomethacin). COX-2 activity was further assessed by urine 2,3-dinor-6-keto PGF1α level, a major metabolite of prostacyclin I2 (PGI2). Results Immunohistochemistry analysis demonstrates that DE exposure enhanced COX-2 expression in both thoracic aorta (p < 0.01) and aortic root (p < 0.03), with no modification of COX-1 expression. The increased COX-2 expression was positively correlated with smooth muscle cell content in aortic lesions (R2 = 0.4081, p < 0.008). The fractional changes of maximal vasoconstriction in the presence of indomethacin was attenuated by 3-fold after DE exposure (p < 0.02). Urine 2,3-dinor-6-keto PGF1α level was 15-fold higher in DE group than the control (p < 0.007). The mRNA expression of COX-2 (p < 0.006) and PGI synthase (p < 0.02), but not COX-1, was significantly augmented after DE exposure. Conclusion We show that DE inhalation enhanced COX-2 expression, which is also associated with phenotypic changes of aortic lesion. PMID:22746401
Jamali, Jamshid; Roustaei, Narges; Ayatollahi, Seyyed Mohammad Taghi; Sadeghi, Erfan
2015-01-01
Background: Mental health is one of the most important dimensions of life and its quality. Minor Psychiatric Disorder as a type of mental health problem is prevalent among health workers. Nursing is considered to be one of the most stressful occupations. Objectives: This study aimed to evaluate the prevalence of minor psychiatric disorder and its associated factors among nurses in southern Iran. Patients and Methods: A cross-sectional study was carried out on 771 nurses working in 20 cities of Bushehr and Fars provinces in southern Iran. Participants were recruited through multi-stage sampling during 2014. The General Health Questionnaire (GHQ-12) was used for screening of minor psychiatric disorder in nurses. Latent Class Regression was used to analyze the data. Results: The prevalence of minor psychiatric disorder among nurses was estimated to be 27.5%. Gender and sleep disorders were significant factors in determining the level of minor psychiatric disorder (P Values of 0.04 and < 0.001, respectively). Female nurses were 20% more likely than males to be classified into the minor psychiatric disorder group. Conclusions: The results of this study provide information about the prevalence of minor psychiatric disorder among nurses, and factors, which affect the prevalence of such disorders. These findings can be used in strategic planning processes to improve nurses’ mental health. PMID:26339670
The effects of invertebrate herbivores on plant population growth: a meta-regression analysis.
Katz, Daniel S W
2016-09-01
Over the last two decades, an increasing number of studies have quantified the effects of herbivory on plant populations using stage-structured population models and integral projection models, allowing for the calculation of plant population growth rates (λ) with and without herbivory. In this paper, I assembled 29 studies and conducted a meta-regression to determine the importance of invertebrate herbivores to population growth rates (λ) while accounting for missing data. I found that invertebrate herbivory often induced important reductions in plant population growth rates (with herbivory, λ was 1.08 ± 0.36; without herbivory, λ was 1.28 ± 0.58). This relationship tended to be weaker for seed predation than for other types of herbivory, except when seed predation rates were very high. Even so, the amount by which studies reduced herbivory was a poor predictor of differences in population growth rates-which strongly cautions against using measured herbivory rates as a proxy for the impact of herbivores. Herbivory reduced plant population growth rates significantly more when potential growth rates were high, which helps to explain why there was less variation in actual population growth rates than in potential population growth rates. The synthesis of these studies also shows the need for future studies to report variance in estimates of λ and to quantify how λ varies as a function of plant density. PMID:27017603
Time-trend of melanoma screening practice by primary care physicians: A meta-regression analysis
Mauri, Davide; Karampoiki, Vassiliki; Polyzos, Nikolaos P; Cortinovis, Ivan; Koukourakis, Georgios; Zacharias, Georgios; Xilomenos, Apostolos; Tsappi, Maria; Casazza, Giovanni
2009-01-01
Objective To assess whether the proportion of primary care physicians implementing full body skin examination (FBSE) to screen for melanoma changed over time. Methods Meta-regression analyses of available data. Data Sources: MEDLINE, ISI, Cochrane Central Register of Controlled Trials. Results Fifteen studies surveying 10,336 physicians were included in the analyses. Overall, 15%–82% of them reported to perform FBSE to screen for melanoma. The proportion of physicians using FBSE screening tended to decrease by 1.72% per year (P =0.086). Corresponding annual changes in European, North American, and Australian settings were −0.68% (P =0.494), −2.02% (P =0.044), and +2.59% (P =0.010), respectively. Changes were not influenced by national guide-lines. Conclusions Considering the increasing incidence of melanoma and other skin malignancies, as well as their relative potential consequences, the FBSE implementation time-trend we retrieved should be considered a worrisome phenomenon. PMID:19242870
Predictors of Asian American adolescents' suicide attempts: a latent class regression analysis.
Wong, Y Joel; Maffini, Cara S
2011-11-01
Although suicide-related outcomes among Asian American adolescents are a serious public health problem in the United States, research in this area has been relatively sparse. To address this gap in the empirical literature, this study examined subgroups of Asian American adolescents for whom family, school, and peer relationships exerted differential effects on suicide attempts. Data were drawn from Waves 1 and 2 of the National Longitudinal Study of Adolescent Health dataset and included responses from a national sample of 959 Asian American adolescents (48.0% girls; average age at Wave 2 = 16.43). A latent class regression was used to assess the optimal number of latent classes (i.e., subgroups of participants) that explained the associations between family, school, and peer relationships and subsequent suicide attempts. Three latent classes were identified. Most participants belonged to a latent class in which family, school, and peer relationships were protective factors. However, stronger school relationships and peer relationships were found to be risk factors in two other latent classes. The three latent classes also differed significantly in terms of suicide attempts, gender, and acculturation. The practical implications of this study, particularly for educators and mental health professionals, are discussed. PMID:21818685
Stevens, F. J.; Bobrovnik, S. A.; Biosciences Division; Palladin Inst. Biochemistry
2007-12-01
Physiological responses of the adaptive immune system are polyclonal in nature whether induced by a naturally occurring infection, by vaccination to prevent infection or, in the case of animals, by challenge with antigen to generate reagents of research or commercial significance. The composition of the polyclonal responses is distinct to each individual or animal and changes over time. Differences exist in the affinities of the constituents and their relative proportion of the responsive population. In addition, some of the antibodies bind to different sites on the antigen, whereas other pairs of antibodies are sterically restricted from concurrent interaction with the antigen. Even if generation of a monoclonal antibody is the ultimate goal of a project, the quality of the resulting reagent is ultimately related to the characteristics of the initial immune response. It is probably impossible to quantitatively parse the composition of a polyclonal response to antigen. However, molecular regression allows further parameterization of a polyclonal antiserum in the context of certain simplifying assumptions. The antiserum is described as consisting of two competing populations of high- and low-affinity and unknown relative proportions. This simple model allows the quantitative determination of representative affinities and proportions. These parameters may be of use in evaluating responses to vaccines, to evaluating continuity of antibody production whether in vaccine recipients or animals used for the production of antisera, or in optimizing selection of donors for the production of monoclonal antibodies.
Costa-Font, Joan; Fabbri, Daniele; Gil, Joan
2009-12-01
Wide cross-country variation in obesity rates has been reported between European Union member states. Although the existing cross-country differences have not been analyzed in depth, they contain important information on health production determinants. In this paper we apply a methodology for conducting standardized cross-country comparisons of body mass index (BMI). We draw on estimations of the marginal density function of BMI for Italy and Spain in 2003, two countries with similar GDP and socio-economic conditions. We produce different counterfactual distribution estimates using covariates (health production inputs) specified in a quantile regression. Our findings suggest that Spain-to-Italy BMI gaps among females are largely explained by cross-country variation in the returns to each covariate, especially for younger women. We find that adverse underlying determinants do not explain the gap observed in particular between younger Spanish females and their Italian counterfactuals; behavioural differences appear to be the key. We tentatively conclude that Spanish policy on obesity should target mainly younger females. PMID:19782010
Lewis, Kristin Nicole; Heckman, Bernadette Davantes; Himawan, Lina
2011-08-01
Growth mixture modeling (GMM) identified latent groups based on treatment outcome trajectories of headache disability measures in patients in headache subspecialty treatment clinics. Using a longitudinal design, 219 patients in headache subspecialty clinics in 4 large cities throughout Ohio provided data on their headache disability at pretreatment and 3 follow-up assessments. GMM identified 3 treatment outcome trajectory groups: (1) patients who initiated treatment with elevated disability levels and who reported statistically significant reductions in headache disability (high-disability improvers; 11%); (2) patients who initiated treatment with elevated disability but who reported no reductions in disability (high-disability nonimprovers; 34%); and (3) patients who initiated treatment with moderate disability and who reported statistically significant reductions in headache disability (moderate-disability improvers; 55%). Based on the final multinomial logistic regression model, a dichotomized treatment appointment attendance variable was a statistically significant predictor for differentiating high-disability improvers from high-disability nonimprovers. Three-fourths of patients who initiated treatment with elevated disability levels did not report reductions in disability after 5 months of treatment with new preventive pharmacotherapies. Preventive headache agents may be most efficacious for patients with moderate levels of disability and for patients with high disability levels who attend all treatment appointments. PMID:21420240
A Vector Approach to Regression Analysis and Its Implications to Heavy-Duty Diesel Emissions
McAdams, H.T.
2001-02-14
An alternative approach is presented for the regression of response data on predictor variables that are not logically or physically separable. The methodology is demonstrated by its application to a data set of heavy-duty diesel emissions. Because of the covariance of fuel properties, it is found advantageous to redefine the predictor variables as vectors, in which the original fuel properties are components, rather than as scalars each involving only a single fuel property. The fuel property vectors are defined in such a way that they are mathematically independent and statistically uncorrelated. Because the available data set does not allow definitive separation of vehicle and fuel effects, and because test fuels used in several of the studies may be unrealistically contrived to break the association of fuel variables, the data set is not considered adequate for development of a full-fledged emission model. Nevertheless, the data clearly show that only a few basic patterns of fuel-property variation affect emissions and that the number of these patterns is considerably less than the number of variables initially thought to be involved. These basic patterns, referred to as ''eigenfuels,'' may reflect blending practice in accordance with their relative weighting in specific circumstances. The methodology is believed to be widely applicable in a variety of contexts. It promises an end to the threat of collinearity and the frustration of attempting, often unrealistically, to separate variables that are inseparable.
Kang, Gumin; Lee, Kwangchil; Park, Haesung; Lee, Jinho; Jung, Youngjean; Kim, Kyoungsik; Son, Boongho; Park, Hyoungkuk
2010-06-15
Mixed hydrofluoric and nitric acids are widely used as a good etchant for the pickling process of stainless steels. The cost reduction and the procedure optimization in the manufacturing process can be facilitated by optically detecting the concentration of the mixed acids. In this work, we developed a novel method which allows us to obtain the concentrations of hydrofluoric acid (HF) and nitric acid (HNO(3)) mixture samples with high accuracy. The experiments were carried out for the mixed acids which consist of the HF (0.5-3wt%) and the HNO(3) (2-12wt%) at room temperature. Fourier Transform Raman spectroscopy has been utilized to measure the concentration of the mixed acids HF and HNO(3), because the mixture sample has several strong Raman bands caused by the vibrational mode of each acid in this spectrum. The calibration of spectral data has been performed using the partial least squares regression method which is ideal for local range data treatment. Several figures of merit (FOM) were calculated using the concept of net analyte signal (NAS) to evaluate performance of our methodology. PMID:20441916
NASA Technical Reports Server (NTRS)
Alston, D. W.
1981-01-01
The considered research had the objective to design a statistical model that could perform an error analysis of curve fits of wind tunnel test data using analysis of variance and regression analysis techniques. Four related subproblems were defined, and by solving each of these a solution to the general research problem was obtained. The capabilities of the evolved true statistical model are considered. The least squares fit is used to determine the nature of the force, moment, and pressure data. The order of the curve fit is increased in order to delete the quadratic effect in the residuals. The analysis of variance is used to determine the magnitude and effect of the error factor associated with the experimental data.
Gerber, Samuel; Rübel, Oliver; Bremer, Peer-Timo; Pascucci, Valerio; Whitaker, Ross T.
2012-01-01
This paper introduces a novel partition-based regression approach that incorporates topological information. Partition-based regression typically introduce a quality-of-fit-driven decomposition of the domain. The emphasis in this work is on a topologically meaningful segmentation. Thus, the proposed regression approach is based on a segmentation induced by a discrete approximation of the Morse-Smale complex. This yields a segmentation with partitions corresponding to regions of the function with a single minimum and maximum that are often well approximated by a linear model. This approach yields regression models that are amenable to interpretation and have good predictive capacity. Typically, regression estimates are quantified by their geometrical accuracy. For the proposed regression, an important aspect is the quality of the segmentation itself. Thus, this paper introduces a new criterion that measures the topological accuracy of the estimate. The topological accuracy provides a complementary measure to the classical geometrical error measures and is very sensitive to over-fitting. The Morse-Smale regression is compared to state-of-the-art approaches in terms of geometry and topology and yields comparable or improved fits in many cases. Finally, a detailed study on climate-simulation data demonstrates the application of the Morse-Smale regression. Supplementary materials are available online and contain an implementation of the proposed approach in the R package msr, an analysis and simulations on the stability of the Morse-Smale complex approximation and additional tables for the climate-simulation study. PMID:23687424
Gerber, Samuel; Rubel, Oliver; Bremer, Peer -Timo; Pascucci, Valerio; Whitaker, Ross T.
2012-01-19
This paper introduces a novel partition-based regression approach that incorporates topological information. Partition-based regression typically introduces a quality-of-fit-driven decomposition of the domain. The emphasis in this work is on a topologically meaningful segmentation. Thus, the proposed regression approach is based on a segmentation induced by a discrete approximation of the Morse–Smale complex. This yields a segmentation with partitions corresponding to regions of the function with a single minimum and maximum that are often well approximated by a linear model. This approach yields regression models that are amenable to interpretation and have good predictive capacity. Typically, regression estimates are quantified by their geometrical accuracy. For the proposed regression, an important aspect is the quality of the segmentation itself. Thus, this article introduces a new criterion that measures the topological accuracy of the estimate. The topological accuracy provides a complementary measure to the classical geometrical error measures and is very sensitive to overfitting. The Morse–Smale regression is compared to state-of-the-art approaches in terms of geometry and topology and yields comparable or improved fits in many cases. Finally, a detailed study on climate-simulation data demonstrates the application of the Morse–Smale regression. Supplementary Materials are available online and contain an implementation of the proposed approach in the R package msr, an analysis and simulations on the stability of the Morse–Smale complex approximation, and additional tables for the climate-simulation study.
Regression Analysis of Long-Term Profile Ozone Data Set from BUV Instruments
NASA Technical Reports Server (NTRS)
Stolarski, Richard S.
2005-01-01
We have produced a profile merged ozone data set (MOD) based on the SBUV/SBUV2 series of nadir-viewing satellite backscatter instruments, covering the period from November 1978 - December 2003. In 2004, data from the Nimbus 7 SBUV and NOAA 9, ll, and 16 SBUV/2 instruments were reprocessed using the Version 8 (V8) algorithm and most recent calibrations. More recently, data from the Nimbus 4 BUT instrument, which was operational from 1970 - 1977, were also reprocessed using the V8 algorithm. As part of the V8 profile calibration, the Nimbus 7 and NOAA 9 (1993-1997 only) instrument calibrations have been adjusted to match the NOAA 11 calibration, which was established based on comparisons with SSBUV shuttle flight data. Differences between NOAA 11, Nimbus 7 and NOAA 9 profile zonal means are within plus or minus 5% at all levels when averaged over the respective periods of data overlap. NOAA 16 SBUV/2 data have insufficient overlap with NOAA 11, so its calibration is based on pre-flight information. Mean differences over 4 months of overlap are within plus or minus 7%. Given the level of agreement between the data sets, we simply average the ozone values during periods of instrument overlap to produce the MOD profile data set. Initial comparisons of coincident matches of N4 BUV and Arosa Umkehr data show mean differences of 0.5 (0.5)% at 30km; 7.5 (0.5)% at 35 km; and 11 (0.7)% at 40 km, where the number in parentheses is the standard error of the mean. In this study, we use the MOD profile data set (1978-2003) to estimate the change in profile ozone due to changing stratospheric chlorine levels. We use a standard linear regression model with proxies for the seasonal cycle, solar cycle, QBO, and ozone trend. To account for the non-linearity of stratospheric chlorine levels since the late 1990s, we use a time series of Effective Chlorine, defined as the global average of Chlorine + 50 * Bromine at 1 hPa, as the trend proxy. The Effective Chlorine data are taken from
ERIC Educational Resources Information Center
Berenson, Mark L.
2013-01-01
There is consensus in the statistical literature that severe departures from its assumptions invalidate the use of regression modeling for purposes of inference. The assumptions of regression modeling are usually evaluated subjectively through visual, graphic displays in a residual analysis but such an approach, taken alone, may be insufficient…
Landslide susceptibility analysis with logistic regression model based on FCM sampling strategy
NASA Astrophysics Data System (ADS)
Wang, Liang-Jie; Sawada, Kazuhide; Moriguchi, Shuji
2013-08-01
Several mathematical models are used to predict the spatial distribution characteristics of landslides to mitigate damage caused by landslide disasters. Although some studies have achieved excellent results around the world, few studies take the inter-relationship of the selected points (training points) into account. In this paper, we present the Fuzzy c-means (FCM) algorithm as an optimal method for choosing the appropriate input landslide points as training data. Based on different combinations of the Fuzzy exponent (m) and the number of clusters (c), five groups of sampling points were derived from formal seed cells points and applied to analyze the landslide susceptibility in Mizunami City, Gifu Prefecture, Japan. A logistic regression model is applied to create the models of the relationships between landslide-conditioning factors and landslide occurrence. The pre-existing landslide bodies and the area under the relative operative characteristic (ROC) curve were used to evaluate the performance of all the models with different m and c. The results revealed that Model no. 4 (m=1.9, c=4) and Model no. 5 (m=1.9, c=5) have significantly high classification accuracies, i.e., 90.0%. Moreover, over 30% of the landslide bodies were grouped under the very high susceptibility zone. Otherwise, Model no. 4 and Model no. 5 had higher area under the ROC curve (AUC) values, which were 0.78 and 0.79, respectively. Therefore, Model no. 4 and Model no. 5 offer better model results for landslide susceptibility mapping. Maps derived from Model no. 4 and Model no. 5 would offer the local authorities crucial information for city planning and development.
Kitsantas, Panagiota
2009-01-01
Objective to be addressed The purpose of this study was to investigate the structural and organizational factors that contribute to the availability and increased capacity for substance abuse treatment programs in correctional settings. We used Classification and Regression Tree statistical procedures to identify how multi-level data can explain the variability in availability and capacity of substance abuse treatment programs in jails and probation/parole offices. Methods The data for this study combined the National Criminal Justice Treatment Practices survey (NCJTP) and the 2000 Census. The NCJTP survey was a nationally representative sample of correctional administrators for jails and probation/parole agencies. The sample size included 295 substance abuse treatment programs that were classified according to the intensity of their services: high, medium, and low. The independent variables included jurisdictional-level structural variables, attributes of the correctional administrators, and program and service delivery characteristics of the correctional agency. Results The two most important variables in predicting the availability of all three types of services were stronger working relationships with other organizations and the adoption of a standardized substance abuse screening tool by correctional agencies. For high and medium intensive programs, the capacity increased when an organizational learning strategy was used by administrators and the organization used a substance abuse screening tool. Implications on advancing treatment practices in correctional settings are discussed, including further work to test theories on how to better understand access to intensive treatment services. This study presents the first phase of understanding capacity-related issues regarding treatment programs offered in correctional settings. PMID:19395204
A regression approach to the analysis of serial peak flow among fuel oil ash exposed workers.
Hauser, R; Daskalakis, C; Christiani, D C
1996-10-01
We investigated the association between exposure to fuel oil ash and acute airway obstruction in 31 boilermakers and 31 utility workers during the overhaul of a large oil-fired boiler. Air flow was assessed with self-recorded serial peak expiratory flow rate measurements (PEFR) using a mini-Wright meter. Exposure to thoracic particulates with an aerodynamic diameter of 10 gm or smaller (PM10) was assessed using personal sampling devices and detailed work diaries. All subjects were male, with an average age of 43 yr, and an average of 18 yr at their current trade. Average PM10 exposure on work days was 2.75 mg/m3 for boilermakers and 0.57 mg/m3 for utility workers. Three daily PEFR measurements (start-of-shift, end-of-shift, and bed-time) were analyzed simultaneously, using Huber linear regression. After adjustment for job title, welder status, age, height, smoking, and weld-years, for each mg/m3 increase in PM10, the estimated decline in PEFR was 13.2 L/min (p = 0.008) for end-of-shift, 9.9 L/min (p = 0.045) for bed-time, and 6.6 L/min (p = 0.26) for start-of-shift of the following day. This decline of the exposure effect over the 24-h period that follows was statistically significant (p = 0.004). No other factors were found to significantly modify the effect of exposure. Our results suggest that occupational exposure to fuel oil ash is associated with significant acute decrements in peak flow. PMID:8887594
A non-linear regression method for CT brain perfusion analysis
NASA Astrophysics Data System (ADS)
Bennink, E.; Oosterbroek, J.; Viergever, M. A.; Velthuis, B. K.; de Jong, H. W. A. M.
2015-03-01
CT perfusion (CTP) imaging allows for rapid diagnosis of ischemic stroke. Generation of perfusion maps from CTP data usually involves deconvolution algorithms providing estimates for the impulse response function in the tissue. We propose the use of a fast non-linear regression (NLR) method that we postulate has similar performance to the current academic state-of-art method (bSVD), but that has some important advantages, including the estimation of vascular permeability, improved robustness to tracer-delay, and very few tuning parameters, that are all important in stroke assessment. The aim of this study is to evaluate the fast NLR method against bSVD and a commercial clinical state-of-art method. The three methods were tested against a published digital perfusion phantom earlier used to illustrate the superiority of bSVD. In addition, the NLR and clinical methods were also tested against bSVD on 20 clinical scans. Pearson correlation coefficients were calculated for each of the tested methods. All three methods showed high correlation coefficients (>0.9) with the ground truth in the phantom. With respect to the clinical scans, the NLR perfusion maps showed higher correlation with bSVD than the perfusion maps from the clinical method. Furthermore, the perfusion maps showed that the fast NLR estimates are robust to tracer-delay. In conclusion, the proposed fast NLR method provides a simple and flexible way of estimating perfusion parameters from CT perfusion scans, with high correlation coefficients. This suggests that it could be a better alternative to the current clinical and academic state-of-art methods.
NASA Astrophysics Data System (ADS)
Kügler, S. D.; Polsterer, K.; Hoecker, M.
2015-04-01
Context. In astronomy, new approaches to process and analyze the exponentially increasing amount of data are inevitable. For spectra, such as in the Sloan Digital Sky Survey spectral database, usually templates of well-known classes are used for classification. In case the fitting of a template fails, wrong spectral properties (e.g. redshift) are derived. Validation of the derived properties is the key to understand the caveats of the template-based method. Aims: In this paper we present a method for statistically computing the redshift z based on a similarity approach. This allows us to determine redshifts in spectra for emission and absorption features without using any predefined model. Additionally, we show how to determine the redshift based on single features. As a consequence we are, for example, able to filter objects that show multiple redshift components. Methods: The redshift calculation is performed by comparing predefined regions in the spectra and individually applying a nearest neighbor regression model to each predefined emission and absorption region. Results: The choice of the model parameters controls the quality and the completeness of the redshifts. For ≈90% of the analyzed 16 000 spectra of our reference and test sample, a certain redshift can be computed that is comparable to the completeness of SDSS (96%). The redshift calculation yields a precision for every individually tested feature that is comparable to the overall precision of the redshifts of SDSS. Using the new method to compute redshifts, we could also identify 14 spectra with a significant shift between emission and absorption or between emission and emission lines. The results already show the immense power of this simple machine-learning approach for investigating huge databases such as the SDSS.
Flexible regression models for ROC and risk analysis, with or without a gold standard.
Branscum, Adam J; Johnson, Wesley O; Hanson, Timothy E; Baron, Andre T
2015-12-30
A novel semiparametric regression model is developed for evaluating the covariate-specific accuracy of a continuous medical test or biomarker. Ideally, studies designed to estimate or compare medical test accuracy will use a separate, flawless gold-standard procedure to determine the true disease status of sampled individuals. We treat this as a special case of the more complicated and increasingly common scenario in which disease status is unknown because a gold-standard procedure does not exist or is too costly or invasive for widespread use. To compensate for missing data on disease status, covariate information is used to discriminate between diseased and healthy units. We thus model the probability of disease as a function of 'disease covariates'. In addition, we model test/biomarker outcome data to depend on 'test covariates', which provides researchers the opportunity to quantify the impact of covariates on the accuracy of a medical test. We further model the distributions of test outcomes using flexible semiparametric classes. An important new theoretical result demonstrating model identifiability under mild conditions is presented. The modeling framework can be used to obtain inferences about covariate-specific test accuracy and the probability of disease based on subject-specific disease and test covariate information. The value of the model is illustrated using multiple simulation studies and data on the age-adjusted ability of soluble epidermal growth factor receptor - a ubiquitous serum protein - to serve as a biomarker of lung cancer in men. SAS code for fitting the model is provided. Copyright © 2015 John Wiley & Sons, Ltd. PMID:26239173
LOGISTIC NETWORK REGRESSION FOR SCALABLE ANALYSIS OF NETWORKS WITH JOINT EDGE/VERTEX DYNAMICS
Almquist, Zack W.; Butts, Carter T.
2015-01-01
Change in group size and composition has long been an important area of research in the social sciences. Similarly, interest in interaction dynamics has a long history in sociology and social psychology. However, the effects of endogenous group change on interaction dynamics are a surprisingly understudied area. One way to explore these relationships is through social network models. Network dynamics may be viewed as a process of change in the edge structure of a network, in the vertex set on which edges are defined, or in both simultaneously. Although early studies of such processes were primarily descriptive, recent work on this topic has increasingly turned to formal statistical models. Although showing great promise, many of these modern dynamic models are computationally intensive and scale very poorly in the size of the network under study and/or the number of time points considered. Likewise, currently used models focus on edge dynamics, with little support for endogenously changing vertex sets. Here, the authors show how an existing approach based on logistic network regression can be extended to serve as a highly scalable framework for modeling large networks with dynamic vertex sets. The authors place this approach within a general dynamic exponential family (exponential-family random graph modeling) context, clarifying the assumptions underlying the framework (and providing a clear path for extensions), and they show how model assessment methods for cross-sectional networks can be extended to the dynamic case. Finally, the authors illustrate this approach on a classic data set involving interactions among windsurfers on a California beach. PMID:26120218
POLINSAR Coherence-Based Regression Analysis of Forest Biomass Using RADARSAT-2 Datasets
NASA Astrophysics Data System (ADS)
Singh, J.; Kumar, S.; Kushwaha, S. P. S.
2014-11-01
Forests play a pivotal role in synchronizing earth's carbon cycle by absorbing carbon from the atmosphere and storing it in the form of biomass. Researchers today are trying to understand the climatic variations, especially those occurring due to destruction of forest and its corresponding biomass loss. Hence, quantification of various forest parameters such as biomass is imperative for evaluating the carbon. The objective of this research was to exploit the potential of C-band Radarsat-2 Polarimetric Interferometric Synthetic Aperture Radar (PolInSAR) technique for analysing the relationship between complex coherence and field-estimated aboveground biomass. Association between the backscatter and the aboveground biomass was also established in the process. To serve our objective, Radarsat-2 interferometric pair dated 4th March, 2013 (master image) and 28th March, 2013 (slave image) were procured for the Barkot Reserve Forest region of Dehradun, India. Field sampling was done for 30 plots (31.62 m x 31.62 m) and stem diameter and tree height were measured in each plot. The study emphasized on the application of POLINSAR coherence instead of using conventional method of relying on backscatter values for retrieving forest biomass. Coherence matrices were utilized for generating complex coherence values for different polarization channels and were regressed against field estimated aboveground biomass. Results indicated a negative linear relationship between complex coherence and aboveground biomass with the cross - polarized coherence showing the highest R2 value of 0.71. Further, the backscatter mechanism when studied with respect to aboveground biomass indicated a positive linear relationship between backscatter values and field estimated aboveground biomass with R2 value of 0.45 and 0.61 for slave and master image respectively. The results suggest that PolInSAR technique, in combination with different modelling approaches, can be adopted for estimating forest
Diepgen, T L; Blettner, M
1996-05-01
In order to determine the relative importance of genetics and the environment on the occurrence of atopic diseases, we investigated the familial aggregation of atopic eczema, allergic rhinitis, and allergic asthma in the relatives of 426 patients with atopic eczema and 628 subjects with no history of eczema (5,136 family members in total). Analyses were performed by regression models for odds ratios (OR) allowing us to estimate OR for the familial aggregation and simultaneously to adjust for other covariates. Three models were analyzed assuming that the OR i) is the same among any two members of a family, ii) depends on different familial constellations, i.e., whether the pairs are siblings, parents, or parent/sibling pairs, and iii) is not the same between the father and the children and between the mother and the children. The OR of familial aggregation for atopic eczema was 2.16 (95% confidence interval (95%-CI) 1.58-2.96) if no distinction was made between the degree of relationship. Further analyses within the members of the family showed a high OR among siblings (OR = 3.86; 95%-CI 2.10-7.09), while the OR between parents and siblings was only 1.90 (95%-CI 1.31-2.97). Only for atopic eczema was the familial aggregation between fathers and siblings (ms: OR = 2.66; fs: OR = 1.29). This can be explained by stronger maternal heritability, shared physical environment of mother and child, or environmental events that affect the fetus in utero. Since for all atopic diseases a stronger correlation was found between siblings than between siblings and parents, our study indicates that environmental factors, especially during childhood, seem to explain the recently observed increased frequencies of atopic diseases. PMID:8618061
Binary logistic regression analysis of hard palate dimensions for sexing human crania
Asif, Muhammed; Shetty, Radhakrishna; Avadhani, Ramakrishna
2016-01-01
Sex determination is the preliminary step in every forensic investigation and the hard palate assumes significance in cranial sexing in cases involving burns and explosions due to its resistant nature and secluded location. This study analyzes the sexing potential of incisive foramen to posterior nasal spine length, palatine process of maxilla length, horizontal plate of palatine bone length and transverse length between the greater palatine foramina. The study deviates from the conventional method of measuring the maxillo-alveolar length and breadth as the dimensions considered in this study are more heat resistant and useful in situations with damaged alveolar margins. The study involves 50 male and 50 female adult dry skulls of Indian ethnic group. The dimensions measured were statistically analyzed using Student's t test, binary logistic regression and receiver operating characteristic curve. It was observed that the incisive foramen to posterior nasal spine length is a definite sex marker with sex predictability of 87.2%. The palatine process of maxilla length with 66.8% sex predictability and the horizontal plate of palatine bone length with 71.9% sex predictability cannot be relied upon as definite sex markers. The transverse length between the greater palatine foramina is statistically insignificant in sexing crania (P=0.318). Considering a significant overlap of values in both the sexes the palatal dimensions singularly cannot be relied upon for sexing. Nevertheless, considering the high sex predictability of incisive foramen to posterior nasal spine length this dimension can definitely be used to supplement other sexing evidence available to precisely conclude the cranial sex. PMID:27382518
Geneletti, Sara; O’Keeffe, Aidan G.; Sharples, Linda D.; Richardson, Sylvia; Baio, Gianluca
2016-01-01
The regression discontinuity (RD) design is a quasi-experimental design that estimates the causal effects of a treatment by exploiting naturally occurring treatment rules. It can be applied in any context where a particular treatment or intervention is administered according to a pre-specified rule linked to a continuous variable. Such thresholds are common in primary care drug prescription where the RD design can be used to estimate the causal effect of medication in the general population. Such results can then be contrasted to those obtained from randomised controlled trials (RCTs) and inform prescription policy and guidelines based on a more realistic and less expensive context. In this paper, we focus on statins, a class of cholesterol-lowering drugs, however, the methodology can be applied to many other drugs provided these are prescribed in accordance to pre-determined guidelines. Current guidelines in the UK state that statins should be prescribed to patients with 10-year cardiovascular disease risk scores in excess of 20%. If we consider patients whose risk scores are close to the 20% risk score threshold, we find that there is an element of random variation in both the risk score itself and its measurement. We can therefore consider the threshold as a randomising device that assigns statin prescription to individuals just above the threshold and withholds it from those just below. Thus, we are effectively replicating the conditions of an RCT in the area around the threshold, removing or at least mitigating confounding. We frame the RD design in the language of conditional independence, which clarifies the assumptions necessary to apply an RD design to data, and which makes the links with instrumental variables clear. We also have context-specific knowledge about the expected sizes of the effects of statin prescription and are thus able to incorporate this into Bayesian models by formulating informative priors on our causal parameters. PMID:25809691
Schmid, Matthias; Wickler, Florian; Maloney, Kelly O.; Mitchell, Richard; Fenske, Nora; Mayr, Andreas
2013-01-01
Regression analysis with a bounded outcome is a common problem in applied statistics. Typical examples include regression models for percentage outcomes and the analysis of ratings that are measured on a bounded scale. In this paper, we consider beta regression, which is a generalization of logit models to situations where the response is continuous on the interval (0,1). Consequently, beta regression is a convenient tool for analyzing percentage responses. The classical approach to fit a beta regression model is to use maximum likelihood estimation with subsequent AIC-based variable selection. As an alternative to this established - yet unstable - approach, we propose a new estimation technique called boosted beta regression. With boosted beta regression estimation and variable selection can be carried out simultaneously in a highly efficient way. Additionally, both the mean and the variance of a percentage response can be modeled using flexible nonlinear covariate effects. As a consequence, the new method accounts for common problems such as overdispersion and non-binomial variance structures. PMID:23626706
Stark, David T.; Bazan, Nicolas G.
2011-01-01
Stimulation of synaptic NMDA receptors (NMDARs) induces neuroprotection, while extrasynaptic NMDARs promote excitotoxic cell death. Neuronal expression of cyclooxygenase-2 (COX-2) is enhanced by synaptic NMDARs, and although this enzyme mediates neuronal functions, COX-2 is also regarded as a key modulator of neuroinflammation and is thought to exacerbate excitotoxicity via overproduction of prostaglandins. This raises an apparent paradox: synaptic NMDARs are pro-survival yet are essential for robust neuronal COX-2 expression. We hypothesized that stimulation of extrasynaptic NMDARs converts COX-2 signaling from a physiological to a potentially pathological process. We combined HPLC-ESI-MS/MS-based mediator lipidomics and unbiased image analysis in mouse dissociated and organotypic cortical cultures to uncover that synaptic and extrasynaptic NMDARs differentially modulate neuronal COX-2 expression and activity. We show that synaptic NMDARs enhance neuronal COX-2 expression, while sustained synaptic stimulation limits COX-2 activity by suppressing cellular levels of the primary COX-2 substrate, arachidonic acid (AA). In contrast, extrasynaptic NMDARs suppress COX-2 expression while activating phospholipase A2 (PLA2), which enhances AA levels by hydrolysis of membrane phospholipids. Thus, sequential activation of synaptic then extrasynaptic NMDARs maximizes COX-2-dependent prostaglandin synthesis. We also show that excitotoxic events only drive induction of COX-2 expression through abnormal synaptic network excitability. Finally, we show that non-enzymatic lipid peroxidation of arachidonic and other polyunsaturated fatty acids is a function of network activity history. A new paradigm emerges from our results suggesting that pathological COX-2 signaling associated with models of stroke, epilepsy, and neurodegeneration requires specific spatio-temporal NMDAR stimulation. PMID:21957234
Genetic analysis of longevity in Dutch dairy cattle using random regression.
van Pelt, M L; Meuwissen, T H E; de Jong, G; Veerkamp, R F
2015-06-01
Longevity, productive life, or lifespan of dairy cattle is an important trait for dairy farmers, and it is defined as the time from first calving to the last test date for milk production. Methods for genetic evaluations need to account for censored data; that is, records from cows that are still alive. The aim of this study was to investigate whether these methods also need to take account of survival being genetically a different trait across the entire lifespan of a cow. The data set comprised 112,000 cows with a total of 3,964,449 observations for survival per month from first calving until 72 mo in productive life. A random regression model with second-order Legendre polynomials was fitted for the additive genetic effect. Alternative parameterizations were (1) different trait definitions for the length of time interval for survival after first calving (1, 3, 6, and 12 mo); (2) linear or threshold model; and (3) differing the order of the Legendre polynomial. The partial derivatives of a profit function were used to transform variance components on the survival scale to those for lifespan. Survival rates were higher in early life than later in life (99 vs. 95%). When survival was defined over 12-mo intervals survival curves were smooth compared with curves when 1-, 3-, or 6-mo intervals were used. Heritabilities in each interval were very low and ranged from 0.002 to 0.031, but the heritability for lifespan over the entire period of 72 mo after first calving ranged from 0.115 to 0.149. Genetic correlations between time intervals ranged from 0.25 to 1.00. Genetic parameters and breeding values for the genetic effect were more sensitive to the trait definition than to whether a linear or threshold model was used or to the order of Legendre polynomial used. Cumulative survival up to the first 6 mo predicted lifespan with an accuracy of only 0.79 to 0.85; that is, reliability of breeding value with many daughters in the first 6 mo can be, at most, 0.62 to 0.72, and
Hu, Meng; Clark, Kelsey L; Gong, Xiajing; Noudoost, Behrad; Li, Mingyao; Moore, Tirin; Liang, Hualou
2015-06-10
Inferotemporal (IT) neurons are known to exhibit persistent, stimulus-selective activity during the delay period of object-based working memory tasks. Frontal eye field (FEF) neurons show robust, spatially selective delay period activity during memory-guided saccade tasks. We present a copula regression paradigm to examine neural interaction of these two types of signals between areas IT and FEF of the monkey during a working memory task. This paradigm is based on copula models that can account for both marginal distribution over spiking activity of individual neurons within each area and joint distribution over ensemble activity of neurons between areas. Considering the popular GLMs as marginal models, we developed a general and flexible likelihood framework that uses the copula to integrate separate GLMs into a joint regression analysis. Such joint analysis essentially leads to a multivariate analog of the marginal GLM theory and hence efficient model estimation. In addition, we show that Granger causality between spike trains can be readily assessed via the likelihood ratio statistic. The performance of this method is validated by extensive simulations, and compared favorably to the widely used GLMs. When applied to spiking activity of simultaneously recorded FEF and IT neurons during working memory task, we observed significant Granger causality influence from FEF to IT, but not in the opposite direction, suggesting the role of the FEF in the selection and retention of visual information during working memory. The copula model has the potential to provide unique neurophysiological insights about network properties of the brain. PMID:26063909
Lamm, Steven H.; Ferdosi, Hamid; Dissen, Elisabeth K.; Li, Ji; Ahn, Jaeil
2015-01-01
High levels (> 200 µg/L) of inorganic arsenic in drinking water are known to be a cause of human lung cancer, but the evidence at lower levels is uncertain. We have sought the epidemiological studies that have examined the dose-response relationship between arsenic levels in drinking water and the risk of lung cancer over a range that includes both high and low levels of arsenic. Regression analysis, based on six studies identified from an electronic search, examined the relationship between the log of the relative risk and the log of the arsenic exposure over a range of 1–1000 µg/L. The best-fitting continuous meta-regression model was sought and found to be a no-constant linear-quadratic analysis where both the risk and the exposure had been logarithmically transformed. This yielded both a statistically significant positive coefficient for the quadratic term and a statistically significant negative coefficient for the linear term. Sub-analyses by study design yielded results that were similar for both ecological studies and non-ecological studies. Statistically significant X-intercepts consistently found no increased level of risk at approximately 100–150 µg/L arsenic. PMID:26690190
Lin, Lixin; Wang, Yunjia; Teng, Jiyao; Wang, Xuchen
2016-02-01
Hyperspectral estimation of soil organic matter (SOM) in coal mining regions is an important tool for enhancing fertilization in soil restoration programs. The correlation--partial least squares regression (PLSR) method effectively solves the information loss problem of correlation--multiple linear stepwise regression, but results of the correlation analysis must be optimized to improve precision. This study considers the relationship between spectral reflectance and SOM based on spectral reflectance curves of soil samples collected from coal mining regions. Based on the major absorption troughs in the 400-1006 nm spectral range, PLSR analysis was performed using 289 independent bands of the second derivative (SDR) with three levels and measured SOM values. A wavelet-correlation-PLSR (W-C-PLSR) model was then constructed. By amplifying useful information that was previously obscured by noise, the W-C-PLSR model was optimal for estimating SOM content, with smaller prediction errors in both calibration (R(2) = 0.970, root mean square error (RMSEC) = 3.10, and mean relative error (MREC) = 8.75) and validation (RMSEV = 5.85 and MREV = 14.32) analyses, as compared with other models. Results indicate that W-C-PLSR has great potential to estimate SOM in coal mining regions. PMID:26780416
El-Ansary, Afaf
2016-06-01
This work demonstrates data of multiple regression analysis between nine biomarkers related to glutamate excitotoxicity and impaired detoxification as two mechanisms recently recorded as autism phenotypes. The presented data was obtained by measuring a panel of markers in 20 autistic patients aged 3-15 years and 20 age and gender matching healthy controls. Levels of GSH, glutathione status (GSH/GSSG), glutathione reductase (GR), glutathione-s-transferase (GST), thioredoxin (Trx), thioredoxin reductase (TrxR) and peroxidoxins (Prxs I and III), glutamate, glutamine, glutamate/glutamine ratio glutamate dehydrogenase (GDH) in plasma and mercury (Hg) in red blood cells were determined in both groups. In Multiple regression analysis, R (2) values which describe the proportion or percentage of variance in the dependent variable attributed to the variance in the independent variables together were calculated. Moreover, β coefficients values which show the direction either positive or negative and the contribution of the independent variable relative to the other independent variables in explaining the variation of the dependent variable were determined. A panel of inter-related markers was recorded. This paper contains data related to and supporting research articles currently published entitled "Mechanism of nitrogen metabolism-related parameters and enzyme activities in the pathophysiology of autism" [1], "Novel metabolic biomarkers related to sulfur-dependent detoxification pathways in autistic patients of Saudi Arabia [2], and "A key role for an impaired detoxification mechanism in the etiology and severity of autism spectrum disorders" [3]. PMID:26933667
Fiumera, Heather L.; Dunham, Maitreya J.; Saracco, Scott A.; Butler, Christine A.; Kelly, Jessica A.; Fox, Thomas D.
2009-01-01
Members of the Oxa1/YidC/Alb3 family of protein translocases are essential for assembly of energy-transducing membrane complexes. In Saccharomyces cerevisiae, Oxa1 and its paralog, Cox18, are required for assembly of Cox2, a mitochondrially encoded subunit of cytochrome c oxidase. Oxa1 is known to be required for cotranslational export of the Cox2 N-terminal domain across the inner mitochondrial membrane, while Cox18 is known to be required for post-translational export of the Cox2 C-tail domain. We find that overexpression of Oxa1 does not compensate for the absence of Cox18 at the level of respiratory growth. However, it does promote some translocation of the Cox2 C-tail domain across the inner membrane and causes increased accumulation of Cox2, which remains unassembled. This result suggests that Cox18 not only translocates the C-tail, but also must deliver it in a distinct state competent for cytochrome oxidase assembly. We identified respiring mutants from a cox18Δ strain overexpressing OXA1, whose respiratory growth requires overexpression of OXA1. The recessive nuclear mutations allow some assembly of Cox2 into cytochrome c oxidase. After failing to identify these mutations by methods based on transformation, we successfully located them to MGR1 and MGR3 by comparative hybridization to whole-genome tiling arrays and microarray-assisted bulk segregant analysis followed by linkage mapping. While Mgr1 and Mgr3 are known to associate with the Yme1 mitochondrial inner membrane i-AAA protease and to participate in membrane protein degradation, their absence does not appear to stabilize Cox2 under these conditions. Instead, Yme1 probably chaperones the folding and/or assembly of Oxa1-exported Cox2 in the absence of Mrg1 or Mgr3, since respiratory growth and cytochrome c oxidase assembly in a cox18 mgr3 double-mutant strain overexpressing OXA1 is YME1 dependent. PMID:19307606
NASA Astrophysics Data System (ADS)
Kozubek, M.; Rozanov, E.; Krizan, P.
2014-09-01
The stratosphere is influenced by many external forcings (natural or anthropogenic). There are many studies which are focused on this problem and that is why we can compare our results with them. This study is focused on the variability and trends of temperature and circulation characteristics (zonal and meridional wind component) in connection with different phenomena variation in the stratosphere and lower mesosphere. We consider the interactions between the troposphere-stratosphere-lower mesosphere system and external and internal phenomena, e.g. solar cycle, QBO, NAO or ENSO using multiple linear techniques. The analysis was applied to the period 1979-2012 based on the current reanalysis data, mainly the MERRA reanalysis dataset (Modern Era Retrospective-analysis for Research and Applications) for pressure levels: 1000-0.1 hPa. We do not find a strong temperature signal for solar flux over the tropics about 30 hPa (ERA-40 results) but the strong positive signal has been observed near stratopause almost in the whole analyzed area. This could indicate that solar forcing is not represented well in the higher pressure levels in MERRA. The analysis of ENSO and ENSO Modoki shows that we should take into account more than one ENSO index for similar analysis. Previous studies show that the volcanic activity is important parameter. The signal of volcanic activity in MERRA is very weak and insignificant.
UNIPALS: SOFTWARE FOR PRINCIPAL COMPONENTS ANALYSIS AND PARTIAL LEAST SQUARES REGRESSION
Software for the analysis of multivariate chemical data by principal components and partial least squares methods is included on disk. he methods extract latent variables from the chemical data with the UNIversal PArtial Least Squares or UNIPALS algorithm. he software is written ...
2013-01-01
Background In recent years, there has been growing interest in measuring the efficiency of hospitals in Iran and several studies have been conducted on the topic. The main objective of this paper was to review studies in the field of hospital efficiency and examine the estimated technical efficiency (TE) of Iranian hospitals. Methods Persian and English databases were searched for studies related to measuring hospital efficiency in Iran. Ordinary least squares (OLS) regression models were applied for statistical analysis. The PRISMA guidelines were followed in the search process. Results A total of 43 efficiency scores from 29 studies were retrieved and used to approach the research question. Data envelopment analysis was the principal frontier efficiency method in the estimation of efficiency scores. The pooled estimate of mean TE was 0.846 (±0.134). There was a considerable variation in the efficiency scores between the different studies performed in Iran. There were no differences in efficiency scores between data envelopment analysis (DEA) and stochastic frontier analysis (SFA) techniques. The reviewed studies are generally similar and suffer from similar methodological deficiencies, such as no adjustment for case mix and quality of care differences. The results of OLS regression revealed that studies that included more variables and more heterogeneous hospitals generally reported higher TE. Larger sample size was associated with reporting lower TE. Conclusions The features of frontier-based techniques had a profound impact on the efficiency scores among Iranian hospital studies. These studies suffer from major methodological deficiencies and were of sub-optimal quality, limiting their validity and reliability. It is suggested that improving data collection and processing in Iranian hospital databases may have a substantial impact on promoting the quality of research in this field. PMID:23945011
Stepwise Regression Analysis of MDOE Balance Calibration Data Acquired at DNW
NASA Technical Reports Server (NTRS)
DeLoach, RIchard; Philipsen, Iwan
2007-01-01
This paper reports a comparison of two experiment design methods applied in the calibration of a strain-gage balance. One features a 734-point test matrix in which loads are varied systematically according to a method commonly applied in aerospace research and known in the literature of experiment design as One Factor At a Time (OFAT) testing. Two variations of an alternative experiment design were also executed on the same balance, each with different features of an MDOE experiment design. The Modern Design of Experiments (MDOE) is an integrated process of experiment design, execution, and analysis applied at NASA's Langley Research Center to achieve significant reductions in cycle time, direct operating cost, and experimental uncertainty in aerospace research generally and in balance calibration experiments specifically. Personnel in the Instrumentation and Controls Department of the German Dutch Wind Tunnels (DNW) have applied MDOE methods to evaluate them in the calibration of a balance using an automated calibration machine. The data have been sent to Langley Research Center for analysis and comparison. This paper reports key findings from this analysis. The chief result is that a 100-point calibration exploiting MDOE principles delivered quality comparable to a 700+ point OFAT calibration with significantly reduced cycle time and attendant savings in direct and indirect costs. While the DNW test matrices implemented key MDOE principles and produced excellent results, additional MDOE concepts implemented in balance calibrations at Langley Research Center are also identified and described.
Decomposition of Variance for Spatial Cox Processes
Jalilian, Abdollah; Guan, Yongtao; Waagepetersen, Rasmus
2012-01-01
Spatial Cox point processes is a natural framework for quantifying the various sources of variation governing the spatial distribution of rain forest trees. We introduce a general criterion for variance decomposition for spatial Cox processes and apply it to specific Cox process models with additive or log linear random intensity functions. We moreover consider a new and flexible class of pair correlation function models given in terms of normal variance mixture covariance functions. The proposed methodology is applied to point pattern data sets of locations of tropical rain forest trees. PMID:23599558
NASA Astrophysics Data System (ADS)
Liu, Bilan; Qiu, Xing; Zhu, Tong; Tian, Wei; Hu, Rui; Ekholm, Sven; Schifitto, Giovanni; Zhong, Jianhui
2016-03-01
Subject-specific longitudinal DTI study is vital for investigation of pathological changes of lesions and disease evolution. Spatial Regression Analysis of Diffusion tensor imaging (SPREAD) is a non-parametric permutation-based statistical framework that combines spatial regression and resampling techniques to achieve effective detection of localized longitudinal diffusion changes within the whole brain at individual level without a priori hypotheses. However, boundary blurring and dislocation limit its sensitivity, especially towards detecting lesions of irregular shapes. In the present study, we propose an improved SPREAD (dubbed improved SPREAD, or iSPREAD) method by incorporating a three-dimensional (3D) nonlinear anisotropic diffusion filtering method, which provides edge-preserving image smoothing through a nonlinear scale space approach. The statistical inference based on iSPREAD was evaluated and compared with the original SPREAD method using both simulated and in vivo human brain data. Results demonstrated that the sensitivity and accuracy of the SPREAD method has been improved substantially by adapting nonlinear anisotropic filtering. iSPREAD identifies subject-specific longitudinal changes in the brain with improved sensitivity, accuracy, and enhanced statistical power, especially when the spatial correlation is heterogeneous among neighboring image pixels in DTI.
Juhasz, Albert L; Weber, John; Smith, Euan
2011-12-15
A number of in vitro assays are available for the determination of arsenic (As) bioaccessibility and prediction of As relative bioavailability (RBA) to quantify exposure for site-specific risk assessment. These data are usually considered in isolation; however, meta analysis may provide predictive capabilities for source-specific As bioaccessibility and RBA. The objectives of this study were to predict As RBA using previously published in vivo/in vitro correlations and to assess the influence of As sources on As RBA independent of geographical location. Data representing 351 soils (classified based on As source) and 514 independent bioaccessibility values were retrieved from the literature for comparison. Arsenic RBA was predicted using published in vivo/in vitro regression models, and 90th and 95th percentiles were determined for each As source classification and in vitro methodology. Differences in predicted mean As RBA were observed among soils contaminated from different As sources and within source materials when various in vitro methodologies were utilized. However, when in vitro data were standardized by transforming SBRC intestinal, IVG, and PBET data to SBRC gastric phase values (through linear regression models), predicted As RBA values for As sources followed the order CCA posts ≥ herbicide/pesticide > mining/smelting > gossan soils with 95th percentiles for predicted As RBA of 78.0, 78.4, 67.0, and 23.7%, respectively. PMID:22059522
Liu, Bilan; Qiu, Xing; Zhu, Tong; Tian, Wei; Hu, Rui; Ekholm, Sven; Schifitto, Giovanni; Zhong, Jianhui
2016-03-21
Subject-specific longitudinal DTI study is vital for investigation of pathological changes of lesions and disease evolution. Spatial Regression Analysis of Diffusion tensor imaging (SPREAD) is a non-parametric permutation-based statistical framework that combines spatial regression and resampling techniques to achieve effective detection of localized longitudinal diffusion changes within the whole brain at individual level without a priori hypotheses. However, boundary blurring and dislocation limit its sensitivity, especially towards detecting lesions of irregular shapes. In the present study, we propose an improved SPREAD (dubbed improved SPREAD, or iSPREAD) method by incorporating a three-dimensional (3D) nonlinear anisotropic diffusion filtering method, which provides edge-preserving image smoothing through a nonlinear scale space approach. The statistical inference based on iSPREAD was evaluated and compared with the original SPREAD method using both simulated and in vivo human brain data. Results demonstrated that the sensitivity and accuracy of the SPREAD method has been improved substantially by adapting nonlinear anisotropic filtering. iSPREAD identifies subject-specific longitudinal changes in the brain with improved sensitivity, accuracy, and enhanced statistical power, especially when the spatial correlation is heterogeneous among neighboring image pixels in DTI. PMID:26948513
Silva, Ana Elisa Pereira; Freitas, Corina da Costa; Dutra, Luciano Vieira; Molento, Marcelo Beltrão
2016-02-15
Fasciola hepatica is the causative agent of fasciolosis, a disease that triggers a chronic inflammatory process in the liver affecting mainly ruminants and other animals including humans. In Brazil, F. hepatica occurs in larger numbers in the most Southern state of Rio Grande do Sul. The objective of this study was to estimate areas at risk using an eight-year (2002-2010) time series of climatic and environmental variables that best relate to the disease using a linear regression method to municipalities in the state of Rio Grande do Sul. The positivity index of the disease, which is the rate of infected animal per slaughtered animal, was divided into three risk classes: low, medium and high. The accuracy of the known sample classification on the confusion matrix for the low, medium and high rates produced by the estimated model presented values between 39 and 88% depending of the year. The regression analysis showed the importance of the time-based data for the construction of the model, considering the two variables of the previous year of the event (positivity index and maximum temperature). The generated data is important for epidemiological and parasite control studies mainly because F. hepatica is an infection that can last from months to years. PMID:26827853
Jansson, Bruce S; Nyamathi, Adeline; Heidemann, Gretchen; Duan, Lei; Kaplan, Charles
2015-01-01
Although literature documents the need for hospital social workers, nurses, and medical residents to engage in patient advocacy, little information exists about what predicts the extent they do so. This study aims to identify predictors of health professionals' patient advocacy engagement with respect to a broad range of patients' problems. A cross-sectional research design was employed with a sample of 94 social workers, 97 nurses, and 104 medical residents recruited from eight hospitals in Los Angeles. Bivariate correlations explored whether seven scales (Patient Advocacy Eagerness, Ethical Commitment, Skills, Tangible Support, Organizational Receptivity, Belief Other Professionals Engage, and Belief the Hospital Empowers Patients) were associated with patient advocacy engagement, measured by the validated Patient Advocacy Engagement Scale. Regression analysis examined whether these scales, when controlling for sociodemographic and setting variables, predicted patient advocacy engagement. While all seven predictor scales were significantly associated with patient advocacy engagement in correlational analyses, only Eagerness, Skills, and Belief the Hospital Empowers Patients predicted patient advocacy engagement in regression analyses. Additionally, younger professionals engaged in higher levels of patient advocacy than older professionals, and social workers engaged in greater patient advocacy than nurses. Limitations and the utility of these findings for acute-care hospitals are discussed. PMID:26317762
Zhu, Haogang; Russell, Richard A; Saunders, Luke J; Ceccon, Stefano; Garway-Heath, David F; Crabb, David P
2014-01-01
Visual fields measured with standard automated perimetry are a benchmark test for determining retinal function in ocular pathologies such as glaucoma. Their monitoring over time is crucial in detecting change in disease course and, therefore, in prompting clinical intervention and defining endpoints in clinical trials of new therapies. However, conventional change detection methods do not take into account non-stationary measurement variability or spatial correlation present in these measures. An inferential statistical model, denoted 'Analysis with Non-Stationary Weibull Error Regression and Spatial enhancement' (ANSWERS), was proposed. In contrast to commonly used ordinary linear regression models, which assume normally distributed errors, ANSWERS incorporates non-stationary variability modelled as a mixture of Weibull distributions. Spatial correlation of measurements was also included into the model using a Bayesian framework. It was evaluated using a large dataset of visual field measurements acquired from electronic health records, and was compared with other widely used methods for detecting deterioration in retinal function. ANSWERS was able to detect deterioration significantly earlier than conventional methods, at matched false positive rates. Statistical sensitivity in detecting deterioration was also significantly better, especially in short time series. Furthermore, the spatial correlation utilised in ANSWERS was shown to improve the ability to detect deterioration, compared to equivalent models without spatial correlation, especially in short follow-up series. ANSWERS is a new efficient method for detecting changes in retinal function. It allows for better detection of change, more efficient endpoints and can potentially shorten the time in clinical trials for new therapies. PMID:24465636
NASA Technical Reports Server (NTRS)
Gohil, B. S.; Hariharan, T. A.; Sharma, A. K.; Pandey, P. C.
1982-01-01
The 19.35 GHz and 22.235 GHz passive microwave radiometers (SAMIR) on board the Indian satellite Bhaskara have provided very useful data. From these data has been demonstrated the feasibility of deriving atmospheric and ocean surface parameters such as water vapor content, liquid water content, rainfall rate and ocean surface winds. Different approaches have been tried for deriving the atmospheric water content. The statistical and empirical methods have been used by others for the analysis of the Nimbus data. A simulation technique has been attempted for the first time for 19.35 GHz and 22.235 GHz radiometer data. The results obtained from three different methods are compared with radiosonde data. A case study of a tropical depression has been undertaken to demonstrate the capability of Bhaskara SAMIR data to show the variation of total water vapor and liquid water contents.
Lesterhuis, W. Joost; Rinaldi, Catherine; Jones, Anya; Rozali, Esdy N.; Dick, Ian M.; Khong, Andrea; Boon, Louis; Robinson, Bruce W.; Nowak, Anna K.; Bosco, Anthony; Lake, Richard A.
2015-01-01
Cancer immunotherapy has shown impressive results, but most patients do not respond. We hypothesized that the effector response in the tumour could be visualized as a complex network of interacting gene products and that by mapping this network we could predict effective pharmacological interventions. Here, we provide proof of concept for the validity of this approach in a murine mesothelioma model, which displays a dichotomous response to anti-CTLA4 immune checkpoint blockade. Network analysis of gene expression profiling data from responding versus non-responding tumours was employed to identify modules associated with response. Targeting the modules via selective modulation of hub genes or alternatively by using repurposed pharmaceuticals selected on the basis of their expression perturbation signatures dramatically enhanced the efficacy of CTLA4 blockade in this model. Our approach provides a powerful platform to repurpose drugs, and define contextually relevant novel therapeutic targets. PMID:26193793
Observational Studies: Matching or Regression?
Brazauskas, Ruta; Logan, Brent R
2016-03-01
In observational studies with an aim of assessing treatment effect or comparing groups of patients, several approaches could be used. Often, baseline characteristics of patients may be imbalanced between groups, and adjustments are needed to account for this. It can be accomplished either via appropriate regression modeling or, alternatively, by conducting a matched pairs study. The latter is often chosen because it makes groups appear to be comparable. In this article we considered these 2 options in terms of their ability to detect a treatment effect in time-to-event studies. Our investigation shows that a Cox regression model applied to the entire cohort is often a more powerful tool in detecting treatment effect as compared with a matched study. Real data from a hematopoietic cell transplantation study is used as an example. PMID:26712591
De la Cruz, Rolando; Meza, Cristian; Arribas-Gil, Ana; Carroll, Raymond J.
2016-01-01
Joint models for a wide class of response variables and longitudinal measurements consist on a mixed-effects model to fit longitudinal trajectories whose random effects enter as covariates in a generalized linear model for the primary response. They provide a useful way to assess association between these two kinds of data, which in clinical studies are often collected jointly on a series of individuals and may help understanding, for instance, the mechanisms of recovery of a certain disease or the efficacy of a given therapy. When a nonlinear mixed-effects model is used to fit the longitudinal trajectories, the existing estimation strategies based on likelihood approximations have been shown to exhibit some computational efficiency problems (De la Cruz et al., 2011). In this article we consider a Bayesian estimation procedure for the joint model with a nonlinear mixed-effects model for the longitudinal data and a generalized linear model for the primary response. The proposed prior structure allows for the implementation of an MCMC sampler. Moreover, we consider that the errors in the longitudinal model may be correlated. We apply our method to the analysis of hormone levels measured at the early stages of pregnancy that can be used to predict normal versus abnormal pregnancy outcomes. We also conduct a simulation study to assess the importance of modelling correlated errors and quantify the consequences of model misspecification. PMID:27274601
Regression Analysis of Long-term Profile Ozone Data Set from BUV Instruments
NASA Technical Reports Server (NTRS)
Frith, Stacey; Taylor, Steve; DeLand, Matt; Ahn, Chang-Woo; Stolarski, Richard S.
2005-01-01
We have produced a profile merged ozone data set (MOD) based on the SBUV/SBUV2 series of nadir-viewing satellite backscatter instruments, covering the period from November 1978 - December 2003. In 2004, data from the Nimbus 7 SBUV and NOAA 9,11, and 16 SBUV/2 instruments were reprocessed using the Version 8 (V8) algorithm and most recent calibrations. More recently, data from the Nimbus 4 BUV instrument, which operated from 1970 - 1977, were also reprocessed using the V8 algorithm. As part of the V8 profile calibration, the Nimbus 7 and NOAA 9 (1993-1997 only) instrument calibrations have been adjusted to match the NOAA 11 calibration, which was established from comparisons with SSBUV shuttle flight data. Given the level of agreement between the data sets, we simply average the ozone values during periods of instrument overlap to produce the MOD profile data set. We use statistical time-series analysis of the MOD profile data set (1978-2003) to estimate the change in profile ozone due to changing stratospheric chlorine levels. The Nimbus 4 BUV data offer an opportunity to test the physical properties of our statistical model. We extrapolate our statistical model fit backwards in time and compare to the Nimbus 4 data. We compare the statistics of the residuals from the fit for the Nimbus 4 period to those obtained from the 1978-2003 period over which the statistical model coefficients were estimated.
Cyclooxygenase (COX) Inhibitors and the Newborn Kidney
Smith, Francine G.; Wade, Andrew W.; Lewis, Megan L.; Qi, Wei
2012-01-01
This review summarizes our current understanding of the role of cyclo-oxygenase inhibitors (COXI) in influencing the structural development as well as the function of the developing kidney. COXI administered either during pregnancy or after birth can influence kidney development including nephronogenesis, and can decrease renal perfusion and ultrafiltration potentially leading to acute kidney injury in the newborn period. To date, which COX isoform (COX-1 or COX-2) plays a more important role in during fetal development and influences kidney function early in life is not known, though evidence points to a predominant role for COX-2. Clinical implications of the use of COXI in pregnancy and in the newborn infant are also evaluated herein, with specific reference to the potential effects of COXI on nephronogenesis as well as newborn kidney function. PMID:24281306
Genetic Deletion of COX-2 Diminishes VEGF Production in Mouse Retinal Müller Cells
Yanni, Susan E.; McCollum, Gary W.; Penn, John S.
2010-01-01
Non-steroidal anti-inflammatory drugs (NSAIDs), which inhibit COX activity, reduce the production of retinal VEGF and neovascularization in relevant models of ocular disease. We hypothesized that COX-2 mediates VEGF production in retinal Müller cells, one of its primary sources in retinal neovascular disease. The purpose of this study was to determine the role of COX-2 and its products in VEGF expression and secretion. These studies have more clearly defined the role of COX-2 and COX-2-derived prostanoids in retinal angiogenesis. Müller cells derived from wild-type and COX-2 null mice were exposed to hypoxia for 0–24 hours. COX-2 protein and activity were assessed by western blot analysis and GC-MS, respectively. VEGF production was assessed by ELISA. Wild-type mouse Müller cells were treated with vehicle (0.1% DMSO), 10 µM PGE2, or PGE2 + 5 µM H-89 (a PKA inhibitor), for 12 hours. VEGF production was assessed by ELISA. Hypoxia significantly increased COX-2 protein (p ≤ 0.05) and activity (p ≤ 0.05), and VEGF production (p ≤ 0.0003). COX-2 null Müller cells produced significantly less VEGF in response to hypoxia (p ≤ 0.05). Of the prostanoids, PGE2 was significantly increased by hypoxia (p ≤ 0.02). Exogenous PGE2 significantly increased VEGF production by Müller cells (p ≤ 0.0039), and this effect was inhibited by H-89 (p ≤ 0.055). These data demonstrate that hypoxia induces COX-2, prostanoid production, and VEGF synthesis in Müller cells, and that VEGF production is at least partially COX-2-dependent. Our study suggests that PGE2, signaling through the EP2 and/or EP4 receptor and PKA, mediates the VEGF response of Müller cells. PMID:20398651
Statistical methods for astronomical data with upper limits. II - Correlation and regression
NASA Technical Reports Server (NTRS)
Isobe, T.; Feigelson, E. D.; Nelson, P. I.
1986-01-01
Statistical methods for calculating correlations and regressions in bivariate censored data where the dependent variable can have upper or lower limits are presented. Cox's regression and the generalization of Kendall's rank correlation coefficient provide significant levels of correlations, and the EM algorithm, under the assumption of normally distributed errors, and its nonparametric analog using the Kaplan-Meier estimator, give estimates for the slope of a regression line. Monte Carlo simulations demonstrate that survival analysis is reliable in determining correlations between luminosities at different bands. Survival analysis is applied to CO emission in infrared galaxies, X-ray emission in radio galaxies, H-alpha emission in cooling cluster cores, and radio emission in Seyfert galaxies.
Sander, R.K.; Quagliano, J.R.; Fry, H.
1997-08-01
Until recently use of lasers for long path absorption measurements has relied on using differential absorption at two wavelengths to look for one species at a time in the atmosphere. With the advent of multi-line CO{sub 2} lasers it is now feasible to generate 30 to 40 lines in a rapid burst to look for spectra of all the chemical species that may be present. Measurements have been made under relatively constant meteorological conditions in a summertime desert environment with a multi-line tunable laser. Multivariate regression analysis of this data shows that the spectra can be accurately fit using a small number of spectral factors or eigenvectors of the time dependent spectral data matrix. The factors can be rationalized in terms of lidar system effects and atmospheric composition changes.
Schut, Christina; Weik, Ulrike; Tews, Natalia; Gieler, Uwe; Deinzer, Renate; Kupfer, Jörg
2015-02-01
Even though it has been shown that stress and itch are associated in patients with atopic dermatitis (AD), it remains unclear whether this relationship occurs due to certain coping strategies being activated under stress. Therefore, this study investigates the role of coping as possible mediating factor between stress and itch in 31 patients with AD. Coping and itch were assessed by self-reported measures, while stress was measured both by a validated questionnaire and by a physiological stress marker, the postawakening cortisol. Using a regression and a mediation analysis, this study showed a relationship between perceived stress and itch (corrected R2 = 0.21), which was fully mediated by negative itch-related cognitions. 62.3% of the variance of itch intensity could be explained by negative itch-related cognitions. This finding helps to explain the positive effects of cognitive restructuring in the treatment of chronic itch. PMID:25363422
NASA Astrophysics Data System (ADS)
Ivanov, A.; Voynikova, D.; Gocheva-Ilieva, S.; Kulina, H.; Iliev, I.
2015-10-01
The monitoring and control of air quality in urban areas is important problem in many European countries. The main air pollutants are observed and a huge amount of data is collected during the last years. In Bulgaria, the air quality is surveyed by the official environmental agency and in many towns exceedances of harmful pollutants are detected. The aim of this study is to investigate the pollution from 9 air pollutants in the town of Dimitrovgrad, Bulgaria in the period of 5 years based on hourly data. Principal Component Analysis (PCA) is used to discover the patterns in the overall pollution and the contribution of the 9 pollutants. In addition the Generalized Path Seeker (GPS) regularized regression method is applied to find dependence of CO (carbon monoxide) with respect to other pollutants and 8 meteorological parameters. It is reported that the CO concentrations are in continuously repeated low level quantities very harmful for human health.
Li, Zhongwei; Xin, Yuezhen; Wang, Xun; Sun, Beibei; Xia, Shengyu; Li, Hui
2016-01-01
Phellinus is a kind of fungus and is known as one of the elemental components in drugs to avoid cancers. With the purpose of finding optimized culture conditions for Phellinus production in the laboratory, plenty of experiments focusing on single factor were operated and large scale of experimental data were generated. In this work, we use the data collected from experiments for regression analysis, and then a mathematical model of predicting Phellinus production is achieved. Subsequently, a gene-set based genetic algorithm is developed to optimize the values of parameters involved in culture conditions, including inoculum size, PH value, initial liquid volume, temperature, seed age, fermentation time, and rotation speed. These optimized values of the parameters have accordance with biological experimental results, which indicate that our method has a good predictability for culture conditions optimization. PMID:27610365
Brabant, Marie-Eve; Hébert, Martine; Chagnon, François
2013-01-01
This study explored the clinical profiles of 77 female teenager survivors of sexual abuse and examined the association of abuse-related and personal variables with suicidal ideations. Analyses revealed that 64% of participants experienced suicidal ideations. Findings from classification and regression tree analysis indicated that depression, posttraumatic stress symptoms, and hopelessness discriminated profiles of suicidal and nonsuicidal survivors. The elevated prevalence of suicidal ideations among adolescent survivors of sexual abuse underscores the importance of investigating the presence of suicidal ideations in sexual abuse survivors. However, suicidal ideation is not the sole variable that needs to be investigated; depression, hopelessness and posttraumatic stress symptoms are also related to suicidal ideations in survivors and could therefore guide interventions. PMID:23428149
NASA Technical Reports Server (NTRS)
Barrett, C. A.
1985-01-01
Multiple linear regression analysis was used to determine an equation for estimating hot corrosion attack for a series of Ni base cast turbine alloys. The U transform (i.e., 1/sin (% A/100) to the 1/2) was shown to give the best estimate of the dependent variable, y. A complete second degree equation is described for the centered" weight chemistries for the elements Cr, Al, Ti, Mo, W, Cb, Ta, and Co. In addition linear terms for the minor elements C, B, and Zr were added for a basic 47 term equation. The best reduced equation was determined by the stepwise selection method with essentially 13 terms. The Cr term was found to be the most important accounting for 60 percent of the explained variability hot corrosion attack.
2014-01-01
Sales forecasting plays an important role in operating a business since it can be used to determine the required inventory level to meet consumer demand and avoid the problem of under/overstocking. Improving the accuracy of sales forecasting has become an important issue of operating a business. This study proposes a hybrid sales forecasting scheme by combining independent component analysis (ICA) with K-means clustering and support vector regression (SVR). The proposed scheme first uses the ICA to extract hidden information from the observed sales data. The extracted features are then applied to K-means algorithm for clustering the sales data into several disjoined clusters. Finally, the SVR forecasting models are applied to each group to generate final forecasting results. Experimental results from information technology (IT) product agent sales data reveal that the proposed sales forecasting scheme outperforms the three comparison models and hence provides an efficient alternative for sales forecasting. PMID:25045738
Albek, E.
1999-12-01
Chloride-discharge relationships at several stations on Turkish streams are investigated, both qualitatively and quantitatively, to identify natural and anthropogenic sources of chloride. Simple expressions are used to distinguish among sources. Linear regression analysis is conducted to estimate parameters of the models. Five groups of stations are distinguished respective to different sources of chloride and change of chloride concentration with stream discharge. Emphasis is placed on the identification of anthropogenic sources of chloride to aid in water pollution control strategies. The polluted Sakarya River and its primary tributary, the Porsuk Stream, are studied in detail to trace chloride behavior along the waterway and to assess the level of pollution from cities discharging to the streams. Among natural sources of chloride, evaporite sediment sources are examined in detail.
2016-01-01
In today's world, Public expenditures on health are one of the most important issues for governments. These increased expenditures are putting pressure on public budgets. Therefore, health policy makers have focused on the performance of their health systems and many countries have introduced reforms to improve the performance of their health systems. This study investigates the most important determinants of healthcare efficiency for OECD countries using second stage approach for Bayesian Stochastic Frontier Analysis (BSFA). There are two steps in this study. First we measure 29 OECD countries' healthcare efficiency by BSFA using the data from the OECD Health Database. At second stage, we expose the multiple relationships between the healthcare efficiency and characteristics of healthcare systems across OECD countries using Bayesian beta regression. PMID:27118987
NASA Astrophysics Data System (ADS)
Mehrjoo, Saeed; Bashiri, Mahdi
2013-05-01
Production planning and control (PPC) systems have to deal with rising complexity and dynamics. The complexity of planning tasks is due to some existing multiple variables and dynamic factors derived from uncertainties surrounding the PPC. Although literatures on exact scheduling algorithms, simulation approaches, and heuristic methods are extensive in production planning, they seem to be inefficient because of daily fluctuations in real factories. Decision support systems can provide productive tools for production planners to offer a feasible and prompt decision in effective and robust production planning. In this paper, we propose a robust decision support tool for detailed production planning based on statistical multivariate method including principal component analysis and logistic regression. The proposed approach has been used in a real case in Iranian automotive industry. In the presence of existing multisource uncertainties, the results of applying the proposed method in the selected case show that the accuracy of daily production planning increases in comparison with the existing method.
Şenel, Talat; Cengiz, Mehmet Ali
2016-01-01
In today's world, Public expenditures on health are one of the most important issues for governments. These increased expenditures are putting pressure on public budgets. Therefore, health policy makers have focused on the performance of their health systems and many countries have introduced reforms to improve the performance of their health systems. This study investigates the most important determinants of healthcare efficiency for OECD countries using second stage approach for Bayesian Stochastic Frontier Analysis (BSFA). There are two steps in this study. First we measure 29 OECD countries' healthcare efficiency by BSFA using the data from the OECD Health Database. At second stage, we expose the multiple relationships between the healthcare efficiency and characteristics of healthcare systems across OECD countries using Bayesian beta regression. PMID:27118987
Li, Zhongwei; Xin, Yuezhen; Wang, Xun; Sun, Beibei; Xia, Shengyu; Li, Hui; Zhu, Hu
2016-01-01
Phellinus is a kind of fungus and is known as one of the elemental components in drugs to avoid cancers. With the purpose of finding optimized culture conditions for Phellinus production in the laboratory, plenty of experiments focusing on single factor were operated and large scale of experimental data were generated. In this work, we use the data collected from experiments for regression analysis, and then a mathematical model of predicting Phellinus production is achieved. Subsequently, a gene-set based genetic algorithm is developed to optimize the values of parameters involved in culture conditions, including inoculum size, PH value, initial liquid volume, temperature, seed age, fermentation time, and rotation speed. These optimized values of the parameters have accordance with biological experimental results, which indicate that our method has a good predictability for culture conditions optimization. PMID:27610365
NASA Astrophysics Data System (ADS)
Sethuramalingam, Prabhu; Vinayagam, Babu Kupusamy
2016-05-01
Carbon nanotube mixed grinding wheel is used in the grinding process to analyze the surface characteristics of AISI D2 tool steel material. Till now no work has been carried out using carbon nanotube based grinding wheel. Carbon nanotube based grinding wheel has excellent thermal conductivity and good mechanical properties which are used to improve the surface finish of the workpiece. In the present study, the multi response optimization of process parameters like surface roughness and metal removal rate of grinding process of single wall carbon nanotube (CNT) in mixed cutting fluids is undertaken using orthogonal array with grey relational analysis. Experiments are performed with designated grinding conditions obtained using the L9 orthogonal array. Based on the results of the grey relational analysis, a set of optimum grinding parameters is obtained. Using the analysis of variance approach the significant machining parameters are found. Empirical model for the prediction of output parameters has been developed using regression analysis and the results are compared empirically, for conditions of with and without CNT grinding wheel in grinding process.
NASA Astrophysics Data System (ADS)
Sethuramalingam, Prabhu; Vinayagam, Babu Kupusamy
2016-07-01
Carbon nanotube mixed grinding wheel is used in the grinding process to analyze the surface characteristics of AISI D2 tool steel material. Till now no work has been carried out using carbon nanotube based grinding wheel. Carbon nanotube based grinding wheel has excellent thermal conductivity and good mechanical properties which are used to improve the surface finish of the workpiece. In the present study, the multi response optimization of process parameters like surface roughness and metal removal rate of grinding process of single wall carbon nanotube (CNT) in mixed cutting fluids is undertaken using orthogonal array with grey relational analysis. Experiments are performed with designated grinding conditions obtained using the L9 orthogonal array. Based on the results of the grey relational analysis, a set of optimum grinding parameters is obtained. Using the analysis of variance approach the significant machining parameters are found. Empirical model for the prediction of output parameters has been developed using regression analysis and the results are compared empirically, for conditions of with and without CNT grinding wheel in grinding process.
COX-2 gene dosage-dependent defects in kidney development.
Slattery, Patrick; Frölich, Stefanie; Schreiber, Yannik; Nüsing, Rolf M
2016-05-15
Deletion of cyclooxygenase (COX)-2 causes impairment of kidney development, including hypothrophic glomeruli and cortical thinning. A critical role for COX-2 is seen 4-8 days postnatally. The present study was aimed at answering whether different COX-2 gene dosage and partial pharmacological COX-2 inhibition impairs kidney development. We studied kidney development in COX-2(+/+), COX-2(+/-), and COX-2(-/-) mice as well as in C57Bl6 mice treated postnatally with low (5 mg·kg(-1)·day(-1)) and high (10 mg·kg(-1)·day(-1)) doses of the selective COX-2 inhibitor SC-236. COX-2(+/-) mice exhibit impaired kidney development leading to reduced glomerular size but, in contrast to COX-2(-/-) mice, only marginal cortical thinning. Moreover, in COX-2(+/-) and COX-2(-/-) kidneys, juxtamedullary glomeruli, which develop in the very early stages of nephrogenesis, also showed a size reduction. In COX-2(+/-) kidneys at the age of 8 days, we observed significantly less expression of COX-2 mRNA and protein and less PGE2 and PGI2 synthetic activity compared with COX-2(+/+) kidneys. The renal defects in COX-2(-/-) and COX-2(+/-) kidneys could be mimicked by high and low doses of SC-236, respectively. In aged COX-2(+/-) kidneys, glomerulosclerosis was observed; however, in contrast to COX-2(-/-) kidneys, periglomerular fibrosis was absent. COX-2(+/-) mice showed signs of kidney insufficiency, demonstrated by enhanced serum creatinine levels, quite similar to COX-2(-/-) mice, but, in contrast, serum urea remained at the control level. In summary, function of both COX-2 gene alleles is absolutely necessary to ensure physiological development of the mouse kidney. Loss of one copy of the COX-2 gene or partial COX-2 inhibition is associated with distinct renal damage and reduced kidney function. PMID:26984955
Long, Nguyen Phuoc; Huy, Nguyen Tien; Trang, Nguyen Thi Huyen; Luan, Nguyen Thien; Anh, Nguyen Hoang; Nghi, Tran Diem; Hieu, Mai Van; Hirayama, Kenji; Karbwang, Juntra
2014-01-01
BACKGROUND: Ethics is one of the main pillars in the development of science. We performed a JoinPoint regression analysis to analyze the trends of ethical issue research over the past half century. The question is whether ethical issues are neglected despite their importance in modern research. METHOD: PubMed electronic library was used to retrieve publications of all fields and ethical issues. JoinPoint regression analysis was used to identify the significant time trends of publications of all fields and ethical issues, as well as the proportion of publications on ethical issues to all fields over the past half century. Annual percent changes (APC) were computed with their 95% confidence intervals, and a p-value < 0.05 was considered statistically significant. RESULTS: We found that publications of ethical issues increased during the period of 1965–1996 but slightly fell in recent years (from 1996 to 2013). When comparing the absolute number of ethics related articles (APEI) to all publications of all fields (APAF) on PubMed, the results showed that the proportion of APEI to APAF statistically increased during the periods of 1965–1974, 1974–1986, and 1986–1993, with APCs of 11.0, 2.1, and 8.8, respectively. However, the trend has gradually dropped since 1993 and shown a marked decrease from 2002 to 2013 with an annual percent change of –7.4%. CONCLUSIONS: Scientific productivity in ethical issues research on over the past half century rapidly increased during the first 30-year period but has recently been in decline. Since ethics is an important aspect of scientific research, we suggest that greater attention is needed in order to emphasize the role of ethics in modern research. PMID:25324690
Batson, Sarah; Sutton, Alex; Abrams, Keith
2016-01-01
Background Patients with atrial fibrillation are at a greater risk of stroke and therefore the main goal for treatment of patients with atrial fibrillation is to prevent stroke from occurring. There are a number of different stroke prevention treatments available to include warfarin and novel oral anticoagulants. Previous network meta-analyses of novel oral anticoagulants for stroke prevention in atrial fibrillation acknowledge the limitation of heterogeneity across the included trials but have not explored the impact of potentially important treatment modifying covariates. Objectives To explore potentially important treatment modifying covariates using network meta-regression analyses for stroke prevention in atrial fibrillation. Methods We performed a network meta-analysis for the outcome of ischaemic stroke and conducted an exploratory regression analysis considering potentially important treatment modifying covariates. These covariates included the proportion of patients with a previous stroke, proportion of males, mean age, the duration of study follow-up and the patients underlying risk of ischaemic stroke. Results None of the covariates explored impacted relative treatment effects relative to placebo. Notably, the exploration of ‘study follow-up’ as a covariate supported the assumption that difference in trial durations is unimportant in this indication despite the variation across trials in the network. Conclusion This study is limited by the quantity of data available. Further investigation is warranted, and, as justifying further trials may be difficult, it would be desirable to obtain individual patient level data (IPD) to facilitate an effort to relate treatment effects to IPD covariates in order to investigate heterogeneity. Observational data could also be examined to establish if there are potential trends elsewhere. The approach and methods presented have potentially wide applications within any indication as to highlight the potential benefit
NASA Astrophysics Data System (ADS)
Bell, A. L.; Moore, J. N.; Greenwood, M. C.
2007-12-01
The Flathead River in Northwestern Montana drains the relatively pristine, high-mountain watersheds of Glacier- Waterton national parks and large wilderness areas making it an excellent test-bed for hydrologic response to climate change. Flows in the North Fork and Middle Fork of the Flathead River are relatively unmodified by humans, whereas the South Fork has a large hydroelectric reservoir (Hungry Horse) in the lower end of the basin. USGS stream gage data for the North, Middle and South forks from 1940 to 2006 were analyzed for significant trends in the timing of quantiles of flow to examine climate forcing vs. direct modification of flow from the dam. The trends in timing were analyzed for climate change influences using the PRISM model output for 1940 to 2006 for the respective basin. The analysis of trends in timing employed two linear regression methods, typical least squares estimation and robust estimation using weighted least squares. Least squares estimation is the standard method employed when performing regression analysis. The power of this method is sensitive to the violation of the assumptions of normally distributed errors with constant variance (homoscedasticity). Considering that violations of these assumptions are common in hydrologic data, robust estimation was used to preserve the desired statistical power because it is not significantly affected by non-normality or heteroscedasticity. Least squares estimated trends that were found to be significant, using a 10% significance level, were typically not significant using a robust estimation method. This could have implications for interpreting the meaning of significant trends found using the least squares estimator. Utilizing robust estimation methods for analyzing hydrologic data may allow investigators to more accurately summarize any trends.
Quality Reporting of Multivariable Regression Models in Observational Studies
Real, Jordi; Forné, Carles; Roso-Llorach, Albert; Martínez-Sánchez, Jose M.
2016-01-01
Abstract Controlling for confounders is a crucial step in analytical observational studies, and multivariable models are widely used as statistical adjustment techniques. However, the validation of the assumptions of the multivariable regression models (MRMs) should be made clear in scientific reporting. The objective of this study is to review the quality of statistical reporting of the most commonly used MRMs (logistic, linear, and Cox regression) that were applied in analytical observational studies published between 2003 and 2014 by journals indexed in MEDLINE. Review of a representative sample of articles indexed in MEDLINE (n = 428) with observational design and use of MRMs (logistic, linear, and Cox regression). We assessed the quality of reporting about: model assumptions and goodness-of-fit, interactions, sensitivity analysis, crude and adjusted effect estimate, and specification of more than 1 adjusted model. The tests of underlying assumptions or goodness-of-fit of the MRMs used were described in 26.2% (95% CI: 22.0–30.3) of the articles and 18.5% (95% CI: 14.8–22.1) reported the interaction analysis. Reporting of all items assessed was higher in articles published in journals with a higher impact factor. A low percentage of articles indexed in MEDLINE that used multivariable techniques provided information demonstrating rigorous application of the model selected as an adjustment method. Given the importance of these methods to the final results and conclusions of observational studies, greater rigor is required in reporting the use of MRMs in the scientific literature. PMID:27196467
Viscum album-Mediated COX-2 Inhibition Implicates Destabilization of COX-2 mRNA
Saha, Chaitrali; Hegde, Pushpa; Friboulet, Alain; Bayry, Jagadeesh; Kaveri, Srinivas V.
2015-01-01
Extensive use of Viscum album (VA) preparations in the complementary therapy of cancer and in several other human pathologies has led to an increasing number of cellular and molecular approaches to explore the mechanisms of action of VA. We have recently demonstrated that, VA preparations exert a potent anti-inflammatory effect by selectively down-regulating the COX-2-mediated cytokine-induced secretion of prostaglandin E2 (PGE2), one of the important molecular signatures of inflammatory reactions. In this study, we observed a significant down-regulation of COX-2 protein expression in VA-treated A549 cells however COX-2 mRNA levels were unaltered. Therefore, we hypothesized that VA induces destabilisation of COX-2 mRNA, thereby depleting the available functional COX-2 mRNA for the protein synthesis and for the subsequent secretion of PGE2. To address this question, we analyzed the molecular degradation of COX-2 protein and its corresponding mRNA in A549 cell line. Using cyclohexamide pulse chase experiment, we demonstrate that, COX-2 protein degradation is not affected by the treatment with VA whereas experiments on transcriptional blockade with actinomycin D, revealed a marked reduction in the half life of COX-2 mRNA due to its rapid degradation in the cells treated with VA compared to that in IL-1β-stimulated cells. These results thus demonstrate that VA-mediated inhibition of PGE2 implicates destabilization of COX-2 mRNA. PMID:25664986
NASA Astrophysics Data System (ADS)
Belashchenko, Kirill; Antropov, Vladimir
2015-03-01
We describe a first-principles code and a set of tools providing detailed information about the mechanisms of the magnetocrystalline anisotropy (MCA) in alloys. The spin-orbit coupling (SOC) is included in the Green's function-based linear muffin-tin orbital (LMTO) method combined with the coherent potential approximation. Third-order correspondence with the LMTO Hamiltonian is formally demonstrated. The analysis tools include the identification of contributions from different spin channels, single-ion and two-ion terms and alloy components by computing the SOC energy with scaled SOC parameters, as well as a full reciprocal-space resolution of MCA in the Brillouin zone. Application of these tools is illustrated for the (Fe1-xCox)2B system, where the complicated non-monotonic concentration dependence of MCA is attributed to the combination of band filling and SOC selection rules. For Li3-xFexN we demonstrate the interplay between chemical disorder, orbital polarization, and correlation effects in a doubly degenerate impurity band. Work at UNL supported by NSF Grant DMR-1308751.
Sharon Falcone Miller; Bruce G. Miller
2007-12-15
This paper compares the emissions factors for a suite of liquid biofuels (three animal fats, waste restaurant grease, pressed soybean oil, and a biodiesel produced from soybean oil) and four fossil fuels (i.e., natural gas, No. 2 fuel oil, No. 6 fuel oil, and pulverized coal) in Penn State's commercial water-tube boiler to assess their viability as fuels for green heat applications. The data were broken into two subsets, i.e., fossil fuels and biofuels. The regression model for the liquid biofuels (as a subset) did not perform well for all of the gases. In addition, the coefficient in the models showed the EPA method underestimating CO and NOx emissions. No relation could be studied for SO{sub 2} for the liquid biofuels as they contain no sulfur; however, the model showed a good relationship between the two methods for SO{sub 2} in the fossil fuels. AP-42 emissions factors for the fossil fuels were also compared to the mass balance emissions factors and EPA CFR Title 40 emissions factors. Overall, the AP-42 emissions factors for the fossil fuels did not compare well with the mass balance emissions factors or the EPA CFR Title 40 emissions factors. Regression analysis of the AP-42, EPA, and mass balance emissions factors for the fossil fuels showed a significant relationship only for CO{sub 2} and SO{sub 2}. However, the regression models underestimate the SO{sub 2} emissions by 33%. These tests illustrate the importance in performing material balances around boilers to obtain the most accurate emissions levels, especially when dealing with biofuels. The EPA emissions factors were very good at predicting the mass balance emissions factors for the fossil fuels and to a lesser degree the biofuels. While the AP-42 emissions factors and EPA CFR Title 40 emissions factors are easier to perform, especially in large, full-scale systems, this study illustrated the shortcomings of estimation techniques. 23 refs., 3 figs., 8 tabs.
NASA Astrophysics Data System (ADS)
Morandi, Maria T.; Daisey, Joan M.; Lioy, Paul J.
A modified factor analysis/multiple regression (FA/MR) receptor-oriented source apportionment model has been developed which permits application of FA/MR statistical methods when some of the tracers are not unique to an individual source type. The new method uses factor and regression analyses to apportion non-unique tracer ambient concentrations in situations where there are unique tracers for all sources contributing to the non-unique tracer except one, and ascribes the residual concentration to that source. This value is then used as the source tracer in the final FA/MR apportionment model for ambient paniculate matter. In addition, factor analyses results are complemented with examination of regression residuals in order to optimize the number of identifiable sources. The new method has been applied to identify and apportion the sources of inhalable particulate matter (IPM; D5015 μm), Pb and Fe at a site in Newark, NJ. The model indicated that sulfate/secondary aerosol contributed an average of 25.8 μ -3 (48%) to IPM concentrations, followed by soil resuspension (8.2 μ -3 or 15%), paint spraying/paint pigment (6.7/gmm -3or 13%), fuel oil burning/space heating (4.3 μ -3 or 8 %), industrial emissions (3.6 μm -3 or 7 %) and motor vehicle exhaust (2.7 μ -3 or 15 %). Contributions to ambient Pb concentrations were: motor vehicle exhaust (0.16μm -3or 36%), soil resuspension (0.10μm -3 or 24%), fuel oil burning/space heating (0.08μm -3or 18%), industrial emissions (0.07 μ -3 or 17 %), paint spraying/paint pigment (0.036 μm -3or 9 %) and zinc related sources (0.022 μ -3 or 5 %). Contributions to ambient Fe concentrations were: soil resuspension (0.43μ -3or 51%), paint spraying/paint pigment (0.28 μm -3or 33 %) and industrial emissions (0.15 μ -3or 18 %). The models were validated by comparing partial source profiles calculated from modeling results with the corresponding published source emissions composition.
Levy, Jonathan I; Clougherty, Jane E; Baxter, Lisa K; Houseman, E Andres; Paciorek, Christopher J
2010-12-01
Previous studies have identified associations between traffic exposures and a variety of adverse health effects, but many of these studies relied on proximity measures rather than measured or modeled concentrations of specific air pollutants, complicating interpretability of the findings. An increasing number of studies have used land-use regression (LUR) or other techniques to model small-scale variability in concentrations of specific air pollutants. However, these studies have generally considered a limited number of pollutants, focused on outdoor concentrations (or indoor concentrations of ambient origin) when indoor concentrations are better proxies for personal exposures, and have not taken full advantage of statistical methods for source apportionment that may have provided insight about the structure of the LUR models and the interpretability of model results. Given these issues, the primary objective of our study was to determine predictors of indoor and outdoor residential concentrations of multiple traffic-related air pollutants within an urban area, based on a combination of central site monitoring data; geographic information system (GIS) covariates reflecting traffic and other outdoor sources; questionnaire data reflecting indoor sources and activities that affect ventilation rates; and factor-analytic methods to better infer source contributions. As part of a prospective birth cohort study assessing asthma etiology in urban Boston, we collected indoor and/or outdoor 3-to-4 day samples of nitrogen dioxide (NO2) and fine particulate matter with an aerodynamic diameter or = 2.5 pm (PM2.5) at 44 residences during multiple seasons of the year from 2003 through 2005. We performed reflectance analysis, x-ray fluorescence spectroscopy (XRF), and high-resolution inductively coupled plasma-mass spectrometry (ICP-MS) on particle filters to estimate the concentrations of elemental carbon (EC), trace elements, and water-soluble metals, respectively. We derived
Zhu, Haogang; Russell, Richard A.; Saunders, Luke J.; Ceccon, Stefano; Garway-Heath, David F.; Crabb, David P.
2014-01-01
Visual fields measured with standard automated perimetry are a benchmark test for determining retinal function in ocular pathologies such as glaucoma. Their monitoring over time is crucial in detecting change in disease course and, therefore, in prompting clinical intervention and defining endpoints in clinical trials of new therapies. However, conventional change detection methods do not take into account non-stationary measurement variability or spatial correlation present in these measures. An inferential statistical model, denoted ‘Analysis with Non-Stationary Weibull Error Regression and Spatial enhancement’ (ANSWERS), was proposed. In contrast to commonly used ordinary linear regression models, which assume normally distributed errors, ANSWERS incorporates non-stationary variability modelled as a mixture of Weibull distributions. Spatial correlation of measurements was also included into the model using a Bayesian framework. It was evaluated using a large dataset of visual field measurements acquired from electronic health records, and was compared with other widely used methods for detecting deterioration in retinal function. ANSWERS was able to detect deterioration significantly earlier than conventional methods, at matched false positive rates. Statistical sensitivity in detecting deterioration was also significantly better, especially in short time series. Furthermore, the spatial correlation utilised in ANSWERS was shown to improve the ability to detect deterioration, compared to equivalent models without spatial correlation, especially in short follow-up series. ANSWERS is a new efficient method for detecting changes in retinal function. It allows for better detection of change, more efficient endpoints and can potentially shorten the time in clinical trials for new therapies. PMID:24465636
Jamshidi, S.; Yadollahi, A.; Ahmadi, H.; Arab, M. M.; Eftekhari, M.
2016-01-01
Two modeling techniques [artificial neural network-genetic algorithm (ANN-GA) and stepwise regression analysis] were used to predict the effect of medium macro-nutrients on in vitro performance of pear rootstocks (OHF and Pyrodwarf). The ANN-GA described associations between investigating eight macronutrients (NO3-, NH4+, Ca2+, K+, Mg2+, PO42-, SO42-, and Cl−) and explant growth parameters [proliferation rate (PR), shoot length (SL), shoot tip necrosis (STN), chlorosis (Chl), and vitrification (Vitri)]. ANN-GA revealed a substantially higher accuracy of prediction than for regression models. According to the ANN-GA results, among the input variables concentrations (mM), NH4+ (301.7), and NO3-, NH4+ (64), SO42- (54.1), K+ (40.4), and NO3- (35.1) in OHF and Ca2+ (23.7), NH4+ (10.7), NO3- (9.1), NH4+ (317.6), and NH4+ (79.6) in Pyrodwarf had the highest values of VSR in data set, respectively, for PR, SL, STN, Chl, and Vitri. The ANN-GA showed that media containing (mM) 62.5 NO3-, 5.7 NH4+, 2.7 Ca2+, 31.5 K+, 3.3 Mg2+, 2.6 PO42-, 5.6 SO42-, and 3.5 Cl− could lead to optimal PR for OHF and optimal PR for Pyrodwarf may be obtained with media containing 25.6 NO3-, 13.1 NH4+, 5.5 Ca2+, 35.7 K+, 1.5 Mg2+, 2.1 PO42-, 3.6 SO42-, and 3 Cl−. PMID:27066013
Strong, Mark; Oakley, Jeremy E; Brennan, Alan; Breeze, Penny
2015-07-01
Health economic decision-analytic models are used to estimate the expected net benefits of competing decision options. The true values of the input parameters of such models are rarely known with certainty, and it is often useful to quantify the value to the decision maker of reducing uncertainty through collecting new data. In the context of a particular decision problem, the value of a proposed research design can be quantified by its expected value of sample information (EVSI). EVSI is commonly estimated via a 2-level Monte Carlo procedure in which plausible data sets are generated in an outer loop, and then, conditional on these, the parameters of the decision model are updated via Bayes rule and sampled in an inner loop. At each iteration of the inner loop, the decision model is evaluated. This is computationally demanding and may be difficult if the posterior distribution of the model parameters conditional on sampled data is hard to sample from. We describe a fast nonparametric regression-based method for estimating per-patient EVSI that requires only the probabilistic sensitivity analysis sample (i.e., the set of samples drawn from the joint distribution of the parameters and the corresponding net benefits). The method avoids the need to sample from the posterior distributions of the parameters and avoids the need to rerun the model. The only requirement is that sample data sets can be generated. The method is applicable with a model of any complexity and with any specification of model parameter distribution. We demonstrate in a case study the superior efficiency of the regression method over the 2-level Monte Carlo method. PMID:25810269
Bosanquet, David C.; Ansell, James; Abdelrahman, Tarig; Cornish, Julie; Harries, Rhiannon; Stimpson, Amy; Davies, Llion; Glasbey, James C. D.; Frewer, Kathryn A.; Frewer, Natasha C.; Russell, Daphne; Russell, Ian; Torkington, Jared
2015-01-01
Background The incidence of incisional hernias (IHs) following midline abdominal incisions is difficult to estimate. Furthermore recent analyses have reported inconsistent findings on the superiority of absorbable versus non-absorbable sutures. Objective To estimate the mean IH rate following midline laparotomy from the published literature, to identify variables that predict IH rates and to analyse whether the type of suture (absorbable versus non-absorbable) affects IH rates. Methods We undertook a systematic review according to PRISMA guidelines. We sought randomised trials and observational studies including patients undergoing midline incisions with standard suture closure. Papers describing two or more arms suitable for inclusion had data abstracted independently for each arm. Results Fifty-six papers, describing 83 separate groups comprising 14 618 patients, met the inclusion criteria. The prevalence of IHs after midline incision was 12.8% (range: 0 to 35.6%) at a weighted mean of 23.7 months. The estimated risk of undergoing IH repair after midline laparotomy was 5.2%. Two meta-regression analyses (A and B) each identified seven characteristics associated with increased IH rate: one patient variable (higher age), two surgical variables (surgery for AAA and either surgery for obesity surgery (model A) or using an upper midline incision (model B)), two inclusion criteria (including patients with previous laparotomies and those with previous IHs), and two circumstantial variables (later year of publication and specifying an exact significance level). There was no significant difference in IH rate between absorbable and non-absorbable sutures either alone or in conjunction with either regression analysis. Conclusions The IH rate estimated by pooling the published literature is 12.8% after about two years. Seven factors account for the large variation in IH rates across groups. However there is no evidence that suture type has an intrinsic effect on IH rates
NASA Technical Reports Server (NTRS)
Scarpace, F. L.; Voss, A. W.
1973-01-01
Dye densities of multi-layered films are determined by applying a regression analysis to the spectral response of the composite transparency. The amount of dye in each layer is determined by fitting the sum of the individual dye layer densities to the measured dye densities. From this, dye content constants are calculated. Methods of calculating equivalent exposures are discussed. Equivalent exposures are a constant amount of energy over a limited band-width that will give the same dye content constants as the real incident energy. Methods of using these equivalent exposures for analysis of photographic data are presented.
Mocking, R J T; Harmsen, I; Assies, J; Koeter, M W J; Ruhé, H G; Schene, A H
2016-01-01
Omega-3 polyunsaturated fatty acid (PUFA) supplementation has been proposed as (adjuvant) treatment for major depressive disorder (MDD). In the present meta-analysis, we pooled randomized placebo-controlled trials assessing the effects of omega-3 PUFA supplementation on depressive symptoms in MDD. Moreover, we performed meta-regression to test whether supplementation effects depended on eicosapentaenoic acid (EPA) or docosahexaenoic acid dose, their ratio, study duration, participants' age, percentage antidepressant users, baseline MDD symptom severity, publication year and study quality. To limit heterogeneity, we only included studies in adult patients with MDD assessed using standardized clinical interviews, and excluded studies that specifically studied perinatal/perimenopausal or comorbid MDD. Our PubMED/EMBASE search resulted in 1955 articles, from which we included 13 studies providing 1233 participants. After taking potential publication bias into account, meta-analysis showed an overall beneficial effect of omega-3 PUFAs on depressive symptoms in MDD (standardized mean difference=0.398 (0.114–0.682), P=0.006, random-effects model). As an explanation for significant heterogeneity (I2=73.36, P<0.001), meta-regression showed that higher EPA dose (β=0.00037 (0.00009–0.00065), P=0.009), higher percentage antidepressant users (β=0.0058 (0.00017–0.01144), P=0.044) and earlier publication year (β=−0.0735 (−0.143 to 0.004), P=0.04) were significantly associated with better outcome for PUFA supplementation. Additional sensitivity analyses were performed. In conclusion, present meta-analysis suggested a beneficial overall effect of omega-3 PUFA supplementation in MDD patients, especially for higher doses of EPA and in participants taking antidepressants. Future precision medicine trials should establish whether possible interactions between EPA and antidepressants could provide targets to improve antidepressant response and its prediction. Furthermore
Mocking, R J T; Harmsen, I; Assies, J; Koeter, M W J; Ruhé, H G; Schene, A H
2016-01-01
Omega-3 polyunsaturated fatty acid (PUFA) supplementation has been proposed as (adjuvant) treatment for major depressive disorder (MDD). In the present meta-analysis, we pooled randomized placebo-controlled trials assessing the effects of omega-3 PUFA supplementation on depressive symptoms in MDD. Moreover, we performed meta-regression to test whether supplementation effects depended on eicosapentaenoic acid (EPA) or docosahexaenoic acid dose, their ratio, study duration, participants' age, percentage antidepressant users, baseline MDD symptom severity, publication year and study quality. To limit heterogeneity, we only included studies in adult patients with MDD assessed using standardized clinical interviews, and excluded studies that specifically studied perinatal/perimenopausal or comorbid MDD. Our PubMED/EMBASE search resulted in 1955 articles, from which we included 13 studies providing 1233 participants. After taking potential publication bias into account, meta-analysis showed an overall beneficial effect of omega-3 PUFAs on depressive symptoms in MDD (standardized mean difference=0.398 (0.114-0.682), P=0.006, random-effects model). As an explanation for significant heterogeneity (I(2)=73.36, P<0.001), meta-regression showed that higher EPA dose (β=0.00037 (0.00009-0.00065), P=0.009), higher percentage antidepressant users (β=0.0058 (0.00017-0.01144), P=0.044) and earlier publication year (β=-0.0735 (-0.143 to 0.004), P=0.04) were significantly associated with better outcome for PUFA supplementation. Additional sensitivity analyses were performed. In conclusion, present meta-analysis suggested a beneficial overall effect of omega-3 PUFA supplementation in MDD patients, especially for higher doses of EPA and in participants taking antidepressants. Future precision medicine trials should establish whether possible interactions between EPA and antidepressants could provide targets to improve antidepressant response and its prediction. Furthermore, potential
A Skew-t space-varying regression model for the spectral analysis of resting state brain activity.
Ismail, Salimah; Sun, Wenqi; Nathoo, Farouk S; Babul, Arif; Moiseev, Alexader; Beg, Mirza Faisal; Virji-Babul, Naznin
2013-08-01
It is known that in many neurological disorders such as Down syndrome, main brain rhythms shift their frequencies slightly, and characterizing the spatial distribution of these shifts is of interest. This article reports on the development of a Skew-t mixed model for the spatial analysis of resting state brain activity in healthy controls and individuals with Down syndrome. Time series of oscillatory brain activity are recorded using magnetoencephalography, and spectral summaries are examined at multiple sensor locations across the scalp. We focus on the mean frequency of the power spectral density, and use space-varying regression to examine associations with age, gender and Down syndrome across several scalp regions. Spatial smoothing priors are incorporated based on a multivariate Markov random field, and the markedly non-Gaussian nature of the spectral response variable is accommodated by the use of a Skew-t distribution. A range of models representing different assumptions on the association structure and response distribution are examined, and we conduct model selection using the deviance information criterion. (1) Our analysis suggests region-specific differences between healthy controls and individuals with Down syndrome, particularly in the left and right temporal regions, and produces smoothed maps indicating the scalp topography of the estimated differences. PMID:22614763
Tvete, Ingunn Fride; Natvig, Bent; Gåsemyr, Jørund; Meland, Nils; Røine, Marianne; Klemp, Marianne
2015-01-01
Rheumatoid arthritis patients have been treated with disease modifying anti-rheumatic drugs (DMARDs) and the newer biologic drugs. We sought to compare and rank the biologics with respect to efficacy. We performed a literature search identifying 54 publications encompassing 9 biologics. We conducted a multiple treatment comparison regression analysis letting the number experiencing a 50% improvement on the ACR score be dependent upon dose level and disease duration for assessing the comparable relative effect between biologics and placebo or DMARD. The analysis embraced all treatment and comparator arms over all publications. Hence, all measured effects of any biologic agent contributed to the comparison of all biologic agents relative to each other either given alone or combined with DMARD. We found the drug effect to be dependent on dose level, but not on disease duration, and the impact of a high versus low dose level was the same for all drugs (higher doses indicated a higher frequency of ACR50 scores). The ranking of the drugs when given without DMARD was certolizumab (ranked highest), etanercept, tocilizumab/ abatacept and adalimumab. The ranking of the drugs when given with DMARD was certolizumab (ranked highest), tocilizumab, anakinra, rituximab, golimumab/ infliximab/ abatacept, adalimumab/ etanercept. Still, all drugs were effective. All biologic agents were effective compared to placebo, with certolizumab the most effective and adalimumab (without DMARD treatment) and adalimumab/ etanercept (combined with DMARD treatment) the least effective. The drugs were in general more effective, except for etanercept, when given together with DMARDs. PMID:26356639
Wadsworth, Sally J; Olson, Richard K; Willcutt, Erik G; DeFries, John C
2012-02-01
The augmented multiple regression model for the analysis of data from selected twin pairs was extended to facilitate analyses of data from twin pairs and nontwin siblings. Fitting this extended model to data from both selected twin pairs and siblings yields direct estimates of heritability (h2) and the difference between environmental influences shared by members of twin pairs and those of sib or twin-sib pairs (i.e., c2(t) - c2 (s)). When this model was fitted to reading performance data from 293 monozygotic and 436 dizygotic pairs selected for reading difficulties, and 291 of their nontwin siblings, h2 = .48 ± .22, p = .03, and c2 (t) - c2 (s) = .22 ± .12, p = .06. Although the test for differential shared environmental influences is only marginally significant, the results of this analysis suggest that environmental influences on reading performance that are shared by members of twin pairs (.36) may be substantially greater than those for less contemporaneous twin-sibling pairs (.14). PMID:22784461
Lee, Sungyoung; Kwon, Min-Seok; Park, Taesung
2014-01-01
In genome-wide association studies (GWAS), regression analysis has been most commonly used to establish an association between a phenotype and genetic variants, such as single nucleotide polymorphism (SNP). However, most applications of regression analysis have been restricted to the investigation of single marker because of the large computational burden. Thus, there have been limited applications of regression analysis to multiple SNPs, including gene–gene interaction (GGI) in large-scale GWAS data. In order to overcome this limitation, we propose CARAT-GxG, a GPU computing system-oriented toolkit, for performing regression analysis with GGI using CUDA (compute unified device architecture). Compared to other methods, CARAT-GxG achieved almost 700-fold execution speed and delivered highly reliable results through our GPU-specific optimization techniques. In addition, it was possible to achieve almost-linear speed acceleration with the application of a GPU computing system, which is implemented by the TORQUE Resource Manager. We expect that CARAT-GxG will enable large-scale regression analysis with GGI for GWAS data. PMID:25574130
Logistic Regression: Concept and Application
ERIC Educational Resources Information Center
Cokluk, Omay
2010-01-01
The main focus of logistic regression analysis is classification of individuals in different groups. The aim of the present study is to explain basic concepts and processes of binary logistic regression analysis intended to determine the combination of independent variables which best explain the membership in certain groups called dichotomous…
Augsburger, J.J.; Gamel, J.W.; Shields, J.A.; Markoe, A.M.; Brady, L.W.
1987-09-01
To determine the prognostic value of the regression rate of choroidal melanomas after cobalt-60 plaque radiotherapy, the authors performed a multivariate analysis on 159 patients treated with a cobalt plaque during the interval from 1976 through 1980. Thirty-three of the 159 patients had died as of the survey date; 29 of metastatic melanoma and 4 of other causes. Multivariate Cox proportional hazards modeling identified a two-term regression incorporating maximal basal tumor diameter at treatment and tumor thickness at 12 months posttreatment as the best model (P less than 0.005 for both parameters) for predicting length of tumor-free survival. These results are consistent with the hypothesis that rapid regression of a choroidal melanoma after cobalt-60 plaque radiotherapy is an unfavorable prognostic sign for prolonged metastasis-free survival.
Effects of dietary fat on fertility of dairy cattle: A meta-analysis and meta-regression.
Rodney, R M; Celi, P; Scott, W; Breinhild, K; Lean, I J
2015-08-01
Evidence is increasing of positive effects of feeding fats during transition on fertility and the adaptation to lactation. This study used meta-analytic methods to explore the effects of including fats in the transition diet on the risk of pregnancy to service (proportion pregnant) and calving to pregnancy interval. Meta-analysis was used to integrate smaller studies and increase the statistical power over that of any single study and explore new hypotheses. We explored the effect of fats and diet composition on fertility using meta-regression methods. Relatively few highly controlled studies are available providing detailed descriptions of the diets used that examined interactions between fat nutrition and reproductive outcomes. Only 17 studies containing 26 comparisons were suitable for inclusion in statistical evaluations. Reproductive variables evaluated were risk of pregnancy (proportion pregnant), primarily to first service, and calving to pregnancy interval. Production variables examined were milk yield, milk composition, and body weight. The sources of heterogeneity in these studies were also explored. A 27% overall increase in pregnancy to service was observed (relative risk=1.27; 95% confidence interval Knapp Hartung 1.09 to 1.45), and results were relatively consistent (I(2)=19.9%). A strong indication of a reduction in calving to pregnancy interval was also identified, which was consistent across studies (I(2)=0.0%), supporting a conclusion that, overall, the inclusion of fats does improve fertility. Further exploration of the factors contributing to proportion pregnant using bivariate meta-regression identified variables that reflected changes in diet composition or animal response resulting from inclusion of the fat interventions in the experimental diets fed. Increased fermentable neutral detergent fiber and soluble fiber intakes increased the proportion pregnant, whereas increased milk yield of the treatment group decreased this measure
Select Dietary Phytochemicals Function as Inhibitors of COX-1 but Not COX-2
Li, Haitao; Zhu, Feng; Sun, Yanwen; Li, Bing; Oi, Naomi; Chen, Hanyong; Lubet, Ronald A.; Bode, Ann M.; Dong, Zigang
2013-01-01
Recent clinical trials raised concerns regarding the cardiovascular toxicity of selective cyclooxygenase-2 (COX-2) inhibitors. Many active dietary factors are reported to suppress carcinogenesis by targeting COX-2. A major question was accordingly raised: why has the lifelong use of phytochemicals that likely inhibit COX-2 presumably not been associated with adverse cardiovascular side effects. To answer this question, we selected a library of dietary-derived phytochemicals and evaluated their potential cardiovascular toxicity in human umbilical vein endothelial cells. Our data indicated that the possibility of cardiovascular toxicity of these dietary phytochemicals was low. Further mechanistic studies revealed that the actions of these phytochemicals were similar to aspirin in that they mainly inhibited COX-1 rather than COX-2, especially at low doses. PMID:24098505
NASA Astrophysics Data System (ADS)
Zheng, Xiaoming; Kim, Myeongsoo; Yang, Sook
2016-03-01
The purposes of this work were to determine the optimal peak voltage for chest computed radiography (CR) using visual grading scores and to compare visual grading characteristics (VGC) and ordinal regression in visual grading analysis. An Afga CR system was used to acquire images of an anthropomorphic chest phantom. Both entrance surface dose and detector surface dose were measured using the Piranha 657 dosimeter. The images were acquired under various voltages from 80 to 120 kVp and exposures from 0.5 to 12.5 mAs. The image qualities were evaluated by 5 experienced radiologists/radiographers based on modified European imaging criteria using 1-5 visual grading scale. The VGC, ordinal regression as well as the conventional visual grading analysis (VGA) were employed for the image quality analysis. Both VGC and ordinal regression yielded the same results with both 100 kVp and 120 kVp producing the best image quality. The image quality of the 120 kVp was slightly higher than that of the 100 kVp but its dose was also higher than that of the 100kVp. On balancing image quality with dose, the 100 kVp should be the optimal kVp for the chest imaging using the Afga CR system. The ordinal regression is a powerful tool in the analysis of image quality using visual grading scores and the VGC can be handled by the ordinal regression.
Association Between COX-2 Polymorphisms and Lung Cancer Risk
Wang, Weiwei; Fan, Xinyun; Zhang, Yong; Yang, Yi; Yang, Siyuan; Li, Gaofeng
2015-01-01
Background Multiple relevant risk factors for lung cancer have been reported in different populations, but results of previous studies were not consistent. Therefore, a meta-analysis is necessary to summarize these outcomes and reach a relatively comprehensive conclusion. Material/Methods STATA 12.0 software was used for all statistical of the relationship between COX-2 polymorphisms and lung cancer risk. Inter-study heterogeneity was examined with the Q statistic (significance level at P<0.1). The publication bias among studies in the meta-analysis was analyzed with Begg’s funnel plot and Egger’s test. Hardy-Weinberg equilibrium was tested in all controls of the studies. Results COX-2 rs20417 polymorphism had a significant association with reduced risk of lung cancer under homozygous and recessive models, and similar results were observed in white and population-based subgroups under 2 and 3 contrasts, respectively. Additionally, rs2066826 polymorphism manifested a strong correlation with increased risk of lung cancer under 5 genetic models. Conclusions In COX-2 gene, rs20417 may have a certain relationship with reduced risk of lung cancer, while rs2066826 may increase the risk of lung cancer. PMID:26624903
Huang, Jian; Zhang, Di; Xie, Fuqiang; Lin, Degui
2015-01-01
Increasing evidence suggests that cancer stem cells (CSCs) are responsible for tumor initiation and maintenance. Additionally, it is becoming apparent that cyclooxygenase (COX) signaling is associated with canine mammary tumor development. The goals of the present study were to investigate COX-2 expression patterns and their effect on CSC-mediated tumor initiation in primary canine mammary tissues and tumorsphere models using immunohistochemistry. Patterns of COX-2, CD44, octamer-binding transcription factor (Oct)-3/4, and epidermal growth factor receptor (EGFR) expression were examined in malignant mammary tumor (MMT) samples and analyzed in terms of clinicopathological characteristics. COX-2 and Oct-3/4 expression was higher in MMTs compared to other histological samples with heterogeneous patterns. In MMTs, COX-2 expression correlated with tumor malignancy features. Significant associations between COX-2, CD44, and EGFR were observed in low-differentiated MMTs. Comparative analysis showed that the levels of COX-2, CD44, and Oct-3/4 expression varied significantly among TSs of three histological grades. Enhanced COX-2 staining was consistently observed in TSs. Similar levels of staining intensity were found for CD44 and Oct-3/4, but EGFR expression was weak. Our findings indicate the potential role of COX-2 in CSC-mediated tumor initiation, and suggest that COX-2 inhibition may help treat canine mammary tumors by targeting CSCs. PMID:26124697
Boulet, Sebastien; Boudot, Elsa; Houel, Nicolas
2016-05-01
Back pain is a common reason for consultation in primary healthcare clinical practice, and has effects on daily activities and posture. Relationships between the whole spine and upright posture, however, remain unknown. The aim of this study was to identify the relationship between each spinal curve and centre of pressure position as well as velocity for healthy subjects. Twenty-one male subjects performed quiet stance in natural position. Each upright posture was then recorded using an optoelectronics system (Vicon Nexus) synchronized with two force plates. At each moment, polynomial interpolations of markers attached on the spine segment were used to compute cervical lordosis, thoracic kyphosis and lumbar lordosis angle curves. Mean of centre of pressure position and velocity was then computed. Multiple stepwise linear regression analysis showed that the position and velocity of centre of pressure associated with each part of the spinal curves were defined as best predictors of the lumbar lordosis angle (R(2)=0.45; p=1.65*10-10) and the thoracic kyphosis angle (R(2)=0.54; p=4.89*10-13) of healthy subjects in quiet stance. This study showed the relationships between each of cervical, thoracic, lumbar curvatures, and centre of pressure's fluctuation during free quiet standing using non-invasive full spinal curve exploration. PMID:26970888
Pauli-Pott, Ursula; Becker, Katja
2015-08-01
Normative development of neuropsychological functions that are assumed to underlie attention deficit/hyperactivity disorder (ADHD) may show transition periods, i.e., periods of heightened developmental discontinuity and reduced differential continuity. During such periods differences between ADHD cases and controls in these functions might be obscured because assessments probably not only reflect individual differences in the ADHD-related deviation but also individual differences in speed/onset of the transition. Our review focuses on executive inhibitory control (IC) and delay aversion/discounting (DA) because normative developmental processes of these characteristics are relatively well described. For complex IC performance a transition period can be assumed in preschool years, for DA around puberty. Published meta-analyses on neuropsychological IC tasks and a meta-regression analysis of 23 case-control comparisons in DA tasks comprising 1395 individuals with ADHD and 1195 controls confirmed our assumption. Effect sizes of case-control comparisons were significantly larger outside transition periods, i.e., in age-periods of relative developmental continuity. An increasingly precise identification of such time windows could contribute to the understanding of the etiological pathways of ADHD. PMID:25956255
Boy-Roura, M; Cameron, K C; Di, H J
2016-02-01
This study presents a meta-analysis of 12 experiments that quantify nitrate-N leaching losses from grazed pasture systems in alluvial sedimentary soils in Canterbury (New Zealand). Mean measured nitrate-N leached (kg N/ha × 100 mm drainage) losses were 2.7 when no urine was applied, 8.4 at the urine rate of 300 kg N/ha, 9.8 at 500 kg N/ha, 24.5 at 700 kg N/ha and 51.4 at 1000 kg N/ha. Lismore soils presented significantly higher nitrate-N losses compared to Templeton soils. Moreover, a multiple linear regression (MLR) model was developed to determine the key factors that influence nitrate-N leaching and to predict nitrate-N leaching losses. The MLR analyses was calibrated and validated using 82 average values of nitrate-N leached and 48 explanatory variables representative of nitrogen inputs and outputs, transport, attenuation of nitrogen and farm management practices. The MLR model (R (2) = 0.81) showed that nitrate-N leaching losses were greater at higher urine application rates and when there was more drainage from rainfall and irrigation. On the other hand, nitrate leaching decreased when nitrification inhibitors (e.g. dicyandiamide (DCD)) were applied. Predicted nitrate-N leaching losses at the paddock scale were calculated using the MLR equation, and they varied largely depending on the urine application rate and urine patch coverage. PMID:26498804
NASA Astrophysics Data System (ADS)
Toth-Tascau, Mirela; Balanean, Flavia; Krepelka, Mircea
2013-10-01
Musculoskeletal impairment of the upper limb can cause difficulties in performing basic daily activities. Three dimensional motion analyses can provide valuable data of arm movement in order to precisely determine arm movement and inter-joint coordination. The purpose of this study was to develop a method to evaluate the degree of impairment based on the influence of shoulder movements in the amplitude of elbow flexion and extension based on the assumption that a lack of motion of the elbow joint will be compensated by an increased shoulder activity. In order to develop and validate a statistical model, one healthy young volunteer has been involved in the study. The activity of choice simulated blowing the nose, starting from a slight flexion of the elbow and raising the hand until the middle finger touches the tip of the nose and return to the start position. Inter-joint coordination between the elbow and shoulder movements showed significant correlation. Statistical regression was used to fit an equation model describing the influence of shoulder movements on the elbow mobility. The study provides a brief description of the kinematic analysis protocol and statistical models that may be useful in describing the relation between inter-joint movements of daily activities.
Shabri, Ani; Samsudin, Ruhaidah
2014-01-01
Crude oil prices do play significant role in the global economy and are a key input into option pricing formulas, portfolio allocation, and risk measurement. In this paper, a hybrid model integrating wavelet and multiple linear regressions (MLR) is proposed for crude oil price forecasting. In this model, Mallat wavelet transform is first selected to decompose an original time series into several subseries with different scale. Then, the principal component analysis (PCA) is used in processing subseries data in MLR for crude oil price forecasting. The particle swarm optimization (PSO) is used to adopt the optimal parameters of the MLR model. To assess the effectiveness of this model, daily crude oil market, West Texas Intermediate (WTI), has been used as the case study. Time series prediction capability performance of the WMLR model is compared with the MLR, ARIMA, and GARCH models using various statistics measures. The experimental results show that the proposed model outperforms the individual models in forecasting of the crude oil prices series. PMID:24895666
Lytras, Theodore; Kopsachilis, Frixos; Mouratidou, Elisavet; Papamichail, Dimitris; Bonovas, Stefanos
2016-03-01
Influenza vaccination is recommended for healthcare workers (HCWs), but coverage is often low. We reviewed studies evaluating interventions to increase seasonal influenza vaccination coverage in HCWs, including a meta-regression analysis to quantify the effect of each component. Fourty-six eligible studies were identified. Domains conferring a high risk of bias were identified in most studies. Mandatory vaccination was the most effective intervention component (Risk Ratio of being unvaccinated [RRunvacc] = 0.18, 95% CI: 0.08-0.45), followed by "soft" mandates such as declination statements (RRunvacc = 0.64, 95% CI: 0.45-0.92), increased awareness (RRunvacc = 0.83, 95% CI: 0.71-0.97) and increased access (RRunvacc = 0.88, 95% CI: 0.78-1.00). For incentives the difference was not significant, while for education no effect was observed. Heterogeneity was substantial (τ(2) = 0.083). These results indicate that effective alternatives to mandatory HCWs influenza vaccination do exist, and need to be further explored in future studies. PMID:26619125
Shabri, Ani; Samsudin, Ruhaidah
2014-01-01
Crude oil prices do play significant role in the global economy and are a key input into option pricing formulas, portfolio allocation, and risk measurement. In this paper, a hybrid model integrating wavelet and multiple linear regressions (MLR) is proposed for crude oil price forecasting. In this model, Mallat wavelet transform is first selected to decompose an original time series into several subseries with different scale. Then, the principal component analysis (PCA) is used in processing subseries data in MLR for crude oil price forecasting. The particle swarm optimization (PSO) is used to adopt the optimal parameters of the MLR model. To assess the effectiveness of this model, daily crude oil market, West Texas Intermediate (WTI), has been used as the case study. Time series prediction capability performance of the WMLR model is compared with the MLR, ARIMA, and GARCH models using various statistics measures. The experimental results show that the proposed model outperforms the individual models in forecasting of the crude oil prices series. PMID:24895666
Perez, Ivan; Chavez, Allison K.; Ponce, Dario
2016-01-01
Background: The Ricketts' posteroanterior (PA) cephalometry seems to be the most widely used and it has not been tested by multivariate statistics for sex determination. Objective: The objective was to determine the applicability of Ricketts' PA cephalometry for sex determination using the logistic regression analysis. Materials and Methods: The logistic models were estimated at distinct age cutoffs (all ages, 11 years, 13 years, and 15 years) in a database from 1,296 Hispano American Peruvians between 5 years and 44 years of age. Results: The logistic models were composed by six cephalometric measurements; the accuracy achieved by resubstitution varied between 60% and 70% and all the variables, with one exception, exhibited a direct relationship with the probability of being classified as male; the nasal width exhibited an indirect relationship. Conclusion: The maxillary and facial widths were present in all models and may represent a sexual dimorphism indicator. The accuracy found was lower than the literature and the Ricketts' PA cephalometry may not be adequate for sex determination. The indirect relationship of the nasal width in models with data from patients of 12 years of age or less may be a trait related to age or a characteristic in the studied population, which could be better studied and confirmed. PMID:27555732
Data fusion of CO2 retrieved from GOSAT and AIRS using regression analysis and fixed rank kriging
NASA Astrophysics Data System (ADS)
Zhou, Cong; Shi, Runhe; Gao, Wei
2015-09-01
This paper proposes an improved statistical method for fusing carbon dioxide (CO2) data retrieved from two major instruments, the Greenhouse gases Observing SATellite (GOSAT) and the Atmospheric Infrared Sounder (AIRS). These two datasets were fused to obtain CO2 concentrations near the surface, which is a region that is especially important for studies on carbon sources and sinks. Overall, the CO2 monthly average values from GOSAT are all lower than those from AIRS from 2010 to 2012. The datasets show the similar seasonal cycles of carbon dioxide and show an increasing trend with a determination coefficient of 0.45. A strong correlation was determined by adding the climatic factors as independent variables for regression analysis. The correlation coefficients between the CO2 values from AIRS and GOSAT significantly increased in response. The true CO2 data processes were then predicted using the fixed rank kriging method. This showed that the data-fusion CO2 product provides more reasonable information and that the corresponding mean squared prediction errors are smaller than those from the single GOSAT CO2 dataset.
Partial covariate adjusted regression
Şentürk, Damla; Nguyen, Danh V.
2008-01-01
Covariate adjusted regression (CAR) is a recently proposed adjustment method for regression analysis where both the response and predictors are not directly observed (Şentürk and Müller, 2005). The available data has been distorted by unknown functions of an observable confounding covariate. CAR provides consistent estimators for the coefficients of the regression between the variables of interest, adjusted for the confounder. We develop a broader class of partial covariate adjusted regression (PCAR) models to accommodate both distorted and undistorted (adjusted/unadjusted) predictors. The PCAR model allows for unadjusted predictors, such as age, gender and demographic variables, which are common in the analysis of biomedical and epidemiological data. The available estimation and inference procedures for CAR are shown to be invalid for the proposed PCAR model. We propose new estimators and develop new inference tools for the more general PCAR setting. In particular, we establish the asymptotic normality of the proposed estimators and propose consistent estimators of their asymptotic variances. Finite sample properties of the proposed estimators are investigated using simulation studies and the method is also illustrated with a Pima Indians diabetes data set. PMID:20126296
Jamshidi, S; Yadollahi, A; Ahmadi, H; Arab, M M; Eftekhari, M
2016-01-01
Two modeling techniques [artificial neural network-genetic algorithm (ANN-GA) and stepwise regression analysis] were used to predict the effect of medium macro-nutrients on in vitro performance of pear rootstocks (OHF and Pyrodwarf). The ANN-GA described associations between investigating eight macronutrients (NO[Formula: see text], NH[Formula: see text], Ca(2+), K(+), Mg(2+), PO[Formula: see text], SO[Formula: see text], and Cl(-)) and explant growth parameters [proliferation rate (PR), shoot length (SL), shoot tip necrosis (STN), chlorosis (Chl), and vitrification (Vitri)]. ANN-GA revealed a substantially higher accuracy of prediction than for regression models. According to the ANN-GA results, among the input variables concentrations (mM), NH[Formula: see text] (301.7), and NO[Formula: see text], NH[Formula: see text] (64), SO[Formula: see text] (54.1), K(+) (40.4), and NO[Formula: see text] (35.1) in OHF and Ca(2+) (23.7), NH[Formula: see text] (10.7), NO[Formula: see text] (9.1), NH[Formula: see text] (317.6), and NH[Formula: see text] (79.6) in Pyrodwarf had the highest values of VSR in data set, respectively, for PR, SL, STN, Chl, and Vitri. The ANN-GA showed that media containing (mM) 62.5 NO[Formula: see text], 5.7 NH[Formula: see text], 2.7 Ca(2+), 31.5 K(+), 3.3 Mg(2+), 2.6 PO[Formula: see text], 5.6 SO[Formula: see text], and 3.5 Cl(-) could lead to optimal PR for OHF and optimal PR for Pyrodwarf may be obtained with media containing 25.6 NO[Formula: see text], 13.1 NH[Formula: see text], 5.5 Ca(2+), 35.7 K(+), 1.5 Mg(2+), 2.1 PO[Formula: see text], 3.6 SO[Formula: see text], and 3 Cl(-). PMID:27066013
ERIC Educational Resources Information Center
Pedrini, D. T.; Pedrini, Bonnie C.
Regression, another mechanism studied by Sigmund Freud, has had much research, e.g., hypnotic regression, frustration regression, schizophrenic regression, and infra-human-animal regression (often directly related to fixation). Many investigators worked with hypnotic age regression, which has a long history, going back to Russian reflexologists.…