Variable Selection in Logistic Regression.
1987-06-01
23 %. AUTIOR(.) S. CONTRACT OR GRANT NUMBE Rf.i %Z. D. Bai, P. R. Krishnaiah and . C. Zhao F49620-85- C-0008 " PERFORMING ORGANIZATION NAME AND AOORESS...d I7 IOK-TK- d 7 -I0 7’ VARIABLE SELECTION IN LOGISTIC REGRESSION Z. D. Bai, P. R. Krishnaiah and L. C. Zhao Center for Multivariate Analysis...University of Pittsburgh Center for Multivariate Analysis University of Pittsburgh Y !I VARIABLE SELECTION IN LOGISTIC REGRESSION Z- 0. Bai, P. R. Krishnaiah
Nagraj, Nandini; Slocik, Joseph M; Phillips, David M; Kelley-Loughnane, Nancy; Naik, Rajesh R; Potyrailo, Radislav A
2013-08-07
Peptide-capped AYSSGAPPMPPF gold nanoparticles were demonstrated for highly selective chemical vapor sensing using individual multivariable inductor-capacitor-resistor (LCR) resonators. Their multivariable response was achieved by measuring their resonance impedance spectra followed by multivariate spectral analysis. Detection of model toxic vapors and chemical agent simulants, such as acetonitrile, dichloromethane and methyl salicylate, was performed. Dichloromethane (dielectric constant εr = 9.1) and methyl salicylate (εr = 9.0) were discriminated using a single sensor. These sensing materials coupled to multivariable transducers can provide numerous opportunities for tailoring the vapor response selectivity based on the diversity of the amino acid composition of the peptides, and by the modulation of the nature of peptide-nanoparticle interactions through designed combinations of hydrophobic and hydrophilic amino acids.
Multivariate Methods for Meta-Analysis of Genetic Association Studies.
Dimou, Niki L; Pantavou, Katerina G; Braliou, Georgia G; Bagos, Pantelis G
2018-01-01
Multivariate meta-analysis of genetic association studies and genome-wide association studies has received a remarkable attention as it improves the precision of the analysis. Here, we review, summarize and present in a unified framework methods for multivariate meta-analysis of genetic association studies and genome-wide association studies. Starting with the statistical methods used for robust analysis and genetic model selection, we present in brief univariate methods for meta-analysis and we then scrutinize multivariate methodologies. Multivariate models of meta-analysis for a single gene-disease association studies, including models for haplotype association studies, multiple linked polymorphisms and multiple outcomes are discussed. The popular Mendelian randomization approach and special cases of meta-analysis addressing issues such as the assumption of the mode of inheritance, deviation from Hardy-Weinberg Equilibrium and gene-environment interactions are also presented. All available methods are enriched with practical applications and methodologies that could be developed in the future are discussed. Links for all available software implementing multivariate meta-analysis methods are also provided.
A survey of variable selection methods in two Chinese epidemiology journals
2010-01-01
Background Although much has been written on developing better procedures for variable selection, there is little research on how it is practiced in actual studies. This review surveys the variable selection methods reported in two high-ranking Chinese epidemiology journals. Methods Articles published in 2004, 2006, and 2008 in the Chinese Journal of Epidemiology and the Chinese Journal of Preventive Medicine were reviewed. Five categories of methods were identified whereby variables were selected using: A - bivariate analyses; B - multivariable analysis; e.g. stepwise or individual significance testing of model coefficients; C - first bivariate analyses, followed by multivariable analysis; D - bivariate analyses or multivariable analysis; and E - other criteria like prior knowledge or personal judgment. Results Among the 287 articles that reported using variable selection methods, 6%, 26%, 30%, 21%, and 17% were in categories A through E, respectively. One hundred sixty-three studies selected variables using bivariate analyses, 80% (130/163) via multiple significance testing at the 5% alpha-level. Of the 219 multivariable analyses, 97 (44%) used stepwise procedures, 89 (41%) tested individual regression coefficients, but 33 (15%) did not mention how variables were selected. Sixty percent (58/97) of the stepwise routines also did not specify the algorithm and/or significance levels. Conclusions The variable selection methods reported in the two journals were limited in variety, and details were often missing. Many studies still relied on problematic techniques like stepwise procedures and/or multiple testing of bivariate associations at the 0.05 alpha-level. These deficiencies should be rectified to safeguard the scientific validity of articles published in Chinese epidemiology journals. PMID:20920252
Multivariate Meta-Analysis Using Individual Participant Data
ERIC Educational Resources Information Center
Riley, R. D.; Price, M. J.; Jackson, D.; Wardle, M.; Gueyffier, F.; Wang, J.; Staessen, J. A.; White, I. R.
2015-01-01
When combining results across related studies, a multivariate meta-analysis allows the joint synthesis of correlated effect estimates from multiple outcomes. Joint synthesis can improve efficiency over separate univariate syntheses, may reduce selective outcome reporting biases, and enables joint inferences across the outcomes. A common issue is…
Brito Lopes, Fernando; da Silva, Marcelo Corrêa; Magnabosco, Cláudio Ulhôa; Goncalves Narciso, Marcelo; Sainz, Roberto Daniel
2016-01-01
This research evaluated a multivariate approach as an alternative tool for the purpose of selection regarding expected progeny differences (EPDs). Data were fitted using a multi-trait model and consisted of growth traits (birth weight and weights at 120, 210, 365 and 450 days of age) and carcass traits (longissimus muscle area (LMA), back-fat thickness (BF), and rump fat thickness (RF)), registered over 21 years in extensive breeding systems of Polled Nellore cattle in Brazil. Multivariate analyses were performed using standardized (zero mean and unit variance) EPDs. The k mean method revealed that the best fit of data occurred using three clusters (k = 3) (P < 0.001). Estimates of genetic correlation among growth and carcass traits and the estimates of heritability were moderate to high, suggesting that a correlated response approach is suitable for practical decision making. Estimates of correlation between selection indices and the multivariate index (LD1) were moderate to high, ranging from 0.48 to 0.97. This reveals that both types of indices give similar results and that the multivariate approach is reliable for the purpose of selection. The alternative tool seems very handy when economic weights are not available or in cases where more rapid identification of the best animals is desired. Interestingly, multivariate analysis allowed forecasting information based on the relationships among breeding values (EPDs). Also, it enabled fine discrimination, rapid data summarization after genetic evaluation, and permitted accounting for maternal ability and the genetic direct potential of the animals. In addition, we recommend the use of longissimus muscle area and subcutaneous fat thickness as selection criteria, to allow estimation of breeding values before the first mating season in order to accelerate the response to individual selection. PMID:26789008
Brito Lopes, Fernando; da Silva, Marcelo Corrêa; Magnabosco, Cláudio Ulhôa; Goncalves Narciso, Marcelo; Sainz, Roberto Daniel
2016-01-01
This research evaluated a multivariate approach as an alternative tool for the purpose of selection regarding expected progeny differences (EPDs). Data were fitted using a multi-trait model and consisted of growth traits (birth weight and weights at 120, 210, 365 and 450 days of age) and carcass traits (longissimus muscle area (LMA), back-fat thickness (BF), and rump fat thickness (RF)), registered over 21 years in extensive breeding systems of Polled Nellore cattle in Brazil. Multivariate analyses were performed using standardized (zero mean and unit variance) EPDs. The k mean method revealed that the best fit of data occurred using three clusters (k = 3) (P < 0.001). Estimates of genetic correlation among growth and carcass traits and the estimates of heritability were moderate to high, suggesting that a correlated response approach is suitable for practical decision making. Estimates of correlation between selection indices and the multivariate index (LD1) were moderate to high, ranging from 0.48 to 0.97. This reveals that both types of indices give similar results and that the multivariate approach is reliable for the purpose of selection. The alternative tool seems very handy when economic weights are not available or in cases where more rapid identification of the best animals is desired. Interestingly, multivariate analysis allowed forecasting information based on the relationships among breeding values (EPDs). Also, it enabled fine discrimination, rapid data summarization after genetic evaluation, and permitted accounting for maternal ability and the genetic direct potential of the animals. In addition, we recommend the use of longissimus muscle area and subcutaneous fat thickness as selection criteria, to allow estimation of breeding values before the first mating season in order to accelerate the response to individual selection.
Microenvironmental and biological/personal monitoring information were collected during the National Human Exposure Assessment Survey (NHEXAS), conducted in the six states comprising U.S. EPA Region Five. They have been analyzed by multivariate analysis techniques with general ...
Hebart, Martin N.; Görgen, Kai; Haynes, John-Dylan
2015-01-01
The multivariate analysis of brain signals has recently sparked a great amount of interest, yet accessible and versatile tools to carry out decoding analyses are scarce. Here we introduce The Decoding Toolbox (TDT) which represents a user-friendly, powerful and flexible package for multivariate analysis of functional brain imaging data. TDT is written in Matlab and equipped with an interface to the widely used brain data analysis package SPM. The toolbox allows running fast whole-brain analyses, region-of-interest analyses and searchlight analyses, using machine learning classifiers, pattern correlation analysis, or representational similarity analysis. It offers automatic creation and visualization of diverse cross-validation schemes, feature scaling, nested parameter selection, a variety of feature selection methods, multiclass capabilities, and pattern reconstruction from classifier weights. While basic users can implement a generic analysis in one line of code, advanced users can extend the toolbox to their needs or exploit the structure to combine it with external high-performance classification toolboxes. The toolbox comes with an example data set which can be used to try out the various analysis methods. Taken together, TDT offers a promising option for researchers who want to employ multivariate analyses of brain activity patterns. PMID:25610393
Ramdani, Sofiane; Bonnet, Vincent; Tallon, Guillaume; Lagarde, Julien; Bernard, Pierre Louis; Blain, Hubert
2016-08-01
Entropy measures are often used to quantify the regularity of postural sway time series. Recent methodological developments provided both multivariate and multiscale approaches allowing the extraction of complexity features from physiological signals; see "Dynamical complexity of human responses: A multivariate data-adaptive framework," in Bulletin of Polish Academy of Science and Technology, vol. 60, p. 433, 2012. The resulting entropy measures are good candidates for the analysis of bivariate postural sway signals exhibiting nonstationarity and multiscale properties. These methods are dependant on several input parameters such as embedding parameters. Using two data sets collected from institutionalized frail older adults, we numerically investigate the behavior of a recent multivariate and multiscale entropy estimator; see "Multivariate multiscale entropy: A tool for complexity analysis of multichannel data," Physics Review E, vol. 84, p. 061918, 2011. We propose criteria for the selection of the input parameters. Using these optimal parameters, we statistically compare the multivariate and multiscale entropy values of postural sway data of non-faller subjects to those of fallers. These two groups are discriminated by the resulting measures over multiple time scales. We also demonstrate that the typical parameter settings proposed in the literature lead to entropy measures that do not distinguish the two groups. This last result confirms the importance of the selection of appropriate input parameters.
Gerhardt, H Carl; Brooks, Robert
2009-10-01
Even simple biological signals vary in several measurable dimensions. Understanding their evolution requires, therefore, a multivariate understanding of selection, including how different properties interact to determine the effectiveness of the signal. We combined experimental manipulation with multivariate selection analysis to assess female mate choice on the simple trilled calls of male gray treefrogs. We independently and randomly varied five behaviorally relevant acoustic properties in 154 synthetic calls. We compared response times of each of 154 females to one of these calls with its response to a standard call that had mean values of the five properties. We found directional and quadratic selection on two properties indicative of the amount of signaling, pulse number, and call rate. Canonical rotation of the fitness surface showed that these properties, along with pulse rate, contributed heavily to a major axis of stabilizing selection, a result consistent with univariate studies showing diminishing effects of increasing pulse number well beyond the mean. Spectral properties contributed to a second major axis of stabilizing selection. The single major axis of disruptive selection suggested that a combination of two temporal and two spectral properties with values differing from the mean should be especially attractive.
FREQ: A computational package for multivariable system loop-shaping procedures
NASA Technical Reports Server (NTRS)
Giesy, Daniel P.; Armstrong, Ernest S.
1989-01-01
Many approaches in the field of linear, multivariable time-invariant systems analysis and controller synthesis employ loop-sharing procedures wherein design parameters are chosen to shape frequency-response singular value plots of selected transfer matrices. A software package, FREQ, is documented for computing within on unified framework many of the most used multivariable transfer matrices for both continuous and discrete systems. The matrices are evaluated at user-selected frequency-response values, and singular values against frequency. Example computations are presented to demonstrate the use of the FREQ code.
DigOut: viewing differential expression genes as outliers.
Yu, Hui; Tu, Kang; Xie, Lu; Li, Yuan-Yuan
2010-12-01
With regards to well-replicated two-conditional microarray datasets, the selection of differentially expressed (DE) genes is a well-studied computational topic, but for multi-conditional microarray datasets with limited or no replication, the same task is not properly addressed by previous studies. This paper adopts multivariate outlier analysis to analyze replication-lacking multi-conditional microarray datasets, finding that it performs significantly better than the widely used limit fold change (LFC) model in a simulated comparative experiment. Compared with the LFC model, the multivariate outlier analysis also demonstrates improved stability against sample variations in a series of manipulated real expression datasets. The reanalysis of a real non-replicated multi-conditional expression dataset series leads to satisfactory results. In conclusion, a multivariate outlier analysis algorithm, like DigOut, is particularly useful for selecting DE genes from non-replicated multi-conditional gene expression dataset.
Authentication of Trappist beers by LC-MS fingerprints and multivariate data analysis.
Mattarucchi, Elia; Stocchero, Matteo; Moreno-Rojas, José Manuel; Giordano, Giuseppe; Reniero, Fabiano; Guillou, Claude
2010-12-08
The aim of this study was to asses the applicability of LC-MS profiling to authenticate a selected Trappist beer as part of a program on traceability funded by the European Commission. A total of 232 beers were fingerprinted and classified through multivariate data analysis. The selected beer was clearly distinguished from beers of different brands, while only 3 samples (3.5% of the test set) were wrongly classified when compared with other types of beer of the same Trappist brewery. The fingerprints were further analyzed to extract the most discriminating variables, which proved to be sufficient for classification, even using a simplified unsupervised model. This reduced fingerprint allowed us to study the influence of batch-to-batch variability on the classification model. Our results can easily be applied to different matrices and they confirmed the effectiveness of LC-MS profiling in combination with multivariate data analysis for the characterization of food products.
ERIC Educational Resources Information Center
Brusco, Michael J.; Singh, Renu; Steinley, Douglas
2009-01-01
The selection of a subset of variables from a pool of candidates is an important problem in several areas of multivariate statistics. Within the context of principal component analysis (PCA), a number of authors have argued that subset selection is crucial for identifying those variables that are required for correct interpretation of the…
Yan, Zhengbing; Kuang, Te-Hui; Yao, Yuan
2017-09-01
In recent years, multivariate statistical monitoring of batch processes has become a popular research topic, wherein multivariate fault isolation is an important step aiming at the identification of the faulty variables contributing most to the detected process abnormality. Although contribution plots have been commonly used in statistical fault isolation, such methods suffer from the smearing effect between correlated variables. In particular, in batch process monitoring, the high autocorrelations and cross-correlations that exist in variable trajectories make the smearing effect unavoidable. To address such a problem, a variable selection-based fault isolation method is proposed in this research, which transforms the fault isolation problem into a variable selection problem in partial least squares discriminant analysis and solves it by calculating a sparse partial least squares model. As different from the traditional methods, the proposed method emphasizes the relative importance of each process variable. Such information may help process engineers in conducting root-cause diagnosis. Copyright © 2017 ISA. Published by Elsevier Ltd. All rights reserved.
Cho, Hwui-Dong; Kim, Ki-Hun; Hwang, Shin; Ahn, Chul-Soo; Moon, Deok-Bog; Ha, Tae-Yong; Song, Gi-Won; Jung, Dong-Hwan; Park, Gil-Chun; Lee, Sung-Gyu
2018-02-01
To compare the outcomes of pure laparoscopic left hemihepatectomy (LLH) versus open left hemihepatectomy (OLH) for benign and malignant conditions using multivariate analysis. All consecutive cases of LLH and OLH between October 2007 and December 2013 in a tertiary referral hospital were enrolled in this retrospective cohort study. All surgical procedures were performed by one surgeon. The LLH and OLH groups were compared in terms of patient demographics, preoperative data, clinical perioperative outcomes, and tumor characteristics in patients with malignancy. Multivariate analysis of the prognostic factors associated with severe complications was then performed. The LLH group (n = 62) had a significantly shorter postoperative hospital stay than the OLH group (n = 118) (9.53 ± 3.30 vs 14.88 ± 11.36 days, p < 0.001). Multivariate analysis revealed that the OLH group had >4 times the risk of the LLH group in terms of developing severe complications (Clavien-Dindo grade ≥III) (odds ratio 4.294, 95% confidence intervals 1.165-15.832, p = 0.029). LLH was a safe and feasible procedure for selected patients. LLH required shorter hospital stay and resulted in less operative blood loss. Multivariate analysis revealed that LLH was associated with a lower risk of severe complications compared to OLH. The authors suggest that LLH could be a reasonable treatment option for selected patients.
Griswold, Cortland K
2015-12-21
Epistatic gene action occurs when mutations or alleles interact to produce a phenotype. Theoretically and empirically it is of interest to know whether gene interactions can facilitate the evolution of diversity. In this paper, we explore how epistatic gene action affects the additive genetic component or heritable component of multivariate trait variation, as well as how epistatic gene action affects the evolvability of multivariate traits. The analysis involves a sexually reproducing and recombining population. Our results indicate that under stabilizing selection conditions a population with a mixed additive and epistatic genetic architecture can have greater multivariate additive genetic variation and evolvability than a population with a purely additive genetic architecture. That greater multivariate additive genetic variation can occur with epistasis is in contrast to previous theory that indicated univariate additive genetic variation is decreased with epistasis under stabilizing selection conditions. In a multivariate setting, epistasis leads to less relative covariance among individuals in their genotypic, as well as their breeding values, which facilitates the maintenance of additive genetic variation and increases a population׳s evolvability. Our analysis involves linking the combinatorial nature of epistatic genetic effects to the ancestral graph structure of a population to provide insight into the consequences of epistasis on multivariate trait variation and evolution. Copyright © 2015 Elsevier Ltd. All rights reserved.
Metric Selection for Evaluation of Human Supervisory Control Systems
2009-12-01
finding a significant effect when there is none becomes more likely. The inflation of type I error due to multiple dependent variables can be handled...with multivariate analysis techniques, such as Multivariate Analysis of Variance (MANOVA) (Johnson & Wichern, 2002). However, it should be noted that...the few significant differences among many insignificant ones. The best way to avoid failure to identify significant differences is to design an
NASA Astrophysics Data System (ADS)
Sheykhizadeh, Saheleh; Naseri, Abdolhossein
2018-04-01
Variable selection plays a key role in classification and multivariate calibration. Variable selection methods are aimed at choosing a set of variables, from a large pool of available predictors, relevant to the analyte concentrations estimation, or to achieve better classification results. Many variable selection techniques have now been introduced among which, those which are based on the methodologies of swarm intelligence optimization have been more respected during a few last decades since they are mainly inspired by nature. In this work, a simple and new variable selection algorithm is proposed according to the invasive weed optimization (IWO) concept. IWO is considered a bio-inspired metaheuristic mimicking the weeds ecological behavior in colonizing as well as finding an appropriate place for growth and reproduction; it has been shown to be very adaptive and powerful to environmental changes. In this paper, the first application of IWO, as a very simple and powerful method, to variable selection is reported using different experimental datasets including FTIR and NIR data, so as to undertake classification and multivariate calibration tasks. Accordingly, invasive weed optimization - linear discrimination analysis (IWO-LDA) and invasive weed optimization- partial least squares (IWO-PLS) are introduced for multivariate classification and calibration, respectively.
Sheykhizadeh, Saheleh; Naseri, Abdolhossein
2018-04-05
Variable selection plays a key role in classification and multivariate calibration. Variable selection methods are aimed at choosing a set of variables, from a large pool of available predictors, relevant to the analyte concentrations estimation, or to achieve better classification results. Many variable selection techniques have now been introduced among which, those which are based on the methodologies of swarm intelligence optimization have been more respected during a few last decades since they are mainly inspired by nature. In this work, a simple and new variable selection algorithm is proposed according to the invasive weed optimization (IWO) concept. IWO is considered a bio-inspired metaheuristic mimicking the weeds ecological behavior in colonizing as well as finding an appropriate place for growth and reproduction; it has been shown to be very adaptive and powerful to environmental changes. In this paper, the first application of IWO, as a very simple and powerful method, to variable selection is reported using different experimental datasets including FTIR and NIR data, so as to undertake classification and multivariate calibration tasks. Accordingly, invasive weed optimization - linear discrimination analysis (IWO-LDA) and invasive weed optimization- partial least squares (IWO-PLS) are introduced for multivariate classification and calibration, respectively. Copyright © 2018 Elsevier B.V. All rights reserved.
Settle, Steven; Vickery, Lillian; Nemirovskiy, Olga; Vidmar, Tom; Bendele, Alison; Messing, Dean; Ruminski, Peter; Schnute, Mark; Sunyer, Teresa
2010-10-01
To demonstrate that the novel highly selective matrix metalloproteinase 13 (MMP-13) inhibitor PF152 reduces joint lesions in adult dogs with osteoarthritis (OA) and decreases biomarkers of cartilage degradation. The potency and selectivity of PF152 were evaluated in vitro using 16 MMPs, TACE, and ADAMTS-4 and ADAMTS-5, as well as ex vivo in human cartilage explants. In vivo effects were evaluated at 3 concentrations in mature beagles with partial medial meniscectomy. Gross and histologic changes in the femorotibial joints were evaluated using various measures of cartilage degeneration. Biomarkers of cartilage turnover were examined in serum, urine, or synovial fluid. Results were analyzed individually and in combination using multivariate analysis. The potent and selective MMP-13 inhibitor PF152 decreased human cartilage degradation ex vivo in a dose-dependent manner. PF152 treatment of dogs with OA reduced cartilage lesions and decreased biomarkers of type II collagen (type II collagen neoepitope) and aggrecan (peptides ending in ARGN or AGEG) degradation. The dose required for significant inhibition varied with the measure used, but multivariate analysis of 6 gross and histologic measures indicated that all doses differed significantly from vehicle but not from each other. Combined analysis of cartilage degradation markers showed similar results. This highly selective MMP-13 inhibitor exhibits chondroprotective effects in mature animals. Biomarkers of cartilage degradation, when evaluated in combination, parallel the joint structural changes induced by the MMP-13 inhibitor. These data support the potential therapeutic value of selective MMP-13 inhibitors and the use of a set of appropriate biomarkers to predict efficacy in OA clinical trials.
Fast classification of hazelnut cultivars through portable infrared spectroscopy and chemometrics
NASA Astrophysics Data System (ADS)
Manfredi, Marcello; Robotti, Elisa; Quasso, Fabio; Mazzucco, Eleonora; Calabrese, Giorgio; Marengo, Emilio
2018-01-01
The authentication and traceability of hazelnuts is very important for both the consumer and the food industry, to safeguard the protected varieties and the food quality. This study investigates the use of a portable FTIR spectrometer coupled to multivariate statistical analysis for the classification of raw hazelnuts. The method discriminates hazelnuts from different origins/cultivars based on differences of the signal intensities of their IR spectra. The multivariate classification methods, namely principal component analysis (PCA) followed by linear discriminant analysis (LDA) and partial least square discriminant analysis (PLS-DA), with or without variable selection, allowed a very good discrimination among the groups, with PLS-DA coupled to variable selection providing the best results. Due to the fast analysis, high sensitivity, simplicity and no sample preparation, the proposed analytical methodology could be successfully used to verify the cultivar of hazelnuts, and the analysis can be performed quickly and directly on site.
Prat, Chantal; Besalú, Emili; Bañeras, Lluís; Anticó, Enriqueta
2011-06-15
The volatile fraction of aqueous cork macerates of tainted and non-tainted agglomerate cork stoppers was analysed by headspace solid-phase microextraction (HS-SPME)/gas chromatography. Twenty compounds containing terpenoids, aliphatic alcohols, lignin-related compounds and others were selected and analysed in individual corks. Cork stoppers were previously classified in six different classes according to sensory descriptions including, 2,4,6-trichloroanisole taint and other frequent, non-characteristic odours found in cork. A multivariate analysis of the chromatographic data of 20 selected chemical compounds using linear discriminant analysis models helped in the differentiation of the a priori made groups. The discriminant model selected five compounds as the best combination. Selected compounds appear in the model in the following order; 2,4,6 TCA, fenchyl alcohol, 1-octen-3-ol, benzyl alcohol and benzothiazole. Unfortunately, not all six a priori differentiated sensory classes were clearly discriminated in the model, probably indicating that no measurable differences exist in the chromatographic data for some categories. The predictive analyses of a refined model in which two sensory classes were fused together resulted in a good classification. Prediction rates of control (non-tainted), TCA, musty-earthy-vegetative, vegetative and chemical descriptions were 100%, 100%, 85%, 67.3% and 100%, respectively, when the modified model was used. The multivariate analysis of chromatographic data will help in the classification of stoppers and provide a perfect complement to sensory analyses. Copyright © 2010 Elsevier Ltd. All rights reserved.
Application of multivariable statistical techniques in plant-wide WWTP control strategies analysis.
Flores, X; Comas, J; Roda, I R; Jiménez, L; Gernaey, K V
2007-01-01
The main objective of this paper is to present the application of selected multivariable statistical techniques in plant-wide wastewater treatment plant (WWTP) control strategies analysis. In this study, cluster analysis (CA), principal component analysis/factor analysis (PCA/FA) and discriminant analysis (DA) are applied to the evaluation matrix data set obtained by simulation of several control strategies applied to the plant-wide IWA Benchmark Simulation Model No 2 (BSM2). These techniques allow i) to determine natural groups or clusters of control strategies with a similar behaviour, ii) to find and interpret hidden, complex and casual relation features in the data set and iii) to identify important discriminant variables within the groups found by the cluster analysis. This study illustrates the usefulness of multivariable statistical techniques for both analysis and interpretation of the complex multicriteria data sets and allows an improved use of information for effective evaluation of control strategies.
Yokoyama, Kazuhiko; Itoman, Moritoshi; Uchino, Masataka; Fukushima, Kensuke; Nitta, Hiroshi; Kojima, Yoshiaki
2008-10-01
The purpose of this study was to evaluate contributing factors affecting deep infection and fracture healing of open tibia fractures treated with locked intramedullary nailing (IMN) by multivariate analysis. We examined 99 open tibial fractures (98 patients) treated with immediate or delayed locked IMN in static fashion from 1991 to 2002. Multivariate analyses following univariate analyses were derived to determine predictors of deep infection, nonunion, and healing time to union. The following predictive variables of deep infection were selected for analysis: age, sex, Gustilo type, fracture grade by AO type, fracture location, timing or method of IMN, reamed or unreamed nailing, debridement time (< or =6 h or >6 h), method of soft-tissue management, skin closure time (< or =1 week or >1 week), existence of polytrauma (ISS< 18 or ISS> or =18), existence of floating knee injury, and existence of superficial/pin site infection. The predictive variables of nonunion selected for analysis was the same as those for deep infection, with the addition of deep infection for exchange of pin site infection. The predictive variables of union time selected for analysis was the same as those for nonunion, excluding of location, debridement time, and existence of floating knee and superficial infection. Six (6.1%; type II Gustilo n=1, type IIIB Gustilo n=5) of the 99 open tibial fractures developed deep infections. Multivariate analysis revealed that timing or method of IMN, debridement time, method of soft-tissue management, and existence of superficial or pin site infection significantly correlated with the occurrence of deep infection (P< 0.0001). In the immediate nailing group alone, the deep infection rate in type IIIB + IIIC was significantly higher than those in type I + II and IIIA (P = 0.016). Nonunion occurred in 17 fractures (20.3%, 17/84). Multivariate analysis revealed that Gustilo type, skin closure time, and existence of deep infection significantly correlated with occurrence of nonunion (P < 0.05). Gustilo type and existence of deep infection were significantly correlated with healing time to union on multivariate analysis (r(2) = 0.263, P = 0.0001). Multivariate analyses for open tibial fractures treated with IMN showed that IMN after EF (especially in existence of pin site infection) was at high risk of deep infection, and that debridement within 6 h and appropriate soft-tissue managements were also important factor in preventing deep infections. These analyses postulated that both the Gustilo type and the existence of deep infection is related with fracture healing in open fractures treated with IMN. In addition, immediate IMN for type IIIB and IIIC is potentially risky, and canal reaming did not increase the risk of complication for open tibial fractures treated with IMN.
Multivariate Profiles of Selected versus Non-Selected Elite Youth Brazilian Soccer Players
Alves, Isabella S.; Padilha, Maickel B.; Casanova, Filipe; Puggina, Enrico F.; Maia, José
2017-01-01
Abstract This study determined whether a multivariate profile more effectively discriminated selected than non-selected elite youth Brazilian soccer players. This examination was carried out on 66 youth soccer players (selected, n = 28, mean age 16.3 ± 0.1; non-selected, n = 38, mean age 16.7 ± 0.4) using objective instruments. Multivariate profiles were assessed through anthropometric characteristics, biological maturation, tactical-technical skills, and motor performance. The Student’s t-test identified that selected players exhibited significantly higher values for height (t = 2.331, p = 0.02), lean body mass (t = 2.441, p = 0.01), and maturity offset (t = 4.559, p < 0.001), as well as performed better in declarative tactical knowledge (t = 10.484, p < 0.001), shooting (t = 2.188, p = 0.03), dribbling (t = 5.914, p < 0.001), speed – 30 m (t = 8.304, p < 0.001), countermovement jump (t = 2.718, p = 0.008), and peak power tests (t = 2.454, p = 0.01). Forward stepwise discriminant function analysis showed that declarative tactical knowledge, running speed –30 m, maturity offset, dribbling, height, and peak power correctly classified 97% of the selected players. These findings may have implications for a highly efficient selection process with objective measures of youth players in soccer clubs. PMID:29339991
Multivariate meta-analysis using individual participant data
Riley, R. D.; Price, M. J.; Jackson, D.; Wardle, M.; Gueyffier, F.; Wang, J.; Staessen, J. A.; White, I. R.
2016-01-01
When combining results across related studies, a multivariate meta-analysis allows the joint synthesis of correlated effect estimates from multiple outcomes. Joint synthesis can improve efficiency over separate univariate syntheses, may reduce selective outcome reporting biases, and enables joint inferences across the outcomes. A common issue is that within-study correlations needed to fit the multivariate model are unknown from published reports. However, provision of individual participant data (IPD) allows them to be calculated directly. Here, we illustrate how to use IPD to estimate within-study correlations, using a joint linear regression for multiple continuous outcomes and bootstrapping methods for binary, survival and mixed outcomes. In a meta-analysis of 10 hypertension trials, we then show how these methods enable multivariate meta-analysis to address novel clinical questions about continuous, survival and binary outcomes; treatment–covariate interactions; adjusted risk/prognostic factor effects; longitudinal data; prognostic and multiparameter models; and multiple treatment comparisons. Both frequentist and Bayesian approaches are applied, with example software code provided to derive within-study correlations and to fit the models. PMID:26099484
Teodoro, P E; Rodrigues, E V; Peixoto, L A; Silva, L A; Laviola, B G; Bhering, L L
2017-03-22
Jatropha is research target worldwide aimed at large-scale oil production for biodiesel and bio-kerosene. Its production potential is among 1200 and 1500 kg/ha of oil after the 4th year. This study aimed to estimate combining ability of Jatropha genotypes by multivariate diallel analysis to select parents and crosses that allow gains in important agronomic traits. We performed crosses in diallel complete genetic design (3 x 3) arranged in blocks with five replications and three plants per plot. The following traits were evaluated: plant height, stem diameter, canopy projection between rows, canopy projection on the line, number of branches, mass of hundred grains, and grain yield. Data were submitted to univariate and multivariate diallel analysis. Genotypes 107 and 190 can be used in crosses for establishing a base population of Jatropha, since it has favorable alleles for increasing the mass of hundred grains and grain yield and reducing the plant height. The cross 190 x 107 is the most promising to perform the selection of superior genotypes for the simultaneous breeding of these traits.
Multivariable passive RFID vapor sensors: roll-to-roll fabrication on a flexible substrate.
Potyrailo, Radislav A; Burns, Andrew; Surman, Cheryl; Lee, D J; McGinniss, Edward
2012-06-21
We demonstrate roll-to-roll (R2R) fabrication of highly selective, battery-free radio frequency identification (RFID) sensors on a flexible polyethylene terephthalate (PET) polymeric substrate. Selectivity of our developed RFID sensors is provided by measurements of their resonance impedance spectra, followed by the multivariate analysis of spectral features, and correlation of these spectral features to the concentrations of vapors of interest. The multivariate analysis of spectral features also provides the ability for the rejection of ambient interferences. As a demonstration of our R2R fabrication process, we employed polyetherurethane (PEUT) as a "classic" sensing material, extruded this sensing material as 25, 75, and 125-μm thick films, and thermally laminated the films onto RFID inlays, rapidly producing approximately 5000 vapor sensors. We further tested these RFID vapor sensors for their response selectivity toward several model vapors such as toluene, acetone, and ethanol as well as water vapor as an abundant interferent. Our RFID sensing concept features 16-bit resolution provided by the sensor reader, granting a highly desired independence from costly proprietary RFID memory chips with a low-resolution analog input. Future steps are being planned for field-testing of these sensors in numerous conditions.
NASA Astrophysics Data System (ADS)
Ding, Hao; Cao, Ming; DuPont, Andrew W.; Scott, Larry D.; Guha, Sushovan; Singhal, Shashideep; Younes, Mamoun; Pence, Isaac; Herline, Alan; Schwartz, David; Xu, Hua; Mahadevan-Jansen, Anita; Bi, Xiaohong
2016-03-01
Inflammatory bowel disease (IBD) is an idiopathic disease that is typically characterized by chronic inflammation of the gastrointestinal tract. Recently much effort has been devoted to the development of novel diagnostic tools that can assist physicians for fast, accurate, and automated diagnosis of the disease. Previous research based on Raman spectroscopy has shown promising results in differentiating IBD patients from normal screening cases. In the current study, we examined IBD patients in vivo through a colonoscope-coupled Raman system. Optical diagnosis for IBD discrimination was conducted based on full-range spectra using multivariate statistical methods. Further, we incorporated several feature selection methods in machine learning into the classification model. The diagnostic performance for disease differentiation was significantly improved after feature selection. Our results showed that improved IBD diagnosis can be achieved using Raman spectroscopy in combination with multivariate analysis and feature selection.
Multivariate meta-analysis using individual participant data.
Riley, R D; Price, M J; Jackson, D; Wardle, M; Gueyffier, F; Wang, J; Staessen, J A; White, I R
2015-06-01
When combining results across related studies, a multivariate meta-analysis allows the joint synthesis of correlated effect estimates from multiple outcomes. Joint synthesis can improve efficiency over separate univariate syntheses, may reduce selective outcome reporting biases, and enables joint inferences across the outcomes. A common issue is that within-study correlations needed to fit the multivariate model are unknown from published reports. However, provision of individual participant data (IPD) allows them to be calculated directly. Here, we illustrate how to use IPD to estimate within-study correlations, using a joint linear regression for multiple continuous outcomes and bootstrapping methods for binary, survival and mixed outcomes. In a meta-analysis of 10 hypertension trials, we then show how these methods enable multivariate meta-analysis to address novel clinical questions about continuous, survival and binary outcomes; treatment-covariate interactions; adjusted risk/prognostic factor effects; longitudinal data; prognostic and multiparameter models; and multiple treatment comparisons. Both frequentist and Bayesian approaches are applied, with example software code provided to derive within-study correlations and to fit the models. © 2014 The Authors. Research Synthesis Methods published by John Wiley & Sons, Ltd.
mESAdb: microRNA Expression and Sequence Analysis Database
Kaya, Koray D.; Karakülah, Gökhan; Yakıcıer, Cengiz M.; Acar, Aybar C.; Konu, Özlen
2011-01-01
microRNA expression and sequence analysis database (http://konulab.fen.bilkent.edu.tr/mirna/) (mESAdb) is a regularly updated database for the multivariate analysis of sequences and expression of microRNAs from multiple taxa. mESAdb is modular and has a user interface implemented in PHP and JavaScript and coupled with statistical analysis and visualization packages written for the R language. The database primarily comprises mature microRNA sequences and their target data, along with selected human, mouse and zebrafish expression data sets. mESAdb analysis modules allow (i) mining of microRNA expression data sets for subsets of microRNAs selected manually or by motif; (ii) pair-wise multivariate analysis of expression data sets within and between taxa; and (iii) association of microRNA subsets with annotation databases, HUGE Navigator, KEGG and GO. The use of existing and customized R packages facilitates future addition of data sets and analysis tools. Furthermore, the ability to upload and analyze user-specified data sets makes mESAdb an interactive and expandable analysis tool for microRNA sequence and expression data. PMID:21177657
mESAdb: microRNA expression and sequence analysis database.
Kaya, Koray D; Karakülah, Gökhan; Yakicier, Cengiz M; Acar, Aybar C; Konu, Ozlen
2011-01-01
microRNA expression and sequence analysis database (http://konulab.fen.bilkent.edu.tr/mirna/) (mESAdb) is a regularly updated database for the multivariate analysis of sequences and expression of microRNAs from multiple taxa. mESAdb is modular and has a user interface implemented in PHP and JavaScript and coupled with statistical analysis and visualization packages written for the R language. The database primarily comprises mature microRNA sequences and their target data, along with selected human, mouse and zebrafish expression data sets. mESAdb analysis modules allow (i) mining of microRNA expression data sets for subsets of microRNAs selected manually or by motif; (ii) pair-wise multivariate analysis of expression data sets within and between taxa; and (iii) association of microRNA subsets with annotation databases, HUGE Navigator, KEGG and GO. The use of existing and customized R packages facilitates future addition of data sets and analysis tools. Furthermore, the ability to upload and analyze user-specified data sets makes mESAdb an interactive and expandable analysis tool for microRNA sequence and expression data.
Zhi, Ruicong; Zhao, Lei; Xie, Nan; Wang, Houyin; Shi, Bolin; Shi, Jingye
2016-01-13
A framework of establishing standard reference scale (texture) is proposed by multivariate statistical analysis according to instrumental measurement and sensory evaluation. Multivariate statistical analysis is conducted to rapidly select typical reference samples with characteristics of universality, representativeness, stability, substitutability, and traceability. The reasonableness of the framework method is verified by establishing standard reference scale of texture attribute (hardness) with Chinese well-known food. More than 100 food products in 16 categories were tested using instrumental measurement (TPA test), and the result was analyzed with clustering analysis, principal component analysis, relative standard deviation, and analysis of variance. As a result, nine kinds of foods were determined to construct the hardness standard reference scale. The results indicate that the regression coefficient between the estimated sensory value and the instrumentally measured value is significant (R(2) = 0.9765), which fits well with Stevens's theory. The research provides reliable a theoretical basis and practical guide for quantitative standard reference scale establishment on food texture characteristics.
Jiao, Shengwu; Guo, Yumin; Huettmann, Falk; Lei, Guangchun
2014-07-01
Avian nest-site selection is an important research and management subject. The hooded crane (Grus monacha) is a vulnerable (VU) species according to the IUCN Red List. Here, we present the first long-term Chinese legacy nest data for this species (1993-2010) with publicly available metadata. Further, we provide the first study that reports findings on multivariate nest habitat preference using such long-term field data for this species. Our work was carried out in Northeastern China, where we found and measured 24 nests and 81 randomly selected control plots and their environmental parameters in a vast landscape. We used machine learning (stochastic boosted regression trees) to quantify nest selection. Our analysis further included varclust (R Hmisc) and (TreenNet) to address statistical correlations and two-way interactions. We found that from an initial list of 14 measured field variables, water area (+), water depth (+) and shrub coverage (-) were the main explanatory variables that contributed to hooded crane nest-site selection. Agricultural sites played a smaller role in the selection of these nests. Our results are important for the conservation management of cranes all over East Asia and constitute a defensible and quantitative basis for predictive models.
Systematic wavelength selection for improved multivariate spectral analysis
Thomas, Edward V.; Robinson, Mark R.; Haaland, David M.
1995-01-01
Methods and apparatus for determining in a biological material one or more unknown values of at least one known characteristic (e.g. the concentration of an analyte such as glucose in blood or the concentration of one or more blood gas parameters) with a model based on a set of samples with known values of the known characteristics and a multivariate algorithm using several wavelength subsets. The method includes selecting multiple wavelength subsets, from the electromagnetic spectral region appropriate for determining the known characteristic, for use by an algorithm wherein the selection of wavelength subsets improves the model's fitness of the determination for the unknown values of the known characteristic. The selection process utilizes multivariate search methods that select both predictive and synergistic wavelengths within the range of wavelengths utilized. The fitness of the wavelength subsets is determined by the fitness function F=.function.(cost, performance). The method includes the steps of: (1) using one or more applications of a genetic algorithm to produce one or more count spectra, with multiple count spectra then combined to produce a combined count spectrum; (2) smoothing the count spectrum; (3) selecting a threshold count from a count spectrum to select these wavelength subsets which optimize the fitness function; and (4) eliminating a portion of the selected wavelength subsets. The determination of the unknown values can be made: (1) noninvasively and in vivo; (2) invasively and in vivo; or (3) in vitro.
Improving Cluster Analysis with Automatic Variable Selection Based on Trees
2014-12-01
regression trees Daisy DISsimilAritY PAM partitioning around medoids PMA penalized multivariate analysis SPC sparse principal components UPGMA unweighted...unweighted pair-group average method ( UPGMA ). This method measures dissimilarities between all objects in two clusters and takes the average value
Bello, Alessandra; Bianchi, Federica; Careri, Maria; Giannetto, Marco; Mori, Giovanni; Musci, Marilena
2007-11-05
A new NIR method based on multivariate calibration for determination of ethanol in industrially packed wholemeal bread was developed and validated. GC-FID was used as reference method for the determination of actual ethanol concentration of different samples of wholemeal bread with proper content of added ethanol, ranging from 0 to 3.5% (w/w). Stepwise discriminant analysis was carried out on the NIR dataset, in order to reduce the number of original variables by selecting those that were able to discriminate between the samples of different ethanol concentrations. With the so selected variables a multivariate calibration model was then obtained by multiple linear regression. The prediction power of the linear model was optimized by a new "leave one out" method, so that the number of original variables resulted further reduced.
Ponsoda, Vicente; Martínez, Kenia; Pineda-Pardo, José A; Abad, Francisco J; Olea, Julio; Román, Francisco J; Barbey, Aron K; Colom, Roberto
2017-02-01
Neuroimaging research involves analyses of huge amounts of biological data that might or might not be related with cognition. This relationship is usually approached using univariate methods, and, therefore, correction methods are mandatory for reducing false positives. Nevertheless, the probability of false negatives is also increased. Multivariate frameworks have been proposed for helping to alleviate this balance. Here we apply multivariate distance matrix regression for the simultaneous analysis of biological and cognitive data, namely, structural connections among 82 brain regions and several latent factors estimating cognitive performance. We tested whether cognitive differences predict distances among individuals regarding their connectivity pattern. Beginning with 3,321 connections among regions, the 36 edges better predicted by the individuals' cognitive scores were selected. Cognitive scores were related to connectivity distances in both the full (3,321) and reduced (36) connectivity patterns. The selected edges connect regions distributed across the entire brain and the network defined by these edges supports high-order cognitive processes such as (a) (fluid) executive control, (b) (crystallized) recognition, learning, and language processing, and (c) visuospatial processing. This multivariate study suggests that one widespread, but limited number, of regions in the human brain, supports high-level cognitive ability differences. Hum Brain Mapp 38:803-816, 2017. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Multivariate analysis of longitudinal rates of change.
Bryan, Matthew; Heagerty, Patrick J
2016-12-10
Longitudinal data allow direct comparison of the change in patient outcomes associated with treatment or exposure. Frequently, several longitudinal measures are collected that either reflect a common underlying health status, or characterize processes that are influenced in a similar way by covariates such as exposure or demographic characteristics. Statistical methods that can combine multivariate response variables into common measures of covariate effects have been proposed in the literature. Current methods for characterizing the relationship between covariates and the rate of change in multivariate outcomes are limited to select models. For example, 'accelerated time' methods have been developed which assume that covariates rescale time in longitudinal models for disease progression. In this manuscript, we detail an alternative multivariate model formulation that directly structures longitudinal rates of change and that permits a common covariate effect across multiple outcomes. We detail maximum likelihood estimation for a multivariate longitudinal mixed model. We show via asymptotic calculations the potential gain in power that may be achieved with a common analysis of multiple outcomes. We apply the proposed methods to the analysis of a trivariate outcome for infant growth and compare rates of change for HIV infected and uninfected infants. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Constrained evolution of the sex comb in Drosophila simulans.
Maraqa, M S; Griffin, R; Sharma, M D; Wilson, A J; Hunt, J; Hosken, D J; House, C M
2017-02-01
Male fitness is dependent on sexual traits that influence mate acquisition (precopulatory sexual selection) and paternity (post-copulatory sexual selection), and although many studies have documented the form of selection in one or the other of these arenas, fewer have done it for both. Nonetheless, it appears that the dominant form of sexual selection is directional, although theoretically, populations should converge on peaks in the fitness surface, where selection is stabilizing. Many factors, however, can prevent populations from reaching adaptive peaks. Genetic constraints can be important if they prevent the development of highest fitness phenotypes, as can the direction of selection if it reverses across episodes of selection. In this study, we examine the evidence that these processes influence the evolution of the multivariate sex comb morphology of male Drosophila simulans. To do this, we conduct a quantitative genetic study together with a multivariate selection analysis to infer how the genetic architecture and selection interact. We find abundant genetic variance and covariance in elements of the sex comb. However, there was little evidence for directional selection in either arena. Significant nonlinear selection was detected prior to copulation when males were mated to nonvirgin females, and post-copulation during sperm offence (again with males mated to nonvirgins). Thus, contrary to our predictions, the evolution of the D. simulans sex comb is limited neither by genetic constraints nor by antagonistic selection between pre- and post-copulatory arenas, but nonlinear selection on the multivariate phenotype may prevent sex combs from evolving to reach some fitness maximizing optima. © 2016 The Authors. Journal of Evolutionary Biology © 2016 European Society For Evolutionary Biology.
Descriptor selection for banana accessions based on univariate and multivariate analysis.
Brandão, L P; Souza, C P F; Pereira, V M; Silva, S O; Santos-Serejo, J A; Ledo, C A S; Amorim, E P
2013-05-14
Our objective was to establish a minimum number of morphological descriptors for the characterization of banana germplasm and evaluate the efficiency of removal of redundant characters, based on univariate and multivariate statistical analyses. Phenotypic characterization was made of 77 accessions from Bahia, Brazil, using 92 descriptors. The selection of the descriptors was carried out by principal components analysis (quantitative) and by entropy (multi-category). Efficiency of elimination was analyzed by a comparative study between the clusters formed, taking into consideration all 92 descriptors and smaller groups. The selected descriptors were analyzed with the Ward-MLM procedure and a combined matrix formed by the Gower algorithm. We were able to reduce the number of descriptors used for characterizing the banana germplasm (42%). The correlation between the matrices considering the 92 descriptors and the selected ones was 0.82, showing that the reduction in the number of descriptors did not influence estimation of genetic variability between the banana accessions. We conclude that removing these descriptors caused no loss of information, considering the groups formed from pre-established criteria, including subgroup/subspecies.
Genome-Wide Association Analysis of Adaptation Using Environmentally Predicted Traits.
van Heerwaarden, Joost; van Zanten, Martijn; Kruijer, Willem
2015-10-01
Current methods for studying the genetic basis of adaptation evaluate genetic associations with ecologically relevant traits or single environmental variables, under the implicit assumption that natural selection imposes correlations between phenotypes, environments and genotypes. In practice, observed trait and environmental data are manifestations of unknown selective forces and are only indirectly associated with adaptive genetic variation. In theory, improved estimation of these forces could enable more powerful detection of loci under selection. Here we present an approach in which we approximate adaptive variation by modeling phenotypes as a function of the environment and using the predicted trait in multivariate and univariate genome-wide association analysis (GWAS). Based on computer simulations and published flowering time data from the model plant Arabidopsis thaliana, we find that environmentally predicted traits lead to higher recovery of functional loci in multivariate GWAS and are more strongly correlated to allele frequencies at adaptive loci than individual environmental variables. Our results provide an example of the use of environmental data to obtain independent and meaningful information on adaptive genetic variation.
An Analysis of Methods Used to Examine Gender Differences in Computer-Related Behavior.
ERIC Educational Resources Information Center
Kay, Robin
1992-01-01
Review of research investigating gender differences in computer-related behavior examines statistical and methodological flaws. Issues addressed include sample selection, sample size, scale development, scale quality, the use of univariate and multivariate analyses, regressional analysis, construct definition, construct testing, and the…
A Note on Asymptotic Joint Distribution of the Eigenvalues of a Noncentral Multivariate F Matrix.
1984-11-01
Krishnaiah (1982). Now, let us consider the samples drawn from the k multivariate normal popuiejons. Let (Xlt....Xpt) denote the mean vector of the t...to maltivariate problems. Sankh-ya, 4, 381-39(s. (71 KRISHNAIAH , P. R. (1982). Selection of variables in discrimlnant analysis. In Handbook of...Statistics, Volume 2 (P. R. Krishnaiah , editor), 805-820. North-Holland Publishing Company. 6. Unclassifie INSTRUCTIONS REPORT DOCUMENTATION PAGE
Input-output oriented computation algorithms for the control of large flexible structures
NASA Technical Reports Server (NTRS)
Minto, K. D.
1989-01-01
An overview is given of work in progress aimed at developing computational algorithms addressing two important aspects in the control of large flexible space structures; namely, the selection and placement of sensors and actuators, and the resulting multivariable control law design problem. The issue of sensor/actuator set selection is particularly crucial to obtaining a satisfactory control design, as clearly a poor choice will inherently limit the degree to which good control can be achieved. With regard to control law design, the researchers are driven by concerns stemming from the practical issues associated with eventual implementation of multivariable control laws, such as reliability, limit protection, multimode operation, sampling rate selection, processor throughput, etc. Naturally, the burden imposed by dealing with these aspects of the problem can be reduced by ensuring that the complexity of the compensator is minimized. Our approach to these problems is based on extensions to input/output oriented techniques that have proven useful in the design of multivariable control systems for aircraft engines. In particular, researchers are exploring the use of relative gain analysis and the condition number as a means of quantifying the process of sensor/actuator selection and placement for shape control of a large space platform.
Kim, Sungduk; Chen, Ming-Hui; Ibrahim, Joseph G.; Shah, Arvind K.; Lin, Jianxin
2013-01-01
In this paper, we propose a class of Box-Cox transformation regression models with multidimensional random effects for analyzing multivariate responses for individual patient data (IPD) in meta-analysis. Our modeling formulation uses a multivariate normal response meta-analysis model with multivariate random effects, in which each response is allowed to have its own Box-Cox transformation. Prior distributions are specified for the Box-Cox transformation parameters as well as the regression coefficients in this complex model, and the Deviance Information Criterion (DIC) is used to select the best transformation model. Since the model is quite complex, a novel Monte Carlo Markov chain (MCMC) sampling scheme is developed to sample from the joint posterior of the parameters. This model is motivated by a very rich dataset comprising 26 clinical trials involving cholesterol lowering drugs where the goal is to jointly model the three dimensional response consisting of Low Density Lipoprotein Cholesterol (LDL-C), High Density Lipoprotein Cholesterol (HDL-C), and Triglycerides (TG) (LDL-C, HDL-C, TG). Since the joint distribution of (LDL-C, HDL-C, TG) is not multivariate normal and in fact quite skewed, a Box-Cox transformation is needed to achieve normality. In the clinical literature, these three variables are usually analyzed univariately: however, a multivariate approach would be more appropriate since these variables are correlated with each other. A detailed analysis of these data is carried out using the proposed methodology. PMID:23580436
Kim, Sungduk; Chen, Ming-Hui; Ibrahim, Joseph G; Shah, Arvind K; Lin, Jianxin
2013-10-15
In this paper, we propose a class of Box-Cox transformation regression models with multidimensional random effects for analyzing multivariate responses for individual patient data in meta-analysis. Our modeling formulation uses a multivariate normal response meta-analysis model with multivariate random effects, in which each response is allowed to have its own Box-Cox transformation. Prior distributions are specified for the Box-Cox transformation parameters as well as the regression coefficients in this complex model, and the deviance information criterion is used to select the best transformation model. Because the model is quite complex, we develop a novel Monte Carlo Markov chain sampling scheme to sample from the joint posterior of the parameters. This model is motivated by a very rich dataset comprising 26 clinical trials involving cholesterol-lowering drugs where the goal is to jointly model the three-dimensional response consisting of low density lipoprotein cholesterol (LDL-C), high density lipoprotein cholesterol (HDL-C), and triglycerides (TG) (LDL-C, HDL-C, TG). Because the joint distribution of (LDL-C, HDL-C, TG) is not multivariate normal and in fact quite skewed, a Box-Cox transformation is needed to achieve normality. In the clinical literature, these three variables are usually analyzed univariately; however, a multivariate approach would be more appropriate because these variables are correlated with each other. We carry out a detailed analysis of these data by using the proposed methodology. Copyright © 2013 John Wiley & Sons, Ltd.
Diversity pattern in Sesamum mutants selected for a semi-arid cropping system.
Murty, B R; Oropeza, F
1989-02-01
Due to the complex requirements of moisture stress, substantial genetic diversity with a wide array of character combinations and effective simultaneous selection for several variables is necessary for improving the productivity and adaptation of a component crop in order for it to fit into a cropping system under semi-arid tropical conditions. Sesamum indicum L. is grown in Venezuela after rice/sorghum/or maize under such conditions. A mutation breeding program was undertaken using six locally adapted varieties to develop genotypes suitable for the above system. The diversity pattern for nine variables was assessed by multivariate analysis in 301 M4 progenies. Analysis of the characteristic roots and principal components in three methods of selection, i.e., M2 bulks (A), individual plant selection throughout (B), and selection in M3 for single variable (C), revealed differences in the pattern of variation between varieties, selection methods, and varieties x methods interactions. Method B was superior to the others and gave 17 of the 21 best M5 progenies. 'Piritu' and 'CF' varieties yielded the most productive progenies in M5 and M6. Diversity was large and selection was effective for such developmental traits as earliness and synchrony, combined with multiple disease resistance, which could be related to their importance by multivariate analyses. Considerable differences in the variety of character combinations among the high yielding. M5 progenies of 'CF' and 'Piritu' suggested possible further yield improvement. The superior response of 'Piritu' and 'CF' over other varieties in yield and adaptation was due to major changes in plant type and character associations. Multilocation testing of M5 generations revealed that the mutant progenies had a 40%-100% yield superiority over the parents; this was combined with earliness, synchrony, and multiple disease resistance, and was confirmed in the M6 generation grown on a commercial scale. This study showed that multivariate analysis is an effective tool for assessing diversity patterns, choice of appropriate variety, and selection methodology in order to make rapid progress in meeting the complex requirements of semi-arid cropping systems.
An Interinstitutional Analysis of Faculty Teaching Load.
ERIC Educational Resources Information Center
Ahrens, Stephen W.
A two-year interinstitutional study among 15 cooperating universities was conducted to determine whether significant differences exist in teaching loads among the selected universities as measured by student credit hours produced by full-time equivalent faculty. The statistical model was a multivariate analysis of variance with fixed effects and…
2015-12-01
of squares more difficult. There are, however, solutions to these issues. A weighted mean can be used in place of the grand mean1 and the STATA ...multivariate test of means provided in STATA 12.1. This test checks whether or not population variances and covariances of both DVs (PALT and CPARS
2015-12-01
however, solutions to these issues. A weighted mean can be used in place of the grand mean1 and the STATA software automatically handles the assignment of...covariance matrices between groups (i.e., sphericity) using the multivariate test of means provided in STATA 12.1. This test checks whether or not
Multivariate Analysis of Longitudinal Rates of Change
Bryan, Matthew; Heagerty, Patrick J.
2016-01-01
Longitudinal data allow direct comparison of the change in patient outcomes associated with treatment or exposure. Frequently, several longitudinal measures are collected that either reflect a common underlying health status, or characterize processes that are influenced in a similar way by covariates such as exposure or demographic characteristics. Statistical methods that can combine multivariate response variables into common measures of covariate effects have been proposed by Roy and Lin [1]; Proust-Lima, Letenneur and Jacqmin-Gadda [2]; and Gray and Brookmeyer [3] among others. Current methods for characterizing the relationship between covariates and the rate of change in multivariate outcomes are limited to select models. For example, Gray and Brookmeyer [3] introduce an “accelerated time” method which assumes that covariates rescale time in longitudinal models for disease progression. In this manuscript we detail an alternative multivariate model formulation that directly structures longitudinal rates of change, and that permits a common covariate effect across multiple outcomes. We detail maximum likelihood estimation for a multivariate longitudinal mixed model. We show via asymptotic calculations the potential gain in power that may be achieved with a common analysis of multiple outcomes. We apply the proposed methods to the analysis of a trivariate outcome for infant growth and compare rates of change for HIV infected and uninfected infants. PMID:27417129
Multi-country health surveys: are the analyses misleading?
Masood, Mohd; Reidpath, Daniel D
2014-05-01
The aim of this paper was to review the types of approaches currently utilized in the analysis of multi-country survey data, specifically focusing on design and modeling issues with a focus on analyses of significant multi-country surveys published in 2010. A systematic search strategy was used to identify the 10 multi-country surveys and the articles published from them in 2010. The surveys were selected to reflect diverse topics and foci; and provide an insight into analytic approaches across research themes. The search identified 159 articles appropriate for full text review and data extraction. The analyses adopted in the multi-country surveys can be broadly classified as: univariate/bivariate analyses, and multivariate/multivariable analyses. Multivariate/multivariable analyses may be further divided into design- and model-based analyses. Of the 159 articles reviewed, 129 articles used model-based analysis, 30 articles used design-based analyses. Similar patterns could be seen in all the individual surveys. While there is general agreement among survey statisticians that complex surveys are most appropriately analyzed using design-based analyses, most researchers continued to use the more common model-based approaches. Recent developments in design-based multi-level analysis may be one approach to include all the survey design characteristics. This is a relatively new area, however, and there remains statistical, as well as applied analytic research required. An important limitation of this study relates to the selection of the surveys used and the choice of year for the analysis, i.e., year 2010 only. There is, however, no strong reason to believe that analytic strategies have changed radically in the past few years, and 2010 provides a credible snapshot of current practice.
Rosswog, Carolina; Schmidt, Rene; Oberthuer, André; Juraeva, Dilafruz; Brors, Benedikt; Engesser, Anne; Kahlert, Yvonne; Volland, Ruth; Bartenhagen, Christoph; Simon, Thorsten; Berthold, Frank; Hero, Barbara; Faldum, Andreas; Fischer, Matthias
2017-12-01
Current risk stratification systems for neuroblastoma patients consider clinical, histopathological, and genetic variables, and additional prognostic markers have been proposed in recent years. We here sought to select highly informative covariates in a multistep strategy based on consecutive Cox regression models, resulting in a risk score that integrates hazard ratios of prognostic variables. A cohort of 695 neuroblastoma patients was divided into a discovery set (n=75) for multigene predictor generation, a training set (n=411) for risk score development, and a validation set (n=209). Relevant prognostic variables were identified by stepwise multivariable L1-penalized least absolute shrinkage and selection operator (LASSO) Cox regression, followed by backward selection in multivariable Cox regression, and then integrated into a novel risk score. The variables stage, age, MYCN status, and two multigene predictors, NB-th24 and NB-th44, were selected as independent prognostic markers by LASSO Cox regression analysis. Following backward selection, only the multigene predictors were retained in the final model. Integration of these classifiers in a risk scoring system distinguished three patient subgroups that differed substantially in their outcome. The scoring system discriminated patients with diverging outcome in the validation cohort (5-year event-free survival, 84.9±3.4 vs 63.6±14.5 vs 31.0±5.4; P<.001), and its prognostic value was validated by multivariable analysis. We here propose a translational strategy for developing risk assessment systems based on hazard ratios of relevant prognostic variables. Our final neuroblastoma risk score comprised two multigene predictors only, supporting the notion that molecular properties of the tumor cells strongly impact clinical courses of neuroblastoma patients. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Multivariate analysis of early and late nest sites of Abert's Towhees
Deborah M. Finch
1985-01-01
Seasonal variation in nest site selection by the Abert's towhee (Pipilo aberti) was studied in honey mesquite (Prosopis glandulosa) habitat along the lower Colorado River from March to July, 1981. Stepwise discriminant function analysis identified nest vegetation type, nest direction, and nest height as the three most important variables that characterized the...
ERIC Educational Resources Information Center
Allegrante, John P.; And Others
The purpose of this study was to identify and analyze interaction effects of selected psychosocial variables on the development of subsequent smoking behavior among youth who had originally identified themselves on a survey as never having smoked. The subjects were seventh grade students who had participated in a total of three surveys over a two…
Lepre, Jorge; Rice, J Jeremy; Tu, Yuhai; Stolovitzky, Gustavo
2004-05-01
Despite the growing literature devoted to finding differentially expressed genes in assays probing different tissues types, little attention has been paid to the combinatorial nature of feature selection inherent to large, high-dimensional gene expression datasets. New flexible data analysis approaches capable of searching relevant subgroups of genes and experiments are needed to understand multivariate associations of gene expression patterns with observed phenotypes. We present in detail a deterministic algorithm to discover patterns of multivariate gene associations in gene expression data. The patterns discovered are differential with respect to a control dataset. The algorithm is exhaustive and efficient, reporting all existent patterns that fit a given input parameter set while avoiding enumeration of the entire pattern space. The value of the pattern discovery approach is demonstrated by finding a set of genes that differentiate between two types of lymphoma. Moreover, these genes are found to behave consistently in an independent dataset produced in a different laboratory using different arrays, thus validating the genes selected using our algorithm. We show that the genes deemed significant in terms of their multivariate statistics will be missed using other methods. Our set of pattern discovery algorithms including a user interface is distributed as a package called Genes@Work. This package is freely available to non-commercial users and can be downloaded from our website (http://www.research.ibm.com/FunGen).
Genome-Wide Association Analysis of Adaptation Using Environmentally Predicted Traits
van Zanten, Martijn
2015-01-01
Current methods for studying the genetic basis of adaptation evaluate genetic associations with ecologically relevant traits or single environmental variables, under the implicit assumption that natural selection imposes correlations between phenotypes, environments and genotypes. In practice, observed trait and environmental data are manifestations of unknown selective forces and are only indirectly associated with adaptive genetic variation. In theory, improved estimation of these forces could enable more powerful detection of loci under selection. Here we present an approach in which we approximate adaptive variation by modeling phenotypes as a function of the environment and using the predicted trait in multivariate and univariate genome-wide association analysis (GWAS). Based on computer simulations and published flowering time data from the model plant Arabidopsis thaliana, we find that environmentally predicted traits lead to higher recovery of functional loci in multivariate GWAS and are more strongly correlated to allele frequencies at adaptive loci than individual environmental variables. Our results provide an example of the use of environmental data to obtain independent and meaningful information on adaptive genetic variation. PMID:26496492
Mercuri, A; Pagliari, M; Baxevanis, F; Fares, R; Fotaki, N
2017-02-25
In this study the selection of in vivo predictive in vitro dissolution experimental set-ups using a multivariate analysis approach, in line with the Quality by Design (QbD) principles, is explored. The dissolution variables selected using a design of experiments (DoE) were the dissolution apparatus [USP1 apparatus (basket) and USP2 apparatus (paddle)], the rotational speed of the basket/or paddle, the operator conditions (dissolution apparatus brand and operator), the volume, the pH, and the ethanol content of the dissolution medium. The dissolution profiles of two nifedipine capsules (poorly soluble compound), under conditions mimicking the intake of the capsules with i. water, ii. orange juice and iii. an alcoholic drink (orange juice and ethanol) were analysed using multiple linear regression (MLR). Optimised dissolution set-ups, generated based on the mathematical model obtained via MLR, were used to build predicted in vitro-in vivo correlations (IVIVC). IVIVC could be achieved using physiologically relevant in vitro conditions mimicking the intake of the capsules with an alcoholic drink (orange juice and ethanol). The multivariate analysis revealed that the concentration of ethanol used in the in vitro dissolution experiments (47% v/v) can be lowered to less than 20% v/v, reflecting recently found physiological conditions. Copyright © 2016 Elsevier B.V. All rights reserved.
ERIC Educational Resources Information Center
Hunt, Dennis; And Others
Sixty-four 8-year-old children were divided into fast and slow learner groups and trained on a tactile simultaneous discrimination task. Selective attention was measured in terms of percentage contact time per trial to the relevant dimension. Inter- and intracouplings per trial were also recorded. A multivariate analysis was carried out to examine…
Peikert, Tobias; Duan, Fenghai; Rajagopalan, Srinivasan; Karwoski, Ronald A; Clay, Ryan; Robb, Richard A; Qin, Ziling; Sicks, JoRean; Bartholmai, Brian J; Maldonado, Fabien
2018-01-01
Optimization of the clinical management of screen-detected lung nodules is needed to avoid unnecessary diagnostic interventions. Herein we demonstrate the potential value of a novel radiomics-based approach for the classification of screen-detected indeterminate nodules. Independent quantitative variables assessing various radiologic nodule features such as sphericity, flatness, elongation, spiculation, lobulation and curvature were developed from the NLST dataset using 726 indeterminate nodules (all ≥ 7 mm, benign, n = 318 and malignant, n = 408). Multivariate analysis was performed using least absolute shrinkage and selection operator (LASSO) method for variable selection and regularization in order to enhance the prediction accuracy and interpretability of the multivariate model. The bootstrapping method was then applied for the internal validation and the optimism-corrected AUC was reported for the final model. Eight of the originally considered 57 quantitative radiologic features were selected by LASSO multivariate modeling. These 8 features include variables capturing Location: vertical location (Offset carina centroid z), Size: volume estimate (Minimum enclosing brick), Shape: flatness, Density: texture analysis (Score Indicative of Lesion/Lung Aggression/Abnormality (SILA) texture), and surface characteristics: surface complexity (Maximum shape index and Average shape index), and estimates of surface curvature (Average positive mean curvature and Minimum mean curvature), all with P<0.01. The optimism-corrected AUC for these 8 features is 0.939. Our novel radiomic LDCT-based approach for indeterminate screen-detected nodule characterization appears extremely promising however independent external validation is needed.
Macpherson, Ignacio; Roqué-Sánchez, María V; Legget Bn, Finola O; Fuertes, Ferran; Segarra, Ignacio
2016-10-01
personalised support provided to women by health professionals is one of the prime factors attaining women's satisfaction during pregnancy and childbirth. However the multifactorial nature of 'satisfaction' makes difficult to assess it. Statistical multivariate analysis may be an effective technique to obtain in depth quantitative evidence of the importance of this factor and its interaction with the other factors involved. This technique allows us to estimate the importance of overall satisfaction in its context and suggest actions for healthcare services. systematic review of studies that quantitatively measure the personal relationship between women and healthcare professionals (gynecologists, obstetricians, nurse, midwifes, etc.) regarding maternity care satisfaction. The literature search focused on studies carried out between 1970 and 2014 that used multivariate analyses and included the woman-caregiver relationship as a factor of their analysis. twenty-four studies which applied various multivariate analysis tools to different periods of maternity care (antenatal, perinatal, post partum) were selected. The studies included discrete scale scores and questionnaires from women with low-risk pregnancies. The "personal relationship" factor appeared under various names: care received, personalised treatment, professional support, amongst others. The most common multivariate techniques used to assess the percentage of variance explained and the odds ratio of each factor were principal component analysis and logistic regression. the data, variables and factor analysis suggest that continuous, personalised care provided by the usual midwife and delivered within a family or a specialised setting, generates the highest level of satisfaction. In addition, these factors foster the woman's psychological and physiological recovery, often surpassing clinical action (e.g. medicalization and hospital organization) and/or physiological determinants (e.g. pain, pathologies, etc.). Copyright © 2016 Elsevier Ltd. All rights reserved.
Sexual selection on female ornaments in the sex-role-reversed Gulf pipefish (Syngnathus scovelli).
Flanagan, S P; Johnson, J B; Rose, E; Jones, A G
2014-11-01
Understanding how selection acts on traits individually and in combination is an important step in deciphering the mechanisms driving evolutionary change, but for most species, and especially those in which sexual selection acts more strongly on females than on males, we have no estimates of selection coefficients pertaining to the multivariate sexually selected phenotype. Here, we use a laboratory-based mesocosm experiment to quantify pre- and post-mating selection on female secondary sexual traits in the Gulf pipefish (Syngnathus scovelli), a sexually dimorphic, sex-role-reversed species in which ornamented females compete for access to choosy males. We calculate selection differentials and gradients on female traits, including ornament area, ornament number and body size for three episodes of selection related to female reproductive success (number of mates, number of eggs transferred and number of surviving embryos). Selection is strong on both ornament area and ornament size, and the majority of selection occurs during the premating episode of selection. Interestingly, selection on female body size, which has been detected in previous studies of Gulf pipefish, appears to be indirect, as evidenced by a multivariate analysis of selection gradients. Our results show that sexual selection favours either many bands or larger bands in female Gulf pipefish. © 2014 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2014 European Society For Evolutionary Biology.
Cox, R M; Costello, R A; Camber, B E; McGlothlin, J W
2017-07-01
Darwin viewed the ornamentation of females as an indirect consequence of sexual selection on males and the transmission of male phenotypes to females via the 'laws of inheritance'. Although a number of studies have supported this view by demonstrating substantial between-sex genetic covariance for ornament expression, the majority of this work has focused on avian plumage. Moreover, few studies have considered the genetic basis of ornaments from a multivariate perspective, which may be crucial for understanding the evolution of sex differences in general, and of complex ornaments in particular. Here, we provide a multivariate, quantitative-genetic analysis of a sexually dimorphic ornament that has figured prominently in studies of sexual selection: the brightly coloured dewlap of Anolis lizards. Using data from a paternal half-sibling breeding experiment in brown anoles (Anolis sagrei), we show that multiple aspects of dewlap size and colour exhibit significant heritability and a genetic variance-covariance structure (G) that is broadly similar in males (G m ) and females (G f ). Whereas sexually monomorphic aspects of the dewlap, such as hue, exhibit significant between-sex genetic correlations (r mf ), sexually dimorphic features, such as area and brightness, exhibit reduced r mf values that do not differ from zero. Using a modified random skewers analysis, we show that the between-sex genetic variance-covariance matrix (B) should not strongly constrain the independent responses of males and females to sexually antagonistic selection. Our microevolutionary analysis is in broad agreement with macroevolutionary perspectives indicating considerable scope for the independent evolution of coloration and ornamentation in males and females. © 2017 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2017 European Society For Evolutionary Biology.
Naccarato, Attilio; Furia, Emilia; Sindona, Giovanni; Tagarelli, Antonio
2016-09-01
Four class-modeling techniques (soft independent modeling of class analogy (SIMCA), unequal dispersed classes (UNEQ), potential functions (PF), and multivariate range modeling (MRM)) were applied to multielement distribution to build chemometric models able to authenticate chili pepper samples grown in Calabria respect to those grown outside of Calabria. The multivariate techniques were applied by considering both all the variables (32 elements, Al, As, Ba, Ca, Cd, Ce, Co, Cr, Cs, Cu, Dy, Fe, Ga, La, Li, Mg, Mn, Na, Nd, Ni, Pb, Pr, Rb, Sc, Se, Sr, Tl, Tm, V, Y, Yb, Zn) and variables selected by means of stepwise linear discriminant analysis (S-LDA). In the first case, satisfactory and comparable results in terms of CV efficiency are obtained with the use of SIMCA and MRM (82.3 and 83.2% respectively), whereas MRM performs better than SIMCA in terms of forced model efficiency (96.5%). The selection of variables by S-LDA permitted to build models characterized, in general, by a higher efficiency. MRM provided again the best results for CV efficiency (87.7% with an effective balance of sensitivity and specificity) as well as forced model efficiency (96.5%). Copyright © 2016 Elsevier Ltd. All rights reserved.
Rapid and Simultaneous Prediction of Eight Diesel Quality Parameters through ATR-FTIR Analysis.
Nespeca, Maurilio Gustavo; Hatanaka, Rafael Rodrigues; Flumignan, Danilo Luiz; de Oliveira, José Eduardo
2018-01-01
Quality assessment of diesel fuel is highly necessary for society, but the costs and time spent are very high while using standard methods. Therefore, this study aimed to develop an analytical method capable of simultaneously determining eight diesel quality parameters (density; flash point; total sulfur content; distillation temperatures at 10% (T10), 50% (T50), and 85% (T85) recovery; cetane index; and biodiesel content) through attenuated total reflection Fourier transform infrared (ATR-FTIR) spectroscopy and the multivariate regression method, partial least square (PLS). For this purpose, the quality parameters of 409 samples were determined using standard methods, and their spectra were acquired in ranges of 4000-650 cm -1 . The use of the multivariate filters, generalized least squares weighting (GLSW) and orthogonal signal correction (OSC), was evaluated to improve the signal-to-noise ratio of the models. Likewise, four variable selection approaches were tested: manual exclusion, forward interval PLS (FiPLS), backward interval PLS (BiPLS), and genetic algorithm (GA). The multivariate filters and variables selection algorithms generated more fitted and accurate PLS models. According to the validation, the FTIR/PLS models presented accuracy comparable to the reference methods and, therefore, the proposed method can be applied in the diesel routine monitoring to significantly reduce costs and analysis time.
Rapid and Simultaneous Prediction of Eight Diesel Quality Parameters through ATR-FTIR Analysis
Hatanaka, Rafael Rodrigues; Flumignan, Danilo Luiz; de Oliveira, José Eduardo
2018-01-01
Quality assessment of diesel fuel is highly necessary for society, but the costs and time spent are very high while using standard methods. Therefore, this study aimed to develop an analytical method capable of simultaneously determining eight diesel quality parameters (density; flash point; total sulfur content; distillation temperatures at 10% (T10), 50% (T50), and 85% (T85) recovery; cetane index; and biodiesel content) through attenuated total reflection Fourier transform infrared (ATR-FTIR) spectroscopy and the multivariate regression method, partial least square (PLS). For this purpose, the quality parameters of 409 samples were determined using standard methods, and their spectra were acquired in ranges of 4000–650 cm−1. The use of the multivariate filters, generalized least squares weighting (GLSW) and orthogonal signal correction (OSC), was evaluated to improve the signal-to-noise ratio of the models. Likewise, four variable selection approaches were tested: manual exclusion, forward interval PLS (FiPLS), backward interval PLS (BiPLS), and genetic algorithm (GA). The multivariate filters and variables selection algorithms generated more fitted and accurate PLS models. According to the validation, the FTIR/PLS models presented accuracy comparable to the reference methods and, therefore, the proposed method can be applied in the diesel routine monitoring to significantly reduce costs and analysis time. PMID:29629209
Gershman, Susan N.; Mitchell, Christopher; Sakaluk, Scott K.; Hunt, John
2012-01-01
Nuptial food gifts function to enhance male fertilization success, but their consumption is not always beneficial to females. In decorated crickets, the spermatophore transferred at mating includes a gelatinous mass, the spermatophylax, which is consumed by females after mating. However, females often discard spermatophylaxes shortly after mating, whereupon they terminate sperm transfer. We hypothesized that females discard gifts based on their assessment of the gift itself, and specifically the composition of free amino acids. We tested this hypothesis by comparing spermatophylaxes discarded by females after mating with those that were destined to be fully consumed, and employed multivariate selection analysis to quantify the strength and form of multivariate sexual selection operating on the free amino acid composition of gifts. The analysis yielded a saddle-shaped fitness surface with two local peaks. Different amino acid profiles appear to elicit continued feeding on the spermatophylax either because they offer the same level of gustatory appeal, or because they differentially affect both the gustatory appeal and texture of the spermatophylax. We conclude that the gustatory response of females to males' nuptial food gifts represents an important avenue of post-copulatory mate choice, imposing significant sexual selection on the free amino acid composition of the spermatophylax. PMID:22357263
Skill Assessment for Coupled Biological/Physical Models of Marine Systems
2009-01-01
cluster analysis e.g., Clark and Corley, 2006) and shown that the dimensions of the problem can be reduced and multivariate and univariate goodness...information; a follow-up analysis (Arhonditsis et al., 2006) reported no relationship between the level of skill assessment presented or the accuracy of the...uncertainty analysis (Beck, 1987), model selection (Kass and Raftery, 1995), model averaging (Hoeting et al., 1999), and scores for probabilistic
Ecological prediction with nonlinear multivariate time-frequency functional data models
Yang, Wen-Hsi; Wikle, Christopher K.; Holan, Scott H.; Wildhaber, Mark L.
2013-01-01
Time-frequency analysis has become a fundamental component of many scientific inquiries. Due to improvements in technology, the amount of high-frequency signals that are collected for ecological and other scientific processes is increasing at a dramatic rate. In order to facilitate the use of these data in ecological prediction, we introduce a class of nonlinear multivariate time-frequency functional models that can identify important features of each signal as well as the interaction of signals corresponding to the response variable of interest. Our methodology is of independent interest and utilizes stochastic search variable selection to improve model selection and performs model averaging to enhance prediction. We illustrate the effectiveness of our approach through simulation and by application to predicting spawning success of shovelnose sturgeon in the Lower Missouri River.
Performance of the disease risk score in a cohort study with policy-induced selection bias.
Tadrous, Mina; Mamdani, Muhammad M; Juurlink, David N; Krahn, Murray D; Lévesque, Linda E; Cadarette, Suzanne M
2015-11-01
To examine the performance of the disease risk score (DRS) in a cohort study with evidence of policy-induced selection bias. We examined two cohorts of new users of bisphosphonates. Estimates for 1-year hip fracture rates between agents using DRS, exposure propensity scores and traditional multivariable analysis were compared. The results for the cohort with no evidence of policy-induced selection bias showed little variation across analyses (-4.1-2.0%). Analysis of the cohort with evidence of policy-induced selection bias showed greater variation (-13.5-8.1%), with the greatest difference seen with DRS analyses. Our findings suggest that caution may be warranted when using DRS methods in cohort studies with policy-induced selection bias, further research is needed.
Natural selection. VII. History and interpretation of kin selection theory.
Frank, S A
2013-06-01
Kin selection theory is a kind of causal analysis. The initial form of kin selection ascribed cause to costs, benefits and genetic relatedness. The theory then slowly developed a deeper and more sophisticated approach to partitioning the causes of social evolution. Controversy followed because causal analysis inevitably attracts opposing views. It is always possible to separate total effects into different component causes. Alternative causal schemes emphasize different aspects of a problem, reflecting the distinct goals, interests and biases of different perspectives. For example, group selection is a particular causal scheme with certain advantages and significant limitations. Ultimately, to use kin selection theory to analyse natural patterns and to understand the history of debates over different approaches, one must follow the underlying history of causal analysis. This article describes the history of kin selection theory, with emphasis on how the causal perspective improved through the study of key patterns of natural history, such as dispersal and sex ratio, and through a unified approach to demographic and social processes. Independent historical developments in the multivariate analysis of quantitative traits merged with the causal analysis of social evolution by kin selection. © 2013 The Author. Journal of Evolutionary Biology © 2013 European Society For Evolutionary Biology.
ERIC Educational Resources Information Center
Brennan, Tim
1980-01-01
A review of prior classification systems of runaways is followed by a descriptive taxonomy of runaways developed using cluster-analytic methods. The empirical types illustrate patterns of weakness in bonds between runaways and families, schools, or peer relationships. (Author)
NASA Astrophysics Data System (ADS)
Braga, Jez Willian Batista; Trevizan, Lilian Cristina; Nunes, Lidiane Cristina; Rufini, Iolanda Aparecida; Santos, Dário, Jr.; Krug, Francisco José
2010-01-01
The application of laser induced breakdown spectrometry (LIBS) aiming the direct analysis of plant materials is a great challenge that still needs efforts for its development and validation. In this way, a series of experimental approaches has been carried out in order to show that LIBS can be used as an alternative method to wet acid digestions based methods for analysis of agricultural and environmental samples. The large amount of information provided by LIBS spectra for these complex samples increases the difficulties for selecting the most appropriated wavelengths for each analyte. Some applications have suggested that improvements in both accuracy and precision can be achieved by the application of multivariate calibration in LIBS data when compared to the univariate regression developed with line emission intensities. In the present work, the performance of univariate and multivariate calibration, based on partial least squares regression (PLSR), was compared for analysis of pellets of plant materials made from an appropriate mixture of cryogenically ground samples with cellulose as the binding agent. The development of a specific PLSR model for each analyte and the selection of spectral regions containing only lines of the analyte of interest were the best conditions for the analysis. In this particular application, these models showed a similar performance, but PLSR seemed to be more robust due to a lower occurrence of outliers in comparison to the univariate method. Data suggests that efforts dealing with sample presentation and fitness of standards for LIBS analysis must be done in order to fulfill the boundary conditions for matrix independent development and validation.
Vasconcelos, A G; Almeida, R M; Nobre, F F
2001-08-01
This paper introduces an approach that includes non-quantitative factors for the selection and assessment of multivariate complex models in health. A goodness-of-fit based methodology combined with fuzzy multi-criteria decision-making approach is proposed for model selection. Models were obtained using the Path Analysis (PA) methodology in order to explain the interrelationship between health determinants and the post-neonatal component of infant mortality in 59 municipalities of Brazil in the year 1991. Socioeconomic and demographic factors were used as exogenous variables, and environmental, health service and agglomeration as endogenous variables. Five PA models were developed and accepted by statistical criteria of goodness-of fit. These models were then submitted to a group of experts, seeking to characterize their preferences, according to predefined criteria that tried to evaluate model relevance and plausibility. Fuzzy set techniques were used to rank the alternative models according to the number of times a model was superior to ("dominated") the others. The best-ranked model explained above 90% of the endogenous variables variation, and showed the favorable influences of income and education levels on post-neonatal mortality. It also showed the unfavorable effect on mortality of fast population growth, through precarious dwelling conditions and decreased access to sanitation. It was possible to aggregate expert opinions in model evaluation. The proposed procedure for model selection allowed the inclusion of subjective information in a clear and systematic manner.
Schreier, Christina; Kremer, Werner; Huber, Fritz; Neumann, Sindy; Pagel, Philipp; Lienemann, Kai; Pestel, Sabine
2013-01-01
Introduction. Spectroscopic analysis of urine samples from laboratory animals can be used to predict the efficacy and side effects of drugs. This employs methods combining 1H NMR spectroscopy with quantification of biomarkers or with multivariate data analysis. The most critical steps in data evaluation are analytical reproducibility of NMR data (collection, storage, and processing) and the health status of the animals, which may influence urine pH and osmolarity. Methods. We treated rats with a solvent, a diuretic, or a nephrotoxicant and collected urine samples. Samples were titrated to pH 3 to 9, or salt concentrations increased up to 20-fold. The effects of storage conditions and freeze-thaw cycles were monitored. Selected metabolites and multivariate data analysis were evaluated after 1H NMR spectroscopy. Results. We showed that variation of pH from 3 to 9 and increases in osmolarity up to 6-fold had no effect on the quantification of the metabolites or on multivariate data analysis. Storage led to changes after 14 days at 4°C or after 12 months at −20°C, independent of sample composition. Multiple freeze-thaw cycles did not affect data analysis. Conclusion. Reproducibility of NMR measurements is not dependent on sample composition under physiological or pathological conditions. PMID:23865070
Schreier, Christina; Kremer, Werner; Huber, Fritz; Neumann, Sindy; Pagel, Philipp; Lienemann, Kai; Pestel, Sabine
2013-01-01
Spectroscopic analysis of urine samples from laboratory animals can be used to predict the efficacy and side effects of drugs. This employs methods combining (1)H NMR spectroscopy with quantification of biomarkers or with multivariate data analysis. The most critical steps in data evaluation are analytical reproducibility of NMR data (collection, storage, and processing) and the health status of the animals, which may influence urine pH and osmolarity. We treated rats with a solvent, a diuretic, or a nephrotoxicant and collected urine samples. Samples were titrated to pH 3 to 9, or salt concentrations increased up to 20-fold. The effects of storage conditions and freeze-thaw cycles were monitored. Selected metabolites and multivariate data analysis were evaluated after (1)H NMR spectroscopy. We showed that variation of pH from 3 to 9 and increases in osmolarity up to 6-fold had no effect on the quantification of the metabolites or on multivariate data analysis. Storage led to changes after 14 days at 4°C or after 12 months at -20°C, independent of sample composition. Multiple freeze-thaw cycles did not affect data analysis. Reproducibility of NMR measurements is not dependent on sample composition under physiological or pathological conditions.
Ahlinder, Linnea; Ekstrand-Hammarström, Barbro; Geladi, Paul; Österlund, Lars
2013-01-01
It is a challenging task to characterize the biodistribution of nanoparticles in cells and tissue on a subcellular level. Conventional methods to study the interaction of nanoparticles with living cells rely on labeling techniques that either selectively stain the particles or selectively tag them with tracer molecules. In this work, Raman imaging, a label-free technique that requires no extensive sample preparation, was combined with multivariate classification to quantify the spatial distribution of oxide nanoparticles inside living lung epithelial cells (A549). Cells were exposed to TiO2 (titania) and/or α-FeO(OH) (goethite) nanoparticles at various incubation times (4 or 48 h). Using multivariate classification of hyperspectral Raman data with partial least-squares discriminant analysis, we show that a surprisingly large fraction of spectra, classified as belonging to the cell nucleus, show Raman bands associated with nanoparticles. Up to 40% of spectra from the cell nucleus show Raman bands associated with nanoparticles. Complementary transmission electron microscopy data for thin cell sections qualitatively support the conclusions. PMID:23870252
Kia, Seyed Mostafa; Vega Pons, Sandro; Weisz, Nathan; Passerini, Andrea
2016-01-01
Brain decoding is a popular multivariate approach for hypothesis testing in neuroimaging. Linear classifiers are widely employed in the brain decoding paradigm to discriminate among experimental conditions. Then, the derived linear weights are visualized in the form of multivariate brain maps to further study spatio-temporal patterns of underlying neural activities. It is well known that the brain maps derived from weights of linear classifiers are hard to interpret because of high correlations between predictors, low signal to noise ratios, and the high dimensionality of neuroimaging data. Therefore, improving the interpretability of brain decoding approaches is of primary interest in many neuroimaging studies. Despite extensive studies of this type, at present, there is no formal definition for interpretability of multivariate brain maps. As a consequence, there is no quantitative measure for evaluating the interpretability of different brain decoding methods. In this paper, first, we present a theoretical definition of interpretability in brain decoding; we show that the interpretability of multivariate brain maps can be decomposed into their reproducibility and representativeness. Second, as an application of the proposed definition, we exemplify a heuristic for approximating the interpretability in multivariate analysis of evoked magnetoencephalography (MEG) responses. Third, we propose to combine the approximated interpretability and the generalization performance of the brain decoding into a new multi-objective criterion for model selection. Our results, for the simulated and real MEG data, show that optimizing the hyper-parameters of the regularized linear classifier based on the proposed criterion results in more informative multivariate brain maps. More importantly, the presented definition provides the theoretical background for quantitative evaluation of interpretability, and hence, facilitates the development of more effective brain decoding algorithms in the future.
Kia, Seyed Mostafa; Vega Pons, Sandro; Weisz, Nathan; Passerini, Andrea
2017-01-01
Brain decoding is a popular multivariate approach for hypothesis testing in neuroimaging. Linear classifiers are widely employed in the brain decoding paradigm to discriminate among experimental conditions. Then, the derived linear weights are visualized in the form of multivariate brain maps to further study spatio-temporal patterns of underlying neural activities. It is well known that the brain maps derived from weights of linear classifiers are hard to interpret because of high correlations between predictors, low signal to noise ratios, and the high dimensionality of neuroimaging data. Therefore, improving the interpretability of brain decoding approaches is of primary interest in many neuroimaging studies. Despite extensive studies of this type, at present, there is no formal definition for interpretability of multivariate brain maps. As a consequence, there is no quantitative measure for evaluating the interpretability of different brain decoding methods. In this paper, first, we present a theoretical definition of interpretability in brain decoding; we show that the interpretability of multivariate brain maps can be decomposed into their reproducibility and representativeness. Second, as an application of the proposed definition, we exemplify a heuristic for approximating the interpretability in multivariate analysis of evoked magnetoencephalography (MEG) responses. Third, we propose to combine the approximated interpretability and the generalization performance of the brain decoding into a new multi-objective criterion for model selection. Our results, for the simulated and real MEG data, show that optimizing the hyper-parameters of the regularized linear classifier based on the proposed criterion results in more informative multivariate brain maps. More importantly, the presented definition provides the theoretical background for quantitative evaluation of interpretability, and hence, facilitates the development of more effective brain decoding algorithms in the future. PMID:28167896
Falahati, Farshad; Westman, Eric; Simmons, Andrew
2014-01-01
Machine learning algorithms and multivariate data analysis methods have been widely utilized in the field of Alzheimer's disease (AD) research in recent years. Advances in medical imaging and medical image analysis have provided a means to generate and extract valuable neuroimaging information. Automatic classification techniques provide tools to analyze this information and observe inherent disease-related patterns in the data. In particular, these classifiers have been used to discriminate AD patients from healthy control subjects and to predict conversion from mild cognitive impairment to AD. In this paper, recent studies are reviewed that have used machine learning and multivariate analysis in the field of AD research. The main focus is on studies that used structural magnetic resonance imaging (MRI), but studies that included positron emission tomography and cerebrospinal fluid biomarkers in addition to MRI are also considered. A wide variety of materials and methods has been employed in different studies, resulting in a range of different outcomes. Influential factors such as classifiers, feature extraction algorithms, feature selection methods, validation approaches, and cohort properties are reviewed, as well as key MRI-based and multi-modal based studies. Current and future trends are discussed.
Classification of adulterated honeys by multivariate analysis.
Amiry, Saber; Esmaiili, Mohsen; Alizadeh, Mohammad
2017-06-01
In this research, honey samples were adulterated with date syrup (DS) and invert sugar syrup (IS) at three concentrations (7%, 15% and 30%). 102 adulterated samples were prepared in six batches with 17 replications for each batch. For each sample, 32 parameters including color indices, rheological, physical, and chemical parameters were determined. To classify the samples, based on type and concentrations of adulterant, a multivariate analysis was applied using principal component analysis (PCA) followed by a linear discriminant analysis (LDA). Then, 21 principal components (PCs) were selected in five sets. Approximately two-thirds were identified correctly using color indices (62.75%) or rheological properties (67.65%). A power discrimination was obtained using physical properties (97.06%), and the best separations were achieved using two sets of chemical properties (set 1: lactone, diastase activity, sucrose - 100%) (set 2: free acidity, HMF, ash - 95%). Copyright © 2016 Elsevier Ltd. All rights reserved.
United States Marine Corps Basic Reconnaissance Course: Predictors of Success
2017-03-01
PAGE INTENTIONALLY LEFT BLANK 81 VI. CONCLUSIONS AND RECOMMENDATIONS A. CONCLUSIONS The objective of my research is to provide quantitative ...percent over the last three years, illustrating there is room for improvement. This study conducts a quantitative and qualitative analysis of the...criteria used to select candidates for the BRC. The research uses multi-variate logistic regression models and survival analysis to determine to what
[Biases in the study of prognostic factors].
Delgado-Rodríguez, M
1999-01-01
The main objective is to detail the main biases in the study of prognostic factors. Confounding bias is illustrated with social class, a prognostic factor still discussed. Within selection bias several cases are commented: response bias, specially frequent when the patients of a clinical trial are used; the shortcomings in the formation of an inception cohort; the fallacy of Neyman (bias due to the duration of disease) when the study begins with a cross-sectional study; the selection bias in the treatment of survivors for the different treatment opportunity of those living longer; the bias due to the inclusion of heterogeneous diagnostic groups; and the selection bias due to differential information losses and the use of statistical multivariate procedures. Within the biases during follow-up, an empiric rule to value the impact of the number of losses is given. In information bias the Will Rogers' phenomenon and the usefulness of clinical databases are discussed. Lastly, a recommendation against the use of cutoff points yielded by bivariate analyses to select the variable to be included in multivariate analysis is given.
Comparison of plant cover of river valley fragments by using GIS tools and multivariate analysis
NASA Astrophysics Data System (ADS)
Waldon-Rudzionek, Barbara
2017-11-01
Selected landscape registers and results of ecological analyses of flora used in studies of transformations of anthropogenic plant cover and river valley landscapes were presented. The results were shown pursuant to a comparison of fragments of two adjacent valleys in north-western Poland.
NASA Astrophysics Data System (ADS)
Attia, Khalid A. M.; Nassar, Mohammed W. I.; El-Zeiny, Mohamed B.; Serag, Ahmed
2016-04-01
Three simple, specific, accurate and precise spectrophotometric methods were developed for the determination of cefprozil (CZ) in the presence of its alkaline induced degradation product (DCZ). The first method was the bivariate method, while the two other multivariate methods were partial least squares (PLS) and spectral residual augmented classical least squares (SRACLS). The multivariate methods were applied with and without variable selection procedure (genetic algorithm GA). These methods were tested by analyzing laboratory prepared mixtures of the above drug with its alkaline induced degradation product and they were applied to its commercial pharmaceutical products.
Lee, Kyu Ha; Tadesse, Mahlet G; Baccarelli, Andrea A; Schwartz, Joel; Coull, Brent A
2017-03-01
The analysis of multiple outcomes is becoming increasingly common in modern biomedical studies. It is well-known that joint statistical models for multiple outcomes are more flexible and more powerful than fitting a separate model for each outcome; they yield more powerful tests of exposure or treatment effects by taking into account the dependence among outcomes and pooling evidence across outcomes. It is, however, unlikely that all outcomes are related to the same subset of covariates. Therefore, there is interest in identifying exposures or treatments associated with particular outcomes, which we term outcome-specific variable selection. In this work, we propose a variable selection approach for multivariate normal responses that incorporates not only information on the mean model, but also information on the variance-covariance structure of the outcomes. The approach effectively leverages evidence from all correlated outcomes to estimate the effect of a particular covariate on a given outcome. To implement this strategy, we develop a Bayesian method that builds a multivariate prior for the variable selection indicators based on the variance-covariance of the outcomes. We show via simulation that the proposed variable selection strategy can boost power to detect subtle effects without increasing the probability of false discoveries. We apply the approach to the Normative Aging Study (NAS) epigenetic data and identify a subset of five genes in the asthma pathway for which gene-specific DNA methylations are associated with exposures to either black carbon, a marker of traffic pollution, or sulfate, a marker of particles generated by power plants. © 2016, The International Biometric Society.
Spelt, Lidewij; Sasor, Agata; Ansari, Daniel; Hilmersson, Katarzyna Said; Andersson, Roland
2018-01-01
To assess the expression of cancer stem cell (CSC) markers CD44, CD133 and CD24 in colon cancer liver metastases and analyse their predictive value for overall survival (OS) and disease-free survival (DFS) after liver resection. Patients operated on for colon cancer liver metastases were included. CSC marker expression was determined through immunohistochemistry analysis. OS and DFS were compared between marker-positive and marker-negative patients. Multivariate analysis was performed to select predictive variables for OS and DFS. CD133-positive patients had a worse DFS than CD133-negative patients, with a median DFS of 12 and 25 months (p=0.051). Multivariate analysis selected CD133 expression as a significant predictor for DFS. CD44 and CD24 were not found to predict OS or DFS. CD133 expression in colonic liver metastases is a negative prognostic factor for DFS after liver resection. In the future, CD133 could be used as a biomarker for risk stratification, and possibly for developing novel targeted therapy. Copyright© 2018, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.
Complex codon usage pattern and compositional features of retroviruses.
RoyChoudhury, Sourav; Mukherjee, Debaprasad
2013-01-01
Retroviruses infect a wide range of organisms including humans. Among them, HIV-1, which causes AIDS, has now become a major threat for world health. Some of these viruses are also potential gene transfer vectors. In this study, the patterns of synonymous codon usage in retroviruses have been studied through multivariate statistical methods on ORFs sequences from the available 56 retroviruses. The principal determinant for evolution of the codon usage pattern in retroviruses seemed to be the compositional constraints, while selection for translation of the viral genes plays a secondary role. This was further supported by multivariate analysis on relative synonymous codon usage. Thus, it seems that mutational bias might have dominated role over translational selection in shaping the codon usage of retroviruses. Codon adaptation index was used to identify translationally optimal codons among genes from retroviruses. The comparative analysis of the preferred and optimal codons among different retroviral groups revealed that four codons GAA, AAA, AGA, and GGA were significantly more frequent in most of the retroviral genes inspite of some differences. Cluster analysis also revealed that phylogenetically related groups of retroviruses have probably evolved their codon usage in a concerted manner under the influence of their nucleotide composition.
NASA Astrophysics Data System (ADS)
Jia, Xiaoliang; An, Haizhong; Sun, Xiaoqi; Huang, Xuan; Gao, Xiangyun
2016-04-01
The globalization and regionalization of crude oil trade inevitably give rise to the difference of crude oil prices. The understanding of the pattern of the crude oil prices' mutual propagation is essential for analyzing the development of global oil trade. Previous research has focused mainly on the fuzzy long- or short-term one-to-one propagation of bivariate oil prices, generally ignoring various patterns of periodical multivariate propagation. This study presents a wavelet-based network approach to help uncover the multipath propagation of multivariable crude oil prices in a joint time-frequency period. The weekly oil spot prices of the OPEC member states from June 1999 to March 2011 are adopted as the sample data. First, we used wavelet analysis to find different subseries based on an optimal decomposing scale to describe the periodical feature of the original oil price time series. Second, a complex network model was constructed based on an optimal threshold selection to describe the structural feature of multivariable oil prices. Third, Bayesian network analysis (BNA) was conducted to find the probability causal relationship based on periodical structural features to describe the various patterns of periodical multivariable propagation. Finally, the significance of the leading and intermediary oil prices is discussed. These findings are beneficial for the implementation of periodical target-oriented pricing policies and investment strategies.
Ting, Hui-Min; Chang, Liyun; Huang, Yu-Jie; Wu, Jia-Ming; Wang, Hung-Yu; Horng, Mong-Fong; Chang, Chun-Ming; Lan, Jen-Hong; Huang, Ya-Yu; Fang, Fu-Min; Leung, Stephen Wan
2014-01-01
Purpose The aim of this study was to develop a multivariate logistic regression model with least absolute shrinkage and selection operator (LASSO) to make valid predictions about the incidence of moderate-to-severe patient-rated xerostomia among head and neck cancer (HNC) patients treated with IMRT. Methods and Materials Quality of life questionnaire datasets from 206 patients with HNC were analyzed. The European Organization for Research and Treatment of Cancer QLQ-H&N35 and QLQ-C30 questionnaires were used as the endpoint evaluation. The primary endpoint (grade 3+ xerostomia) was defined as moderate-to-severe xerostomia at 3 (XER3m) and 12 months (XER12m) after the completion of IMRT. Normal tissue complication probability (NTCP) models were developed. The optimal and suboptimal numbers of prognostic factors for a multivariate logistic regression model were determined using the LASSO with bootstrapping technique. Statistical analysis was performed using the scaled Brier score, Nagelkerke R2, chi-squared test, Omnibus, Hosmer-Lemeshow test, and the AUC. Results Eight prognostic factors were selected by LASSO for the 3-month time point: Dmean-c, Dmean-i, age, financial status, T stage, AJCC stage, smoking, and education. Nine prognostic factors were selected for the 12-month time point: Dmean-i, education, Dmean-c, smoking, T stage, baseline xerostomia, alcohol abuse, family history, and node classification. In the selection of the suboptimal number of prognostic factors by LASSO, three suboptimal prognostic factors were fine-tuned by Hosmer-Lemeshow test and AUC, i.e., Dmean-c, Dmean-i, and age for the 3-month time point. Five suboptimal prognostic factors were also selected for the 12-month time point, i.e., Dmean-i, education, Dmean-c, smoking, and T stage. The overall performance for both time points of the NTCP model in terms of scaled Brier score, Omnibus, and Nagelkerke R2 was satisfactory and corresponded well with the expected values. Conclusions Multivariate NTCP models with LASSO can be used to predict patient-rated xerostomia after IMRT. PMID:24586971
Lee, Tsair-Fwu; Chao, Pei-Ju; Ting, Hui-Min; Chang, Liyun; Huang, Yu-Jie; Wu, Jia-Ming; Wang, Hung-Yu; Horng, Mong-Fong; Chang, Chun-Ming; Lan, Jen-Hong; Huang, Ya-Yu; Fang, Fu-Min; Leung, Stephen Wan
2014-01-01
The aim of this study was to develop a multivariate logistic regression model with least absolute shrinkage and selection operator (LASSO) to make valid predictions about the incidence of moderate-to-severe patient-rated xerostomia among head and neck cancer (HNC) patients treated with IMRT. Quality of life questionnaire datasets from 206 patients with HNC were analyzed. The European Organization for Research and Treatment of Cancer QLQ-H&N35 and QLQ-C30 questionnaires were used as the endpoint evaluation. The primary endpoint (grade 3(+) xerostomia) was defined as moderate-to-severe xerostomia at 3 (XER3m) and 12 months (XER12m) after the completion of IMRT. Normal tissue complication probability (NTCP) models were developed. The optimal and suboptimal numbers of prognostic factors for a multivariate logistic regression model were determined using the LASSO with bootstrapping technique. Statistical analysis was performed using the scaled Brier score, Nagelkerke R(2), chi-squared test, Omnibus, Hosmer-Lemeshow test, and the AUC. Eight prognostic factors were selected by LASSO for the 3-month time point: Dmean-c, Dmean-i, age, financial status, T stage, AJCC stage, smoking, and education. Nine prognostic factors were selected for the 12-month time point: Dmean-i, education, Dmean-c, smoking, T stage, baseline xerostomia, alcohol abuse, family history, and node classification. In the selection of the suboptimal number of prognostic factors by LASSO, three suboptimal prognostic factors were fine-tuned by Hosmer-Lemeshow test and AUC, i.e., Dmean-c, Dmean-i, and age for the 3-month time point. Five suboptimal prognostic factors were also selected for the 12-month time point, i.e., Dmean-i, education, Dmean-c, smoking, and T stage. The overall performance for both time points of the NTCP model in terms of scaled Brier score, Omnibus, and Nagelkerke R(2) was satisfactory and corresponded well with the expected values. Multivariate NTCP models with LASSO can be used to predict patient-rated xerostomia after IMRT.
NASA Astrophysics Data System (ADS)
Bressan, Lucas P.; do Nascimento, Paulo Cícero; Schmidt, Marcella E. P.; Faccin, Henrique; de Machado, Leandro Carvalho; Bohrer, Denise
2017-02-01
A novel method was developed to determine low molecular weight polycyclic aromatic hydrocarbons in aqueous leachates from soils and sediments using a salting-out assisted liquid-liquid extraction, synchronous fluorescence spectrometry and a multivariate calibration technique. Several experimental parameters were controlled and the optimum conditions were: sodium carbonate as the salting-out agent at concentration of 2 mol L- 1, 3 mL of acetonitrile as extraction solvent, 6 mL of aqueous leachate, vortexing for 5 min and centrifuging at 4000 rpm for 5 min. The partial least squares calibration was optimized to the lowest values of root mean squared error and five latent variables were chosen for each of the targeted compounds. The regression coefficients for the true versus predicted concentrations were higher than 0.99. Figures of merit for the multivariate method were calculated, namely sensitivity, multivariate detection limit and multivariate quantification limit. The selectivity was also evaluated and other polycyclic aromatic hydrocarbons did not interfere in the analysis. Likewise, high performance liquid chromatography was used as a comparative methodology, and the regression analysis between the methods showed no statistical difference (t-test). The proposed methodology was applied to soils and sediments of a Brazilian river and the recoveries ranged from 74.3% to 105.8%. Overall, the proposed methodology was suitable for the targeted compounds, showing that the extraction method can be applied to spectrofluorometric analysis and that the multivariate calibration is also suitable for these compounds in leachates from real samples.
Shi, Wenhao; Zhang, Silin; Zhao, Wanqiu; Xia, Xue; Wang, Min; Wang, Hui; Bai, Haiyan; Shi, Juanzi
2013-07-01
What factors does multivariate logistic regression show to be significantly associated with the likelihood of clinical pregnancy in vitrified-warmed embryo transfer (VET) cycles? Assisted hatching (AH) and if the reason to freeze embryos was to avoid the risk of ovarian hyperstimulation syndrome (OHSS) were significantly positively associated with a greater likelihood of clinical pregnancy. Single factor analysis has shown AH, number of embryos transferred and the reason of freezing for OHSS to be positively and damaged blastomere to be negatively significantly associated with the chance of clinical pregnancy after VET. It remains unclear what factors would be significant after multivariate analysis. The study was a retrospective analysis of 2313 VET cycles from 1481 patients performed between January 2008 and April 2012. A multivariate logistic regression analysis was performed to identify the factors to affect clinical pregnancy outcome of VET. There were 22 candidate variables selected based on clinical experiences and the literature. With the thresholds of α entry = α removal= 0.05 for both variable entry and variable removal, eight variables were chosen to contribute the multivariable model by the bootstrap stepwise variable selection algorithm (n = 1000). Eight variables were age at controlled ovarian hyperstimulation (COH), reason for freezing, AH, endometrial thickness, damaged blastomere, number of embryos transferred, number of good-quality embryos, and blood presence on transfer catheter. A descriptive comparison of the relative importance was accomplished by the proportion of explained variation (PEV). Among the reasons for freezing, the OHSS group showed a higher OR than the surplus embryo group when compared with other reasons for VET groups (OHSS versus Other, OR: 2.145; CI: 1.4-3.286; Surplus embryos versus Other, OR: 1.152; CI: 0.761-1.743) and high PEV (marginal 2.77%, P = 0.2911; partial 1.68%; CI of area under receptor operator characteristic curve (ROC): 0.5576-0.6000). AH also showed a high OR (OR: 2.105, CI: 1.554-2.85) and high PEV (marginal 1.97%; partial 1.02%; CI of area under ROC: 0.5344-0.5647). The number of good-quality embryos showed the highest marginal PEV and partial PEV (marginal 3.91%, partial 2.28%; CI of area under ROC: 0.5886-0.6343). This was a retrospective multivariate analysis of the data obtained in 5 years from a single IVF center. Repeated cycles in the same woman were treated as independent observations, which could introduce bias. Results are based on clinical pregnancy and not live births. Prospective analysis of a larger data set from a multicenter study based on live births is necessary to confirm the findings. Paying attention to the quality of embryos, the number of good embryos, AH and the reasons for freezing that are associated with clinical pregnancy after VET will assist the improvement of success rates.
Duffy, Sonia A; Ronis, David L; McLean, Scott; Fowler, Karen E; Gruber, Stephen B; Wolf, Gregory T; Terrell, Jeffrey E
2009-04-20
Our prior work has shown that the health behaviors of head and neck cancer patients are interrelated and are associated with quality of life; however, other than smoking, the relationship between health behaviors and survival is unclear. A prospective cohort study was conducted to determine the relationship between five pretreatment health behaviors (smoking, alcohol, diet, physical activity, and sleep) and all-cause survival among 504 head and neck cancer patients. Smoking status was the strongest predictor of survival, with both current smokers (hazard ratio [HR] = 2.4; 95% CI, 1.3 to 4.4) and former smokers (HR = 2.0; 95% CI, 1.2 to 3.5) showing significant associations with poor survival. Problem drinking was associated with survival in the univariate analysis (HR = 1.4; 95% CI, 1.0 to 2.0) but lost significance when controlling for other factors. Low fruit intake was negatively associated with survival in the univariate analysis only (HR = 1.6; 95% CI, 1.1 to 2.1), whereas vegetable intake was not significant in either univariate or multivariate analyses. Although physical activity was associated with survival in the univariate analysis (HR = 0.95; 95% CI, 0.93 to 0.97), it was not significant in the multivariate model. Sleep was not significantly associated with survival in either univariate or multivariate analysis. Control variables that were also independently associated with survival in the multivariate analysis were age, education, tumor site, cancer stage, and surgical treatment. Variation in selected pretreatment health behaviors (eg, smoking, fruit intake, and physical activity) in this population is associated with variation in survival.
USDA-ARS?s Scientific Manuscript database
Plants are attacked by pathogens representing diverse taxonomic groups, such that genes providing multiple disease resistance (MDR) would likely be under positive selection pressure. We examined the novel proposition that naturally occurring allelic variants may confer MDR. To do so, we applied a ...
USDA-ARS?s Scientific Manuscript database
SNP effects estimated in genomic selection programs allow for the prediction of direct genomic values (DGV) both at genome-wide and chromosomal level. As a consequence, genome-wide (G_GW) or chromosomal (G_CHR) correlation matrices between genomic predictions for different traits can be calculated. ...
Perceived constraints to art museum attendance
Jinhee Jun; Gerard Kyle; Joseph T. O' Leary
2007-01-01
We explored selected socio-demographic factors that influence the perception of constraints to art museum attendance among a sample of interested individuals who were currently not enjoying art museum visitation. Data from the Survey of Public Participation in the Arts (SPPA), a nationwide survey were used for this study. Using multivariate analysis of variance, we...
Farmers as Consumers of Agricultural Education Services: Willingness to Pay and Spend Time
ERIC Educational Resources Information Center
Charatsari, Chrysanthi; Papadaki-Klavdianou, Afroditi; Michailidis, Anastasios
2011-01-01
This study assessed farmers' willingness to pay for and spend time attending an Agricultural Educational Program (AEP). Primary data on the demographic and socio-economic variables of farmers were collected from 355 farmers selected randomly from Northern Greece. Descriptive statistics and multivariate analysis methods were used in order to meet…
Prediction of Gestational Diabetes through NMR Metabolomics of Maternal Blood.
Pinto, Joana; Almeida, Lara M; Martins, Ana S; Duarte, Daniela; Barros, António S; Galhano, Eulália; Pita, Cristina; Almeida, Maria do Céu; Carreira, Isabel M; Gil, Ana M
2015-06-05
Metabolic biomarkers of pre- and postdiagnosis gestational diabetes mellitus (GDM) were sought, using nuclear magnetic resonance (NMR) metabolomics of maternal plasma and corresponding lipid extracts. Metabolite differences between controls and disease were identified through multivariate analysis of variable selected (1)H NMR spectra. For postdiagnosis GDM, partial least squares regression identified metabolites with higher dependence on normal gestational age evolution. Variable selection of NMR spectra produced good classification models for both pre- and postdiagnostic GDM. Prediagnosis GDM was accompanied by cholesterol increase and minor increases in lipoproteins (plasma), fatty acids, and triglycerides (extracts). Small metabolite changes comprised variations in glucose (up regulated), amino acids, betaine, urea, creatine, and metabolites related to gut microflora. Most changes were enhanced upon GDM diagnosis, in addition to newly observed changes in low-Mw compounds. GDM prediction seems possible exploiting multivariate profile changes rather than a set of univariate changes. Postdiagnosis GDM is successfully classified using a 26-resonance plasma biomarker. Plasma and extracts display comparable classification performance, the former enabling direct and more rapid analysis. Results and putative biochemical hypotheses require further confirmation in larger cohorts of distinct ethnicities.
Comparative multivariate analysis of biometric traits of West African Dwarf and Red Sokoto goats.
Yakubu, Abdulmojeed; Salako, Adebowale E; Imumorin, Ikhide G
2011-03-01
The population structure of 302 randomly selected West African Dwarf (WAD) and Red Sokoto (RS) goats was examined using multivariate morphometric analyses. This was to make the case for conservation, rational management and genetic improvement of these two most important Nigerian goat breeds. Fifteen morphometric measurements were made on each individual animal. RS goats were superior (P<0.05) to the WAD for the body size and skeletal proportions investigated. The phenotypic variability between the two breeds was revealed by their mutual responses in the principal components. While four principal components were extracted for WAD goats, three components were obtained for their RS counterparts with variation in the loading traits of each component for each breed. The Mahalanobis distance of 72.28 indicated a high degree of spatial racial separation in morphology between the genotypes. The Ward's option of the cluster analysis consolidated the morphometric distinctness of the two breeds. Application of selective breeding to genetic improvement would benefit from the detected phenotypic differentiation. Other implications for management and conservation of the goats are highlighted.
Application of multivariate statistical techniques in microbial ecology
Paliy, O.; Shankar, V.
2016-01-01
Recent advances in high-throughput methods of molecular analyses have led to an explosion of studies generating large scale ecological datasets. Especially noticeable effect has been attained in the field of microbial ecology, where new experimental approaches provided in-depth assessments of the composition, functions, and dynamic changes of complex microbial communities. Because even a single high-throughput experiment produces large amounts of data, powerful statistical techniques of multivariate analysis are well suited to analyze and interpret these datasets. Many different multivariate techniques are available, and often it is not clear which method should be applied to a particular dataset. In this review we describe and compare the most widely used multivariate statistical techniques including exploratory, interpretive, and discriminatory procedures. We consider several important limitations and assumptions of these methods, and we present examples of how these approaches have been utilized in recent studies to provide insight into the ecology of the microbial world. Finally, we offer suggestions for the selection of appropriate methods based on the research question and dataset structure. PMID:26786791
Influence factors and forecast of carbon emission in China: structure adjustment for emission peak
NASA Astrophysics Data System (ADS)
Wang, B.; Cui, C. Q.; Li, Z. P.
2018-02-01
This paper introduced Principal Component Analysis and Multivariate Linear Regression Model to verify long-term balance relationships between Carbon Emissions and the impact factors. The integrated model of improved PCA and multivariate regression analysis model is attainable to figure out the pattern of carbon emission sources. Main empirical results indicate that among all selected variables, the role of energy consumption scale was largest. GDP and Population follow and also have significant impacts on carbon emission. Industrialization rate and fossil fuel proportion, which is the indicator of reflecting the economic structure and energy structure, have a higher importance than the factor of urbanization rate and the dweller consumption level of urban areas. In this way, some suggestions are put forward for government to achieve the peak of carbon emissions.
Longobardi, F; Ventrella, A; Bianco, A; Catucci, L; Cafagna, I; Gallo, V; Mastrorilli, P; Agostiano, A
2013-12-01
In this study, non-targeted (1)H NMR fingerprinting was used in combination with multivariate statistical techniques for the classification of Italian sweet cherries based on their different geographical origins (Emilia Romagna and Puglia). As classification techniques, Soft Independent Modelling of Class Analogy (SIMCA), Partial Least Squares Discriminant Analysis (PLS-DA), and Linear Discriminant Analysis (LDA) were carried out and the results were compared. For LDA, before performing a refined selection of the number/combination of variables, two different strategies for a preliminary reduction of the variable number were tested. The best average recognition and CV prediction abilities (both 100.0%) were obtained for all the LDA models, although PLS-DA also showed remarkable performances (94.6%). All the statistical models were validated by observing the prediction abilities with respect to an external set of cherry samples. The best result (94.9%) was obtained with LDA by performing a best subset selection procedure on a set of 30 principal components previously selected by a stepwise decorrelation. The metabolites that mostly contributed to the classification performances of such LDA model, were found to be malate, glucose, fructose, glutamine and succinate. Copyright © 2013 Elsevier Ltd. All rights reserved.
Bhatt, Chet R; Alfarraj, Bader; Ghany, Charles T; Yueh, Fang Y; Singh, Jagdish P
2017-04-01
In this study, the laser-induced breakdown spectroscopy (LIBS) technique was used to identify and compare the presence of major nutrient elements in organic and conventional vegetables. Different parts of cauliflowers and broccolis were used as working samples. Laser-induced breakdown spectra from these samples were acquired at optimum values of laser energy, gate delay, and gate width. Both univariate and multivariate analyses were performed for the comparison of these organic and conventional vegetable flowers. Principal component analysis (PCA) was taken into account for multivariate analysis while for univariate analysis, the intensity of selected atomic lines of different elements and their intensity ratio with some reference lines of organic cauliflower and broccoli samples were compared with those of conventional ones. In addition, different parts of the cauliflower and broccoli were compared in terms of intensity and intensity ratio of elemental lines.
Amirian, Mohammad-Elyas; Fazilat-Pour, Masoud
2016-08-01
The present study examined simple and multivariate relationships of spiritual intelligence with general health and happiness. The employed method was descriptive and correlational. King's Spiritual Quotient scales, GHQ-28 and Oxford Happiness Inventory, are filled out by a sample consisted of 384 students, which were selected using stratified random sampling from the students of Shahid Bahonar University of Kerman. Data are subjected to descriptive and inferential statistics including correlations and multivariate regressions. Bivariate correlations support positive and significant predictive value of spiritual intelligence toward general health and happiness. Further analysis showed that among the Spiritual Intelligence' subscales, Existential Critical Thinking Predicted General Health and Happiness, reversely. In addition, happiness was positively predicted by generation of personal meaning and transcendental awareness. The findings are discussed in line with the previous studies and the relevant theoretical background.
Hamchevici, Carmen; Udrea, Ion
2013-11-01
The concept of basin-wide Joint Danube Survey (JDS) was launched by the International Commission for the Protection of the Danube River (ICPDR) as a tool for investigative monitoring under the Water Framework Directive (WFD), with a frequency of 6 years. The first JDS was carried out in 2001 and its success in providing key information for characterisation of the Danube River Basin District as required by WFD lead to the organisation of the second JDS in 2007, which was the world's biggest river research expedition in that year. The present paper presents an approach for improving the survey strategy for the next planned survey JDS3 (2013) by means of several multivariate statistical techniques. In order to design the optimum structure in terms of parameters and sampling sites, principal component analysis (PCA), factor analysis (FA) and cluster analysis were applied on JDS2 data for 13 selected physico-chemical and one biological element measured in 78 sampling sites located on the main course of the Danube. Results from PCA/FA showed that most of the dataset variance (above 75%) was explained by five varifactors loaded with 8 out of 14 variables: physical (transparency and total suspended solids), relevant nutrients (N-nitrates and P-orthophosphates), feedback effects of primary production (pH, alkalinity and dissolved oxygen) and algal biomass. Taking into account the representation of the factor scores given by FA versus sampling sites and the major groups generated by the clustering procedure, the spatial network of the next survey could be carefully tailored, leading to a decreasing of sampling sites by more than 30%. The approach of target oriented sampling strategy based on the selected multivariate statistics can provide a strong reduction in dimensionality of the original data and corresponding costs as well, without any loss of information.
Fazeli, Bahare; Ravari, Hassan; Assadi, Reza
2012-08-01
The aim of this study was first to describe the natural history of Buerger's disease (BD) and then to discuss a clinical approach to this disease based on multivariate analysis. One hundred eight patients who corresponded with Shionoya's criteria were selected from 2000 to 2007 for this study. Major amputation was considered the ultimate adverse event. Survival analyses were performed by Kaplan-Meier curves. Independent variables including gender, duration of smoking, number of cigarettes smoked per day, minor amputation events and type of treatments, were determined by multivariate Cox regression analysis. The recorded data demonstrated that BD may present in four forms, including relapsing-remitting (75%), secondary progressive (4.6%), primary progressive (14.2%) and benign BD (6.2%). Most of the amputations occurred due to relapses within the six years after diagnosis of BD. In multivariate analysis, duration of smoking of more than 20 years had a significant relationship with further major amputation among patients with BD. Smoking cessation programs with experienced psychotherapists are strongly recommended for those areas in which Buerger's disease is common. Patients who have smoked for more than 20 years should be encouraged to quit smoking, but should also be recommended for more advanced treatment for limb salvage.
On Some Multiple Decision Problems
1976-08-01
parameter space. Some recent results in the area of subset selection formulation are Gnanadesikan and Gupta [28], Gupta and Studden [43], Gupta and...York, pp. 363-376. [27) Gnanadesikan , M. (1966). Some Selection and Ranking Procedures for Multivariate Normal Populations. Ph.D. Thesis. Dept. of...Statist., Purdue Univ., West Lafayette, Indiana 47907. [28) Gnanadesikan , M. and Gupta, S. S. (1970). Selection procedures for multivariate normal
de Paula, Lauro C. M.; Soares, Anderson S.; de Lima, Telma W.; Delbem, Alexandre C. B.; Coelho, Clarimar J.; Filho, Arlindo R. G.
2014-01-01
Several variable selection algorithms in multivariate calibration can be accelerated using Graphics Processing Units (GPU). Among these algorithms, the Firefly Algorithm (FA) is a recent proposed metaheuristic that may be used for variable selection. This paper presents a GPU-based FA (FA-MLR) with multiobjective formulation for variable selection in multivariate calibration problems and compares it with some traditional sequential algorithms in the literature. The advantage of the proposed implementation is demonstrated in an example involving a relatively large number of variables. The results showed that the FA-MLR, in comparison with the traditional algorithms is a more suitable choice and a relevant contribution for the variable selection problem. Additionally, the results also demonstrated that the FA-MLR performed in a GPU can be five times faster than its sequential implementation. PMID:25493625
de Paula, Lauro C M; Soares, Anderson S; de Lima, Telma W; Delbem, Alexandre C B; Coelho, Clarimar J; Filho, Arlindo R G
2014-01-01
Several variable selection algorithms in multivariate calibration can be accelerated using Graphics Processing Units (GPU). Among these algorithms, the Firefly Algorithm (FA) is a recent proposed metaheuristic that may be used for variable selection. This paper presents a GPU-based FA (FA-MLR) with multiobjective formulation for variable selection in multivariate calibration problems and compares it with some traditional sequential algorithms in the literature. The advantage of the proposed implementation is demonstrated in an example involving a relatively large number of variables. The results showed that the FA-MLR, in comparison with the traditional algorithms is a more suitable choice and a relevant contribution for the variable selection problem. Additionally, the results also demonstrated that the FA-MLR performed in a GPU can be five times faster than its sequential implementation.
NASA Astrophysics Data System (ADS)
Vittal, H.; Singh, Jitendra; Kumar, Pankaj; Karmakar, Subhankar
2015-06-01
In watershed management, flood frequency analysis (FFA) is performed to quantify the risk of flooding at different spatial locations and also to provide guidelines for determining the design periods of flood control structures. The traditional FFA was extensively performed by considering univariate scenario for both at-site and regional estimation of return periods. However, due to inherent mutual dependence of the flood variables or characteristics [i.e., peak flow (P), flood volume (V) and flood duration (D), which are random in nature], analysis has been further extended to multivariate scenario, with some restrictive assumptions. To overcome the assumption of same family of marginal density function for all flood variables, the concept of copula has been introduced. Although, the advancement from univariate to multivariate analyses drew formidable attention to the FFA research community, the basic limitation was that the analyses were performed with the implementation of only parametric family of distributions. The aim of the current study is to emphasize the importance of nonparametric approaches in the field of multivariate FFA; however, the nonparametric distribution may not always be a good-fit and capable of replacing well-implemented multivariate parametric and multivariate copula-based applications. Nevertheless, the potential of obtaining best-fit using nonparametric distributions might be improved because such distributions reproduce the sample's characteristics, resulting in more accurate estimations of the multivariate return period. Hence, the current study shows the importance of conjugating multivariate nonparametric approach with multivariate parametric and copula-based approaches, thereby results in a comprehensive framework for complete at-site FFA. Although the proposed framework is designed for at-site FFA, this approach can also be applied to regional FFA because regional estimations ideally include at-site estimations. The framework is based on the following steps: (i) comprehensive trend analysis to assess nonstationarity in the observed data; (ii) selection of the best-fit univariate marginal distribution with a comprehensive set of parametric and nonparametric distributions for the flood variables; (iii) multivariate frequency analyses with parametric, copula-based and nonparametric approaches; and (iv) estimation of joint and various conditional return periods. The proposed framework for frequency analysis is demonstrated using 110 years of observed data from Allegheny River at Salamanca, New York, USA. The results show that for both univariate and multivariate cases, the nonparametric Gaussian kernel provides the best estimate. Further, we perform FFA for twenty major rivers over continental USA, which shows for seven rivers, all the flood variables followed nonparametric Gaussian kernel; whereas for other rivers, parametric distributions provide the best-fit either for one or two flood variables. Thus the summary of results shows that the nonparametric method cannot substitute the parametric and copula-based approaches, but should be considered during any at-site FFA to provide the broadest choices for best estimation of the flood return periods.
Dort, Jonathan M; Trickey, Amber W; Kallies, Kara J; Joshi, Amit R T; Sidwell, Richard A; Jarman, Benjamin T
2015-01-01
This study evaluated characteristics of applicants selected for interview and ranked by independent general surgery residency programs and assessed independent program application volumes, interview selection, rank list formation, and match success. Demographic and academic information was analyzed for 2014-2015 applicants. Applicant characteristics were compared by ranking status using univariate and multivariable statistical techniques. Characteristics independently associated with whether or not an applicant was ranked were identified using multivariable logistic regression modeling with backward stepwise variable selection and cluster-correlated robust variance estimates to account for correlations among individuals who applied to multiple programs. The Electronic Residency Application Service was used to obtain applicant data and program match outcomes at 33 independent surgery programs. All applicants selected to interview at 33 participating independent general surgery residency programs were included in the study. Applicants were 60% male with median age of 26 years. Birthplace was well distributed. Most applicants (73%) had ≥1 academic publication. Median United States Medical Licensing Exams (USMLE) Step 1 score was 228 (interquartile range: 218-240), and median USMLE Step 2 clinical knowledge score was 241 (interquartile range: 231-250). Residency programs in some regions more often ranked applicants who attended medical school within the same region. On multivariable analysis, significant predictors of ranking by an independent residency program were: USMLE scores, medical school region, and birth region. Independent programs received an average of 764 applications (range: 307-1704). On average, 12% interviews, and 81% of interviewed applicants were ranked. Most programs (84%) matched at least 1 applicant ranked in their top 10. Participating independent programs attract a large volume of applicants and have high standards in the selection process. This information can be used by surgery residency applicants to gauge their candidacy at independent programs. Independent programs offer a select number of interviews, rank most applicants that they interview, and successfully match competitive applicants. Copyright © 2015 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
SPReM: Sparse Projection Regression Model For High-dimensional Linear Regression *
Sun, Qiang; Zhu, Hongtu; Liu, Yufeng; Ibrahim, Joseph G.
2014-01-01
The aim of this paper is to develop a sparse projection regression modeling (SPReM) framework to perform multivariate regression modeling with a large number of responses and a multivariate covariate of interest. We propose two novel heritability ratios to simultaneously perform dimension reduction, response selection, estimation, and testing, while explicitly accounting for correlations among multivariate responses. Our SPReM is devised to specifically address the low statistical power issue of many standard statistical approaches, such as the Hotelling’s T2 test statistic or a mass univariate analysis, for high-dimensional data. We formulate the estimation problem of SPREM as a novel sparse unit rank projection (SURP) problem and propose a fast optimization algorithm for SURP. Furthermore, we extend SURP to the sparse multi-rank projection (SMURP) by adopting a sequential SURP approximation. Theoretically, we have systematically investigated the convergence properties of SURP and the convergence rate of SURP estimates. Our simulation results and real data analysis have shown that SPReM out-performs other state-of-the-art methods. PMID:26527844
Stone loaches of Choman River system, Kurdistan, Iran (Teleostei: Cypriniformes: Nemacheilidae).
Kamangar, Barzan Bahrami; Prokofiev, Artem M; Ghaderi, Edris; Nalbant, Theodore T
2014-01-20
For the first time, we present data on species composition and distributions of nemacheilid loaches in the Choman River basin of Kurdistan province, Iran. Two genera and four species are recorded from the area, of which three species are new for science: Oxynoemacheilus kurdistanicus, O. zagrosensis, O. chomanicus spp. nov., and Turcinoemacheilus kosswigi Băn. et Nalb. Detailed and illustrated morphological descriptions and univariate and multivariate analysis of morphometric and meristic features are for each of these species. Forty morphometric and eleven meristic characters were used in multivariate analysis to select characters that could discriminate between the four loach species. Discriminant Function Analysis revealed that sixteen morphometric measures and five meristic characters have the most variability between the loach species. The dendrograms based on cluster analysis of Mahalanobis distances of morphometrics and a combination of both characters confirmed two distinct groups: Oxynoemacheilus spp. and T. kosswigi. Within Oxynoemacheilus, O. zagrosensis and O. chomanicus are more similar to one other rather to either is to O. kurdistanicus.
Multivariate selection and intersexual genetic constraints in a wild bird population.
Poissant, J; Morrissey, M B; Gosler, A G; Slate, J; Sheldon, B C
2016-10-01
When selection differs between the sexes for traits that are genetically correlated between the sexes, there is potential for the effect of selection in one sex to be altered by indirect selection in the other sex, a situation commonly referred to as intralocus sexual conflict (ISC). While potentially common, ISC has rarely been studied in wild populations. Here, we studied ISC over a set of morphological traits (wing length, tarsus length, bill depth and bill length) in a wild population of great tits (Parus major) from Wytham Woods, UK. Specifically, we quantified the microevolutionary impacts of ISC by combining intra- and intersex additive genetic (co)variances and sex-specific selection estimates in a multivariate framework. Large genetic correlations between homologous male and female traits combined with evidence for sex-specific multivariate survival selection suggested that ISC could play an appreciable role in the evolution of this population. Together, multivariate sex-specific selection and additive genetic (co)variance for the traits considered accounted for additive genetic variance in fitness that was uncorrelated between the sexes (cross-sex genetic correlation = -0.003, 95% CI = -0.83, 0.83). Gender load, defined as the reduction in a population's rate of adaptation due to sex-specific effects, was estimated at 50% (95% CI = 13%, 86%). This study provides novel insights into the evolution of sexual dimorphism in wild populations and illustrates how quantitative genetics and selection analyses can be combined in a multivariate framework to quantify the microevolutionary impacts of ISC. © 2016 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2016 European Society For Evolutionary Biology.
Shape variation in the human pelvis and limb skeleton: Implications for obstetric adaptation.
Kurki, Helen K; Decrausaz, Sarah-Louise
2016-04-01
Under the obstetrical dilemma (OD) hypothesis, selection acts on the human female pelvis to ensure a sufficiently sized obstetric canal for birthing a large-brained, broad shouldered neonate, while bipedal locomotion selects for a narrower and smaller pelvis. Despite this female-specific stabilizing selection, variability of linear dimensions of the pelvic canal and overall size are not reduced in females, suggesting shape may instead be variable among females of a population. Female canal shape has been shown to vary among populations, while male canal shape does not. Within this context, we examine within-population canal shape variation in comparison with that of noncanal aspects of the pelvis and the limbs. Nine skeletal samples (total female n = 101, male n = 117) representing diverse body sizes and shapes were included. Principal components analysis was applied to size-adjusted variables of each skeletal region. A multivariate variance was calculated using the weighted PC scores for all components in each model and F-ratios used to assess differences in within-population variances between sexes and skeletal regions. Within both sexes, multivariate canal shape variance is significantly greater than noncanal pelvis and limb variances, while limb variance is greater than noncanal pelvis variance in some populations. Multivariate shape variation is not consistently different between the sexes in any of the skeletal regions. Diverse selective pressures, including obstetrics, locomotion, load carrying, and others may act on canal shape, as well as genetic drift and plasticity, thus increasing variation in morphospace while protecting obstetric sufficiency. © 2015 Wiley Periodicals, Inc.
1983-06-16
has been advocated by Gnanadesikan and ilk (1969), and others in the literature. This suggests that, if we use the formal signficance test type...American Statistical Asso., 62, 1159-1178. Gnanadesikan , R., and Wilk, M..B. (1969). Data Analytic Methods in Multi- variate Statistical Analysis. In
ERIC Educational Resources Information Center
Chou, Yu-Chi; Wehmeyer, Michael L.; Palmer, Susan B.; Lee, Jaehoon
2017-01-01
This study examined differences in self-determination among students with autism spectrum disorders (ASD), students with intellectual disability (ID), and students with learning disabilities (LD). A total of 222 participants with an equal size group for each of the three disability categories were selected to participate in the comparison of total…
Liu, Xiaona; Zhang, Qiao; Wu, Zhisheng; Shi, Xinyuan; Zhao, Na; Qiao, Yanjiang
2015-01-01
Laser-induced breakdown spectroscopy (LIBS) was applied to perform a rapid elemental analysis and provenance study of Blumea balsamifera DC. Principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) were implemented to exploit the multivariate nature of the LIBS data. Scores and loadings of computed principal components visually illustrated the differing spectral data. The PLS-DA algorithm showed good classification performance. The PLS-DA model using complete spectra as input variables had similar discrimination performance to using selected spectral lines as input variables. The down-selection of spectral lines was specifically focused on the major elements of B. balsamifera samples. Results indicated that LIBS could be used to rapidly analyze elements and to perform provenance study of B. balsamifera. PMID:25558999
Oliveira, M M; Sousa, L B; Reis, M C; Silva Junior, E G; Cardoso, D B O; Hamawaki, O T; Nogueira, A P O
2017-05-31
The genetic diversity study has paramount importance in breeding programs; hence, it allows selection and choice of the parental genetic divergence, which have the agronomic traits desired by the breeder. This study aimed to characterize the genetic divergence between 24 soybean genotypes through their agronomic traits, using multivariate clustering methods to select the potential genitors for the promising hybrid combinations. Six agronomic traits evaluated were number of days to flowering and maturity, plant height at flowering and maturity, insertion height of the first pod, and yield. The genetic divergence evaluated by multivariate analysis that esteemed first the Mahalanobis' generalized distance (D 2 ), then the clustering using Tocher's optimization methods, and then the unweighted pair group method with arithmetic average (UPGMA). Tocher's optimization method and the UPGMA agreed with the groups' constitution between each other, the formation of eight distinct groups according Tocher's method and seven distinct groups using UPGMA. The trait number of days for flowering (45.66%) was the most efficient to explain dissimilarity between genotypes, and must be one of the main traits considered by the breeder in the moment of genitors choice in soybean-breeding programs. The genetic variability allowed the identification of dissimilar genotypes and with superior performances. The hybridizations UFU 18 x UFUS CARAJÁS, UFU 15 x UFU 13, and UFU 13 x UFUS CARAJÁS are promising to obtain superior segregating populations, which enable the development of more productive genotypes.
Generating Virtual Patients by Multivariate and Discrete Re-Sampling Techniques.
Teutonico, D; Musuamba, F; Maas, H J; Facius, A; Yang, S; Danhof, M; Della Pasqua, O
2015-10-01
Clinical Trial Simulations (CTS) are a valuable tool for decision-making during drug development. However, to obtain realistic simulation scenarios, the patients included in the CTS must be representative of the target population. This is particularly important when covariate effects exist that may affect the outcome of a trial. The objective of our investigation was to evaluate and compare CTS results using re-sampling from a population pool and multivariate distributions to simulate patient covariates. COPD was selected as paradigm disease for the purposes of our analysis, FEV1 was used as response measure and the effects of a hypothetical intervention were evaluated in different populations in order to assess the predictive performance of the two methods. Our results show that the multivariate distribution method produces realistic covariate correlations, comparable to the real population. Moreover, it allows simulation of patient characteristics beyond the limits of inclusion and exclusion criteria in historical protocols. Both methods, discrete resampling and multivariate distribution generate realistic pools of virtual patients. However the use of a multivariate distribution enable more flexible simulation scenarios since it is not necessarily bound to the existing covariate combinations in the available clinical data sets.
Duffy, Sonia A.; Ronis, David L.; McLean, Scott; Fowler, Karen E.; Gruber, Stephen B.; Wolf, Gregory T.; Terrell, Jeffrey E.
2009-01-01
Purpose Our prior work has shown that the health behaviors of head and neck cancer patients are interrelated and are associated with quality of life; however, other than smoking, the relationship between health behaviors and survival is unclear. Patients and Methods A prospective cohort study was conducted to determine the relationship between five pretreatment health behaviors (smoking, alcohol, diet, physical activity, and sleep) and all-cause survival among 504 head and neck cancer patients. Results Smoking status was the strongest predictor of survival, with both current smokers (hazard ratio [HR] = 2.4; 95% CI, 1.3 to 4.4) and former smokers (HR = 2.0; 95% CI, 1.2 to 3.5) showing significant associations with poor survival. Problem drinking was associated with survival in the univariate analysis (HR = 1.4; 95% CI, 1.0 to 2.0) but lost significance when controlling for other factors. Low fruit intake was negatively associated with survival in the univariate analysis only (HR = 1.6; 95% CI, 1.1 to 2.1), whereas vegetable intake was not significant in either univariate or multivariate analyses. Although physical activity was associated with survival in the univariate analysis (HR = 0.95; 95% CI, 0.93 to 0.97), it was not significant in the multivariate model. Sleep was not significantly associated with survival in either univariate or multivariate analysis. Control variables that were also independently associated with survival in the multivariate analysis were age, education, tumor site, cancer stage, and surgical treatment. Conclusion Variation in selected pretreatment health behaviors (eg, smoking, fruit intake, and physical activity) in this population is associated with variation in survival. PMID:19289626
Riley, Richard D; Elia, Eleni G; Malin, Gemma; Hemming, Karla; Price, Malcolm P
2015-07-30
A prognostic factor is any measure that is associated with the risk of future health outcomes in those with existing disease. Often, the prognostic ability of a factor is evaluated in multiple studies. However, meta-analysis is difficult because primary studies often use different methods of measurement and/or different cut-points to dichotomise continuous factors into 'high' and 'low' groups; selective reporting is also common. We illustrate how multivariate random effects meta-analysis models can accommodate multiple prognostic effect estimates from the same study, relating to multiple cut-points and/or methods of measurement. The models account for within-study and between-study correlations, which utilises more information and reduces the impact of unreported cut-points and/or measurement methods in some studies. The applicability of the approach is improved with individual participant data and by assuming a functional relationship between prognostic effect and cut-point to reduce the number of unknown parameters. The models provide important inferential results for each cut-point and method of measurement, including the summary prognostic effect, the between-study variance and a 95% prediction interval for the prognostic effect in new populations. Two applications are presented. The first reveals that, in a multivariate meta-analysis using published results, the Apgar score is prognostic of neonatal mortality but effect sizes are smaller at most cut-points than previously thought. In the second, a multivariate meta-analysis of two methods of measurement provides weak evidence that microvessel density is prognostic of mortality in lung cancer, even when individual participant data are available so that a continuous prognostic trend is examined (rather than cut-points). © 2015 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd.
A Multivariate Solution of the Multivariate Ranking and Selection Problem
1980-02-01
Taneja (1972)), a ’a for a vector of constants c (Krishnaiah and Rizvi (1966)), the generalized variance ( Gnanadesikan and Gupta (1970)), iegier (1976...Olk-in, I. and Sobel, M. (1977). Selecting and Ordering Populations: A New Statistical Methodology, John Wiley & Sons, Inc., New York. Gnanadesikan
Fontes, Cristiano Hora; Budman, Hector
2017-11-01
A clustering problem involving multivariate time series (MTS) requires the selection of similarity metrics. This paper shows the limitations of the PCA similarity factor (SPCA) as a single metric in nonlinear problems where there are differences in magnitude of the same process variables due to expected changes in operation conditions. A novel method for clustering MTS based on a combination between SPCA and the average-based Euclidean distance (AED) within a fuzzy clustering approach is proposed. Case studies involving either simulated or real industrial data collected from a large scale gas turbine are used to illustrate that the hybrid approach enhances the ability to recognize normal and fault operating patterns. This paper also proposes an oversampling procedure to create synthetic multivariate time series that can be useful in commonly occurring situations involving unbalanced data sets. Copyright © 2017 ISA. Published by Elsevier Ltd. All rights reserved.
Te Stroet, Martijn A J; Rijnen, Wim H C; Gardeniers, Jean W M; Schreurs, B Willem; Hannink, Gerjon
2016-09-29
Despite improvements in the technique of femoral impaction bone grafting, reconstruction failures still can occur. Therefore, the aim of our study was to determine risk factors for the endpoint re-revision for any reason. We used prospectively collected demographic, clinical and surgical data of all 202 patients who underwent 208 femoral revisions using the X-change Femoral Revision System (Stryker-Howmedica), fresh-frozen morcellised allograft and a cemented polished Exeter stem in our department from 1991 to 2007. Univariable and multivariable Cox regression analyses were performed to identify potential factors associated with re-revision. The mean follow-up was 10.6 (5-21) years. The cumulative re-revision rate was 6.3% (13/208). After univariable selection, sex, age, body mass index (BMI), American Association of Anesthesiologists (ASA) classification, type of removed femoral component, and mesh used for reconstruction were included in multivariable regression analysis.In the multivariable analysis, BMI was the only factor that was significantly associated with the risk of re-revision after bone impaction grafting (BMI ≥30 vs. BMI <30, HR = 6.54 [95% CI 1.89-22.65]; p = 0.003). BMI was the only factor associated with the risk of re-revision for any reason. Besides BMI also other factors, such as Endoklinik score and the type of removed femoral component, can provide guidance in the process of preclinical decision making. With the knowledge obtained from this study, preoperative patient selection, informed consent, and treatment protocols can be better adjusted to the individual patient who needs to undergo a femoral revision with impaction bone grafting.
Björklund, M; Gustafsson, L
2017-07-01
Understanding the magnitude and long-term patterns of selection in natural populations is of importance, for example, when analysing the evolutionary impact of climate change. We estimated univariate and multivariate directional, quadratic and correlational selection on four morphological traits (adult wing, tarsus and tail length, body mass) over a time period of 33 years (≈ 19 000 observations) in a nest-box breeding population of collared flycatchers (Ficedula albicollis). In general, selection was weak in both males and females over the years regardless of fitness measure (fledged young, recruits and survival) with only few cases with statistically significant selection. When data were analysed in a multivariate context and as time series, a number of patterns emerged; there was a consistent, but weak, selection for longer wings in both sexes, selection was stronger on females when the number of fledged young was used as a fitness measure, there were no indications of sexually antagonistic selection, and we found a negative correlation between selection on tarsus and wing length in both sexes but using different fitness measures. Uni- and multivariate selection gradients were correlated only for wing length and mass. Multivariate selection gradient vectors were longer than corresponding vector of univariate gradients and had more constrained direction. Correlational selection had little importance. Overall, the fitness surface was more or less flat with few cases of significant curvature, indicating that the adaptive peak with regard to body size in this species is broader than the phenotypic distribution, which has resulted in weak estimates of selection. © 2017 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2017 European Society For Evolutionary Biology.
Stekolnikov, Alexandr A; Klimov, Pavel B
2010-09-01
We revise chiggers belonging to the minuta-species group (genus Neotrombicula Hirst, 1925) from the Palaearctic using size-free multivariate morphometrics. This approach allowed us to resolve several diagnostic problems. We show that the widely distributed Neotrombicula scrupulosa Kudryashova, 1993 forms three spatially and ecologically isolated groups different from each other in size or shape (morphometric property) only: specimens from the Caucasus are distinct from those from Asia in shape, whereas the Asian specimens from plains and mountains are different from each other in size. We developed a multivariate classification model to separate three closely related species: N. scrupulosa, N. lubrica Kudryashova, 1993 and N. minuta Schluger, 1966. This model is based on five shape variables selected from an initial 17 variables by a best subset analysis using a custom size-correction subroutine. The variable selection procedure slightly improved the predictive power of the model, suggesting that it not only removed redundancy but also reduced 'noise' in the dataset. The overall classification accuracy of this model is 96.2, 96.2 and 95.5%, as estimated by internal validation, external validation and jackknife statistics, respectively. Our analyses resulted in one new synonymy: N. dimidiata Stekolnikov, 1995 is considered to be a synonym of N. lubrica. Both N. scrupulosa and N. lubrica are recorded from new localities. A key to species of the minuta-group incorporating results from our multivariate analyses is presented.
Li, Yan; Zhang, Ji; Zhao, Yanli; Liu, Honggao; Wang, Yuanzhong; Jin, Hang
2016-01-01
In this study the geographical differentiation of dried sclerotia of the medicinal mushroom Wolfiporia extensa, obtained from different regions in Yunnan Province, China, was explored using Fourier-transform infrared (FT-IR) spectroscopy coupled with multivariate data analysis. The FT-IR spectra of 97 samples were obtained for wave numbers ranging from 4000 to 400 cm-1. Then, the fingerprint region of 1800-600 cm-1 of the FT-IR spectrum, rather than the full spectrum, was analyzed. Different pretreatments were applied on the spectra, and a discriminant analysis model based on the Mahalanobis distance was developed to select an optimal pretreatment combination. Two unsupervised pattern recognition procedures- principal component analysis and hierarchical cluster analysis-were applied to enhance the authenticity of discrimination of the specimens. The results showed that excellent classification could be obtained after optimizing spectral pretreatment. The tested samples were successfully discriminated according to their geographical locations. The chemical properties of dried sclerotia of W. extensa were clearly dependent on the mushroom's geographical origins. Furthermore, an interesting finding implied that the elevations of collection areas may have effects on the chemical components of wild W. extensa sclerotia. Overall, this study highlights the feasibility of FT-IR spectroscopy combined with multivariate data analysis in particular for exploring the distinction of different regional W. extensa sclerotia samples. This research could also serve as a basis for the exploitation and utilization of medicinal mushrooms.
Assessing Multivariate Constraints to Evolution across Ten Long-Term Avian Studies
Teplitsky, Celine; Tarka, Maja; Møller, Anders P.; Nakagawa, Shinichi; Balbontín, Javier; Burke, Terry A.; Doutrelant, Claire; Gregoire, Arnaud; Hansson, Bengt; Hasselquist, Dennis; Gustafsson, Lars; de Lope, Florentino; Marzal, Alfonso; Mills, James A.; Wheelwright, Nathaniel T.; Yarrall, John W.; Charmantier, Anne
2014-01-01
Background In a rapidly changing world, it is of fundamental importance to understand processes constraining or facilitating adaptation through microevolution. As different traits of an organism covary, genetic correlations are expected to affect evolutionary trajectories. However, only limited empirical data are available. Methodology/Principal Findings We investigate the extent to which multivariate constraints affect the rate of adaptation, focusing on four morphological traits often shown to harbour large amounts of genetic variance and considered to be subject to limited evolutionary constraints. Our data set includes unique long-term data for seven bird species and a total of 10 populations. We estimate population-specific matrices of genetic correlations and multivariate selection coefficients to predict evolutionary responses to selection. Using Bayesian methods that facilitate the propagation of errors in estimates, we compare (1) the rate of adaptation based on predicted response to selection when including genetic correlations with predictions from models where these genetic correlations were set to zero and (2) the multivariate evolvability in the direction of current selection to the average evolvability in random directions of the phenotypic space. We show that genetic correlations on average decrease the predicted rate of adaptation by 28%. Multivariate evolvability in the direction of current selection was systematically lower than average evolvability in random directions of space. These significant reductions in the rate of adaptation and reduced evolvability were due to a general nonalignment of selection and genetic variance, notably orthogonality of directional selection with the size axis along which most (60%) of the genetic variance is found. Conclusions These results suggest that genetic correlations can impose significant constraints on the evolution of avian morphology in wild populations. This could have important impacts on evolutionary dynamics and hence population persistence in the face of rapid environmental change. PMID:24608111
The evolution of multivariate maternal effects.
Kuijper, Bram; Johnstone, Rufus A; Townley, Stuart
2014-04-01
There is a growing interest in predicting the social and ecological contexts that favor the evolution of maternal effects. Most predictions focus, however, on maternal effects that affect only a single character, whereas the evolution of maternal effects is poorly understood in the presence of suites of interacting traits. To overcome this, we simulate the evolution of multivariate maternal effects (captured by the matrix M) in a fluctuating environment. We find that the rate of environmental fluctuations has a substantial effect on the properties of M: in slowly changing environments, offspring are selected to have a multivariate phenotype roughly similar to the maternal phenotype, so that M is characterized by positive dominant eigenvalues; by contrast, rapidly changing environments favor Ms with dominant eigenvalues that are negative, as offspring favor a phenotype which substantially differs from the maternal phenotype. Moreover, when fluctuating selection on one maternal character is temporally delayed relative to selection on other traits, we find a striking pattern of cross-trait maternal effects in which maternal characters influence not only the same character in offspring, but also other offspring characters. Additionally, when selection on one character contains more stochastic noise relative to selection on other traits, large cross-trait maternal effects evolve from those maternal traits that experience the smallest amounts of noise. The presence of these cross-trait maternal effects shows that individual maternal effects cannot be studied in isolation, and that their study in a multivariate context may provide important insights about the nature of past selection. Our results call for more studies that measure multivariate maternal effects in wild populations.
The Evolution of Multivariate Maternal Effects
Kuijper, Bram; Johnstone, Rufus A.; Townley, Stuart
2014-01-01
There is a growing interest in predicting the social and ecological contexts that favor the evolution of maternal effects. Most predictions focus, however, on maternal effects that affect only a single character, whereas the evolution of maternal effects is poorly understood in the presence of suites of interacting traits. To overcome this, we simulate the evolution of multivariate maternal effects (captured by the matrix M) in a fluctuating environment. We find that the rate of environmental fluctuations has a substantial effect on the properties of M: in slowly changing environments, offspring are selected to have a multivariate phenotype roughly similar to the maternal phenotype, so that M is characterized by positive dominant eigenvalues; by contrast, rapidly changing environments favor Ms with dominant eigenvalues that are negative, as offspring favor a phenotype which substantially differs from the maternal phenotype. Moreover, when fluctuating selection on one maternal character is temporally delayed relative to selection on other traits, we find a striking pattern of cross-trait maternal effects in which maternal characters influence not only the same character in offspring, but also other offspring characters. Additionally, when selection on one character contains more stochastic noise relative to selection on other traits, large cross-trait maternal effects evolve from those maternal traits that experience the smallest amounts of noise. The presence of these cross-trait maternal effects shows that individual maternal effects cannot be studied in isolation, and that their study in a multivariate context may provide important insights about the nature of past selection. Our results call for more studies that measure multivariate maternal effects in wild populations. PMID:24722346
Selecting climate simulations for impact studies based on multivariate patterns of climate change.
Mendlik, Thomas; Gobiet, Andreas
In climate change impact research it is crucial to carefully select the meteorological input for impact models. We present a method for model selection that enables the user to shrink the ensemble to a few representative members, conserving the model spread and accounting for model similarity. This is done in three steps: First, using principal component analysis for a multitude of meteorological parameters, to find common patterns of climate change within the multi-model ensemble. Second, detecting model similarities with regard to these multivariate patterns using cluster analysis. And third, sampling models from each cluster, to generate a subset of representative simulations. We present an application based on the ENSEMBLES regional multi-model ensemble with the aim to provide input for a variety of climate impact studies. We find that the two most dominant patterns of climate change relate to temperature and humidity patterns. The ensemble can be reduced from 25 to 5 simulations while still maintaining its essential characteristics. Having such a representative subset of simulations reduces computational costs for climate impact modeling and enhances the quality of the ensemble at the same time, as it prevents double-counting of dependent simulations that would lead to biased statistics. The online version of this article (doi:10.1007/s10584-015-1582-0) contains supplementary material, which is available to authorized users.
Application of multivariate statistical techniques in microbial ecology.
Paliy, O; Shankar, V
2016-03-01
Recent advances in high-throughput methods of molecular analyses have led to an explosion of studies generating large-scale ecological data sets. In particular, noticeable effect has been attained in the field of microbial ecology, where new experimental approaches provided in-depth assessments of the composition, functions and dynamic changes of complex microbial communities. Because even a single high-throughput experiment produces large amount of data, powerful statistical techniques of multivariate analysis are well suited to analyse and interpret these data sets. Many different multivariate techniques are available, and often it is not clear which method should be applied to a particular data set. In this review, we describe and compare the most widely used multivariate statistical techniques including exploratory, interpretive and discriminatory procedures. We consider several important limitations and assumptions of these methods, and we present examples of how these approaches have been utilized in recent studies to provide insight into the ecology of the microbial world. Finally, we offer suggestions for the selection of appropriate methods based on the research question and data set structure. © 2016 John Wiley & Sons Ltd.
Analysis of Big Data in Gait Biomechanics: Current Trends and Future Directions.
Phinyomark, Angkoon; Petri, Giovanni; Ibáñez-Marcelo, Esther; Osis, Sean T; Ferber, Reed
2018-01-01
The increasing amount of data in biomechanics research has greatly increased the importance of developing advanced multivariate analysis and machine learning techniques, which are better able to handle "big data". Consequently, advances in data science methods will expand the knowledge for testing new hypotheses about biomechanical risk factors associated with walking and running gait-related musculoskeletal injury. This paper begins with a brief introduction to an automated three-dimensional (3D) biomechanical gait data collection system: 3D GAIT, followed by how the studies in the field of gait biomechanics fit the quantities in the 5 V's definition of big data: volume, velocity, variety, veracity, and value. Next, we provide a review of recent research and development in multivariate and machine learning methods-based gait analysis that can be applied to big data analytics. These modern biomechanical gait analysis methods include several main modules such as initial input features, dimensionality reduction (feature selection and extraction), and learning algorithms (classification and clustering). Finally, a promising big data exploration tool called "topological data analysis" and directions for future research are outlined and discussed.
Moon, Youngmin; Han, Jung Hyun; Shin, Sungho; Kim, Yong-Chul; Jeong, Sungho
2016-01-01
By laser induced breakdown spectroscopy (LIBS) analysis of epidermal lesion and dermis tissue pellets of hairless mouse, it is shown that Ca intensity in the epidermal lesion is higher than that in dermis, whereas Na and K intensities have an opposite tendency. It is demonstrated that epidermal lesion and normal dermis can be differentiated with high selectivity either by univariate or multivariate analysis of LIBS spectra with an intensity ratio difference by factor of 8 or classification accuracy over 0.995, respectively. PMID:27231610
2015-12-01
issues. A weighted mean can be used in place of the grand mean3 and the STATA software automatically handles the assignment of the sums of squares. Thus...between groups (i.e., sphericity) using the multivariate test of means provided in STATA 12.1. This test checks whether or not population variances and
Zafar, Raheel; Kamel, Nidal; Naufal, Mohamad; Malik, Aamir Saeed; Dass, Sarat C; Ahmad, Rana Fayyaz; Abdullah, Jafri M; Reza, Faruque
2017-01-01
Decoding of human brain activity has always been a primary goal in neuroscience especially with functional magnetic resonance imaging (fMRI) data. In recent years, Convolutional neural network (CNN) has become a popular method for the extraction of features due to its higher accuracy, however it needs a lot of computation and training data. In this study, an algorithm is developed using Multivariate pattern analysis (MVPA) and modified CNN to decode the behavior of brain for different images with limited data set. Selection of significant features is an important part of fMRI data analysis, since it reduces the computational burden and improves the prediction performance; significant features are selected using t-test. MVPA uses machine learning algorithms to classify different brain states and helps in prediction during the task. General linear model (GLM) is used to find the unknown parameters of every individual voxel and the classification is done using multi-class support vector machine (SVM). MVPA-CNN based proposed algorithm is compared with region of interest (ROI) based method and MVPA based estimated values. The proposed method showed better overall accuracy (68.6%) compared to ROI (61.88%) and estimation values (64.17%).
Enhancing e-waste estimates: Improving data quality by multivariate Input–Output Analysis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Feng, E-mail: fwang@unu.edu; Design for Sustainability Lab, Faculty of Industrial Design Engineering, Delft University of Technology, Landbergstraat 15, 2628CE Delft; Huisman, Jaco
2013-11-15
Highlights: • A multivariate Input–Output Analysis method for e-waste estimates is proposed. • Applying multivariate analysis to consolidate data can enhance e-waste estimates. • We examine the influence of model selection and data quality on e-waste estimates. • Datasets of all e-waste related variables in a Dutch case study have been provided. • Accurate modeling of time-variant lifespan distributions is critical for estimate. - Abstract: Waste electrical and electronic equipment (or e-waste) is one of the fastest growing waste streams, which encompasses a wide and increasing spectrum of products. Accurate estimation of e-waste generation is difficult, mainly due to lackmore » of high quality data referred to market and socio-economic dynamics. This paper addresses how to enhance e-waste estimates by providing techniques to increase data quality. An advanced, flexible and multivariate Input–Output Analysis (IOA) method is proposed. It links all three pillars in IOA (product sales, stock and lifespan profiles) to construct mathematical relationships between various data points. By applying this method, the data consolidation steps can generate more accurate time-series datasets from available data pool. This can consequently increase the reliability of e-waste estimates compared to the approach without data processing. A case study in the Netherlands is used to apply the advanced IOA model. As a result, for the first time ever, complete datasets of all three variables for estimating all types of e-waste have been obtained. The result of this study also demonstrates significant disparity between various estimation models, arising from the use of data under different conditions. It shows the importance of applying multivariate approach and multiple sources to improve data quality for modelling, specifically using appropriate time-varying lifespan parameters. Following the case study, a roadmap with a procedural guideline is provided to enhance e-waste estimation studies.« less
Masiá, M; Gutiérrez, F; Padilla, S; Soldán, B; Mirete, C; Shum, C; Hernández, I; Royo, G; Martin-Hidalgo, A
2007-02-01
The aim of this study was to characterise community-acquired pneumonia (CAP) caused by atypical pathogens by combining distinctive clinical and epidemiological features and novel biological markers. A population-based prospective study of consecutive patients with CAP included investigation of biomarkers of bacterial infection, e.g., procalcitonin, C-reactive protein and lipopolysaccharide-binding protein (LBP) levels. Clinical, radiological and laboratory data for patients with CAP caused by atypical pathogens were compared by univariate and multivariate analysis with data for patients with typical pathogens and patients from whom no organisms were identified. Two predictive scoring models were developed with the most discriminatory variables from multivariate analysis. Of 493 patients, 94 had CAP caused by atypical pathogens. According to multivariate analysis, patients with atypical pneumonia were more likely to have normal white blood cell counts, have repetitive air-conditioning exposure, be aged <65 years, have elevated aspartate aminotransferase levels, have been exposed to birds, and have lower serum levels of LBP. Two different scoring systems were developed that predicted atypical pathogens with sensitivities of 35.2% and 48.8%, and specificities of 93% and 91%, respectively. The combination of selected patient characteristics and laboratory data identified up to half of the cases of atypical pneumonia with high specificity, which should help clinicians to optimise initial empirical therapy for CAP.
Association between split selection instability and predictive error in survival trees.
Radespiel-Tröger, M; Gefeller, O; Rabenstein, T; Hothorn, T
2006-01-01
To evaluate split selection instability in six survival tree algorithms and its relationship with predictive error by means of a bootstrap study. We study the following algorithms: logrank statistic with multivariate p-value adjustment without pruning (LR), Kaplan-Meier distance of survival curves (KM), martingale residuals (MR), Poisson regression for censored data (PR), within-node impurity (WI), and exponential log-likelihood loss (XL). With the exception of LR, initial trees are pruned by using split-complexity, and final trees are selected by means of cross-validation. We employ a real dataset from a clinical study of patients with gallbladder stones. The predictive error is evaluated using the integrated Brier score for censored data. The relationship between split selection instability and predictive error is evaluated by means of box-percentile plots, covariate and cutpoint selection entropy, and cutpoint selection coefficients of variation, respectively, in the root node. We found a positive association between covariate selection instability and predictive error in the root node. LR yields the lowest predictive error, while KM and MR yield the highest predictive error. The predictive error of survival trees is related to split selection instability. Based on the low predictive error of LR, we recommend the use of this algorithm for the construction of survival trees. Unpruned survival trees with multivariate p-value adjustment can perform equally well compared to pruned trees. The analysis of split selection instability can be used to communicate the results of tree-based analyses to clinicians and to support the application of survival trees.
Cluster-based exposure variation analysis
2013-01-01
Background Static posture, repetitive movements and lack of physical variation are known risk factors for work-related musculoskeletal disorders, and thus needs to be properly assessed in occupational studies. The aims of this study were (i) to investigate the effectiveness of a conventional exposure variation analysis (EVA) in discriminating exposure time lines and (ii) to compare it with a new cluster-based method for analysis of exposure variation. Methods For this purpose, we simulated a repeated cyclic exposure varying within each cycle between “low” and “high” exposure levels in a “near” or “far” range, and with “low” or “high” velocities (exposure change rates). The duration of each cycle was also manipulated by selecting a “small” or “large” standard deviation of the cycle time. Theses parameters reflected three dimensions of exposure variation, i.e. range, frequency and temporal similarity. Each simulation trace included two realizations of 100 concatenated cycles with either low (ρ = 0.1), medium (ρ = 0.5) or high (ρ = 0.9) correlation between the realizations. These traces were analyzed by conventional EVA, and a novel cluster-based EVA (C-EVA). Principal component analysis (PCA) was applied on the marginal distributions of 1) the EVA of each of the realizations (univariate approach), 2) a combination of the EVA of both realizations (multivariate approach) and 3) C-EVA. The least number of principal components describing more than 90% of variability in each case was selected and the projection of marginal distributions along the selected principal component was calculated. A linear classifier was then applied to these projections to discriminate between the simulated exposure patterns, and the accuracy of classified realizations was determined. Results C-EVA classified exposures more correctly than univariate and multivariate EVA approaches; classification accuracy was 49%, 47% and 52% for EVA (univariate and multivariate), and C-EVA, respectively (p < 0.001). All three methods performed poorly in discriminating exposure patterns differing with respect to the variability in cycle time duration. Conclusion While C-EVA had a higher accuracy than conventional EVA, both failed to detect differences in temporal similarity. The data-driven optimality of data reduction and the capability of handling multiple exposure time lines in a single analysis are the advantages of the C-EVA. PMID:23557439
Fink, Herbert; Panne, Ulrich; Niessner, Reinhard
2002-09-01
An experimental setup for direct elemental analysis of recycled thermoplasts from consumer electronics by laser-induced plasma spectroscopy (LIPS, or laser-induced breakdown spectroscopy, LIBS) was realized. The combination of a echelle spectrograph, featuring a high resolution with a broad spectral coverage, with multivariate methods, such as PLS, PCR, and variable subset selection via a genetic algorithm, resulted in considerable improvements in selectivity and sensitivity for this complex matrix. With a normalization to carbon as internal standard, the limits of detection were in the ppm range. A preliminary pattern recognition study points to the possibility of polymer recognition via the line-rich echelle spectra. Several experiments at an extruder within a recycling plant demonstrated successfully the capability of LIPS for different kinds of routine on-line process analysis.
Optical assay for biotechnology and clinical diagnosis.
Moczko, Ewa; Cauchi, Michael; Turner, Claire; Meglinski, Igor; Piletsky, Sergey
2011-08-01
In this paper, we present an optical diagnostic assay consisting of a mixture of environmental-sensitive fluorescent dyes combined with multivariate data analysis for quantitative and qualitative examination of biological and clinical samples. The performance of the assay is based on the analysis of spectrum of the selected fluorescent dyes with the operational principle similar to electronic nose and electronic tongue systems. This approach has been successfully applied for monitoring of growing cell cultures and identification of gastrointestinal diseases in humans.
Time-varying nonstationary multivariate risk analysis using a dynamic Bayesian copula
NASA Astrophysics Data System (ADS)
Sarhadi, Ali; Burn, Donald H.; Concepción Ausín, María.; Wiper, Michael P.
2016-03-01
A time-varying risk analysis is proposed for an adaptive design framework in nonstationary conditions arising from climate change. A Bayesian, dynamic conditional copula is developed for modeling the time-varying dependence structure between mixed continuous and discrete multiattributes of multidimensional hydrometeorological phenomena. Joint Bayesian inference is carried out to fit the marginals and copula in an illustrative example using an adaptive, Gibbs Markov Chain Monte Carlo (MCMC) sampler. Posterior mean estimates and credible intervals are provided for the model parameters and the Deviance Information Criterion (DIC) is used to select the model that best captures different forms of nonstationarity over time. This study also introduces a fully Bayesian, time-varying joint return period for multivariate time-dependent risk analysis in nonstationary environments. The results demonstrate that the nature and the risk of extreme-climate multidimensional processes are changed over time under the impact of climate change, and accordingly the long-term decision making strategies should be updated based on the anomalies of the nonstationary environment.
Roldan-Valadez, Ernesto; Suarez-May, Marcela A; Favila, Rafael; Aguilar-Castañeda, Erika; Rios, Camilo
2015-07-01
Interest in the lateralization of the human brain is evident through a multidisciplinary number of scientific studies. Understanding volumetric brain asymmetries allows the distinction between normal development stages and behavior, as well as brain diseases. We aimed to evaluate volumetric asymmetries in order to select the best gyri able to classify right- versus left cerebral hemispheres. A cross-sectional study performed in 47 right-handed young-adults healthy volunteers. SPM-based software performed brain segmentation, automatic labeling and volumetric analyses for 54 regions involving the cerebral lobes, basal ganglia and cerebellum from each cerebral hemisphere. Multivariate discriminant analysis (DA) allowed the assembling of a predictive model. DA revealed one discriminant function that significantly differentiated left vs. right cerebral hemispheres: Wilks' λ = 0.008, χ(2) (9) = 238.837, P < 0.001. The model explained 99.20% of the variation in the grouping variable and depicted an overall predictive accuracy of 98.8%. With the influence of gender; the selected gyri able to discriminate between hemispheres were middle orbital frontal gyrus (g.), angular g., supramarginal g., middle cingulum g., inferior orbital frontal g., calcarine g., inferior parietal lobule and the pars triangularis inferior frontal g. Specific brain gyri are able to accurately classify left vs. right cerebral hemispheres by using a multivariate approach; the selected regions correspond to key brain areas involved in attention, internal thought, vision and language; our findings favored the concept that lateralization has been evolutionary favored by mental processes increasing cognitive efficiency and brain capacity. © 2015 Wiley Periodicals, Inc.
Coupling GIS and multivariate approaches to reference site selection for wadeable stream monitoring.
Collier, Kevin J; Haigh, Andy; Kelly, Johlene
2007-04-01
Geographic Information System (GIS) was used to identify potential reference sites for wadeable stream monitoring, and multivariate analyses were applied to test whether invertebrate communities reflected a priori spatial and stream type classifications. We identified potential reference sites in segments with unmodified vegetation cover adjacent to the stream and in >85% of the upstream catchment. We then used various landcover, amenity and environmental impact databases to eliminate sites that had potential anthropogenic influences upstream and that fell into a range of access classes. Each site identified by this process was coded by four dominant stream classes and seven zones, and 119 candidate sites were randomly selected for follow-up assessment. This process yielded 16 sites conforming to reference site criteria using a conditional-probabilistic design, and these were augmented by an additional 14 existing or special interest reference sites. Non-metric multidimensional scaling (NMS) analysis of percent abundance invertebrate data indicated significant differences in community composition among some of the zones and stream classes identified a priori providing qualified support for this framework in reference site selection. NMS analysis of a range standardised condition and diversity metrics derived from the invertebrate data indicated a core set of 26 closely related sites, and four outliers that were considered atypical of reference site conditions and subsequently dropped from the network. Use of GIS linked to stream typology, available spatial databases and aerial photography greatly enhanced the objectivity and efficiency of reference site selection. The multi-metric ordination approach reduced variability among stream types and bias associated with non-random site selection, and provided an effective way to identify representative reference sites.
Zhou, Fei; Zhao, Yajing; Peng, Jiyu; Jiang, Yirong; Li, Maiquan; Jiang, Yuan; Lu, Baiyi
2017-07-01
Osmanthus fragrans flowers are used as folk medicine and additives for teas, beverages and foods. The metabolites of O. fragrans flowers from different geographical origins were inconsistent in some extent. Chromatography and mass spectrometry combined with multivariable analysis methods provides an approach for discriminating the origin of O. fragrans flowers. To discriminate the Osmanthus fragrans var. thunbergii flowers from different origins with the identified metabolites. GC-MS and UPLC-PDA were conducted to analyse the metabolites in O. fragrans var. thunbergii flowers (in total 150 samples). Principal component analysis (PCA), soft independent modelling of class analogy analysis (SIMCA) and random forest (RF) analysis were applied to group the GC-MS and UPLC-PDA data. GC-MS identified 32 compounds common to all samples while UPLC-PDA/QTOF-MS identified 16 common compounds. PCA of the UPLC-PDA data generated a better clustering than PCA of the GC-MS data. Ten metabolites (six from GC-MS and four from UPLC-PDA) were selected as effective compounds for discrimination by PCA loadings. SIMCA and RF analysis were used to build classification models, and the RF model, based on the four effective compounds (caffeic acid derivative, acteoside, ligustroside and compound 15), yielded better results with the classification rate of 100% in the calibration set and 97.8% in the prediction set. GC-MS and UPLC-PDA combined with multivariable analysis methods can discriminate the origin of Osmanthus fragrans var. thunbergii flowers. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
Yue, Yong; Osipov, Arsen; Fraass, Benedick; Sandler, Howard; Zhang, Xiao; Nissen, Nicholas; Hendifar, Andrew; Tuli, Richard
2017-02-01
To stratify risks of pancreatic adenocarcinoma (PA) patients using pre- and post-radiotherapy (RT) PET/CT images, and to assess the prognostic value of texture variations in predicting therapy response of patients. Twenty-six PA patients treated with RT from 2011-2013 with pre- and post-treatment 18F-FDG-PET/CT scans were identified. Tumor locoregional texture was calculated using 3D kernel-based approach, and texture variations were identified by fitting discrepancies of texture maps of pre- and post-treatment images. A total of 48 texture and clinical variables were identified and evaluated for association with overall survival (OS). The prognostic heterogeneity features were selected using lasso/elastic net regression, and further were evaluated by multivariate Cox analysis. Median age was 69 y (range, 46-86 y). The texture map and temporal variations between pre- and post-treatment were well characterized by histograms and statistical fitting. The lasso analysis identified seven predictors (age, node stage, post-RT SUVmax, variations of homogeneity, variance, sum mean, and cluster tendency). The multivariate Cox analysis identified five significant variables: age, node stage, variations of homogeneity, variance, and cluster tendency (with P=0.020, 0.040, 0.065, 0.078, and 0.081, respectively). The patients were stratified into two groups based on the risk score of multivariate analysis with log-rank P=0.001: a low risk group (n=11) with a longer mean OS (29.3 months) and higher texture variation (>30%), and a high risk group (n=15) with a shorter mean OS (17.7 months) and lower texture variation (<15%). Locoregional metabolic texture response provides a feasible approach for evaluating and predicting clinical outcomes following treatment of PA with RT. The proposed method can be used to stratify patient risk and help select appropriate treatment strategies for individual patients toward implementing response-driven adaptive RT.
NASA Astrophysics Data System (ADS)
Nikolić, G. S.; Žerajić, S.; Cakić, M.
2011-10-01
Multivariate calibration method is a powerful mathematical tool that can be applied in analytical chemistry when the analytical signals are highly overlapped. The method with regression by partial least squares is proposed for the simultaneous spectrophotometric determination of adrenergic vasoconstrictors in decongestive solution containing two active components: phenyleprine hydrochloride and trimazoline hydrochloride. These sympathomimetic agents are that frequently associated in pharmaceutical formulations against the common cold. The proposed method, which is, simple and rapid, offers the advantages of sensitivity and wide range of determinations without the need for extraction of the vasoconstrictors. In order to minimize the optimal factors necessary to obtain the calibration matrix by multivariate calibration, different parameters were evaluated. The adequate selection of the spectral regions proved to be important on the number of factors. In order to simultaneously quantify both hydrochlorides among excipients, the spectral region between 250 and 290 nm was selected. A recovery for the vasoconstrictor was 98-101%. The developed method was applied to assay of two decongestive pharmaceutical preparations.
Selecting an Informative/Discriminating Multivariate Response for Inverse Prediction
Thomas, Edward V.; Lewis, John R.; Anderson-Cook, Christine M.; ...
2017-11-21
nverse prediction is important in a wide variety of scientific and engineering contexts. One might use inverse prediction to predict fundamental properties/characteristics of an object using measurements obtained from it. This can be accomplished by “inverting” parameterized forward models that relate the measurements (responses) to the properties/characteristics of interest. Sometimes forward models are science based; but often, forward models are empirically based, using the results of experimentation. For empirically-based forward models, it is important that the experiments provide a sound basis to develop accurate forward models in terms of the properties/characteristics (factors). While nature dictates the causal relationship between factorsmore » and responses, experimenters can influence control of the type, accuracy, and precision of forward models that can be constructed via selection of factors, factor levels, and the set of trials that are performed. Whether the forward models are based on science, experiments or both, researchers can influence the ability to perform inverse prediction by selecting informative response variables. By using an errors-in-variables framework for inverse prediction, this paper shows via simple analysis and examples how the capability of a multivariate response (with respect to being informative and discriminating) can vary depending on how well the various responses complement one another over the range of the factor-space of interest. Insights derived from this analysis could be useful for selecting a set of response variables among candidates in cases where the number of response variables that can be acquired is limited by difficulty, expense, and/or availability of material.« less
Selecting an Informative/Discriminating Multivariate Response for Inverse Prediction
DOE Office of Scientific and Technical Information (OSTI.GOV)
Thomas, Edward V.; Lewis, John R.; Anderson-Cook, Christine M.
nverse prediction is important in a wide variety of scientific and engineering contexts. One might use inverse prediction to predict fundamental properties/characteristics of an object using measurements obtained from it. This can be accomplished by “inverting” parameterized forward models that relate the measurements (responses) to the properties/characteristics of interest. Sometimes forward models are science based; but often, forward models are empirically based, using the results of experimentation. For empirically-based forward models, it is important that the experiments provide a sound basis to develop accurate forward models in terms of the properties/characteristics (factors). While nature dictates the causal relationship between factorsmore » and responses, experimenters can influence control of the type, accuracy, and precision of forward models that can be constructed via selection of factors, factor levels, and the set of trials that are performed. Whether the forward models are based on science, experiments or both, researchers can influence the ability to perform inverse prediction by selecting informative response variables. By using an errors-in-variables framework for inverse prediction, this paper shows via simple analysis and examples how the capability of a multivariate response (with respect to being informative and discriminating) can vary depending on how well the various responses complement one another over the range of the factor-space of interest. Insights derived from this analysis could be useful for selecting a set of response variables among candidates in cases where the number of response variables that can be acquired is limited by difficulty, expense, and/or availability of material.« less
Ji, Hong; Petro, Nathan M; Chen, Badong; Yuan, Zejian; Wang, Jianji; Zheng, Nanning; Keil, Andreas
2018-02-06
Over the past decade, the simultaneous recording of electroencephalogram (EEG) and functional magnetic resonance imaging (fMRI) data has garnered growing interest because it may provide an avenue towards combining the strengths of both imaging modalities. Given their pronounced differences in temporal and spatial statistics, the combination of EEG and fMRI data is however methodologically challenging. Here, we propose a novel screening approach that relies on a Cross Multivariate Correlation Coefficient (xMCC) framework. This approach accomplishes three tasks: (1) It provides a measure for testing multivariate correlation and multivariate uncorrelation of the two modalities; (2) it provides criterion for the selection of EEG features; (3) it performs a screening of relevant EEG information by grouping the EEG channels into clusters to improve efficiency and to reduce computational load when searching for the best predictors of the BOLD signal. The present report applies this approach to a data set with concurrent recordings of steady-state-visual evoked potentials (ssVEPs) and fMRI, recorded while observers viewed phase-reversing Gabor patches. We test the hypothesis that fluctuations in visuo-cortical mass potentials systematically covary with BOLD fluctuations not only in visual cortical, but also in anterior temporal and prefrontal areas. Results supported the hypothesis and showed that the xMCC-based analysis provides straightforward identification of neurophysiological plausible brain regions with EEG-fMRI covariance. Furthermore xMCC converged with other extant methods for EEG-fMRI analysis. © 2018 The Authors Journal of Neuroscience Research Published by Wiley Periodicals, Inc.
Chen, Yong; Luo, Sheng; Chu, Haitao; Wei, Peng
2013-05-01
Multivariate meta-analysis is useful in combining evidence from independent studies which involve several comparisons among groups based on a single outcome. For binary outcomes, the commonly used statistical models for multivariate meta-analysis are multivariate generalized linear mixed effects models which assume risks, after some transformation, follow a multivariate normal distribution with possible correlations. In this article, we consider an alternative model for multivariate meta-analysis where the risks are modeled by the multivariate beta distribution proposed by Sarmanov (1966). This model have several attractive features compared to the conventional multivariate generalized linear mixed effects models, including simplicity of likelihood function, no need to specify a link function, and has a closed-form expression of distribution functions for study-specific risk differences. We investigate the finite sample performance of this model by simulation studies and illustrate its use with an application to multivariate meta-analysis of adverse events of tricyclic antidepressants treatment in clinical trials.
ERIC Educational Resources Information Center
Taylor, Matthew J.; Merritt, Stephanie M.; Austin, Chammie C.
2013-01-01
A model of negative affect and alcohol use was replicated on a sample of African-American high school students. Participants (N = 5,086) were randomly selected from a previously collected data set and consisted of 2,253 males and 2,833 females residing in both rural and urban locations. Multivariate analysis of covariance and structural equation…
PNPLA3 I148M associations with liver carcinogenesis in Japanese chronic hepatitis C patients.
Nakaoka, Kazunori; Hashimoto, Senju; Kawabe, Naoto; Nitta, Yoshifumi; Murao, Michihito; Nakano, Takuji; Shimazaki, Hiroaki; Kan, Toshiki; Takagawa, Yuka; Ohki, Masashi; Kurashita, Takamitsu; Takamura, Tomoki; Nishikawa, Toru; Ichino, Naohiro; Osakabe, Keisuke; Yoshioka, Kentaro
2015-01-01
To investigate associations between patatin-like phospholipase domain-containing 3 (PNPLA3) genotypes and fibrosis and hepatocarcinogenesis in Japanese chronic hepatitis C (CHC) patients. Two hundred and thirty-one patients with CHC were examined for PNPLA3 genotypes, liver stiffness measurements (LSM), and hepatocellular carcinoma (HCC) from May 2010 to October 2012 at Fujita Health University Hospital. The rs738409 single nucleotide polymorphism (SNP) encoding for a functional PNPLA3 I148M protein variant was genotyped using a TaqMan predesigned SNP genotyping assay. LSM was determined as the velocity of a shear wave (Vs) with an acoustic radiation force impulse. Vs cut-off values for cirrhosis were set at 1.55 m/s. We excluded CHC patients with a sustained virological response or relapse after interferon treatment. PNPLA3 genotypes were CC, CG, and GG for 118, 72, and 41 patients, respectively. Multivariable logistic regression analysis selected older age (OR = 1.06; 95% CI: 1.03-1.09; p < 0.0001), higher body mass index (BMI) (OR= 1.12; 95% CI: 1.03-1.22; p = 0.0082), and PNPLA3 genotype GG (OR = 2.07; 95% CI: 0.97-4.42; p = 0.0599) as the factors independently associated with cirrhosis. When 137 patients without past history of interferon treatment were separately assessed, multivariable logistic regression analysis selected older age (OR = 1.05; 95% CI: 1.02-1.09; p = 0.0034), and PNPLA3 genotype GG (OR = 3.35; 95% CI: 1.13-9.91; p = 0.0291) as the factors independently associated with cirrhosis. Multivariable logistic regression analysis selected older age (OR = 1.12; 95% CI: 1.07-1.17; p < 0.0001), PNPLA3 genotype GG (OR = 2.62; 95% CI: 1.15-5.96; p = 0.0218), and male gender (OR = 1.83; 95% CI: 0.90-3.71); p = 0.0936) as the factors independently associated with HCC. PNPLA3 genotype I148M is one of risk factors for developing HCC in Japanese CHC patients, and is one of risk factors for progress to cirrhosis in the patients without past history of interferon treatment.
Brühlmann, David; Sokolov, Michael; Butté, Alessandro; Sauer, Markus; Hemberger, Jürgen; Souquet, Jonathan; Broly, Hervé; Jordan, Martin
2017-07-01
Rational and high-throughput optimization of mammalian cell culture media has a great potential to modulate recombinant protein product quality. We present a process design method based on parallel design-of-experiment (DoE) of CHO fed-batch cultures in 96-deepwell plates to modulate monoclonal antibody (mAb) glycosylation using medium supplements. To reduce the risk of losing valuable information in an intricate joint screening, 17 compounds were separated into five different groups, considering their mode of biological action. The concentration ranges of the medium supplements were defined according to information encountered in the literature and in-house experience. The screening experiments produced wide glycosylation pattern ranges. Multivariate analysis including principal component analysis and decision trees was used to select the best performing glycosylation modulators. Subsequent D-optimal quadratic design with four factors (three promising compounds and temperature shift) in shake tubes confirmed the outcome of the selection process and provided a solid basis for sequential process development at a larger scale. The glycosylation profile with respect to the specifications for biosimilarity was greatly improved in shake tube experiments: 75% of the conditions were equally close or closer to the specifications for biosimilarity than the best 25% in 96-deepwell plates. Biotechnol. Bioeng. 2017;114: 1448-1458. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Abbiati, Milena; Baroffio, Anne; Gerbase, Margaret W.
2016-01-01
Introduction A consistent body of literature highlights the importance of a broader approach to select medical school candidates both assessing cognitive capacity and individual characteristics. However, selection in a great number of medical schools worldwide is still based on knowledge exams, a procedure that might neglect students with needed personal characteristics for future medical practice. We investigated whether the personal profile of students selected through a knowledge-based exam differed from those not selected. Methods Students applying for medical school (N=311) completed questionnaires assessing motivations for becoming a doctor, learning approaches, personality traits, empathy, and coping styles. Selection was based on the results of MCQ tests. Principal component analysis was used to draw a profile of the students. Differences between selected and non-selected students were examined by Multivariate ANOVAs, and their impact on selection by logistic regression analysis. Results Students demonstrating a profile of diligence with higher conscientiousness, deep learning approach, and task-focused coping were more frequently selected (p=0.01). Other personal characteristics such as motivation, sociability, and empathy did not significantly differ, comparing selected and non-selected students. Conclusion Selection through a knowledge-based exam privileged diligent students. It did neither advantage nor preclude candidates with a more humane profile. PMID:27079886
Selecting minimum dataset soil variables using PLSR as a regressive multivariate method
NASA Astrophysics Data System (ADS)
Stellacci, Anna Maria; Armenise, Elena; Castellini, Mirko; Rossi, Roberta; Vitti, Carolina; Leogrande, Rita; De Benedetto, Daniela; Ferrara, Rossana M.; Vivaldi, Gaetano A.
2017-04-01
Long-term field experiments and science-based tools that characterize soil status (namely the soil quality indices, SQIs) assume a strategic role in assessing the effect of agronomic techniques and thus in improving soil management especially in marginal environments. Selecting key soil variables able to best represent soil status is a critical step for the calculation of SQIs. Current studies show the effectiveness of statistical methods for variable selection to extract relevant information deriving from multivariate datasets. Principal component analysis (PCA) has been mainly used, however supervised multivariate methods and regressive techniques are progressively being evaluated (Armenise et al., 2013; de Paul Obade et al., 2016; Pulido Moncada et al., 2014). The present study explores the effectiveness of partial least square regression (PLSR) in selecting critical soil variables, using a dataset comparing conventional tillage and sod-seeding on durum wheat. The results were compared to those obtained using PCA and stepwise discriminant analysis (SDA). The soil data derived from a long-term field experiment in Southern Italy. On samples collected in April 2015, the following set of variables was quantified: (i) chemical: total organic carbon and nitrogen (TOC and TN), alkali-extractable C (TEC and humic substances - HA-FA), water extractable N and organic C (WEN and WEOC), Olsen extractable P, exchangeable cations, pH and EC; (ii) physical: texture, dry bulk density (BD), macroporosity (Pmac), air capacity (AC), and relative field capacity (RFC); (iii) biological: carbon of the microbial biomass quantified with the fumigation-extraction method. PCA and SDA were previously applied to the multivariate dataset (Stellacci et al., 2016). PLSR was carried out on mean centered and variance scaled data of predictors (soil variables) and response (wheat yield) variables using the PLS procedure of SAS/STAT. In addition, variable importance for projection (VIP) statistics was used to quantitatively assess the predictors most relevant for response variable estimation and then for variable selection (Andersen and Bro, 2010). PCA and SDA returned TOC and RFC as influential variables both on the set of chemical and physical data analyzed separately as well as on the whole dataset (Stellacci et al., 2016). Highly weighted variables in PCA were also TEC, followed by K, and AC, followed by Pmac and BD, in the first PC (41.2% of total variance); Olsen P and HA-FA in the second PC (12.6%), Ca in the third (10.6%) component. Variables enabling maximum discrimination among treatments for SDA were WEOC, on the whole dataset, humic substances, followed by Olsen P, EC and clay, in the separate data analyses. The highest PLS-VIP statistics were recorded for Olsen P and Pmac, followed by TOC, TEC, pH and Mg for chemical variables and clay, RFC and AC for the physical variables. Results show that different methods may provide different ranking of the selected variables and the presence of a response variable, in regressive techniques, may affect variable selection. Further investigation with different response variables and with multi-year datasets would allow to better define advantages and limits of single or combined approaches. Acknowledgment The work was supported by the projects "BIOTILLAGE, approcci innovative per il miglioramento delle performances ambientali e produttive dei sistemi cerealicoli no-tillage", financed by PSR-Basilicata 2007-2013, and "DESERT, Low-cost water desalination and sensor technology compact module" financed by ERANET-WATERWORKS 2014. References Andersen C.M. and Bro R., 2010. Variable selection in regression - a tutorial. Journal of Chemometrics, 24 728-737. Armenise et al., 2013. Developing a soil quality index to compare soil fitness for agricultural use under different managements in the mediterranean environment. Soil and Tillage Research, 130:91-98. de Paul Obade et al., 2016. A standardized soil quality index for diverse field conditions. Sci. Total Env. 541:424-434. Pulido Moncada et al., 2014. Data-driven analysis of soil quality indicators using limited data. Geoderma, 235:271-278. Stellacci et al., 2016. Comparison of different multivariate methods to select key soil variables for soil quality indices computation. XLV Congress of the Italian Society of Agronomy (SIA), Sassari, 20-22 September 2016.
Reflectance of vegetation, soil, and water
NASA Technical Reports Server (NTRS)
Wiegand, C. L. (Principal Investigator)
1973-01-01
There are no author-identified significant results in this report. This report deals with the selection of the best channels from the 24-channel aircraft data to represent crop and soil conditions. A three-step procedure has been developed that involves using univariate statistics and an F-ratio test to indicate the best 14 channels. From the 14, the 10 best channels are selected by a multivariate stochastic process. The third step involves the pattern recognition procedures developed in the data analysis plan. Indications are that the procedures in use are satsifactory and will extract the desired information from the data.
Lu, Tsui-Shan; Longnecker, Matthew P.; Zhou, Haibo
2016-01-01
Outcome-dependent sampling (ODS) scheme is a cost-effective sampling scheme where one observes the exposure with a probability that depends on the outcome. The well-known such design is the case-control design for binary response, the case-cohort design for the failure time data and the general ODS design for a continuous response. While substantial work has been done for the univariate response case, statistical inference and design for the ODS with multivariate cases remain under-developed. Motivated by the need in biological studies for taking the advantage of the available responses for subjects in a cluster, we propose a multivariate outcome dependent sampling (Multivariate-ODS) design that is based on a general selection of the continuous responses within a cluster. The proposed inference procedure for the Multivariate-ODS design is semiparametric where all the underlying distributions of covariates are modeled nonparametrically using the empirical likelihood methods. We show that the proposed estimator is consistent and developed the asymptotically normality properties. Simulation studies show that the proposed estimator is more efficient than the estimator obtained using only the simple-random-sample portion of the Multivariate-ODS or the estimator from a simple random sample with the same sample size. The Multivariate-ODS design together with the proposed estimator provides an approach to further improve study efficiency for a given fixed study budget. We illustrate the proposed design and estimator with an analysis of association of PCB exposure to hearing loss in children born to the Collaborative Perinatal Study. PMID:27966260
Liu, Xueqin; Li, Ning; Yuan, Shuai; Xu, Ning; Shi, Wenqin; Chen, Weibin
2015-12-15
As a random event, a natural disaster has the complex occurrence mechanism. The comprehensive analysis of multiple hazard factors is important in disaster risk assessment. In order to improve the accuracy of risk analysis and forecasting, the formation mechanism of a disaster should be considered in the analysis and calculation of multi-factors. Based on the consideration of the importance and deficiencies of multivariate analysis of dust storm disasters, 91 severe dust storm disasters in Inner Mongolia from 1990 to 2013 were selected as study cases in the paper. Main hazard factors from 500-hPa atmospheric circulation system, near-surface meteorological system, and underlying surface conditions were selected to simulate and calculate the multidimensional joint return periods. After comparing the simulation results with actual dust storm events in 54years, we found that the two-dimensional Frank Copula function showed the better fitting results at the lower tail of hazard factors and that three-dimensional Frank Copula function displayed the better fitting results at the middle and upper tails of hazard factors. However, for dust storm disasters with the short return period, three-dimensional joint return period simulation shows no obvious advantage. If the return period is longer than 10years, it shows significant advantages in extreme value fitting. Therefore, we suggest the multivariate analysis method may be adopted in forecasting and risk analysis of serious disasters with the longer return period, such as earthquake and tsunami. Furthermore, the exploration of this method laid the foundation for the prediction and warning of other nature disasters. Copyright © 2015 Elsevier B.V. All rights reserved.
Papillary type 2 versus clear cell renal cell carcinoma: Survival outcomes.
Simone, G; Tuderti, G; Ferriero, M; Papalia, R; Misuraca, L; Minisola, F; Costantini, M; Mastroianni, R; Sentinelli, S; Guaglianone, S; Gallucci, M
2016-11-01
To compare the cancer specific survival (CSS) between p2-RCC and a Propensity Score Matched (PSM) cohort of cc-RCC patients. Fifty-five (4.6%) patients with p2-RCC and 920 cc-RCC patients were identified within a prospectively maintained institutional dataset of 1205 histologically proved RCC patients treated with either RN or PN. Univariable and multivariable Cox regression analyses were used to identify predictors of CSS after surgical treatment. A 1:2 PSM analysis based on independent predictors of oncologic outcomes was employed and CSS was compared between PSM selected cc-RCC patients using Kaplan-Meier and Cox regression analysis. Overall, 55 (4.6%) p2-RCC and 920 (76.3%) cc-RCC patients were selected from the database; p2-RCC were significantly larger (p = 0.001), more frequently locally advanced (p < 0.001) and node positive (p < 0.001) and had significantly higher Fuhrman grade (p < 0.001) than cc-RCC. On multivariable Cox regression analysis age (p = 0.025), histologic subtype (p = 0.029), pN stage (p = 0.006), size, pT stage, cM stage, sarcomatoid features and Fuhrman grade (all p < 0.001) were independent predictors of CSS. After applying the PSM, 82 cc-RCC selected cases were comparable to 41 p2-RCC for age (p = 0.81), tumor size (p = 0.39), pT (p = 1.00) and pN (p = 0.62) stages, cM stage (p = 0.71) and Fuhrman grade (p = 1). In this PSM cohort, 5 yr CSS was significantly lower in the p2-RCC (63% vs 72.4%; p = 0.047). At multivariable Cox analysis p2 histology was an independent predictor of CSM (HR 2.46, 95% CI 1.04-5.83; p = 0.041). We confirmed the tendency of p2-RCC to present as locally advanced and metastatic disease more frequently than cc-RCC and demonstrated p2-RCC histology as an independent predictor of worse oncologic outcomes. Copyright © 2016 Elsevier Ltd, BASO ~ The Association for Cancer Surgery, and the European Society of Surgical Oncology. All rights reserved.
Using a Grocery List Is Associated With a Healthier Diet and Lower BMI Among Very High-Risk Adults.
Dubowitz, Tamara; Cohen, Deborah A; Huang, Christina Y; Beckman, Robin A; Collins, Rebecca L
2015-01-01
Examine whether use of a grocery list is associated with healthier diet and weight among food desert residents. Cross-sectional analysis of in-person interview data from randomly selected household food shoppers in 2 low-income, primarily African American urban neighborhoods in Pittsburgh, PA with limited access to healthy foods. Multivariate ordinary least-square regressions conducted among 1,372 participants and controlling for sociodemographic factors and other potential confounding variables indicated that although most of the sample (78%) was overweight or obese, consistently using a list was associated with lower body mass index (based on measured height and weight) (adjusted multivariant coefficient = 0.095) and higher dietary quality (based on the Healthy Eating Index-2005) (adjusted multivariant coefficient = 0.103) (P < .05). Shopping with a list may be a useful tool for low-income individuals to improve diet or decrease body mass index. Copyright © 2015 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
Potyrailo, Radislav A
2016-10-12
Modern gas monitoring scenarios for medical diagnostics, environmental surveillance, industrial safety, and other applications demand new sensing capabilities. This Review provides analysis of development of new generation of gas sensors based on the multivariable response principles. Design criteria of these individual sensors involve a sensing material with multiresponse mechanisms to different gases and a multivariable transducer with independent outputs to recognize these different gas responses. These new sensors quantify individual components in mixtures, reject interferences, and offer more stable response over sensor arrays. Such performance is attractive when selectivity advantages of classic gas chromatography, ion mobility, and mass spectrometry instruments are canceled by requirements for no consumables, low power, low cost, and unobtrusive form factors for Internet of Things, Industrial Internet, and other applications. This Review is concluded with a perspective for future needs in fundamental and applied aspects of gas sensing and with the 2025 roadmap for ubiquitous gas monitoring.
Amodal processing in human prefrontal cortex.
Tamber-Rosenau, Benjamin J; Dux, Paul E; Tombu, Michael N; Asplund, Christopher L; Marois, René
2013-07-10
Information enters the cortex via modality-specific sensory regions, whereas actions are produced by modality-specific motor regions. Intervening central stages of information processing map sensation to behavior. Humans perform this central processing in a flexible, abstract manner such that sensory information in any modality can lead to response via any motor system. Cognitive theories account for such flexible behavior by positing amodal central information processing (e.g., "central executive," Baddeley and Hitch, 1974; "supervisory attentional system," Norman and Shallice, 1986; "response selection bottleneck," Pashler, 1994). However, the extent to which brain regions embodying central mechanisms of information processing are amodal remains unclear. Here we apply multivariate pattern analysis to functional magnetic resonance imaging (fMRI) data to compare response selection, a cognitive process widely believed to recruit an amodal central resource across sensory and motor modalities. We show that most frontal and parietal cortical areas known to activate across a wide variety of tasks code modality, casting doubt on the notion that these regions embody a central processor devoid of modality representation. Importantly, regions of anterior insula and dorsolateral prefrontal cortex consistently failed to code modality across four experiments. However, these areas code at least one other task dimension, process (instantiated as response selection vs response execution), ensuring that failure to find coding of modality is not driven by insensitivity of multivariate pattern analysis in these regions. We conclude that abstract encoding of information modality is primarily a property of subregions of the prefrontal cortex.
Lê Cao, Kim-Anh; Boitard, Simon; Besse, Philippe
2011-06-22
Variable selection on high throughput biological data, such as gene expression or single nucleotide polymorphisms (SNPs), becomes inevitable to select relevant information and, therefore, to better characterize diseases or assess genetic structure. There are different ways to perform variable selection in large data sets. Statistical tests are commonly used to identify differentially expressed features for explanatory purposes, whereas Machine Learning wrapper approaches can be used for predictive purposes. In the case of multiple highly correlated variables, another option is to use multivariate exploratory approaches to give more insight into cell biology, biological pathways or complex traits. A simple extension of a sparse PLS exploratory approach is proposed to perform variable selection in a multiclass classification framework. sPLS-DA has a classification performance similar to other wrapper or sparse discriminant analysis approaches on public microarray and SNP data sets. More importantly, sPLS-DA is clearly competitive in terms of computational efficiency and superior in terms of interpretability of the results via valuable graphical outputs. sPLS-DA is available in the R package mixOmics, which is dedicated to the analysis of large biological data sets.
Multivariate analysis in thoracic research.
Mengual-Macenlle, Noemí; Marcos, Pedro J; Golpe, Rafael; González-Rivas, Diego
2015-03-01
Multivariate analysis is based in observation and analysis of more than one statistical outcome variable at a time. In design and analysis, the technique is used to perform trade studies across multiple dimensions while taking into account the effects of all variables on the responses of interest. The development of multivariate methods emerged to analyze large databases and increasingly complex data. Since the best way to represent the knowledge of reality is the modeling, we should use multivariate statistical methods. Multivariate methods are designed to simultaneously analyze data sets, i.e., the analysis of different variables for each person or object studied. Keep in mind at all times that all variables must be treated accurately reflect the reality of the problem addressed. There are different types of multivariate analysis and each one should be employed according to the type of variables to analyze: dependent, interdependence and structural methods. In conclusion, multivariate methods are ideal for the analysis of large data sets and to find the cause and effect relationships between variables; there is a wide range of analysis types that we can use.
Spelt, Lidewij; Sasor, Agata; Ansari, Daniel; Andersson, Roland
2016-10-01
To identify significant predictive factors for overall survival (OS) and disease-free survival (DFS) after liver resection for colon cancer metastases, with special focus on features of the primary colon cancer, such as lymph node ratio (LNR), vascular invasion, and perineural invasion. Patients operated for colonic cancer liver metastases between 2006 and 2014 were included. Details on patient characteristics, the primary colon cancer operation and metastatic disease were collected. Multivariate analysis was performed to select predictive variables for OS and DFS. Median OS and DFS were 67 and 20 months, respectively. 1-, 3- and 5-year OS were 97, 76, and 52%. 1-, 3- and 5-year DFS were 65, 42, and 37%. Multivariate analysis showed LNR to be an independent predictive factor for DFS but not for OS. Other identified predictive factors were vascular and perineural invasion of the primary colon cancer, size of the largest metastasis and severe complications after liver surgery for OS, and perineural invasion, number of liver metastases and preoperative CEA-level for DFS. Traditional N-stage was also considered to be an independent predictive factor for DFS in a separate multivariate analysis. LNR and perineural invasion of the primary colon cancer can be used as a prognostic variable for DFS after a concomitant liver resection for colon cancer metastases. Vascular and perineural invasion of the primary colon cancer are predictive for OS.
Moseson, Heidi; Gerdts, Caitlin; Dehlendorf, Christine; Hiatt, Robert A; Vittinghoff, Eric
2017-12-21
The list experiment is a promising measurement tool for eliciting truthful responses to stigmatized or sensitive health behaviors. However, investigators may be hesitant to adopt the method due to previously untestable assumptions and the perceived inability to conduct multivariable analysis. With a recently developed statistical test that can detect the presence of a design effect - the absence of which is a central assumption of the list experiment method - we sought to test the validity of a list experiment conducted on self-reported abortion in Liberia. We also aim to introduce recently developed multivariable regression estimators for the analysis of list experiment data, to explore relationships between respondent characteristics and having had an abortion - an important component of understanding the experiences of women who have abortions. To test the null hypothesis of no design effect in the Liberian list experiment data, we calculated the percentage of each respondent "type," characterized by response to the control items, and compared these percentages across treatment and control groups with a Bonferroni-adjusted alpha criterion. We then implemented two least squares and two maximum likelihood models (four total), each representing different bias-variance trade-offs, to estimate the association between respondent characteristics and abortion. We find no clear evidence of a design effect in list experiment data from Liberia (p = 0.18), affirming the first key assumption of the method. Multivariable analyses suggest a negative association between education and history of abortion. The retrospective nature of measuring lifetime experience of abortion, however, complicates interpretation of results, as the timing and safety of a respondent's abortion may have influenced her ability to pursue an education. Our work demonstrates that multivariable analyses, as well as statistical testing of a key design assumption, are possible with list experiment data, although with important limitations when considering lifetime measures. We outline how to implement this methodology with list experiment data in future research.
Multivariate Radiological-Based Models for the Prediction of Future Knee Pain: Data from the OAI
Galván-Tejada, Jorge I.; Celaya-Padilla, José M.; Treviño, Victor; Tamez-Peña, José G.
2015-01-01
In this work, the potential of X-ray based multivariate prognostic models to predict the onset of chronic knee pain is presented. Using X-rays quantitative image assessments of joint-space-width (JSW) and paired semiquantitative central X-ray scores from the Osteoarthritis Initiative (OAI), a case-control study is presented. The pain assessments of the right knee at the baseline and the 60-month visits were used to screen for case/control subjects. Scores were analyzed at the time of pain incidence (T-0), the year prior incidence (T-1), and two years before pain incidence (T-2). Multivariate models were created by a cross validated elastic-net regularized generalized linear models feature selection tool. Univariate differences between cases and controls were reported by AUC, C-statistics, and ODDs ratios. Univariate analysis indicated that the medial osteophytes were significantly more prevalent in cases than controls: C-stat 0.62, 0.62, and 0.61, at T-0, T-1, and T-2, respectively. The multivariate JSW models significantly predicted pain: AUC = 0.695, 0.623, and 0.620, at T-0, T-1, and T-2, respectively. Semiquantitative multivariate models predicted paint with C-stat = 0.671, 0.648, and 0.645 at T-0, T-1, and T-2, respectively. Multivariate models derived from plain X-ray radiography assessments may be used to predict subjects that are at risk of developing knee pain. PMID:26504490
Vanderhaeghe, F; Smolders, A J P; Roelofs, J G M; Hoffmann, M
2012-03-01
Selecting an appropriate variable subset in linear multivariate methods is an important methodological issue for ecologists. Interest often exists in obtaining general predictive capacity or in finding causal inferences from predictor variables. Because of a lack of solid knowledge on a studied phenomenon, scientists explore predictor variables in order to find the most meaningful (i.e. discriminating) ones. As an example, we modelled the response of the amphibious softwater plant Eleocharis multicaulis using canonical discriminant function analysis. We asked how variables can be selected through comparison of several methods: univariate Pearson chi-square screening, principal components analysis (PCA) and step-wise analysis, as well as combinations of some methods. We expected PCA to perform best. The selected methods were evaluated through fit and stability of the resulting discriminant functions and through correlations between these functions and the predictor variables. The chi-square subset, at P < 0.05, followed by a step-wise sub-selection, gave the best results. In contrast to expectations, PCA performed poorly, as so did step-wise analysis. The different chi-square subset methods all yielded ecologically meaningful variables, while probable noise variables were also selected by PCA and step-wise analysis. We advise against the simple use of PCA or step-wise discriminant analysis to obtain an ecologically meaningful variable subset; the former because it does not take into account the response variable, the latter because noise variables are likely to be selected. We suggest that univariate screening techniques are a worthwhile alternative for variable selection in ecology. © 2011 German Botanical Society and The Royal Botanical Society of the Netherlands.
Grootswagers, Tijl; Wardle, Susan G; Carlson, Thomas A
2017-04-01
Multivariate pattern analysis (MVPA) or brain decoding methods have become standard practice in analyzing fMRI data. Although decoding methods have been extensively applied in brain-computer interfaces, these methods have only recently been applied to time series neuroimaging data such as MEG and EEG to address experimental questions in cognitive neuroscience. In a tutorial style review, we describe a broad set of options to inform future time series decoding studies from a cognitive neuroscience perspective. Using example MEG data, we illustrate the effects that different options in the decoding analysis pipeline can have on experimental results where the aim is to "decode" different perceptual stimuli or cognitive states over time from dynamic brain activation patterns. We show that decisions made at both preprocessing (e.g., dimensionality reduction, subsampling, trial averaging) and decoding (e.g., classifier selection, cross-validation design) stages of the analysis can significantly affect the results. In addition to standard decoding, we describe extensions to MVPA for time-varying neuroimaging data including representational similarity analysis, temporal generalization, and the interpretation of classifier weight maps. Finally, we outline important caveats in the design and interpretation of time series decoding experiments.
Linear regression analysis: part 14 of a series on evaluation of scientific publications.
Schneider, Astrid; Hommel, Gerhard; Blettner, Maria
2010-11-01
Regression analysis is an important statistical method for the analysis of medical data. It enables the identification and characterization of relationships among multiple factors. It also enables the identification of prognostically relevant risk factors and the calculation of risk scores for individual prognostication. This article is based on selected textbooks of statistics, a selective review of the literature, and our own experience. After a brief introduction of the uni- and multivariable regression models, illustrative examples are given to explain what the important considerations are before a regression analysis is performed, and how the results should be interpreted. The reader should then be able to judge whether the method has been used correctly and interpret the results appropriately. The performance and interpretation of linear regression analysis are subject to a variety of pitfalls, which are discussed here in detail. The reader is made aware of common errors of interpretation through practical examples. Both the opportunities for applying linear regression analysis and its limitations are presented.
Correlative and multivariate analysis of increased radon concentration in underground laboratory.
Maletić, Dimitrije M; Udovičić, Vladimir I; Banjanac, Radomir M; Joković, Dejan R; Dragić, Aleksandar L; Veselinović, Nikola B; Filipović, Jelena
2014-11-01
The results of analysis using correlative and multivariate methods, as developed for data analysis in high-energy physics and implemented in the Toolkit for Multivariate Analysis software package, of the relations of the variation of increased radon concentration with climate variables in shallow underground laboratory is presented. Multivariate regression analysis identified a number of multivariate methods which can give a good evaluation of increased radon concentrations based on climate variables. The use of the multivariate regression methods will enable the investigation of the relations of specific climate variable with increased radon concentrations by analysis of regression methods resulting in 'mapped' underlying functional behaviour of radon concentrations depending on a wide spectrum of climate variables. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Thiagarajah, Shankar; Wilkinson, J. Mark; Panoutsopoulou, Kalliope; Day‐Williams, Aaron G.; Cootes, Timothy F.; Wallis, Gillian A.; Loughlin, John; Arden, Nigel; Birrell, Fraser; Carr, Andrew; Chapman, Kay; Deloukas, Panos; Doherty, Michael; McCaskie, Andrew; Ollier, William E. R.; Rai, Ashok; Ralston, Stuart H.; Spector, Timothy D.; Valdes, Ana M.; Wallis, Gillian A.; Mark Wilkinson, J.; Zeggini, Eleftheria
2015-01-01
Objective To test whether previously reported hip morphology or osteoarthritis (OA) susceptibility loci are associated with proximal femur shape as represented by statistical shape model (SSM) modes and as univariate or multivariate quantitative traits. Methods We used pelvic radiographs and genotype data from 929 subjects with unilateral hip OA who had been recruited previously for the Arthritis Research UK Osteoarthritis Genetics Consortium genome‐wide association study. We built 3 SSMs capturing the shape variation of the OA‐unaffected proximal femur in the entire mixed‐sex cohort and for male/female‐stratified cohorts. We selected 41 candidate single‐nucleotide polymorphisms (SNPs) previously reported as being associated with hip morphology (for replication analysis) or OA (for discovery analysis) and for which genotype data were available. We performed 2 types of analysis for genotype–phenotype associations between these SNPs and the modes of the SSMs: 1) a univariate analysis using individual SSM modes and 2) a multivariate analysis using combinations of SSM modes. Results The univariate analysis identified association between rs4836732 (within the ASTN2 gene) and mode 5 of the female SSM (P = 0.0016) and between rs6976 (within the GLT8D1 gene) and mode 7 of the mixed‐sex SSM (P = 0.0003). The multivariate analysis identified association between rs5009270 (near the IFRD1 gene) and a combination of modes 3, 4, and 9 of the mixed‐sex SSM (P = 0.0004). Evidence of associations remained significant following adjustment for multiple testing. All 3 SNPs had previously been associated with hip OA. Conclusion These de novo findings suggest that rs4836732, rs6976, and rs5009270 may contribute to hip OA susceptibility by altering proximal femur shape. PMID:25939412
NASA Technical Reports Server (NTRS)
Hague, D. S.; Woodbury, N. W.
1975-01-01
The Mars system is a tool for rapid prediction of aircraft or engine characteristics based on correlation-regression analysis of past designs stored in the data bases. An example of output obtained from the MARS system, which involves derivation of an expression for gross weight of subsonic transport aircraft in terms of nine independent variables is given. The need is illustrated for careful selection of correlation variables and for continual review of the resulting estimation equations. For Vol. 1, see N76-10089.
ERIC Educational Resources Information Center
Grochowalski, Joseph H.
2015-01-01
Component Universe Score Profile analysis (CUSP) is introduced in this paper as a psychometric alternative to multivariate profile analysis. The theoretical foundations of CUSP analysis are reviewed, which include multivariate generalizability theory and constrained principal components analysis. Because CUSP is a combination of generalizability…
A direct-gradient multivariate index of biotic condition
Miranda, Leandro E.; Aycock, J.N.; Killgore, K. J.
2012-01-01
Multimetric indexes constructed by summing metric scores have been criticized despite many of their merits. A leading criticism is the potential for investigator bias involved in metric selection and scoring. Often there is a large number of competing metrics equally well correlated with environmental stressors, requiring a judgment call by the investigator to select the most suitable metrics to include in the index and how to score them. Data-driven procedures for multimetric index formulation published during the last decade have reduced this limitation, yet apprehension remains. Multivariate approaches that select metrics with statistical algorithms may reduce the level of investigator bias and alleviate a weakness of multimetric indexes. We investigated the suitability of a direct-gradient multivariate procedure to derive an index of biotic condition for fish assemblages in oxbow lakes in the Lower Mississippi Alluvial Valley. Although this multivariate procedure also requires that the investigator identify a set of suitable metrics potentially associated with a set of environmental stressors, it is different from multimetric procedures because it limits investigator judgment in selecting a subset of biotic metrics to include in the index and because it produces metric weights suitable for computation of index scores. The procedure, applied to a sample of 35 competing biotic metrics measured at 50 oxbow lakes distributed over a wide geographical region in the Lower Mississippi Alluvial Valley, selected 11 metrics that adequately indexed the biotic condition of five test lakes. Because the multivariate index includes only metrics that explain the maximum variability in the stressor variables rather than a balanced set of metrics chosen to reflect various fish assemblage attributes, it is fundamentally different from multimetric indexes of biotic integrity with advantages and disadvantages. As such, it provides an alternative to multimetric procedures.
2011-01-01
Introduction Necrotizing fasciitis (NF) is a life threatening infectious disease with a high mortality rate. We carried out a microbiological characterization of the causative pathogens. We investigated the correlation of mortality in NF with bloodstream infection and with the presence of co-morbidities. Methods In this retrospective study, we analyzed 323 patients who presented with necrotizing fasciitis at two different institutions. Bloodstream infection (BSI) was defined as a positive blood culture result. The patients were categorized as survivors and non-survivors. Eleven clinically important variables which were statistically significant by univariate analysis were selected for multivariate regression analysis and a stepwise logistic regression model was developed to determine the association between BSI and mortality. Results Univariate logistic regression analysis showed that patients with hypotension, heart disease, liver disease, presence of Vibrio spp. in wound cultures, presence of fungus in wound cultures, and presence of Streptococcus group A, Aeromonas spp. or Vibrio spp. in blood cultures, had a significantly higher risk of in-hospital mortality. Our multivariate logistic regression analysis showed a higher risk of mortality in patients with pre-existing conditions like hypotension, heart disease, and liver disease. Multivariate logistic regression analysis also showed that presence of Vibrio spp in wound cultures, and presence of Streptococcus Group A in blood cultures were associated with a high risk of mortality while debridement > = 3 was associated with improved survival. Conclusions Mortality in patients with necrotizing fasciitis was significantly associated with the presence of Vibrio in wound cultures and Streptococcus group A in blood cultures. PMID:21693053
van Mierlo, Pieter; Lie, Octavian; Staljanssens, Willeke; Coito, Ana; Vulliémoz, Serge
2018-04-26
We investigated the influence of processing steps in the estimation of multivariate directed functional connectivity during seizures recorded with intracranial EEG (iEEG) on seizure-onset zone (SOZ) localization. We studied the effect of (i) the number of nodes, (ii) time-series normalization, (iii) the choice of multivariate time-varying connectivity measure: Adaptive Directed Transfer Function (ADTF) or Adaptive Partial Directed Coherence (APDC) and (iv) graph theory measure: outdegree or shortest path length. First, simulations were performed to quantify the influence of the various processing steps on the accuracy to localize the SOZ. Afterwards, the SOZ was estimated from a 113-electrodes iEEG seizure recording and compared with the resection that rendered the patient seizure-free. The simulations revealed that ADTF is preferred over APDC to localize the SOZ from ictal iEEG recordings. Normalizing the time series before analysis resulted in an increase of 25-35% of correctly localized SOZ, while adding more nodes to the connectivity analysis led to a moderate decrease of 10%, when comparing 128 with 32 input nodes. The real-seizure connectivity estimates localized the SOZ inside the resection area using the ADTF coupled to outdegree or shortest path length. Our study showed that normalizing the time-series is an important pre-processing step, while adding nodes to the analysis did only marginally affect the SOZ localization. The study shows that directed multivariate Granger-based connectivity analysis is feasible with many input nodes (> 100) and that normalization of the time-series before connectivity analysis is preferred.
Multivariable PID controller design tuning using bat algorithm for activated sludge process
NASA Astrophysics Data System (ADS)
Atikah Nor’Azlan, Nur; Asmiza Selamat, Nur; Mat Yahya, Nafrizuan
2018-04-01
The designing of a multivariable PID control for multi input multi output is being concerned with this project by applying four multivariable PID control tuning which is Davison, Penttinen-Koivo, Maciejowski and Proposed Combined method. The determination of this study is to investigate the performance of selected optimization technique to tune the parameter of MPID controller. The selected optimization technique is Bat Algorithm (BA). All the MPID-BA tuning result will be compared and analyzed. Later, the best MPID-BA will be chosen in order to determine which techniques are better based on the system performances in terms of transient response.
Chen, Xiaohong; Fan, Yanqin; Pouzo, Demian; Ying, Zhiliang
2010-07-01
We study estimation and model selection of semiparametric models of multivariate survival functions for censored data, which are characterized by possibly misspecified parametric copulas and nonparametric marginal survivals. We obtain the consistency and root- n asymptotic normality of a two-step copula estimator to the pseudo-true copula parameter value according to KLIC, and provide a simple consistent estimator of its asymptotic variance, allowing for a first-step nonparametric estimation of the marginal survivals. We establish the asymptotic distribution of the penalized pseudo-likelihood ratio statistic for comparing multiple semiparametric multivariate survival functions subject to copula misspecification and general censorship. An empirical application is provided.
Chen, Xiaohong; Fan, Yanqin; Pouzo, Demian; Ying, Zhiliang
2013-01-01
We study estimation and model selection of semiparametric models of multivariate survival functions for censored data, which are characterized by possibly misspecified parametric copulas and nonparametric marginal survivals. We obtain the consistency and root-n asymptotic normality of a two-step copula estimator to the pseudo-true copula parameter value according to KLIC, and provide a simple consistent estimator of its asymptotic variance, allowing for a first-step nonparametric estimation of the marginal survivals. We establish the asymptotic distribution of the penalized pseudo-likelihood ratio statistic for comparing multiple semiparametric multivariate survival functions subject to copula misspecification and general censorship. An empirical application is provided. PMID:24790286
Lo, Kenneth
2011-01-01
Cluster analysis is the automated search for groups of homogeneous observations in a data set. A popular modeling approach for clustering is based on finite normal mixture models, which assume that each cluster is modeled as a multivariate normal distribution. However, the normality assumption that each component is symmetric is often unrealistic. Furthermore, normal mixture models are not robust against outliers; they often require extra components for modeling outliers and/or give a poor representation of the data. To address these issues, we propose a new class of distributions, multivariate t distributions with the Box-Cox transformation, for mixture modeling. This class of distributions generalizes the normal distribution with the more heavy-tailed t distribution, and introduces skewness via the Box-Cox transformation. As a result, this provides a unified framework to simultaneously handle outlier identification and data transformation, two interrelated issues. We describe an Expectation-Maximization algorithm for parameter estimation along with transformation selection. We demonstrate the proposed methodology with three real data sets and simulation studies. Compared with a wealth of approaches including the skew-t mixture model, the proposed t mixture model with the Box-Cox transformation performs favorably in terms of accuracy in the assignment of observations, robustness against model misspecification, and selection of the number of components. PMID:22125375
Lo, Kenneth; Gottardo, Raphael
2012-01-01
Cluster analysis is the automated search for groups of homogeneous observations in a data set. A popular modeling approach for clustering is based on finite normal mixture models, which assume that each cluster is modeled as a multivariate normal distribution. However, the normality assumption that each component is symmetric is often unrealistic. Furthermore, normal mixture models are not robust against outliers; they often require extra components for modeling outliers and/or give a poor representation of the data. To address these issues, we propose a new class of distributions, multivariate t distributions with the Box-Cox transformation, for mixture modeling. This class of distributions generalizes the normal distribution with the more heavy-tailed t distribution, and introduces skewness via the Box-Cox transformation. As a result, this provides a unified framework to simultaneously handle outlier identification and data transformation, two interrelated issues. We describe an Expectation-Maximization algorithm for parameter estimation along with transformation selection. We demonstrate the proposed methodology with three real data sets and simulation studies. Compared with a wealth of approaches including the skew-t mixture model, the proposed t mixture model with the Box-Cox transformation performs favorably in terms of accuracy in the assignment of observations, robustness against model misspecification, and selection of the number of components.
He, Jie; Zhao, Yunfeng; Zhao, Jingli; Gao, Jin; Han, Dandan; Xu, Pao; Yang, Runqing
2017-11-02
Because of their high economic importance, growth traits in fish are under continuous improvement. For growth traits that are recorded at multiple time-points in life, the use of univariate and multivariate animal models is limited because of the variable and irregular timing of these measures. Thus, the univariate random regression model (RRM) was introduced for the genetic analysis of dynamic growth traits in fish breeding. We used a multivariate random regression model (MRRM) to analyze genetic changes in growth traits recorded at multiple time-point of genetically-improved farmed tilapia. Legendre polynomials of different orders were applied to characterize the influences of fixed and random effects on growth trajectories. The final MRRM was determined by optimizing the univariate RRM for the analyzed traits separately via penalizing adaptively the likelihood statistical criterion, which is superior to both the Akaike information criterion and the Bayesian information criterion. In the selected MRRM, the additive genetic effects were modeled by Legendre polynomials of three orders for body weight (BWE) and body length (BL) and of two orders for body depth (BD). By using the covariance functions of the MRRM, estimated heritabilities were between 0.086 and 0.628 for BWE, 0.155 and 0.556 for BL, and 0.056 and 0.607 for BD. Only heritabilities for BD measured from 60 to 140 days of age were consistently higher than those estimated by the univariate RRM. All genetic correlations between growth time-points exceeded 0.5 for either single or pairwise time-points. Moreover, correlations between early and late growth time-points were lower. Thus, for phenotypes that are measured repeatedly in aquaculture, an MRRM can enhance the efficiency of the comprehensive selection for BWE and the main morphological traits.
Cao, Hongbao; Duan, Junbo; Lin, Dongdong; Shugart, Yin Yao; Calhoun, Vince; Wang, Yu-Ping
2014-11-15
Integrative analysis of multiple data types can take advantage of their complementary information and therefore may provide higher power to identify potential biomarkers that would be missed using individual data analysis. Due to different natures of diverse data modality, data integration is challenging. Here we address the data integration problem by developing a generalized sparse model (GSM) using weighting factors to integrate multi-modality data for biomarker selection. As an example, we applied the GSM model to a joint analysis of two types of schizophrenia data sets: 759,075 SNPs and 153,594 functional magnetic resonance imaging (fMRI) voxels in 208 subjects (92 cases/116 controls). To solve this small-sample-large-variable problem, we developed a novel sparse representation based variable selection (SRVS) algorithm, with the primary aim to identify biomarkers associated with schizophrenia. To validate the effectiveness of the selected variables, we performed multivariate classification followed by a ten-fold cross validation. We compared our proposed SRVS algorithm with an earlier sparse model based variable selection algorithm for integrated analysis. In addition, we compared with the traditional statistics method for uni-variant data analysis (Chi-squared test for SNP data and ANOVA for fMRI data). Results showed that our proposed SRVS method can identify novel biomarkers that show stronger capability in distinguishing schizophrenia patients from healthy controls. Moreover, better classification ratios were achieved using biomarkers from both types of data, suggesting the importance of integrative analysis. Copyright © 2014 Elsevier Inc. All rights reserved.
Spatial assessment of air quality patterns in Malaysia using multivariate analysis
NASA Astrophysics Data System (ADS)
Dominick, Doreena; Juahir, Hafizan; Latif, Mohd Talib; Zain, Sharifuddin M.; Aris, Ahmad Zaharin
2012-12-01
This study aims to investigate possible sources of air pollutants and the spatial patterns within the eight selected Malaysian air monitoring stations based on a two-year database (2008-2009). The multivariate analysis was applied on the dataset. It incorporated Hierarchical Agglomerative Cluster Analysis (HACA) to access the spatial patterns, Principal Component Analysis (PCA) to determine the major sources of the air pollution and Multiple Linear Regression (MLR) to assess the percentage contribution of each air pollutant. The HACA results grouped the eight monitoring stations into three different clusters, based on the characteristics of the air pollutants and meteorological parameters. The PCA analysis showed that the major sources of air pollution were emissions from motor vehicles, aircraft, industries and areas of high population density. The MLR analysis demonstrated that the main pollutant contributing to variability in the Air Pollutant Index (API) at all stations was particulate matter with a diameter of less than 10 μm (PM10). Further MLR analysis showed that the main air pollutant influencing the high concentration of PM10 was carbon monoxide (CO). This was due to combustion processes, particularly originating from motor vehicles. Meteorological factors such as ambient temperature, wind speed and humidity were also noted to influence the concentration of PM10.
NASA Astrophysics Data System (ADS)
Shan, Jiajia; Wang, Xue; Zhou, Hao; Han, Shuqing; Riza, Dimas Firmanda Al; Kondo, Naoshi
2018-04-01
Synchronous fluorescence spectra, combined with multivariate analysis were used to predict flavonoids content in green tea rapidly and nondestructively. This paper presented a new and efficient spectral intervals selection method called clustering based partial least square (CL-PLS), which selected informative wavelengths by combining clustering concept and partial least square (PLS) methods to improve models’ performance by synchronous fluorescence spectra. The fluorescence spectra of tea samples were obtained and k-means and kohonen-self organizing map clustering algorithms were carried out to cluster full spectra into several clusters, and sub-PLS regression model was developed on each cluster. Finally, CL-PLS models consisting of gradually selected clusters were built. Correlation coefficient (R) was used to evaluate the effect on prediction performance of PLS models. In addition, variable influence on projection partial least square (VIP-PLS), selectivity ratio partial least square (SR-PLS), interval partial least square (iPLS) models and full spectra PLS model were investigated and the results were compared. The results showed that CL-PLS presented the best result for flavonoids prediction using synchronous fluorescence spectra.
Shan, Jiajia; Wang, Xue; Zhou, Hao; Han, Shuqing; Riza, Dimas Firmanda Al; Kondo, Naoshi
2018-03-13
Synchronous fluorescence spectra, combined with multivariate analysis were used to predict flavonoids content in green tea rapidly and nondestructively. This paper presented a new and efficient spectral intervals selection method called clustering based partial least square (CL-PLS), which selected informative wavelengths by combining clustering concept and partial least square (PLS) methods to improve models' performance by synchronous fluorescence spectra. The fluorescence spectra of tea samples were obtained and k-means and kohonen-self organizing map clustering algorithms were carried out to cluster full spectra into several clusters, and sub-PLS regression model was developed on each cluster. Finally, CL-PLS models consisting of gradually selected clusters were built. Correlation coefficient (R) was used to evaluate the effect on prediction performance of PLS models. In addition, variable influence on projection partial least square (VIP-PLS), selectivity ratio partial least square (SR-PLS), interval partial least square (iPLS) models and full spectra PLS model were investigated and the results were compared. The results showed that CL-PLS presented the best result for flavonoids prediction using synchronous fluorescence spectra.
Factors related to choosing an academic career track among spine fellowship applicants.
Park, Daniel K; Rhee, John M; Wu, Baohua; Easley, Kirk
2013-03-01
Retrospective review. To identify factors associated with the likelihood of spine surgery fellowship applicants choosing an academic job upon fellowship completion. Training academic spine surgeons is an important goal of many spine fellowships. However, there are no established criteria associated with academic job choice to guide selection committees. Two hundred three consecutive applications of candidates who were granted an interview to a single spine surgical fellowship from 2005 to 2010 were analyzed. Factors investigated included the following: membership in honor societies; number of publications, presentations, and book chapters; age; completion of an additional degree; completion of a research fellowship; teaching experience; marital status; graduation from a top-20 school; attendance in a residency with a spine fellowship; and comments made in personal statements and letters of recommendation. The job taken upon graduation from fellowship was determined. The χ2 test or Fisher exact test was used to estimate the strength of the association between the covariates and response. Significant variables were selected for further multivariate analysis. The following were significantly associated in a univariable analysis with academia: 5 or more national presentations; completion of a research fellowship; attendance in a top-20 medical school; stated desire in the personal statement to become an academic surgeon; and letters of reference stating likelihood of pursuing academics on hiring the applicant. When significant variables were selected for multivariable analysis, completion of a research fellowship, graduation from a top-20 medical school, and stated desire in the personal statement to become an academic surgeon were most strongly associated with choice of academia. Although job choice is multifactorial, the present study demonstrates that there are objective factors listed on spine fellowship applications associated with a significantly higher likelihood of academic job choice. Analyzing these factors may help selection committees evaluate spine fellowship applicants consistent with the academic missions of their programs.
Multivariate meta-analysis: potential and promise.
Jackson, Dan; Riley, Richard; White, Ian R
2011-09-10
The multivariate random effects model is a generalization of the standard univariate model. Multivariate meta-analysis is becoming more commonly used and the techniques and related computer software, although continually under development, are now in place. In order to raise awareness of the multivariate methods, and discuss their advantages and disadvantages, we organized a one day 'Multivariate meta-analysis' event at the Royal Statistical Society. In addition to disseminating the most recent developments, we also received an abundance of comments, concerns, insights, critiques and encouragement. This article provides a balanced account of the day's discourse. By giving others the opportunity to respond to our assessment, we hope to ensure that the various view points and opinions are aired before multivariate meta-analysis simply becomes another widely used de facto method without any proper consideration of it by the medical statistics community. We describe the areas of application that multivariate meta-analysis has found, the methods available, the difficulties typically encountered and the arguments for and against the multivariate methods, using four representative but contrasting examples. We conclude that the multivariate methods can be useful, and in particular can provide estimates with better statistical properties, but also that these benefits come at the price of making more assumptions which do not result in better inference in every case. Although there is evidence that multivariate meta-analysis has considerable potential, it must be even more carefully applied than its univariate counterpart in practice. Copyright © 2011 John Wiley & Sons, Ltd.
Dyes assay for measuring physicochemical parameters.
Moczko, Ewa; Meglinski, Igor V; Bessant, Conrad; Piletsky, Sergey A
2009-03-15
A combination of selective fluorescent dyes has been developed for simultaneous quantitative measurements of several physicochemical parameters. The operating principle of the assay is similar to electronic nose and tongue systems, which combine nonspecific or semispecific elements for the determination of diverse analytes and chemometric techniques for multivariate data analysis. The analytical capability of the proposed mixture is engendered by changes in fluorescence signal in response to changes in environment such as pH, temperature, ionic strength, and presence of oxygen. The signal is detected by a three-dimensional spectrofluorimeter, and the acquired data are processed using an artificial neural network (ANN) for multivariate calibration. The fluorescence spectrum of a solution of selected dyes allows discreet reading of emission maxima of all dyes composing the mixture. The variations in peaks intensities caused by environmental changes provide distinctive fluorescence patterns which can be handled in the same way as the signals collected from nose/tongue electrochemical or piezoelectric devices. This optical system opens possibilities for rapid, inexpensive, real-time detection of a multitude of physicochemical parameters and analytes of complex samples.
Lifshits, A M
1979-01-01
General characteristics of the multivariate statistical analysis (MSA) is given. Methodical premises and criteria for the selection of an adequate MSA method applicable to pathoanatomic investigations of the epidemiology of multicausal diseases are presented. The experience of using MSA with computors and standard computing programs in studies of coronary arteries aterosclerosis on the materials of 2060 autopsies is described. The combined use of 4 MSA methods: sequential, correlational, regressional, and discriminant permitted to quantitate the contribution of each of the 8 examined risk factors in the development of aterosclerosis. The most important factors were found to be the age, arterial hypertension, and heredity. Occupational hypodynamia and increased fatness were more important in men, whereas diabetes melitus--in women. The registration of this combination of risk factors by MSA methods provides for more reliable prognosis of the likelihood of coronary heart disease with a fatal outcome than prognosis of the degree of coronary aterosclerosis.
TG study of the Li0.4Fe2.4Zn0.2O4 ferrite synthesis
NASA Astrophysics Data System (ADS)
Lysenko, E. N.; Nikolaev, E. V.; Surzhikov, A. P.
2016-02-01
In this paper, the kinetic analysis of Li-Zn ferrite synthesis was studied using thermogravimetry (TG) method through the simultaneous application of non-linear regression to several measurements run at different heating rates (multivariate non-linear regression). Using TG-curves obtained for the four heating rates and Netzsch Thermokinetics software package, the kinetic models with minimal adjustable parameters were selected to quantitatively describe the reaction of Li-Zn ferrite synthesis. It was shown that the experimental TG-curves clearly suggest a two-step process for the ferrite synthesis and therefore a model-fitting kinetic analysis based on multivariate non-linear regressions was conducted. The complex reaction was described by a two-step reaction scheme consisting of sequential reaction steps. It is established that the best results were obtained using the Yander three-dimensional diffusion model at the first stage and Ginstling-Bronstein model at the second step. The kinetic parameters for lithium-zinc ferrite synthesis reaction were found and discussed.
Lu, Tsui-Shan; Longnecker, Matthew P; Zhou, Haibo
2017-03-15
Outcome-dependent sampling (ODS) scheme is a cost-effective sampling scheme where one observes the exposure with a probability that depends on the outcome. The well-known such design is the case-control design for binary response, the case-cohort design for the failure time data, and the general ODS design for a continuous response. While substantial work has been carried out for the univariate response case, statistical inference and design for the ODS with multivariate cases remain under-developed. Motivated by the need in biological studies for taking the advantage of the available responses for subjects in a cluster, we propose a multivariate outcome-dependent sampling (multivariate-ODS) design that is based on a general selection of the continuous responses within a cluster. The proposed inference procedure for the multivariate-ODS design is semiparametric where all the underlying distributions of covariates are modeled nonparametrically using the empirical likelihood methods. We show that the proposed estimator is consistent and developed the asymptotically normality properties. Simulation studies show that the proposed estimator is more efficient than the estimator obtained using only the simple-random-sample portion of the multivariate-ODS or the estimator from a simple random sample with the same sample size. The multivariate-ODS design together with the proposed estimator provides an approach to further improve study efficiency for a given fixed study budget. We illustrate the proposed design and estimator with an analysis of association of polychlorinated biphenyl exposure to hearing loss in children born to the Collaborative Perinatal Study. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
TOF-SIMS imaging technique with information entropy
NASA Astrophysics Data System (ADS)
Aoyagi, Satoka; Kawashima, Y.; Kudo, Masahiro
2005-05-01
Time-of-flight secondary ion mass spectrometry (TOF-SIMS) is capable of chemical imaging of proteins on insulated samples in principal. However, selection of specific peaks related to a particular protein, which are necessary for chemical imaging, out of numerous candidates had been difficult without an appropriate spectrum analysis technique. Therefore multivariate analysis techniques, such as principal component analysis (PCA), and analysis with mutual information defined by information theory, have been applied to interpret SIMS spectra of protein samples. In this study mutual information was applied to select specific peaks related to proteins in order to obtain chemical images. Proteins on insulated materials were measured with TOF-SIMS and then SIMS spectra were analyzed by means of the analysis method based on the comparison using mutual information. Chemical mapping of each protein was obtained using specific peaks related to each protein selected based on values of mutual information. The results of TOF-SIMS images of proteins on the materials provide some useful information on properties of protein adsorption, optimality of immobilization processes and reaction between proteins. Thus chemical images of proteins by TOF-SIMS contribute to understand interactions between material surfaces and proteins and to develop sophisticated biomaterials.
Regression analysis for LED color detection of visual-MIMO system
NASA Astrophysics Data System (ADS)
Banik, Partha Pratim; Saha, Rappy; Kim, Ki-Doo
2018-04-01
Color detection from a light emitting diode (LED) array using a smartphone camera is very difficult in a visual multiple-input multiple-output (visual-MIMO) system. In this paper, we propose a method to determine the LED color using a smartphone camera by applying regression analysis. We employ a multivariate regression model to identify the LED color. After taking a picture of an LED array, we select the LED array region, and detect the LED using an image processing algorithm. We then apply the k-means clustering algorithm to determine the number of potential colors for feature extraction of each LED. Finally, we apply the multivariate regression model to predict the color of the transmitted LEDs. In this paper, we show our results for three types of environmental light condition: room environmental light, low environmental light (560 lux), and strong environmental light (2450 lux). We compare the results of our proposed algorithm from the analysis of training and test R-Square (%) values, percentage of closeness of transmitted and predicted colors, and we also mention about the number of distorted test data points from the analysis of distortion bar graph in CIE1931 color space.
Dominance Genetic Variance for Traits Under Directional Selection in Drosophila serrata
Sztepanacz, Jacqueline L.; Blows, Mark W.
2015-01-01
In contrast to our growing understanding of patterns of additive genetic variance in single- and multi-trait combinations, the relative contribution of nonadditive genetic variance, particularly dominance variance, to multivariate phenotypes is largely unknown. While mechanisms for the evolution of dominance genetic variance have been, and to some degree remain, subject to debate, the pervasiveness of dominance is widely recognized and may play a key role in several evolutionary processes. Theoretical and empirical evidence suggests that the contribution of dominance variance to phenotypic variance may increase with the correlation between a trait and fitness; however, direct tests of this hypothesis are few. Using a multigenerational breeding design in an unmanipulated population of Drosophila serrata, we estimated additive and dominance genetic covariance matrices for multivariate wing-shape phenotypes, together with a comprehensive measure of fitness, to determine whether there is an association between directional selection and dominance variance. Fitness, a trait unequivocally under directional selection, had no detectable additive genetic variance, but significant dominance genetic variance contributing 32% of the phenotypic variance. For single and multivariate morphological traits, however, no relationship was observed between trait–fitness correlations and dominance variance. A similar proportion of additive and dominance variance was found to contribute to phenotypic variance for single traits, and double the amount of additive compared to dominance variance was found for the multivariate trait combination under directional selection. These data suggest that for many fitness components a positive association between directional selection and dominance genetic variance may not be expected. PMID:25783700
A time domain frequency-selective multivariate Granger causality approach.
Leistritz, Lutz; Witte, Herbert
2016-08-01
The investigation of effective connectivity is one of the major topics in computational neuroscience to understand the interaction between spatially distributed neuronal units of the brain. Thus, a wide variety of methods has been developed during the last decades to investigate functional and effective connectivity in multivariate systems. Their spectrum ranges from model-based to model-free approaches with a clear separation into time and frequency range methods. We present in this simulation study a novel time domain approach based on Granger's principle of predictability, which allows frequency-selective considerations of directed interactions. It is based on a comparison of prediction errors of multivariate autoregressive models fitted to systematically modified time series. These modifications are based on signal decompositions, which enable a targeted cancellation of specific signal components with specific spectral properties. Depending on the embedded signal decomposition method, a frequency-selective or data-driven signal-adaptive Granger Causality Index may be derived.
Multivariate Models for Normal and Binary Responses in Intervention Studies
ERIC Educational Resources Information Center
Pituch, Keenan A.; Whittaker, Tiffany A.; Chang, Wanchen
2016-01-01
Use of multivariate analysis (e.g., multivariate analysis of variance) is common when normally distributed outcomes are collected in intervention research. However, when mixed responses--a set of normal and binary outcomes--are collected, standard multivariate analyses are no longer suitable. While mixed responses are often obtained in…
Empirical performance of the multivariate normal universal portfolio
NASA Astrophysics Data System (ADS)
Tan, Choon Peng; Pang, Sook Theng
2013-09-01
Universal portfolios generated by the multivariate normal distribution are studied with emphasis on the case where variables are dependent, namely, the covariance matrix is not diagonal. The moving-order multivariate normal universal portfolio requires very long implementation time and large computer memory in its implementation. With the objective of reducing memory and implementation time, the finite-order universal portfolio is introduced. Some stock-price data sets are selected from the local stock exchange and the finite-order universal portfolio is run on the data sets, for small finite order. Empirically, it is shown that the portfolio can outperform the moving-order Dirichlet universal portfolio of Cover and Ordentlich[2] for certain parameters in the selected data sets.
NASA Astrophysics Data System (ADS)
Hudson, Brian D.; George, Ashley R.; Ford, Martyn G.; Livingstone, David J.
1992-04-01
Molecular dynamics simulations have been performed on a number of conformationally flexible pyrethroid insecticides. The results indicate that molecular dynamics is a suitable tool for conformational searching of small molecules given suitable simulation parameters. The structures derived from the simulations are compared with the static conformation used in a previous study. Various physicochemical parameters have been calculated for a set of conformations selected from the simulations using multivariate analysis. The averaged values of the parameters over the selected set (and the factors derived from them) are compared with the single conformation values used in the previous study.
Natural Resources Inventory and Land Evaluation in Switzerland
NASA Technical Reports Server (NTRS)
Haefner, H. (Principal Investigator)
1975-01-01
The author has identified the following significant results. A system was developed to operationally map and measure the areal extent of various land use categories for updating existing and producing new and actual thematic maps showing the latest state of rural and urban landscapes and its changes. The processing system includes: (1) preprocessing steps for radiometric and geometric corrections; (2) classification of the data by a multivariate procedure, using a stepwise linear discriminant analysis based on carefully selected training cells; and (3) output in form of color maps by printing black and white theme overlays of a selected scale with photomation system and its coloring and combination into a color composite.
Robust Selectivity for Faces in the Human Amygdala in the Absence of Expressions
Mende-Siedlecki, Peter; Verosky, Sara C.; Turk-Browne, Nicholas B.; Todorov, Alexander
2014-01-01
There is a well-established posterior network of cortical regions that plays a central role in face processing and that has been investigated extensively. In contrast, although responsive to faces, the amygdala is not considered a core face-selective region, and its face selectivity has never been a topic of systematic research in human neuroimaging studies. Here, we conducted a large-scale group analysis of fMRI data from 215 participants. We replicated the posterior network observed in prior studies but found equally robust and reliable responses to faces in the amygdala. These responses were detectable in most individual participants, but they were also highly sensitive to the initial statistical threshold and habituated more rapidly than the responses in posterior face-selective regions. A multivariate analysis showed that the pattern of responses to faces across voxels in the amygdala had high reliability over time. Finally, functional connectivity analyses showed stronger coupling between the amygdala and posterior face-selective regions during the perception of faces than during the perception of control visual categories. These findings suggest that the amygdala should be considered a core face-selective region. PMID:23984945
Rank estimation and the multivariate analysis of in vivo fast-scan cyclic voltammetric data
Keithley, Richard B.; Carelli, Regina M.; Wightman, R. Mark
2010-01-01
Principal component regression has been used in the past to separate current contributions from different neuromodulators measured with in vivo fast-scan cyclic voltammetry. Traditionally, a percent cumulative variance approach has been used to determine the rank of the training set voltammetric matrix during model development, however this approach suffers from several disadvantages including the use of arbitrary percentages and the requirement of extreme precision of training sets. Here we propose that Malinowski’s F-test, a method based on a statistical analysis of the variance contained within the training set, can be used to improve factor selection for the analysis of in vivo fast-scan cyclic voltammetric data. These two methods of rank estimation were compared at all steps in the calibration protocol including the number of principal components retained, overall noise levels, model validation as determined using a residual analysis procedure, and predicted concentration information. By analyzing 119 training sets from two different laboratories amassed over several years, we were able to gain insight into the heterogeneity of in vivo fast-scan cyclic voltammetric data and study how differences in factor selection propagate throughout the entire principal component regression analysis procedure. Visualizing cyclic voltammetric representations of the data contained in the retained and discarded principal components showed that using Malinowski’s F-test for rank estimation of in vivo training sets allowed for noise to be more accurately removed. Malinowski’s F-test also improved the robustness of our criterion for judging multivariate model validity, even though signal-to-noise ratios of the data varied. In addition, pH change was the majority noise carrier of in vivo training sets while dopamine prediction was more sensitive to noise. PMID:20527815
Chen, Qiang; Chen, Yunhao; Jiang, Weiguo
2016-07-30
In the field of multiple features Object-Based Change Detection (OBCD) for very-high-resolution remotely sensed images, image objects have abundant features and feature selection affects the precision and efficiency of OBCD. Through object-based image analysis, this paper proposes a Genetic Particle Swarm Optimization (GPSO)-based feature selection algorithm to solve the optimization problem of feature selection in multiple features OBCD. We select the Ratio of Mean to Variance (RMV) as the fitness function of GPSO, and apply the proposed algorithm to the object-based hybrid multivariate alternative detection model. Two experiment cases on Worldview-2/3 images confirm that GPSO can significantly improve the speed of convergence, and effectively avoid the problem of premature convergence, relative to other feature selection algorithms. According to the accuracy evaluation of OBCD, GPSO is superior at overall accuracy (84.17% and 83.59%) and Kappa coefficient (0.6771 and 0.6314) than other algorithms. Moreover, the sensitivity analysis results show that the proposed algorithm is not easily influenced by the initial parameters, but the number of features to be selected and the size of the particle swarm would affect the algorithm. The comparison experiment results reveal that RMV is more suitable than other functions as the fitness function of GPSO-based feature selection algorithm.
Baker, Jannah; White, Nicole; Mengersen, Kerrie
2014-11-20
Spatial analysis is increasingly important for identifying modifiable geographic risk factors for disease. However, spatial health data from surveys are often incomplete, ranging from missing data for only a few variables, to missing data for many variables. For spatial analyses of health outcomes, selection of an appropriate imputation method is critical in order to produce the most accurate inferences. We present a cross-validation approach to select between three imputation methods for health survey data with correlated lifestyle covariates, using as a case study, type II diabetes mellitus (DM II) risk across 71 Queensland Local Government Areas (LGAs). We compare the accuracy of mean imputation to imputation using multivariate normal and conditional autoregressive prior distributions. Choice of imputation method depends upon the application and is not necessarily the most complex method. Mean imputation was selected as the most accurate method in this application. Selecting an appropriate imputation method for health survey data, after accounting for spatial correlation and correlation between covariates, allows more complete analysis of geographic risk factors for disease with more confidence in the results to inform public policy decision-making.
Robertson, David S; Prevost, A Toby; Bowden, Jack
2016-10-01
The problem of selection bias has long been recognized in the analysis of two-stage trials, where promising candidates are selected in stage 1 for confirmatory analysis in stage 2. To efficiently correct for bias, uniformly minimum variance conditionally unbiased estimators (UMVCUEs) have been proposed for a wide variety of trial settings, but where the population parameter estimates are assumed to be independent. We relax this assumption and derive the UMVCUE in the multivariate normal setting with an arbitrary known covariance structure. One area of application is the estimation of odds ratios (ORs) when combining a genome-wide scan with a replication study. Our framework explicitly accounts for correlated single nucleotide polymorphisms, as might occur due to linkage disequilibrium. We illustrate our approach on the measurement of the association between 11 genetic variants and the risk of Crohn's disease, as reported in Parkes and others (2007. Sequence variants in the autophagy gene IRGM and multiple other replicating loci contribute to Crohn's disease susceptibility. Nat. Gen. 39: (7), 830-832.), and show that the estimated ORs can vary substantially if both selection and correlation are taken into account. © The Author 2016. Published by Oxford University Press.
Popp, Oliver; Müller, Dirk; Didzus, Katharina; Paul, Wolfgang; Lipsmeier, Florian; Kirchner, Florian; Niklas, Jens; Mauch, Klaus; Beaucamp, Nicola
2016-09-01
In-depth characterization of high-producer cell lines and bioprocesses is vital to ensure robust and consistent production of recombinant therapeutic proteins in high quantity and quality for clinical applications. This requires applying appropriate methods during bioprocess development to enable meaningful characterization of CHO clones and processes. Here, we present a novel hybrid approach for supporting comprehensive characterization of metabolic clone performance. The approach combines metabolite profiling with multivariate data analysis and fluxomics to enable a data-driven mechanistic analysis of key metabolic traits associated with desired cell phenotypes. We applied the methodology to quantify and compare metabolic performance in a set of 10 recombinant CHO-K1 producer clones and a host cell line. The comprehensive characterization enabled us to derive an extended set of clone performance criteria that not only captured growth and product formation, but also incorporated information on intracellular clone physiology and on metabolic changes during the process. These criteria served to establish a quantitative clone ranking and allowed us to identify metabolic differences between high-producing CHO-K1 clones yielding comparably high product titers. Through multivariate data analysis of the combined metabolite and flux data we uncovered common metabolic traits characteristic of high-producer clones in the screening setup. This included high intracellular rates of glutamine synthesis, low cysteine uptake, reduced excretion of aspartate and glutamate, and low intracellular degradation rates of branched-chain amino acids and of histidine. Finally, the above approach was integrated into a workflow that enables standardized high-content selection of CHO producer clones in a high-throughput fashion. In conclusion, the combination of quantitative metabolite profiling, multivariate data analysis, and mechanistic network model simulations can identify metabolic traits characteristic of high-performance clones and enables informed decisions on which clones provide a good match for a particular process platform. The proposed approach also provides a mechanistic link between observed clone phenotype, process setup, and feeding regimes, and thereby offers concrete starting points for subsequent process optimization. Biotechnol. Bioeng. 2016;113: 2005-2019. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Tyagi, Neelam; Sutton, Elizabeth; Hunt, Margie; Zhang, Jing; Oh, Jung Hun; Apte, Aditya; Mechalakos, James; Wilgucki, Molly; Gelb, Emily; Mehrara, Babak; Matros, Evan; Ho, Alice
2017-02-01
Capsular contracture (CC) is a serious complication in patients receiving implant-based reconstruction for breast cancer. Currently, no objective methods are available for assessing CC. The goal of the present study was to identify image-based surrogates of CC using magnetic resonance imaging (MRI). We analyzed a retrospective data set of 50 patients who had undergone both a diagnostic MRI scan and a plastic surgeon's evaluation of the CC score (Baker's score) within a 6-month period after mastectomy and reconstructive surgery. The MRI scans were assessed for morphologic shape features of the implant and histogram features of the pectoralis muscle. The shape features, such as roundness, eccentricity, solidity, extent, and ratio length for the implant, were compared with the Baker score. For the pectoralis muscle, the muscle width and median, skewness, and kurtosis of the intensity were compared with the Baker score. Univariate analysis (UVA) using a Wilcoxon rank-sum test and multivariate analysis with the least absolute shrinkage and selection operator logistic regression was performed to determine significant differences in these features between the patient groups categorized according to their Baker's scores. UVA showed statistically significant differences between grade 1 and grade ≥2 for morphologic shape features and histogram features, except for volume and skewness. Only eccentricity, ratio length, and volume were borderline significant in differentiating grade ≤2 and grade ≥3. Features with P<.1 on UVA were used in the multivariate least absolute shrinkage and selection operator logistic regression analysis. Multivariate analysis showed a good level of predictive power for grade 1 versus grade ≥2 CC (area under the receiver operating characteristic curve 0.78, sensitivity 0.78, and specificity 0.82) and for grade ≤2 versus grade ≥3 CC (area under the receiver operating characteristic curve 0.75, sensitivity 0.75, and specificity 0.79). The morphologic shape features described on MR images were associated with the severity of CC. MRI has the potential to further improve the diagnostic ability of the Baker score in breast cancer patients who undergo implant reconstruction. Copyright © 2016 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Liu, Zhenyu; Cui, Xingwei; Tang, Zhenchao; Dong, Di; Zang, Yali; Tian, Jie
2017-03-01
Previous researches have shown that type 2 diabetes mellitus (T2DM) is associated with an increased risk of cognitive impairment. Early detection of brain abnormalities at the preclinical stage can be useful for developing preventive interventions to abate cognitive decline. We aimed to investigate the whole-brain resting-state functional connectivity (RSFC) patterns of T2DM patients between 90 regions of interest (ROIs) based on the RS-fMRI data, which can be used to test the feasibility of identifying T2DM patients with cognitive impairment from other T2DM patients. 74 patients were recruited in this study and multivariate pattern analysis was utilized to assess the prediction performance. Elastic net was firstly used to select the key features for prediction, and then a linear discrimination model was constructed. 23 RSFCs were selected and it achieved the performance with classification accuracy of 90.54% and areas under the receiver operating characteristic curve (AUC) of 0.944 using ten-fold cross-validation. The results provide strong evidence that functional interactions of brain regions undergo notable alterations between T2DM patients with cognitive impairment or not. By analyzing the RSFCs that were selected as key features, we found that most of them involved the frontal or temporal. We speculated that cognitive impairment in T2DM patients mainly impacted these two lobes. Overall, the present study indicated that RSFCs undergo notable alterations associated with the cognitive impairment in T2DM patients, and it is possible to predicted cognitive impairment early with RSFCs.
Nataly, Yogesh; Merrie, Arend E; Stewart, Ian D
2002-03-01
The use of endoscopic retrograde cholangiopancreatography (ERCP) in the management of suspected common bile duct (CBD) stones prior to laparoscopic cholecystectomy is common. The associated morbidity can be significant. The present study determines significant predictors of CBD stones and improves the selection of patients for preoperative ERCP. All preoperative ERCP for suspected CBD stones in the year 1998 were studied retrospectively. Univariate and multivariate analyses of a number of clinical, biochemical and radiological variables were carried out to determine the best predictors of CBD stones. A total of 112 patients had successful preoperative ERCP. Sixty-one per cent of these were negative for stones and the morbidity was 9%. Univariate analysis revealed the following variables as predictors: cholangitis (P = 0.006), abnormal serum bilirubin > or = 3 days (P = 0.002), serum alkaline phosphatase > or = 130 U/L (P = 0.002), deranged liver function tests (P = < 0.001) and CBD diameter > or = 8 mm (P = 0.009) with positive predictive values of 80%, 68%, 49%, 38% and 52%, respectively. Multivariate analysis revealed the model with the best ability to discriminate for CBD stones (P = 0.0005) was cholangitis, abnormal serum bilirubin for > or = 3 days and CBD diameter > or = 8 mm. The best predictors from this study had a sensitivity of 80% and a specificity of 27%. The predictors of CBD stones are imprecise. Until laparoscopic exploration of CBD becomes widely available, ERCP prior to cholecystectomy will remain popular. The use of stricter selection criteria can reduce the number of negative preoperative ERCP.
Gutiérrez-Cacciabue, Dolores; Teich, Ingrid; Poma, Hugo Ramiro; Cruz, Mercedes Cecilia; Balzarini, Mónica; Rajal, Verónica Beatriz
2014-01-01
Several recreational surface waters in Salta, Argentina, were selected to assess their quality. Seventy percent of the measurements exceeded at least one of the limits established by international legislation becoming unsuitable for their use. To interpret results of complex data, multivariate techniques were applied. Arenales River, due to the variability observed in the data, was divided in two: upstream and downstream representing low and high pollution sites, respectively; and Cluster Analysis supported that differentiation. Arenales River downstream and Campo Alegre Reservoir were the most different environments and Vaqueros and La Caldera Rivers were the most similar. Canonical Correlation Analysis allowed exploration of correlations between physicochemical and microbiological variables except in both parts of Arenales River, and Principal Component Analysis allowed finding relationships among the 9 measured variables in all aquatic environments. Variable’s loadings showed that Arenales River downstream was impacted by industrial and domestic activities, Arenales River upstream was affected by agricultural activities, Campo Alegre Reservoir was disturbed by anthropogenic and ecological effects, and La Caldera and Vaqueros Rivers were influenced by recreational activities. Discriminant Analysis allowed identification of subgroup of variables responsible for seasonal and spatial variations. Enterococcus, dissolved oxygen, conductivity, E. coli, pH, and fecal coliforms are sufficient to spatially describe the quality of the aquatic environments. Regarding seasonal variations, dissolved oxygen, conductivity, fecal coliforms, and pH can be used to describe water quality during dry season, while dissolved oxygen, conductivity, total coliforms, E. coli, and Enterococcus during wet season. Thus, the use of multivariate techniques allowed optimizing monitoring tasks and minimizing costs involved. PMID:25190636
Pan, Xin; Lopez-Olivo, Maria A; Song, Juhee; Pratt, Gregory; Suarez-Almazor, Maria E
2017-01-01
Objectives We appraised the methodological and reporting quality of randomised controlled clinical trials (RCTs) evaluating the efficacy and safety of Chinese herbal medicine (CHM) in patients with rheumatoid arthritis (RA). Design For this systematic review, electronic databases were searched from inception until June 2015. The search was limited to humans and non-case report studies, but was not limited by language, year of publication or type of publication. Two independent reviewers selected RCTs, evaluating CHM in RA (herbals and decoctions). Descriptive statistics were used to report on risk of bias and their adherence to reporting standards. Multivariable logistic regression analysis was performed to determine study characteristics associated with high or unclear risk of bias. Results Out of 2342 unique citations, we selected 119 RCTs including 18 919 patients: 10 108 patients received CHM alone and 6550 received one of 11 treatment combinations. A high risk of bias was observed across all domains: 21% had a high risk for selection bias (11% from sequence generation and 30% from allocation concealment), 85% for performance bias, 89% for detection bias, 4% for attrition bias and 40% for reporting bias. In multivariable analysis, fewer authors were associated with selection bias (allocation concealment), performance bias and attrition bias, and earlier year of publication and funding source not reported or disclosed were associated with selection bias (sequence generation). Studies published in non-English language were associated with reporting bias. Poor adherence to recommended reporting standards (<60% of the studies not providing sufficient information) was observed in 11 of the 23 sections evaluated. Limitations Study quality and data extraction were performed by one reviewer and cross-checked by a second reviewer. Translation to English was performed by one reviewer in 85% of the included studies. Conclusions Studies evaluating CHM often fail to meet expected methodological criteria, and high-quality evidence is lacking. PMID:28249848
Deconstructing multivariate decoding for the study of brain function.
Hebart, Martin N; Baker, Chris I
2017-08-04
Multivariate decoding methods were developed originally as tools to enable accurate predictions in real-world applications. The realization that these methods can also be employed to study brain function has led to their widespread adoption in the neurosciences. However, prior to the rise of multivariate decoding, the study of brain function was firmly embedded in a statistical philosophy grounded on univariate methods of data analysis. In this way, multivariate decoding for brain interpretation grew out of two established frameworks: multivariate decoding for predictions in real-world applications, and classical univariate analysis based on the study and interpretation of brain activation. We argue that this led to two confusions, one reflecting a mixture of multivariate decoding for prediction or interpretation, and the other a mixture of the conceptual and statistical philosophies underlying multivariate decoding and classical univariate analysis. Here we attempt to systematically disambiguate multivariate decoding for the study of brain function from the frameworks it grew out of. After elaborating these confusions and their consequences, we describe six, often unappreciated, differences between classical univariate analysis and multivariate decoding. We then focus on how the common interpretation of what is signal and noise changes in multivariate decoding. Finally, we use four examples to illustrate where these confusions may impact the interpretation of neuroimaging data. We conclude with a discussion of potential strategies to help resolve these confusions in interpreting multivariate decoding results, including the potential departure from multivariate decoding methods for the study of brain function. Copyright © 2017. Published by Elsevier Inc.
Real, Jordi; Forné, Carles; Roso-Llorach, Albert; Martínez-Sánchez, Jose M
2016-05-01
Controlling for confounders is a crucial step in analytical observational studies, and multivariable models are widely used as statistical adjustment techniques. However, the validation of the assumptions of the multivariable regression models (MRMs) should be made clear in scientific reporting. The objective of this study is to review the quality of statistical reporting of the most commonly used MRMs (logistic, linear, and Cox regression) that were applied in analytical observational studies published between 2003 and 2014 by journals indexed in MEDLINE.Review of a representative sample of articles indexed in MEDLINE (n = 428) with observational design and use of MRMs (logistic, linear, and Cox regression). We assessed the quality of reporting about: model assumptions and goodness-of-fit, interactions, sensitivity analysis, crude and adjusted effect estimate, and specification of more than 1 adjusted model.The tests of underlying assumptions or goodness-of-fit of the MRMs used were described in 26.2% (95% CI: 22.0-30.3) of the articles and 18.5% (95% CI: 14.8-22.1) reported the interaction analysis. Reporting of all items assessed was higher in articles published in journals with a higher impact factor.A low percentage of articles indexed in MEDLINE that used multivariable techniques provided information demonstrating rigorous application of the model selected as an adjustment method. Given the importance of these methods to the final results and conclusions of observational studies, greater rigor is required in reporting the use of MRMs in the scientific literature.
Multivariate meta-analysis: Potential and promise
Jackson, Dan; Riley, Richard; White, Ian R
2011-01-01
The multivariate random effects model is a generalization of the standard univariate model. Multivariate meta-analysis is becoming more commonly used and the techniques and related computer software, although continually under development, are now in place. In order to raise awareness of the multivariate methods, and discuss their advantages and disadvantages, we organized a one day ‘Multivariate meta-analysis’ event at the Royal Statistical Society. In addition to disseminating the most recent developments, we also received an abundance of comments, concerns, insights, critiques and encouragement. This article provides a balanced account of the day's discourse. By giving others the opportunity to respond to our assessment, we hope to ensure that the various view points and opinions are aired before multivariate meta-analysis simply becomes another widely used de facto method without any proper consideration of it by the medical statistics community. We describe the areas of application that multivariate meta-analysis has found, the methods available, the difficulties typically encountered and the arguments for and against the multivariate methods, using four representative but contrasting examples. We conclude that the multivariate methods can be useful, and in particular can provide estimates with better statistical properties, but also that these benefits come at the price of making more assumptions which do not result in better inference in every case. Although there is evidence that multivariate meta-analysis has considerable potential, it must be even more carefully applied than its univariate counterpart in practice. Copyright © 2011 John Wiley & Sons, Ltd. PMID:21268052
Walling, Craig A; Morrissey, Michael B; Foerster, Katharina; Clutton-Brock, Tim H; Pemberton, Josephine M; Kruuk, Loeske E B
2014-12-01
Evolutionary theory predicts that genetic constraints should be widespread, but empirical support for their existence is surprisingly rare. Commonly applied univariate and bivariate approaches to detecting genetic constraints can underestimate their prevalence, with important aspects potentially tractable only within a multivariate framework. However, multivariate genetic analyses of data from natural populations are challenging because of modest sample sizes, incomplete pedigrees, and missing data. Here we present results from a study of a comprehensive set of life history traits (juvenile survival, age at first breeding, annual fecundity, and longevity) for both males and females in a wild, pedigreed, population of red deer (Cervus elaphus). We use factor analytic modeling of the genetic variance-covariance matrix ( G: ) to reduce the dimensionality of the problem and take a multivariate approach to estimating genetic constraints. We consider a range of metrics designed to assess the effect of G: on the deflection of a predicted response to selection away from the direction of fastest adaptation and on the evolvability of the traits. We found limited support for genetic constraint through genetic covariances between traits, both within sex and between sexes. We discuss these results with respect to other recent findings and to the problems of estimating these parameters for natural populations. Copyright © 2014 Walling et al.
Walling, Craig A.; Morrissey, Michael B.; Foerster, Katharina; Clutton-Brock, Tim H.; Pemberton, Josephine M.; Kruuk, Loeske E. B.
2014-01-01
Evolutionary theory predicts that genetic constraints should be widespread, but empirical support for their existence is surprisingly rare. Commonly applied univariate and bivariate approaches to detecting genetic constraints can underestimate their prevalence, with important aspects potentially tractable only within a multivariate framework. However, multivariate genetic analyses of data from natural populations are challenging because of modest sample sizes, incomplete pedigrees, and missing data. Here we present results from a study of a comprehensive set of life history traits (juvenile survival, age at first breeding, annual fecundity, and longevity) for both males and females in a wild, pedigreed, population of red deer (Cervus elaphus). We use factor analytic modeling of the genetic variance–covariance matrix (G) to reduce the dimensionality of the problem and take a multivariate approach to estimating genetic constraints. We consider a range of metrics designed to assess the effect of G on the deflection of a predicted response to selection away from the direction of fastest adaptation and on the evolvability of the traits. We found limited support for genetic constraint through genetic covariances between traits, both within sex and between sexes. We discuss these results with respect to other recent findings and to the problems of estimating these parameters for natural populations. PMID:25278555
MAOA, MTHFR, and TNF-β genes polymorphisms and personality traits in the pathogenesis of migraine.
Ishii, Masakazu; Shimizu, Shunichi; Sakairi, Yuki; Nagamine, Ayumu; Naito, Yuika; Hosaka, Yukiko; Naito, Yuko; Kurihara, Tatsuya; Onaya, Tomomi; Oyamada, Hideto; Imagawa, Atsuko; Shida, Kenji; Takahashi, Johji; Oguchi, Katsuji; Masuda, Yutaka; Hara, Hajime; Usami, Shino; Kiuchi, Yuji
2012-04-01
Migraine is a multifactorial disease with various factors, such as genetic polymorphisms and personality traits, but the contribution of those factors is not clear. To clarify the pathogenesis of migraine, the contributions of genetic polymorphisms and personality traits were simultaneously investigated using multivariate analysis. Ninety-one migraine patients and 119 non-headache healthy volunteers were enrolled. The 12 gene polymorphisms analysis and NEO-FFI personality test were performed. At first, the univariate analysis was performed to extract the contributing factors to pathogenesis of migraine. We then extracted the factors that independently contributed to the pathogenesis of migraine using multivariate stepwise logistic regression analysis. Using the multivariate analysis, three gene polymorphisms including monoamine oxidase A (MAOA) T941G, methylenetetrahydrofolate reductase (MTHFR) C677T, and tumor necrosis factor beta (TNF-β) G252Α, and the neuroticism and conscientiousness scores in NEO-FFI were selected as significant factors that independently contributed to the pathogenesis of migraine. Their odds ratios were 1.099 (per point of neuroticism score), 1.080 (per point of conscientiousness score), 2.272 (T and T/T or T/G vs G and G/G genotype of MAOA), 1.939 (C/T or T/T vs C/C genotype of MTHFR), and 2.748 (G/A or A/A vs G/G genotype of TNF-β), respectively. We suggested that multiple factors, such as gene polymorphisms and personality traits, contribute to the pathogenesis of migraine. The contribution of polymorphisms, such as MAOA T941G, MTHFR C677T, and TNF-β G252A, were more important than personality traits in the pathogenesis of migraine, a multifactorial disorder.
Ríos, A; López-Navas, A; Ayala-García, M A; Sebastián, M; Febrero, B; Ramírez, E J; Muñoz, G; Palacios, G; Rodríguez, J S; Martínez, M A; Nieto, A; Martínez-Alarcón, L; Ramis, G; Ramírez, P; Parrilla, P
2012-01-01
Healthcare assistants are an important group of workers who can influence public opinion. Their attitudes toward organ donation may influence public awareness of healthcare matters; negative attitudes toward donation and transplantation could have a negative impact on public attitudes. Our objective was analyze the attitudes of healthcare assistants, in Spanish and Mexican healthcare centers toward organ donation and determine factors affecting them using a multivariate analysis. As part of the "International Collaborative Donor Project," 32 primary care centers and 4 hospitals were selected in Spain and 5 hospitals in Mexico. A randomized sample of healthcare assistants was stratified according to healthcare services. Attitudes were evaluated using a validated questionnaire of the psychosocial aspects of donation, which was self-completed anonymously by the respondent. Statistical analysis used the chi-square test, Student t test, and logistic regression analysis. Of 532 respondents, 66% in favored donation and 34% were against it or undecided. Upon multivariate analysis, the following variables had the most weight: 1) country of origin (Mexicans were more in favor than Spanish; odds ratio [OR]) = 1.964; P = .014); 2) a partner with a favorable attitude (OR = 2.597; P = .013); 3) not being concerned about possible bodily mutilation after donation (OR = 2.631; P = .006); 4) preference for options apart from burial for handling the body after death (OR = 4.694; P < .001) and 5) accepting an autopsy if one was needed (OR = 3.584; P < .001). The attitudes of healthcare assistants toward organ donation varied considerably according to the respondent's country of origin. The psycho-social profile of a person with a positive attitude to donation was similar to that described within the general public. Copyright © 2012 Elsevier Inc. All rights reserved.
Chapat, Ludivine; Hilaire, Florence; Bouvet, Jérome; Pialot, Daniel; Philippe-Reversat, Corinne; Guiot, Anne-Laure; Remolue, Lydie; Lechenet, Jacques; Andreoni, Christine; Poulet, Hervé; Day, Michael J; De Luca, Karelle; Cariou, Carine; Cupillard, Lionel
2017-07-01
The assessment of vaccine combinations, or the evaluation of the impact of minor modifications of one component in well-established vaccines, requires animal challenges in the absence of previously validated correlates of protection. As an alternative, we propose conducting a multivariate analysis of the specific immune response to the vaccine. This approach is consistent with the principles of the 3Rs (Refinement, Reduction and Replacement) and avoids repeating efficacy studies based on infectious challenges in vivo. To validate this approach, a set of nine immunological parameters was selected in order to characterize B and T lymphocyte responses against canine rabies virus and to evaluate the compatibility between two canine vaccines, an inactivated rabies vaccine (RABISIN ® ) and a combined vaccine (EURICAN ® DAPPi-Lmulti) injected at two different sites in the same animals. The analysis was focused on the magnitude and quality of the immune response. The multi-dimensional picture given by this 'immune fingerprint' was used to assess the impact of the concomitant injection of the combined vaccine on the immunogenicity of the rabies vaccine. A principal component analysis fully discriminated the control group from the groups vaccinated with RABISIN ® alone or RABISIN ® +EURICAN ® DAPPi-Lmulti and confirmed the compatibility between the rabies vaccines. This study suggests that determining the immune fingerprint, combined with a multivariate statistical analysis, is a promising approach to characterizing the immunogenicity of a vaccine with an established record of efficacy. It may also avoid the need to repeat efficacy studies involving challenge infection in case of minor modifications of the vaccine or for compatibility studies. Copyright © 2017 Elsevier B.V. All rights reserved.
Domestication and fitness in the wild: A multivariate view.
Tufto, Jarle
2017-09-01
Domesticated species continually escaping and interbreeding with wild relatives impose a migration load on wild populations. As domesticated stocks become increasingly different as a result of artificial and natural selection in captivity, fitness of escapees in the wild is expected to decline, reducing the effective rate of migration into wild populations. Recent theory suggest that this may alleviate and eventually eliminate the resulting migration load. I develop a multivariate model of trait and wild fitness evolution resulting from the joint effects of artificial and natural selection in the captive environment. Initially, the evolutionary trajectory is dominated by the effects of artificial selection causing a fast initial decline in fitness of escapees in the wild. In later phases, through the counteracting effects of correlational multivariate natural selection in captivity, the mean phenotype is pushed in directions of weak stabilizing selection, allowing a sustained response in the trait subject to artificial selection. Provided that there is some alignment between the adaptive landscapes in the wild and in captivity, these phases are associated with slower rates of decline in wild fitness of the domesticated stock, suggesting that detrimental effects on wild populations are likely to remain a concern in the foreseeable future. © 2017 The Author(s). Evolution © 2017 The Society for the Study of Evolution.
Luminescence Sensors Applied to Water Analysis of Organic Pollutants—An Update
Ibañez, Gabriela A.; Escandar, Graciela M.
2011-01-01
The development of chemical sensors for environmental analysis based on fluorescence, phosphorescence and chemiluminescence signals continues to be a dynamic topic within the sensor field. This review covers the fundamentals of this type of sensors, and an update on recent works devoted to quantifying organic pollutants in environmental waters, focusing on advances since about 2005. Among the wide variety of these contaminants, special attention has been paid polycyclic aromatic hydrocarbons, pesticides, explosives and emerging organic pollutants. The potential of coupling optical sensors with multivariate calibration methods in order to improve the selectivity is also discussed. PMID:22247654
Multivariate Longitudinal Analysis with Bivariate Correlation Test
Adjakossa, Eric Houngla; Sadissou, Ibrahim; Hounkonnou, Mahouton Norbert; Nuel, Gregory
2016-01-01
In the context of multivariate multilevel data analysis, this paper focuses on the multivariate linear mixed-effects model, including all the correlations between the random effects when the dimensional residual terms are assumed uncorrelated. Using the EM algorithm, we suggest more general expressions of the model’s parameters estimators. These estimators can be used in the framework of the multivariate longitudinal data analysis as well as in the more general context of the analysis of multivariate multilevel data. By using a likelihood ratio test, we test the significance of the correlations between the random effects of two dependent variables of the model, in order to investigate whether or not it is useful to model these dependent variables jointly. Simulation studies are done to assess both the parameter recovery performance of the EM estimators and the power of the test. Using two empirical data sets which are of longitudinal multivariate type and multivariate multilevel type, respectively, the usefulness of the test is illustrated. PMID:27537692
Multivariate Longitudinal Analysis with Bivariate Correlation Test.
Adjakossa, Eric Houngla; Sadissou, Ibrahim; Hounkonnou, Mahouton Norbert; Nuel, Gregory
2016-01-01
In the context of multivariate multilevel data analysis, this paper focuses on the multivariate linear mixed-effects model, including all the correlations between the random effects when the dimensional residual terms are assumed uncorrelated. Using the EM algorithm, we suggest more general expressions of the model's parameters estimators. These estimators can be used in the framework of the multivariate longitudinal data analysis as well as in the more general context of the analysis of multivariate multilevel data. By using a likelihood ratio test, we test the significance of the correlations between the random effects of two dependent variables of the model, in order to investigate whether or not it is useful to model these dependent variables jointly. Simulation studies are done to assess both the parameter recovery performance of the EM estimators and the power of the test. Using two empirical data sets which are of longitudinal multivariate type and multivariate multilevel type, respectively, the usefulness of the test is illustrated.
Del Giudice, G; Padulano, R; Siciliano, D
2016-01-01
The lack of geometrical and hydraulic information about sewer networks often excludes the adoption of in-deep modeling tools to obtain prioritization strategies for funds management. The present paper describes a novel statistical procedure for defining the prioritization scheme for preventive maintenance strategies based on a small sample of failure data collected by the Sewer Office of the Municipality of Naples (IT). Novelty issues involve, among others, considering sewer parameters as continuous statistical variables and accounting for their interdependences. After a statistical analysis of maintenance interventions, the most important available factors affecting the process are selected and their mutual correlations identified. Then, after a Box-Cox transformation of the original variables, a methodology is provided for the evaluation of a vulnerability map of the sewer network by adopting a joint multivariate normal distribution with different parameter sets. The goodness-of-fit is eventually tested for each distribution by means of a multivariate plotting position. The developed methodology is expected to assist municipal engineers in identifying critical sewers, prioritizing sewer inspections in order to fulfill rehabilitation requirements.
Vigli, Georgia; Philippidis, Angelos; Spyros, Apostolos; Dais, Photis
2003-09-10
A combination of (1)H NMR and (31)P NMR spectroscopy and multivariate statistical analysis was used to classify 192 samples from 13 types of vegetable oils, namely, hazelnut, sunflower, corn, soybean, sesame, walnut, rapeseed, almond, palm, groundnut, safflower, coconut, and virgin olive oils from various regions of Greece. 1,2-Diglycerides, 1,3-diglycerides, the ratio of 1,2-diglycerides to total diglycerides, acidity, iodine value, and fatty acid composition determined upon analysis of the respective (1)H NMR and (31)P NMR spectra were selected as variables to establish a classification/prediction model by employing discriminant analysis. This model, obtained from the training set of 128 samples, resulted in a significant discrimination among the different classes of oils, whereas 100% of correct validated assignments for 64 samples were obtained. Different artificial mixtures of olive-hazelnut, olive-corn, olive-sunflower, and olive-soybean oils were prepared and analyzed by (1)H NMR and (31)P NMR spectroscopy. Subsequent discriminant analysis of the data allowed detection of adulteration as low as 5% w/w, provided that fresh virgin olive oil samples were used, as reflected by their high 1,2-diglycerides to total diglycerides ratio (D > or = 0.90).
Economic indicators selection for crime rates forecasting using cooperative feature selection
NASA Astrophysics Data System (ADS)
Alwee, Razana; Shamsuddin, Siti Mariyam Hj; Salleh Sallehuddin, Roselina
2013-04-01
Features selection in multivariate forecasting model is very important to ensure that the model is accurate. The purpose of this study is to apply the Cooperative Feature Selection method for features selection. The features are economic indicators that will be used in crime rate forecasting model. The Cooperative Feature Selection combines grey relational analysis and artificial neural network to establish a cooperative model that can rank and select the significant economic indicators. Grey relational analysis is used to select the best data series to represent each economic indicator and is also used to rank the economic indicators according to its importance to the crime rate. After that, the artificial neural network is used to select the significant economic indicators for forecasting the crime rates. In this study, we used economic indicators of unemployment rate, consumer price index, gross domestic product and consumer sentiment index, as well as data rates of property crime and violent crime for the United States. Levenberg-Marquardt neural network is used in this study. From our experiments, we found that consumer price index is an important economic indicator that has a significant influence on the violent crime rate. While for property crime rate, the gross domestic product, unemployment rate and consumer price index are the influential economic indicators. The Cooperative Feature Selection is also found to produce smaller errors as compared to Multiple Linear Regression in forecasting property and violent crime rates.
An Application of Discriminant Analysis to the Selection of Software Cost Estimating Models.
1984-09-01
the PRICE S Users Manual (29:111-25) was used with a slight modification. Based on the experience and advice of Captain Joe Dean, Electronic System...this study, and EXP is the expansion factor listed in the PRICE S User’s Manual . Another important factor needing explanation is development cost...coefficients and a unique constant. According to the SPSS manual (26:445) "Under the assumption of a multivariate normal distribution, the
Speranza, Barbara; Racioppo, Angela; Beneduce, Luciano; Bevilacqua, Antonio; Sinigaglia, Milena; Corbo, Maria Rosaria
2017-08-01
This study focused on the selection of lactic starters with probiotic properties for the production of fermented fish-products by the use of a multivariate approach (Cluster Analysis and Principal Component Analysis). Seventy-five isolates were recovered from fish intestinal microbiota and characterized by evaluating phenotypical, technological and probiotic traits; the most promising isolates were molecularly identified and then used into fish fermented sausage production. Namely, data from technological characterization were modelled through Growth Index and used as input to run a preliminary selection. Thus, 15 promising strains were selected and subjected to probiotic characterization; considering the results from probiotic tests, 3 promising strains were finally chosen (11, 68 and 69), identified as members of the genus Lactobacillus and used for the validation at laboratory level through the assessment of their performances for the production of fermented fish sausages. The results were promising as the use of the selected strains reduced the fermentation time (2 days) ensuring a good microbiological quality of the final product. Copyright © 2017 Elsevier Ltd. All rights reserved.
Rovadoscki, Gregori A; Petrini, Juliana; Ramirez-Diaz, Johanna; Pertile, Simone F N; Pertille, Fábio; Salvian, Mayara; Iung, Laiza H S; Rodriguez, Mary Ana P; Zampar, Aline; Gaya, Leila G; Carvalho, Rachel S B; Coelho, Antonio A D; Savino, Vicente J M; Coutinho, Luiz L; Mourão, Gerson B
2016-09-01
Repeated measures from the same individual have been analyzed by using repeatability and finite dimension models under univariate or multivariate analyses. However, in the last decade, the use of random regression models for genetic studies with longitudinal data have become more common. Thus, the aim of this research was to estimate genetic parameters for body weight of four experimental chicken lines by using univariate random regression models. Body weight data from hatching to 84 days of age (n = 34,730) from four experimental free-range chicken lines (7P, Caipirão da ESALQ, Caipirinha da ESALQ and Carijó Barbado) were used. The analysis model included the fixed effects of contemporary group (gender and rearing system), fixed regression coefficients for age at measurement, and random regression coefficients for permanent environmental effects and additive genetic effects. Heterogeneous variances for residual effects were considered, and one residual variance was assigned for each of six subclasses of age at measurement. Random regression curves were modeled by using Legendre polynomials of the second and third orders, with the best model chosen based on the Akaike Information Criterion, Bayesian Information Criterion, and restricted maximum likelihood. Multivariate analyses under the same animal mixed model were also performed for the validation of the random regression models. The Legendre polynomials of second order were better for describing the growth curves of the lines studied. Moderate to high heritabilities (h(2) = 0.15 to 0.98) were estimated for body weight between one and 84 days of age, suggesting that selection for body weight at all ages can be used as a selection criteria. Genetic correlations among body weight records obtained through multivariate analyses ranged from 0.18 to 0.96, 0.12 to 0.89, 0.06 to 0.96, and 0.28 to 0.96 in 7P, Caipirão da ESALQ, Caipirinha da ESALQ, and Carijó Barbado chicken lines, respectively. Results indicate that genetic gain for body weight can be achieved by selection. Also, selection for body weight at 42 days of age can be maintained as a selection criterion. © 2016 Poultry Science Association Inc.
Sparse partial least squares regression for simultaneous dimension reduction and variable selection
Chun, Hyonho; Keleş, Sündüz
2010-01-01
Partial least squares regression has been an alternative to ordinary least squares for handling multicollinearity in several areas of scientific research since the 1960s. It has recently gained much attention in the analysis of high dimensional genomic data. We show that known asymptotic consistency of the partial least squares estimator for a univariate response does not hold with the very large p and small n paradigm. We derive a similar result for a multivariate response regression with partial least squares. We then propose a sparse partial least squares formulation which aims simultaneously to achieve good predictive performance and variable selection by producing sparse linear combinations of the original predictors. We provide an efficient implementation of sparse partial least squares regression and compare it with well-known variable selection and dimension reduction approaches via simulation experiments. We illustrate the practical utility of sparse partial least squares regression in a joint analysis of gene expression and genomewide binding data. PMID:20107611
Fernández-Varela, R; Andrade, J M; Muniategui, S; Prada, D; Ramírez-Villalobos, F
2010-04-01
Identifying petroleum-related products released into the environment is a complex and difficult task. To achieve this, polycyclic aromatic hydrocarbons (PAHs) are of outstanding importance nowadays. Despite traditional quantitative fingerprinting uses straightforward univariate statistical analyses to differentiate among oils and to assess their sources, a multivariate strategy based on Procrustes rotation (PR) was applied in this paper. The aim of PR is to select a reduced subset of PAHs still capable of performing a satisfactory identification of petroleum-related hydrocarbons. PR selected two subsets of three (C(2)-naphthalene, C(2)-dibenzothiophene and C(2)-phenanthrene) and five (C(1)-decahidronaphthalene, naphthalene, C(2)-phenanthrene, C(3)-phenanthrene and C(2)-fluoranthene) PAHs for each of the two datasets studied here. The classification abilities of each subset of PAHs were tested using principal components analysis, hierarchical cluster analysis and Kohonen neural networks and it was demonstrated that they unraveled the same patterns as the overall set of PAHs. (c) 2009 Elsevier Ltd. All rights reserved.
Binder, Harald; Sauerbrei, Willi; Royston, Patrick
2013-06-15
In observational studies, many continuous or categorical covariates may be related to an outcome. Various spline-based procedures or the multivariable fractional polynomial (MFP) procedure can be used to identify important variables and functional forms for continuous covariates. This is the main aim of an explanatory model, as opposed to a model only for prediction. The type of analysis often guides the complexity of the final model. Spline-based procedures and MFP have tuning parameters for choosing the required complexity. To compare model selection approaches, we perform a simulation study in the linear regression context based on a data structure intended to reflect realistic biomedical data. We vary the sample size, variance explained and complexity parameters for model selection. We consider 15 variables. A sample size of 200 (1000) and R(2) = 0.2 (0.8) is the scenario with the smallest (largest) amount of information. For assessing performance, we consider prediction error, correct and incorrect inclusion of covariates, qualitative measures for judging selected functional forms and further novel criteria. From limited information, a suitable explanatory model cannot be obtained. Prediction performance from all types of models is similar. With a medium amount of information, MFP performs better than splines on several criteria. MFP better recovers simpler functions, whereas splines better recover more complex functions. For a large amount of information and no local structure, MFP and the spline procedures often select similar explanatory models. Copyright © 2012 John Wiley & Sons, Ltd.
Dominance genetic variance for traits under directional selection in Drosophila serrata.
Sztepanacz, Jacqueline L; Blows, Mark W
2015-05-01
In contrast to our growing understanding of patterns of additive genetic variance in single- and multi-trait combinations, the relative contribution of nonadditive genetic variance, particularly dominance variance, to multivariate phenotypes is largely unknown. While mechanisms for the evolution of dominance genetic variance have been, and to some degree remain, subject to debate, the pervasiveness of dominance is widely recognized and may play a key role in several evolutionary processes. Theoretical and empirical evidence suggests that the contribution of dominance variance to phenotypic variance may increase with the correlation between a trait and fitness; however, direct tests of this hypothesis are few. Using a multigenerational breeding design in an unmanipulated population of Drosophila serrata, we estimated additive and dominance genetic covariance matrices for multivariate wing-shape phenotypes, together with a comprehensive measure of fitness, to determine whether there is an association between directional selection and dominance variance. Fitness, a trait unequivocally under directional selection, had no detectable additive genetic variance, but significant dominance genetic variance contributing 32% of the phenotypic variance. For single and multivariate morphological traits, however, no relationship was observed between trait-fitness correlations and dominance variance. A similar proportion of additive and dominance variance was found to contribute to phenotypic variance for single traits, and double the amount of additive compared to dominance variance was found for the multivariate trait combination under directional selection. These data suggest that for many fitness components a positive association between directional selection and dominance genetic variance may not be expected. Copyright © 2015 by the Genetics Society of America.
Multivariate Regression Analysis and Slaughter Livestock,
AGRICULTURE, *ECONOMICS), (*MEAT, PRODUCTION), MULTIVARIATE ANALYSIS, REGRESSION ANALYSIS , ANIMALS, WEIGHT, COSTS, PREDICTIONS, STABILITY, MATHEMATICAL MODELS, STORAGE, BEEF, PORK, FOOD, STATISTICAL DATA, ACCURACY
The Statistical Consulting Center for Astronomy (SCCA)
NASA Technical Reports Server (NTRS)
Akritas, Michael
2001-01-01
The process by which raw astronomical data acquisition is transformed into scientifically meaningful results and interpretation typically involves many statistical steps. Traditional astronomy limits itself to a narrow range of old and familiar statistical methods: means and standard deviations; least-squares methods like chi(sup 2) minimization; and simple nonparametric procedures such as the Kolmogorov-Smirnov tests. These tools are often inadequate for the complex problems and datasets under investigations, and recent years have witnessed an increased usage of maximum-likelihood, survival analysis, multivariate analysis, wavelet and advanced time-series methods. The Statistical Consulting Center for Astronomy (SCCA) assisted astronomers with the use of sophisticated tools, and to match these tools with specific problems. The SCCA operated with two professors of statistics and a professor of astronomy working together. Questions were received by e-mail, and were discussed in detail with the questioner. Summaries of those questions and answers leading to new approaches were posted on the Web (www.state.psu.edu/ mga/SCCA). In addition to serving individual astronomers, the SCCA established a Web site for general use that provides hypertext links to selected on-line public-domain statistical software and services. The StatCodes site (www.astro.psu.edu/statcodes) provides over 200 links in the areas of: Bayesian statistics; censored and truncated data; correlation and regression, density estimation and smoothing, general statistics packages and information; image analysis; interactive Web tools; multivariate analysis; multivariate clustering and classification; nonparametric analysis; software written by astronomers; spatial statistics; statistical distributions; time series analysis; and visualization tools. StatCodes has received a remarkable high and constant hit rate of 250 hits/week (over 10,000/year) since its inception in mid-1997. It is of interest to scientists both within and outside of astronomy. The most popular sections are multivariate techniques, image analysis, and time series analysis. Hundreds of copies of the ASURV, SLOPES and CENS-TAU codes developed by SCCA scientists were also downloaded from the StatCodes site. In addition to formal SCCA duties, SCCA scientists continued a variety of related activities in astrostatistics, including refereeing of statistically oriented papers submitted to the Astrophysical Journal, talks in meetings including Feigelson's talk to science journalists entitled "The reemergence of astrostatistics" at the American Association for the Advancement of Science meeting, and published papers of astrostatistical content.
Reed, Thomas E; Gienapp, Phillip; Visser, Marcel E
2016-10-01
Key life history traits such as breeding time and clutch size are frequently both heritable and under directional selection, yet many studies fail to document microevolutionary responses. One general explanation is that selection estimates are biased by the omission of correlated traits that have causal effects on fitness, but few valid tests of this exist. Here, we show, using a quantitative genetic framework and six decades of life-history data on two free-living populations of great tits Parus major, that selection estimates for egg-laying date and clutch size are relatively unbiased. Predicted responses to selection based on the Robertson-Price Identity were similar to those based on the multivariate breeder's equation (MVBE), indicating that unmeasured covarying traits were not missing from the analysis. Changing patterns of phenotypic selection on these traits (for laying date, linked to climate change) therefore reflect changing selection on breeding values, and genetic constraints appear not to limit their independent evolution. Quantitative genetic analysis of correlational data from pedigreed populations can be a valuable complement to experimental approaches to help identify whether apparent associations between traits and fitness are biased by missing traits, and to parse the roles of direct versus indirect selection across a range of environments. © 2016 The Author(s). Evolution © 2016 The Society for the Study of Evolution.
NASA Astrophysics Data System (ADS)
Zhao, Jianhua; Zeng, Haishan; Kalia, Sunil; Lui, Harvey
2017-02-01
Background: Raman spectroscopy is a non-invasive optical technique which can measure molecular vibrational modes within tissue. A large-scale clinical study (n = 518) has demonstrated that real-time Raman spectroscopy could distinguish malignant from benign skin lesions with good diagnostic accuracy; this was validated by a follow-up independent study (n = 127). Objective: Most of the previous diagnostic algorithms have typically been based on analyzing the full band of the Raman spectra, either in the fingerprint or high wavenumber regions. Our objective in this presentation is to explore wavenumber selection based analysis in Raman spectroscopy for skin cancer diagnosis. Methods: A wavenumber selection algorithm was implemented using variably-sized wavenumber windows, which were determined by the correlation coefficient between wavenumbers. Wavenumber windows were chosen based on accumulated frequency from leave-one-out cross-validated stepwise regression or least and shrinkage selection operator (LASSO). The diagnostic algorithms were then generated from the selected wavenumber windows using multivariate statistical analyses, including principal component and general discriminant analysis (PC-GDA) and partial least squares (PLS). A total cohort of 645 confirmed lesions from 573 patients encompassing skin cancers, precancers and benign skin lesions were included. Lesion measurements were divided into training cohort (n = 518) and testing cohort (n = 127) according to the measurement time. Result: The area under the receiver operating characteristic curve (ROC) improved from 0.861-0.891 to 0.891-0.911 and the diagnostic specificity for sensitivity levels of 0.99-0.90 increased respectively from 0.17-0.65 to 0.20-0.75 by selecting specific wavenumber windows for analysis. Conclusion: Wavenumber selection based analysis in Raman spectroscopy improves skin cancer diagnostic specificity at high sensitivity levels.
Saber, W.; Moua, T.; Williams, E. C.; Verso, M.; Agnelli, G.; Couban, S.; Young, A.; De Cicco, M.; Biffi, R.; van Rooden, C. J.; Huisman, M. V.; Fagnani, D.; Cimminiello, C.; Moia, M.; Magagnoli, M.; Povoski, S. P.; Malak, S. F.; Lee, A. Y.
2010-01-01
Background Knowledge of independent, baseline risk factors of catheter-related thrombosis (CRT) may help select adult cancer patients at high risk to receive thromboprophylaxis. Objectives We conducted a meta-analysis of individual patient-level data to identify these baseline risk factors. Patients/Methods MEDLINE, EMBASE, CINAHL, CENTRAL, DARE, Grey literature databases were searched in all languages from 1995-2008. Prospective studies and randomized controlled trials (RCTs) were eligible. Studies were included if original patient-level data were provided by the investigators and if CRT was objectively confirmed with valid imaging. Multivariate logistic regression analysis of 17 prespecified baseline characteristics was conducted. Adjusted odds ratios (OR) and 95% confidence intervals (CI) were estimated. Results A total sample of 5636 subjects from 5 RCTs and 7 prospective studies was included in the analysis. Among these subjects, 425 CRT events were observed. In multivariate logistic regression, the use of implanted ports as compared with peripherally implanted central venous catheters (PICC), decreased CRT risk (OR = 0.43; 95% CI, 0.23-0.80), whereas past history of deep vein thrombosis (DVT) (OR = 2.03; 95% CI, 1.05-3.92), subclavian venipuncture insertion technique (OR = 2.16; 95% CI, 1.07-4.34), and improper catheter tip location (OR = 1.92; 95% CI, 1.22-3.02), increased CRT risk. Conclusions CRT risk is increased with using PICC catheters, previous history of DVT, subclavian venipuncture insertion technique and improper positioning of the catheter tip. These factors may be useful for risk stratifying patients to select those for thromboprophylaxis. Prospective studies are needed to validate these findings. PMID:21040443
Christiansen, H; Sahin, K; Berthold, F; Hero, B; Terpe, H J; Lampert, F
1995-01-01
A comparison of the prognostic impact of five molecular variables in a large series was made, including tests of their nonrandom association and multivariate analysis. Molecular data were available for 377 patients and MYCN amplification, cytogenetic chromosome 1p deletion, loss of chromosome 1p heterozygosity, DNA ploidy and CD44 expression were investigated. Their interdependence and influence on event-free survival was tested uni- and multivariately using Pearson's chi 2-test, Kaplan-Meier estimates, log rank tests and the Cox's regression model. MYCN amplification was present in 18% (58/322) of cases and predicted poorer prognosis in localised (P < 0.001), metastatic (P = 0.002) and even 4S (P = 0.040) disease. CD44 expression was found in 86% (127/148) of cases, and was a marker for favourable outcome in patients with neuroblastoma stages 1-3 (P = 0.003) and 4 (P = 0.017). Chromosome 1p deletion was cytogenetically detected in 51% (28/55), and indicated reduced event-free survival in localised neuroblastoma (P = 0.020). DNA ploidy and loss of heterozygosity on chromosome 1p were of less prognostic value. Most factors of prognostic significance were associated with each other. By multivariate analysis, MYCN was selected as the only relevant factor. Risk estimation of high discriminating power is, therefore, possible for patients with localised and metastatic neuroblastoma using stage and MYCN.
NASA Astrophysics Data System (ADS)
Safi, A.; Campanella, B.; Grifoni, E.; Legnaioli, S.; Lorenzetti, G.; Pagnotta, S.; Poggialini, F.; Ripoll-Seguer, L.; Hidalgo, M.; Palleschi, V.
2018-06-01
The introduction of multivariate calibration curve approach in Laser-Induced Breakdown Spectroscopy (LIBS) quantitative analysis has led to a general improvement of the LIBS analytical performances, since a multivariate approach allows to exploit the redundancy of elemental information that are typically present in a LIBS spectrum. Software packages implementing multivariate methods are available in the most diffused commercial and open source analytical programs; in most of the cases, the multivariate algorithms are robust against noise and operate in unsupervised mode. The reverse of the coin of the availability and ease of use of such packages is the (perceived) difficulty in assessing the reliability of the results obtained which often leads to the consideration of the multivariate algorithms as 'black boxes' whose inner mechanism is supposed to remain hidden to the user. In this paper, we will discuss the dangers of a 'black box' approach in LIBS multivariate analysis, and will discuss how to overcome them using the chemical-physical knowledge that is at the base of any LIBS quantitative analysis.
Dinç, Erdal; Ozdemir, Abdil
2005-01-01
Multivariate chromatographic calibration technique was developed for the quantitative analysis of binary mixtures enalapril maleate (EA) and hydrochlorothiazide (HCT) in tablets in the presence of losartan potassium (LST). The mathematical algorithm of multivariate chromatographic calibration technique is based on the use of the linear regression equations constructed using relationship between concentration and peak area at the five-wavelength set. The algorithm of this mathematical calibration model having a simple mathematical content was briefly described. This approach is a powerful mathematical tool for an optimum chromatographic multivariate calibration and elimination of fluctuations coming from instrumental and experimental conditions. This multivariate chromatographic calibration contains reduction of multivariate linear regression functions to univariate data set. The validation of model was carried out by analyzing various synthetic binary mixtures and using the standard addition technique. Developed calibration technique was applied to the analysis of the real pharmaceutical tablets containing EA and HCT. The obtained results were compared with those obtained by classical HPLC method. It was observed that the proposed multivariate chromatographic calibration gives better results than classical HPLC.
A single determinant dominates the rate of yeast protein evolution.
Drummond, D Allan; Raval, Alpan; Wilke, Claus O
2006-02-01
A gene's rate of sequence evolution is among the most fundamental evolutionary quantities in common use, but what determines evolutionary rates has remained unclear. Here, we carry out the first combined analysis of seven predictors (gene expression level, dispensability, protein abundance, codon adaptation index, gene length, number of protein-protein interactions, and the gene's centrality in the interaction network) previously reported to have independent influences on protein evolutionary rates. Strikingly, our analysis reveals a single dominant variable linked to the number of translation events which explains 40-fold more variation in evolutionary rate than any other, suggesting that protein evolutionary rate has a single major determinant among the seven predictors. The dominant variable explains nearly half the variation in the rate of synonymous and protein evolution. We show that the two most commonly used methods to disentangle the determinants of evolutionary rate, partial correlation analysis and ordinary multivariate regression, produce misleading or spurious results when applied to noisy biological data. We overcome these difficulties by employing principal component regression, a multivariate regression of evolutionary rate against the principal components of the predictor variables. Our results support the hypothesis that translational selection governs the rate of synonymous and protein sequence evolution in yeast.
Evaluation and simplification of the occupational slip, trip and fall risk-assessment test
NAKAMURA, Takehiro; OYAMA, Ichiro; FUJINO, Yoshihisa; KUBO, Tatsuhiko; KADOWAKI, Koji; KUNIMOTO, Masamizu; ODOI, Haruka; TABATA, Hidetoshi; MATSUDA, Shinya
2016-01-01
Objective: The purpose of this investigation is to evaluate the efficacy of the occupational slip, trip and fall (STF) risk assessment test developed by the Japan Industrial Safety and Health Association (JISHA). We further intended to simplify the test to improve efficiency. Methods: A previous cohort study was performed using 540 employees aged ≥50 years who took the JISHA’s STF risk assessment test. We conducted multivariate analysis using these previous results as baseline values and answers to questionnaire items or score on physical fitness tests as variables. The screening efficiency of each model was evaluated based on the obtained receiver operating characteristic (ROC) curve. Results: The area under the ROC obtained in multivariate analysis was 0.79 when using all items. Six of the 25 questionnaire items were selected for stepwise analysis, giving an area under the ROC curve of 0.77. Conclusion: Based on the results of follow-up performed one year after the initial examination, we successfully determined the usefulness of the STF risk assessment test. Administering a questionnaire alone is sufficient for screening subjects at risk of STF during the subsequent one-year period. PMID:27021057
Kukreti, B M; Pandey, Pradeep; Singh, R V
2012-08-01
Non-coring based exploratory drilling was under taken in the sedimentary environment of Rangsohkham block, East Khasi Hills district to examine the eastern extension of existing uranium resources located at Domiasiat and Wakhyn in the Mahadek basin of Meghalaya (India). Although radiometric survey and radiometric analysis of surface grab/channel samples in the block indicate high uranium content but the gamma ray logging results of exploratory boreholes in the block, did not obtain the expected results. To understand this abrupt discontinuity between the two sets of data (surface and subsurface) multivariate statistical analysis of primordial radioactive elements (K(40), U(238) and Th(232)) was performed using the concept of representative subsurface samples, drawn from the randomly selected 11 boreholes of this block. The study was performed to a high confidence level (99%), and results are discussed for assessing the U and Th behavior in the block. Results not only confirm the continuation of three distinct geological formations in the area but also the uranium bearing potential in the Mahadek sandstone of the eastern part of Mahadek Basin. Copyright © 2012 Elsevier Ltd. All rights reserved.
Total Shoulder Arthroplasty: Is Less Time in the Hospital Better?
Duchman, Kyle R; Anthony, Chris A; Westermann, Robert W; Pugely, Andrew J; Gao, Yubo; Hettrich, Carolyn M
2017-01-01
The incidence of total shoulder arthroplasty (TSA) has increased significantly over the last decade. Short-stay protocols for other highvolume procedures have been shown to be safe and effective but have yet to be fully explored for TSA. Our purpose in comparing short-stay and inpatient TSA was to determine: (1) patient demographics and comorbidities, (2) 30-day morbidity, mortality, and readmissions using a matched analysis, and (3) independent predictors of 30-day complications. The American College of Surgeons National Surgical Quality Improvement (ACS NSQIP) database was queried and all patients undergoing elective, primary TSA between 2006 and 2013 were identified. Patients were categorized as short-stay or inpatient based on day of discharge. Propensity score matching was used to adjust for selection bias. Univariate and multivariate statistical analysis was used to compare 30-day morbidity and mortality between the two cohorts. Overall, 4,619 cases were available, with inpatient admission occurring in 65.7% of patients. Prior to propensity score matching, short-stay patients were significantly younger, more frequently male, with fewer comorbid conditions. After matching, inpatient admission was associated with increased rates of urinary tract infection (1.1% vs. 0.1%; p = 0.001), blood transfusion (5.3% vs. 0.8%; p < 0.001), and total complications (4.7% vs. 1.8%; p < 0.001). Multivariate analysis identified inpatient admission as an independent risk factor for 30-day complication following TSA. Short-stay TSA is a safe option for the appropriately selected patient. Inpatient admission was an independent risk factor for complication following TSA. Level of Evidence: III.
Milot, Marie-Hélène; Spencer, Steven J.; Chan, Vicky; Allington, James P.; Klein, Julius; Chou, Cathy; Pearson-Fuhrhop, Kristin; Bobrow, James E.; Reinkensmeyer, David J.; Cramer, Steven C.
2014-01-01
Background Robotic training can help improve function of a paretic limb following a stroke, but individuals respond differently to the training. A predictor of functional gains might improve the ability to select those individuals more likely to benefit from robot based therapy. Studies evaluating predictors of functional improvement after a robotic training are scarce. One study has found that white matter tract integrity predicts functional gains following a robotic training of the hand and wrist. Objective Determine the predictive ability of behavioral and brain measures to improve selection of individuals for robotic training. Methods Twenty subjects with chronic stroke participated in an 8-week course of robotic exoskeletal training for the arm. Before training, a clinical evaluation, fMRI, diffusion tensor imaging, and transcranial magnetic stimulation (TMS) were each measured as predictors. Final functional gain was defined as change in the Box and Block Test (BBT). Measures significant in bivariate analysis were fed into a multivariate linear regression model. Results Training was associated with an average gain of 6±5 blocks on the BBT (p<0.0001). Bivariate analysis revealed that lower baseline motor evoked potential (MEP) amplitude on TMS, and lower laterality M1 index on fMRI each significantly correlated with greater BBT change. In the multivariate linear regression analysis, baseline MEP magnitude was the only measure that remained significant. Conclusion Subjects with lower baseline MEP magnitude benefited the most from robotic training of the affected arm. These subjects might have reserve remaining for the training to boost corticospinal excitability, translating into functional gains. PMID:24642382
Capacity for Physical Activity Predicts Weight Loss After Roux-en-Y Gastric Bypass
Hatoum, Ida J.; Stein, Heather K.; Merrifield, Benjamin F.; Kaplan, Lee M.
2014-01-01
Despite its overall excellent outcomes, weight loss after Roux-en-Y gastric bypass (RYGB) is highly variable. We conducted this study to identify clinical predictors of weight loss after RYGB. We reviewed charts from 300 consecutive patients who underwent RYGB from August 1999 to November 2002. Data collected included patient demographics, medical comorbidities, and diet history. Of the 20 variables selected for univariate analysis, 9 with univariate P values ≤ 0.15 were entered into a multivariable regression analysis. Using backward selection, covariates with P < 0.05 were retained. Potential confounders were added back into the model and assessed for effect on all model variables. Complete records were available for 246 of the 300 patients (82%). The patient characteristics were 75% female, 93% white, mean age of 45 years, and mean initial BMI of 52.3 kg/m2. One year after surgery, patients lost an average of 64.8% of their excess weight (s.d. = 20.5%). The multivariable regression analysis revealed that limited physical activity, higher initial BMI, lower educational level, diabetes, and decreased attendance at postoperative appointments had an adverse effect on weight loss after RYGB. A model including these five factors accounts for 41% of the observed variability in weight loss (adjusted r2 = 0.41). In this cohort, higher initial BMI and limited physical activity were the strongest predictors of decreased excess weight loss following RYGB. Limited physical activity may be particularly important because it represents an opportunity for potentially meaningful pre- and postsurgical intervention to maximize weight loss following RYGB. PMID:18997674
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nee, K.; Bryan, S.; Levitskaia, T.
The reliability of chemical processes can be greatly improved by implementing inline monitoring systems. Combining multivariate analysis with non-destructive sensors can enhance the process without interfering with the operation. Here, we present here hierarchical models using both principal component analysis and partial least square analysis developed for different chemical components representative of solvent extraction process streams. A training set of 380 samples and an external validation set of 95 samples were prepared and Near infrared and Raman spectral data as well as conductivity under variable temperature conditions were collected. The results from the models indicate that careful selection of themore » spectral range is important. By compressing the data through Principal Component Analysis (PCA), we lower the rank of the data set to its most dominant features while maintaining the key principal components to be used in the regression analysis. Within the studied data set, concentration of five chemical components were modeled; total nitrate (NO 3 -), total acid (H +), neodymium (Nd 3+), sodium (Na +), and ionic strength (I.S.). The best overall model prediction for each of the species studied used a combined data set comprised of complementary techniques including NIR, Raman, and conductivity. Finally, our study shows that chemometric models are powerful but requires significant amount of carefully analyzed data to capture variations in the chemistry.« less
Nee, K.; Bryan, S.; Levitskaia, T.; ...
2017-12-28
The reliability of chemical processes can be greatly improved by implementing inline monitoring systems. Combining multivariate analysis with non-destructive sensors can enhance the process without interfering with the operation. Here, we present here hierarchical models using both principal component analysis and partial least square analysis developed for different chemical components representative of solvent extraction process streams. A training set of 380 samples and an external validation set of 95 samples were prepared and Near infrared and Raman spectral data as well as conductivity under variable temperature conditions were collected. The results from the models indicate that careful selection of themore » spectral range is important. By compressing the data through Principal Component Analysis (PCA), we lower the rank of the data set to its most dominant features while maintaining the key principal components to be used in the regression analysis. Within the studied data set, concentration of five chemical components were modeled; total nitrate (NO 3 -), total acid (H +), neodymium (Nd 3+), sodium (Na +), and ionic strength (I.S.). The best overall model prediction for each of the species studied used a combined data set comprised of complementary techniques including NIR, Raman, and conductivity. Finally, our study shows that chemometric models are powerful but requires significant amount of carefully analyzed data to capture variations in the chemistry.« less
Construction of Shale Gas Well
NASA Astrophysics Data System (ADS)
Sapińska-Śliwa, Aneta; Wiśniowski, Rafał; Skrzypaszek, Krzysztof
2018-03-01
The paper describes shale gas borehole axes trajectories (vertical, horizontal, multilateral). The methodology of trajectory design in a two-and three-dimensional space has been developed. The selection of the profile type of the trajectory axes of the directional borehole depends on the technical and technological possibilities of its implementation and the results of a comprehensive economic analysis of the availability and development of the field. The work assumes the possibility of a multivariate design of trajectories depending on the accepted (available or imposed) input data.
Ground Vibration Test Planning and Pre-Test Analysis for the X-33 Vehicle
NASA Technical Reports Server (NTRS)
Bedrossian, Herand; Tinker, Michael L.; Hidalgo, Homero
2000-01-01
This paper describes the results of the modal test planning and the pre-test analysis for the X-33 vehicle. The pre-test analysis included the selection of the target modes, selection of the sensor and shaker locations and the development of an accurate Test Analysis Model (TAM). For target mode selection, four techniques were considered, one based on the Modal Cost technique, one based on Balanced Singular Value technique, a technique known as the Root Sum Squared (RSS) method, and a Modal Kinetic Energy (MKE) approach. For selecting sensor locations, four techniques were also considered; one based on the Weighted Average Kinetic Energy (WAKE), one based on Guyan Reduction (GR), one emphasizing engineering judgment, and one based on an optimum sensor selection technique using Genetic Algorithm (GA) search technique combined with a criteria based on Hankel Singular Values (HSV's). For selecting shaker locations, four techniques were also considered; one based on the Weighted Average Driving Point Residue (WADPR), one based on engineering judgment and accessibility considerations, a frequency response method, and an optimum shaker location selection based on a GA search technique combined with a criteria based on HSV's. To evaluate the effectiveness of the proposed sensor and shaker locations for exciting the target modes, extensive numerical simulations were performed. Multivariate Mode Indicator Function (MMIF) was used to evaluate the effectiveness of each sensor & shaker set with respect to modal parameter identification. Several TAM reduction techniques were considered including, Guyan, IRS, Modal, and Hybrid. Based on a pre-test cross-orthogonality checks using various reduction techniques, a Hybrid TAM reduction technique was selected and was used for all three vehicle fuel level configurations.
Multivariate analysis: A statistical approach for computations
NASA Astrophysics Data System (ADS)
Michu, Sachin; Kaushik, Vandana
2014-10-01
Multivariate analysis is a type of multivariate statistical approach commonly used in, automotive diagnosis, education evaluating clusters in finance etc and more recently in the health-related professions. The objective of the paper is to provide a detailed exploratory discussion about factor analysis (FA) in image retrieval method and correlation analysis (CA) of network traffic. Image retrieval methods aim to retrieve relevant images from a collected database, based on their content. The problem is made more difficult due to the high dimension of the variable space in which the images are represented. Multivariate correlation analysis proposes an anomaly detection and analysis method based on the correlation coefficient matrix. Anomaly behaviors in the network include the various attacks on the network like DDOs attacks and network scanning.
Multivariate Cluster Analysis.
ERIC Educational Resources Information Center
McRae, Douglas J.
Procedures for grouping students into homogeneous subsets have long interested educational researchers. The research reported in this paper is an investigation of a set of objective grouping procedures based on multivariate analysis considerations. Four multivariate functions that might serve as criteria for adequate grouping are given and…
Wu, Q-M; Zhao, X-Y; You, H
2016-01-01
Esophageal-gastro Varices (EGV) may develop in any histological stages of primary biliary cirrhosis (PBC). We aim to establish and validate quantitative fibrosis (qFibrosis) parameters in portal, septal and fibrillar areas as ideal predictors of EGV in PBC patients. PBC patients with liver biopsy, esophagogastroscopy and Second Harmonic Generation (SHG)/Two-photon Excited Fluorescence (TPEF) microscopy images were retrospectively enrolled in this study. qFibrosis parameters in portal, septal and fibrillar areas were acquired by computer-assisted SHG/TPEF imaging system. Independent predictor was identified using multivariate logistic regression analysis. PBC patients with liver biopsy, esophagogastroscopy and Second Harmonic Generation (SHG)/Two-photon Excited Fluorescence (TPEF) microscopy images were retrospectively enrolled in this study. qFibrosis parameters in portal, septal and fibrillar areas were acquired by computer-assisted SHG/TPEF imaging system. Independent predictor was identified using multivariate logistic regression analysis. Among the forty-nine PBC patients with qFibrosis images, twenty-nine PBC patients with both esophagogastroscopy data and qFibrosis data were selected out for EGV prognosis analysis and 44.8% (13/29) of them had EGV. The qFibrosis parameters of collagen percentage and number of crosslink in fibrillar area, short/long/thin strings number and length/width of the strings in septa area were associated with EGV (p < 0.05). Multivariate logistic analysis showed that the collagen percentage in fibrillar area ≥ 3.6% was an independent factor to predict EGV (odds ratio 6.9; 95% confidence interval 1.6-27.4). The area under receiver operating characteristic (ROC), diagnostic sensitivity and specificity was 0.9, 100% and 75% respectively. Collagen percentage in Collagen percentage in the fibrillar area as an independent predictor can highly predict EGV in PBC patients.
Van Dongen, Stefan
2014-01-01
Studies of the process of human mate selection and attractiveness have assumed that selection favours morphological features that correlate with (genetic) quality. Degree of masculinity/femininity and fluctuating asymmetry (FA) may signal (genetic) quality, but what information they harboured and how they relate to fitness is still debated. To study strength of associations between facial masculinity/femininity, facial FA, attractiveness and physical strength in humans. Two-hundred young males and females were studied by measuring facial asymmetry and masculinity on the basis of frontal photographs. Attractiveness was determined on the basis of scores given by an anonymous panel, and physical strength using hand grip strength. Patterns differed markedly between males and females and analysis method used (univariate vs multivariate). Overall, no associations between FA and attractiveness, masculinity and physical strength were found. In females, but not males, masculinity and attractiveness correlated negatively and masculinity and physical strength correlated positively. Further research into the differences between males and females in associations between facial morphology, attractiveness and physical strength is clearly needed. The use of a multivariate approach can increase our understanding of which regions of the face harbour specific information of hormone levels and perhaps behavioural traits.
Ruggieri, Flavia; Gil, Raúl A; Fernandez-Turiel, Jose-Luis; Saavedra, Julio; Gimeno, Domingo; Lobo, Agustin; Martinez, Luis D; Rodriguez-Gonzalez, Alejandro
2012-04-30
A method to obtain robust information on short term leaching behaviour of volcanic ashes has been developed independently on the sample age. A mixed factorial design (MFD) was employed as a multivariate strategy for the evaluation of the effects of selected control factors and their interactions (amount of sample (A), contact time (B), and liquid to solid ratio or L/S (C)) on the leaching process of selected metals (Na, K, Mg, Ca, Si, Al, V, Mn, Fe, and Co) and anions (Cl(-) and SO(4)(2-)). Box plots of the data acquired were used to evaluate the reproducibility achieved at different experimental conditions. Both the amount of sample (A) and leaching time (B) had a significant effect on the element stripping whereas the L/S ratio influenced only few elements. The lowest dispersion values have been observed when 1.0 g was leached with an L/S ratio equal to 10, shaking during 4 h. The entire method is completed within few hours, and it is simple, feasible and reliable in laboratory conditions. Copyright © 2012 Elsevier B.V. All rights reserved.
Ho, Ching-Sung
2018-05-01
To analyze the selection of a place to die and its related factors in patients who received hospice shared care service in Taiwan. This study included patients who received hospice shared care service in a metropolitan hospital as the research participants. A total of 172 questionnaires were collected, and 146 of them were used as valid samples for analysis. This study applied the multivariate logistic regression analysis to assess the significance of independent variables associated with the selection of place of death. The results revealed that 52.6% of the patients select hospital as the place to end their life, while 43.8% of them select their home as the place of death. Furthermore, younger adult patients (<65), those who with a higher educational level (≥10 years), and those with a clear cognizance of the disease progression tended to select hospital as the place to spend their last days. The research disclosed that more patients with the hospice shared care service prefer hospital to their home as the place to die. In order to provide end-of-life care for patients with low cost and appropriate treatment, it is important to understand the related sociodemographic factors and the need of the patients to provide well-designed hospice/specialist palliative care regimen.
Chen, Qiang; Chen, Yunhao; Jiang, Weiguo
2016-01-01
In the field of multiple features Object-Based Change Detection (OBCD) for very-high-resolution remotely sensed images, image objects have abundant features and feature selection affects the precision and efficiency of OBCD. Through object-based image analysis, this paper proposes a Genetic Particle Swarm Optimization (GPSO)-based feature selection algorithm to solve the optimization problem of feature selection in multiple features OBCD. We select the Ratio of Mean to Variance (RMV) as the fitness function of GPSO, and apply the proposed algorithm to the object-based hybrid multivariate alternative detection model. Two experiment cases on Worldview-2/3 images confirm that GPSO can significantly improve the speed of convergence, and effectively avoid the problem of premature convergence, relative to other feature selection algorithms. According to the accuracy evaluation of OBCD, GPSO is superior at overall accuracy (84.17% and 83.59%) and Kappa coefficient (0.6771 and 0.6314) than other algorithms. Moreover, the sensitivity analysis results show that the proposed algorithm is not easily influenced by the initial parameters, but the number of features to be selected and the size of the particle swarm would affect the algorithm. The comparison experiment results reveal that RMV is more suitable than other functions as the fitness function of GPSO-based feature selection algorithm. PMID:27483285
2013-01-01
Background The aims were to identify predictors of treatment retention in methadone maintenance treatment (MMT) clinics in Pearl River Delta, China. Methods Retrospective longitudinal study. Participants: 6 MMT clinics in rural and urban area were selected. Statistical analysis: Stratified random sampling was employed, and the data were analyzed using Kaplan-Meier survival curves and life table method. Protective or risk factors were explored using Cox’s proportional hazards model. Independent variables were enrolled in univariate analysis and among which significant variables were analyzed by multivariate analysis. Results A total of 2728 patients were enrolled. The median of the retention duration was 13.63 months, and the cumulative retention rates at 1,2,3 years were 53.0%, 35.0%, 20.0%, respectively. Multivariate Cox analysis showed: age, relationship with family, live on support from family or friends, income, considering treatment cost suitable, considering treatment open time suitable, addiction severity (daily expense for drug), communication with former drug taking peer, living in rural area, daily treatment dosage, sharing needles, re-admission and history of being arrested were predictors for MMT retention. Conclusions MMT retention rate in Guangdong was low and treatment skills and quality should be improved. Meanwhile, participation of family and society should be encouraged. PMID:23497263
Ristivojević, Petar; Trifković, Jelena; Vovk, Irena; Milojković-Opsenica, Dušanka
2017-01-01
Considering the introduction of phytochemical fingerprint analysis, as a method of screening the complex natural products for the presence of most bioactive compounds, use of chemometric classification methods, application of powerful scanning and image capturing and processing devices and algorithms, advancement in development of novel stationary phases as well as various separation modalities, high-performance thin-layer chromatography (HPTLC) fingerprinting is becoming attractive and fruitful field of separation science. Multivariate image analysis is crucial in the light of proper data acquisition. In a current study, different image processing procedures were studied and compared in detail on the example of HPTLC chromatograms of plant resins. In that sense, obtained variables such as gray intensities of pixels along the solvent front, peak area and mean values of peak were used as input data and compared to obtained best classification models. Important steps in image analysis, baseline removal, denoising, target peak alignment and normalization were pointed out. Numerical data set based on mean value of selected bands and intensities of pixels along the solvent front proved to be the most convenient for planar-chromatographic profiling, although required at least the basic knowledge on image processing methodology, and could be proposed for further investigation in HPLTC fingerprinting. Copyright © 2016 Elsevier B.V. All rights reserved.
Chandrasekaran, A; Ravisankar, R; Harikrishnan, N; Satapathy, K K; Prasad, M V R; Kanagasabapathy, K V
2015-02-25
Anthropogenic activities increase the accumulation of heavy metals in the soil environment. Soil pollution significantly reduces environmental quality and affects the human health. In the present study soil samples were collected at different locations of Yelagiri Hills, Tamilnadu, India for heavy metal analysis. The samples were analyzed for twelve selected heavy metals (Mg, Al, K, Ca, Ti, Fe, V, Cr, Mn, Co, Ni and Zn) using energy dispersive X-ray fluorescence (EDXRF) spectroscopy. Heavy metals concentration in soil were investigated using enrichment factor (EF), geo-accumulation index (Igeo), contamination factor (CF) and pollution load index (PLI) to determine metal accumulation, distribution and its pollution status. Heavy metal toxicity risk was assessed using soil quality guidelines (SQGs) given by target and intervention values of Dutch soil standards. The concentration of Ni, Co, Zn, Cr, Mn, Fe, Ti, K, Al, Mg were mainly controlled by natural sources. Multivariate statistical methods such as correlation matrix, principal component analysis and cluster analysis were applied for the identification of heavy metal sources (anthropogenic/natural origin). Geo-statistical methods such as kirging identified hot spots of metal contamination in road areas influenced mainly by presence of natural rocks. Copyright © 2014 Elsevier B.V. All rights reserved.
Wager, M; Menei, P; Guilhot, J; Levillain, P; Michalak, S; Bataille, B; Blanc, J-L; Lapierre, F; Rigoard, P; Milin, S; Duthe, F; Bonneau, D; Larsen, C-J; Karayan-Tapon, L
2008-06-03
This study assessed the prognostic value of several markers involved in gliomagenesis, and compared it with that of other clinical and imaging markers already used. Four-hundred and sixteen adult patients with newly diagnosed glioma were included over a 3-year period and tumour suppressor genes, oncogenes, MGMT and hTERT expressions, losses of heterozygosity, as well as relevant clinical and imaging information were recorded. This prospective study was based on all adult gliomas. Analyses were performed on patient groups selected according to World Health Organization histoprognostic criteria and on the entire cohort. The endpoint was overall survival, estimated by the Kaplan-Meier method. Univariate analysis was followed by multivariate analysis according to a Cox model. p14(ARF), p16(INK4A) and PTEN expressions, and 10p 10q23, 10q26 and 13q LOH for the entire cohort, hTERT expression for high-grade tumours, EGFR for glioblastomas, 10q26 LOH for grade III tumours and anaplastic oligodendrogliomas were found to be correlated with overall survival on univariate analysis and age and grade on multivariate analysis only. This study confirms the prognostic value of several markers. However, the scattering of the values explained by tumour heterogeneity prevents their use in individual decision-making.
Kaier, Klaus; Hagist, Christian; Frank, Uwe; Conrad, Andreas; Meyer, Elisabeth
2009-04-01
To determine the impact of antibiotic consumption and alcohol-based hand disinfection on the incidences of nosocomial methicillin-resistant Staphylococcus aureus (MRSA) infection and Clostridium difficile infection (CDI). Two multivariate time-series analyses were performed that used as dependent variables the monthly incidences of nosocomial MRSA infection and CDI at the Freiburg University Medical Center during the period January 2003 through October 2007. The volume of alcohol-based hand rub solution used per month was quantified in liters per 1,000 patient-days. Antibiotic consumption was calculated in terms of the number of defined daily doses per 1,000 patient-days per month. The use of alcohol-based hand rub was found to have a significant impact on the incidence of nosocomial MRSA infection (P< .001). The multivariate analysis (R2=0.66) showed that a higher volume of use of alcohol-based hand rub was associated with a lower incidence of nosocomial MRSA infection. Conversely, a higher level of consumption of selected antimicrobial agents was associated with a higher incidence of nosocomial MRSA infection. This analysis showed this relationship was the same for the use of second-generation cephalosporins (P= .023), third-generation cephalosporins (P= .05), fluoroquinolones (P= .01), and lincosamides (P= .05). The multivariate analysis (R2=0.55) showed that a higher level of consumption of third-generation cephalosporins (P= .008), fluoroquinolones (P= .084), and/or macrolides (P= .007) was associated with a higher incidence of CDI. A correlation with use of alcohol-based hand rub was not detected. In 2 multivariate time-series analyses, we were able to show the impact of hand hygiene and antibiotic use on the incidence of nosocomial MRSA infection, but we found no association between hand hygiene and incidence of CDI.
Sexual selection on cuticular hydrocarbons in the Australian field cricket, Teleogryllus oceanicus
Thomas, Melissa L; Simmons, Leigh W
2009-01-01
Background Females in a wide range of taxa have been shown to base their choice of mates on pheromone signals. However, little research has focussed specifically on the form and intensity of selection that mate choice imposes on the pheromone signal. Using multivariate selection analysis, we characterise directly the form and intensity of sexual selection acting on cuticular hydrocarbons, chemical compounds widely used in the selection of mates in insects. Using the Australian field cricket Teleogryllus oceanicus as a model organism, we use three measures of male attractiveness to estimate fitness; mating success, the duration of courtship required to elicit copulation, and subsequent spermatophore attachment duration. Results We found that all three measures of male attractiveness generated sexual selection on male cuticular hydrocarbons, however there were differences in the form and intensity of selection among these three measures. Mating success was the only measure of attractiveness that imposed both univariate linear and quadratic selection on cuticular hydrocarbons. Although we found that all three attractiveness measures generated nonlinear selection, again only mating success was found to exert statistically significant stabilizing selection. Conclusion This study shows that sexual selection plays an important role in the evolution of male cuticular hydrocarbon signals. PMID:19594896
Use of the Analysis of the Volatile Faecal Metabolome in Screening for Colorectal Cancer
2015-01-01
Diagnosis of colorectal cancer is an invasive and expensive colonoscopy, which is usually carried out after a positive screening test. Unfortunately, existing screening tests lack specificity and sensitivity, hence many unnecessary colonoscopies are performed. Here we report on a potential new screening test for colorectal cancer based on the analysis of volatile organic compounds (VOCs) in the headspace of faecal samples. Faecal samples were obtained from subjects who had a positive faecal occult blood sample (FOBT). Subjects subsequently had colonoscopies performed to classify them into low risk (non-cancer) and high risk (colorectal cancer) groups. Volatile organic compounds were analysed by selected ion flow tube mass spectrometry (SIFT-MS) and then data were analysed using both univariate and multivariate statistical methods. Ions most likely from hydrogen sulphide, dimethyl sulphide and dimethyl disulphide are statistically significantly higher in samples from high risk rather than low risk subjects. Results using multivariate methods show that the test gives a correct classification of 75% with 78% specificity and 72% sensitivity on FOBT positive samples, offering a potentially effective alternative to FOBT. PMID:26086914
Ishizuka, Mitsuru; Oyama, Yusuke; Abe, Akihito; Tago, Kazuma; Tanaka, Genki; Kubota, Keiichi
2014-08-01
To investigate the influence of clinical characteristics including nutritional markers on postoperative survival in patients undergoing total gastrectomy (TG) for gastric cancer (GC). One hundred fifty-four patients were enrolled. Uni- and multivariate analyses using the Cox proportional hazard model were performed to explore the most valuable clinical characteristic that was associated with postoperative survival. Multivariate analysis using twelve clinical characteristics selected from univariate analyses revealed that age (≤ 72/>72), carcinoembryonic antigen (≤ 20/>20) (ng/ml), white blood cell count (≤ 9.5/>9.5) (× 10(3)/mm(3)), prognostic nutritional index (PNI) (≤ 45/>45) and lymph node metastasis (negative/positive) were associated with postoperative survival. Kaplan-Meier analysis and log-rank test showed that patients with higher PNI (>45) had a higher postoperative survival rate than those with lower PNI (≤ 45) (p<0.001). PNI is associated with postoperative survival of patients undergoing TG for GC and is able to divide such patients into two independent groups before surgery. Copyright© 2014 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.
Barigye, Stephen J; Freitas, Matheus P; Ausina, Priscila; Zancan, Patricia; Sola-Penna, Mauro; Castillo-Garit, Juan A
2018-02-12
We recently generalized the formerly alignment-dependent multivariate image analysis applied to quantitative structure-activity relationships (MIA-QSAR) method through the application of the discrete Fourier transform (DFT), allowing for its application to noncongruent and structurally diverse chemical compound data sets. Here we report the first practical application of this method in the screening of molecular entities of therapeutic interest, with human aromatase inhibitory activity as the case study. We developed an ensemble classification model based on the two-dimensional (2D) DFT MIA-QSAR descriptors, with which we screened the NCI Diversity Set V (1593 compounds) and obtained 34 chemical compounds with possible aromatase inhibitory activity. These compounds were docked into the aromatase active site, and the 10 most promising compounds were selected for in vitro experimental validation. Of these compounds, 7419 (nonsteroidal) and 89 201 (steroidal) demonstrated satisfactory antiproliferative and aromatase inhibitory activities. The obtained results suggest that the 2D-DFT MIA-QSAR method may be useful in ligand-based virtual screening of new molecular entities of therapeutic utility.
Multivariate analysis in provenance studies: Cerrillos obsidians case, Peru
NASA Astrophysics Data System (ADS)
Bustamante, A.; Delgado, M.; Latini, R. M.; Bellido, A. V. B.
2007-02-01
We present the preliminary results of a provenance study of obsidians samples from Cerrillos (ca. 800 100 b.c.) using Mössbauer Spectroscopy. The Cerrillos archaeological site, located in the Upper Ica Valley, Peru, is the only Paracas ceremonial center excavated so far. The archaeological data collected suggest the existence of a complex social and economic organization on the south coast of Peru. Provenance research of obsidian provides valuable information about the selection of lithic resources by our ancestors and eventually about the existence of communication routes and exchange networks. We characterized 18 obsidian artifacts samples by Mössbauer spectroscopy from Cerrillos. The spectra, recorded at room temperature using different velocities, are mainly composed of broad asymmetric doublets due to the superposition of at least two quadrupole doublets corresponding to Fe2+ in two different sites (species A and B), one weak Fe3+ doublet (specie C) and magnetic components associated to the presence of small particles of magnetite. Multivariate statistical analysis of the Mössbauer data (hyperfine parameters) allows to defined two main groups of obsidians, reflecting different geographical origins.
Bonetti, Jennifer; Quarino, Lawrence
2014-05-01
This study has shown that the combination of simple techniques with the use of multivariate statistics offers the potential for the comparative analysis of soil samples. Five samples were obtained from each of twelve state parks across New Jersey in both the summer and fall seasons. Each sample was examined using particle-size distribution, pH analysis in both water and 1 M CaCl2 , and a loss on ignition technique. Data from each of the techniques were combined, and principal component analysis (PCA) and canonical discriminant analysis (CDA) were used for multivariate data transformation. Samples from different locations could be visually differentiated from one another using these multivariate plots. Hold-one-out cross-validation analysis showed error rates as low as 3.33%. Ten blind study samples were analyzed resulting in no misclassifications using Mahalanobis distance calculations and visual examinations of multivariate plots. Seasonal variation was minimal between corresponding samples, suggesting potential success in forensic applications. © 2014 American Academy of Forensic Sciences.
NASA Astrophysics Data System (ADS)
Ho, K. F.; Lee, S. C.; Chiu, Gloria M. Y.
Volatile organic compounds (VOCs), PAHs and carbonyl compounds are the major toxic components in Hong Kong. Emissions from motor vehicles have been one of the primary pollution sources in the metropolitan areas throughout Hong Kong for a long time. A 1-yr monitoring program for VOCs, PAHs and carbonyl compounds had been performed at a roadside urban station at Hong Kong Polytechnic University in order to determine the variations and correlations of each selected species (VOCs, PAHs and carbonyl compounds). This study is aimed to analyze toxic volatile organic compounds (benzene, toluene, ethylbenzene and xylene), two carbonyl compounds (formaldehyde, acetaldehyde), and selective polycyclic aromatic hydrocarbons. The monitoring program started from 16 April 1999 to 30 March 2000. Ambient VOC concentrations, many of which originate from the same sources as particulate PAHs and carbonyls compounds, show significant quantities of benzene, toluene and xylenes. Correlations and multivariate analysis of selected gaseous and particulate phase organic pollutants were performed. Source identification by principle component analysis and hierarchical cluster analysis allowed the identification of four sources (factors) for the roadside monitoring station. Factor 1 represents the effect of diesel vehicle exhaust. Factor 2 shows the contribution of aromatic compounds. Factor 3 explains photochemical products—formaldehyde and acetaldehyde. Factor 4 explains the effect of gasoline vehicle exhaust.
Quantifying the impact of between-study heterogeneity in multivariate meta-analyses
Jackson, Dan; White, Ian R; Riley, Richard D
2012-01-01
Measures that quantify the impact of heterogeneity in univariate meta-analysis, including the very popular I2 statistic, are now well established. Multivariate meta-analysis, where studies provide multiple outcomes that are pooled in a single analysis, is also becoming more commonly used. The question of how to quantify heterogeneity in the multivariate setting is therefore raised. It is the univariate R2 statistic, the ratio of the variance of the estimated treatment effect under the random and fixed effects models, that generalises most naturally, so this statistic provides our basis. This statistic is then used to derive a multivariate analogue of I2, which we call . We also provide a multivariate H2 statistic, the ratio of a generalisation of Cochran's heterogeneity statistic and its associated degrees of freedom, with an accompanying generalisation of the usual I2 statistic, . Our proposed heterogeneity statistics can be used alongside all the usual estimates and inferential procedures used in multivariate meta-analysis. We apply our methods to some real datasets and show how our statistics are equally appropriate in the context of multivariate meta-regression, where study level covariate effects are included in the model. Our heterogeneity statistics may be used when applying any procedure for fitting the multivariate random effects model. Copyright © 2012 John Wiley & Sons, Ltd. PMID:22763950
Analyzing Multiple Outcomes in Clinical Research Using Multivariate Multilevel Models
Baldwin, Scott A.; Imel, Zac E.; Braithwaite, Scott R.; Atkins, David C.
2014-01-01
Objective Multilevel models have become a standard data analysis approach in intervention research. Although the vast majority of intervention studies involve multiple outcome measures, few studies use multivariate analysis methods. The authors discuss multivariate extensions to the multilevel model that can be used by psychotherapy researchers. Method and Results Using simulated longitudinal treatment data, the authors show how multivariate models extend common univariate growth models and how the multivariate model can be used to examine multivariate hypotheses involving fixed effects (e.g., does the size of the treatment effect differ across outcomes?) and random effects (e.g., is change in one outcome related to change in the other?). An online supplemental appendix provides annotated computer code and simulated example data for implementing a multivariate model. Conclusions Multivariate multilevel models are flexible, powerful models that can enhance clinical research. PMID:24491071
Predictive equations for the estimation of body size in seals and sea lions (Carnivora: Pinnipedia)
Churchill, Morgan; Clementz, Mark T; Kohno, Naoki
2014-01-01
Body size plays an important role in pinniped ecology and life history. However, body size data is often absent for historical, archaeological, and fossil specimens. To estimate the body size of pinnipeds (seals, sea lions, and walruses) for today and the past, we used 14 commonly preserved cranial measurements to develop sets of single variable and multivariate predictive equations for pinniped body mass and total length. Principal components analysis (PCA) was used to test whether separate family specific regressions were more appropriate than single predictive equations for Pinnipedia. The influence of phylogeny was tested with phylogenetic independent contrasts (PIC). The accuracy of these regressions was then assessed using a combination of coefficient of determination, percent prediction error, and standard error of estimation. Three different methods of multivariate analysis were examined: bidirectional stepwise model selection using Akaike information criteria; all-subsets model selection using Bayesian information criteria (BIC); and partial least squares regression. The PCA showed clear discrimination between Otariidae (fur seals and sea lions) and Phocidae (earless seals) for the 14 measurements, indicating the need for family-specific regression equations. The PIC analysis found that phylogeny had a minor influence on relationship between morphological variables and body size. The regressions for total length were more accurate than those for body mass, and equations specific to Otariidae were more accurate than those for Phocidae. Of the three multivariate methods, the all-subsets approach required the fewest number of variables to estimate body size accurately. We then used the single variable predictive equations and the all-subsets approach to estimate the body size of two recently extinct pinniped taxa, the Caribbean monk seal (Monachus tropicalis) and the Japanese sea lion (Zalophus japonicus). Body size estimates using single variable regressions generally under or over-estimated body size; however, the all-subset regression produced body size estimates that were close to historically recorded body length for these two species. This indicates that the all-subset regression equations developed in this study can estimate body size accurately. PMID:24916814
Wong, Evelyn Yi Ting; Tan, Grace Hwei Ching; Ng, Deanna Wan Jie; Koh, Tina Puay Theng; Kumar, Mrinal; Teo, Melissa Ching Ching
2017-12-01
Metastasectomy is accepted as standard of care for selected patients with colorectal pulmonary metastases (CLM); however, the role of cytoreductive surgery (CRS) and hyperthermic intraperitoneal chemotherapy (HIPEC) for colorectal peritoneal metastases (CPM) is not universally accepted. We aim to compare oncological outcomes of patients with CLM and CPM after pulmonary resection and CRS-HIPEC, respectively, by comparing overall survival (OS) and disease-free survival (DFS). A retrospective review of 49 CLM patients who underwent pulmonary resection, and 52 CPM patients who underwent CRS-HIPEC in a single institution from January 2003 to March 2015, was performed. The 5-year OS for CLM patients and CPM patients were 59.6 and 40.5%, respectively (p = 0.100), while the 5-year DFS were 24.0 and 14.2%, respectively (p = 0.173). CPM patients had longer median operative time (8.38 vs. 1.75 h, p < 0.001), median hospital stay (13 vs. 5 days, p < 0.001), a higher rate of intensive care unit (ICU) admissions (67.3 vs. 8.2%, p < 0.001), and a higher rate of high-grade complications (17.3 vs. 4.1%, p < 0.001). Multivariate analysis demonstrated that recurrent lung metastasis after metastasectomy was an independent prognostic factor for OS of CLM patients (OR = 0.045, 95%, CL 0.003-0.622, p = 0.021). There were no independent prognostic factors for OS in CPM patients by multivariate analysis. There were no independent prognostic factors for DFS in CLM patients by multivariate analysis, but peritoneal cancer index score, bladder involvement, and higher nodal stage at presentation of the initial malignancy were independent prognostic factors for DFS in CPM patients. OS and DFS for CPM patients after CRS and HIPEC are comparable to CLM patients after lung resection, although morbidity appears higher. The prognostic factors affecting survival after surgery are different between CPM and CLM patients and must be considered when selecting patients for metastasectomy.
Zhang, Yanbin; Lin, Guanfeng; Wang, Shengru; Zhang, Jianguo; Shen, Jianxiong; Wang, Yipeng; Guo, Jianwei; Yang, Xinyu; Zhao, Lijuan
2016-01-01
Study Design. Retrospective study. Objective. To study the behavior of the unfused thoracic curve in Lenke type 5C during the follow-up and to identify risk factors for its correction loss. Summary of Background Data. Few studies have focused on the spontaneous behaviors of the unfused thoracic curve after selective thoracolumbar or lumbar fusion during the follow-up and the risk factors for spontaneous correction loss. Methods. We retrospectively reviewed 45 patients (41 females and 4 males) with AIS who underwent selective TL/L fusion from 2006 to 2012 in a single institution. The follow-up averaged 36 months (range, 24–105 months). Patients were divided into two groups. Thoracic curves in group A improved or maintained their curve magnitude after spontaneous correction, with a negative or no correction loss during the follow-up. Thoracic curves in group B deteriorated after spontaneous correction with a positive correction loss. Univariate analysis and multivariate analysis were built to identify the risk factors for correction loss of the unfused thoracic curves. Results. The minor thoracic curve was 26° preoperatively. It was corrected to 13° immediately with a spontaneous correction of 48.5%. At final follow-up it was 14° with a correction loss of 1°. Thoracic curves did not deteriorate after spontaneous correction in 23 cases in group A, while 22 cases were identified with thoracic curve progressing in group B. In multivariate analysis, two risk factors were independently associated with thoracic correction loss: higher flexibility and better immediate spontaneous correction rate of thoracic curve. Conclusion. Posterior selective TL/L fusion with pedicle screw constructs is an effective treatment for Lenke 5C AIS patients. Nonstructural thoracic curves with higher flexibility or better immediate correction are more likely to progress during the follow-up and close attentions must be paid to these patients in case of decompensation. Level of Evidence: 4 PMID:27831989
San-Juan, Rafael; Aguado, Jose M; Lumbreras, Carlos; Fortun, Jesus; Len, Oscar; Munoz, Patricia; Montejo, Miguel; Moreno, Asunción; Cordero, Elisa; Blanes, Marino; Ramos, Antonio; Torre-Cisneros, Julian; López-Medrano, Francisco; Carratala, Jordi; Moreno, Enrique
2011-08-01
The role of selective intestinal decontamination with fluoroquinolones (FQ-SID) in the prevention of early bacterial infections (EBIs) in liver transplant recipients (LTRs) is unknown. We used the online database of the Spanish Network of Infection in Transplantation/Spanish Network for Research in Infectious Diseases, which prospectively analyzed 1010 LTRs from 12 Spanish hospitals from September 2003 to February 2005. We compared the incidence and etiology of EBIs (30 days after transplantation) in 415 LTRs from 4 centers that used FQ-SID (>7 days) and in 595 LTRs from 8 hospitals that did not use FQ-SID. A multivariate logistic regression analysis (including an adjustment for the transplant center factor) was performed to evaluate the potential protective factor of FQ-SID in the development of EBIs. We reported 266 EBI episodes in 252 LTRs (incidence = 24.9%). There were no differences in the incidence of EBIs between patients in the FQ-SID group and patients not in the FQ-SID group [109/415 (26.3%) versus 143/595 (24%), P = 0.9]. Although LTRs who received FQ-SID had a lower incidence of infections due to enteric bacteria (2.7% versus 6.5%, P = 0.007) and a higher incidence of infections due to nonfermenting gram-negative bacilli (6.6% versus 2.6%, P = 0.004), these findings could not be confirmed after an adjustment by the center factor in the multivariate models. We found no significant differences in the incidence of enterococcal infections (3.4% with FQ-SID versus 3.9% without FQ-SID, P = 0.5). Multivariate analysis did not confirm any protective effect of FQ-SID against the development of EBIs by enteric bacteria. In conclusion, FQ-SID does not reduce the incidence of EBIs in LTRs and could be withheld from this group of patients. Copyright © 2011 American Association for the Study of Liver Diseases.
A "Model" Multivariable Calculus Course.
ERIC Educational Resources Information Center
Beckmann, Charlene E.; Schlicker, Steven J.
1999-01-01
Describes a rich, investigative approach to multivariable calculus. Introduces a project in which students construct physical models of surfaces that represent real-life applications of their choice. The models, along with student-selected datasets, serve as vehicles to study most of the concepts of the course from both continuous and discrete…
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dyar, M. Darby; McCanta, Molly; Breves, Elly
2016-03-01
Pre-edge features in the K absorption edge of X-ray absorption spectra are commonly used to predict Fe3+ valence state in silicate glasses. However, this study shows that using the entire spectral region from the pre-edge into the extended X-ray absorption fine-structure region provides more accurate results when combined with multivariate analysis techniques. The least absolute shrinkage and selection operator (lasso) regression technique yields %Fe3+ values that are accurate to ±3.6% absolute when the full spectral region is employed. This method can be used across a broad range of glass compositions, is easily automated, and is demonstrated to yield accurate resultsmore » from different synchrotrons. It will enable future studies involving X-ray mapping of redox gradients on standard thin sections at 1 × 1 μm pixel sizes.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dyar, M. Darby; McCanta, Molly; Breves, Elly
2016-03-01
Pre-edge features in the K absorption edge of X-ray absorption spectra are commonly used to predict Fe 3+ valence state in silicate glasses. However, this study shows that using the entire spectral region from the pre-edge into the extended X-ray absorption fine-structure region provides more accurate results when combined with multivariate analysis techniques. The least absolute shrinkage and selection operator (lasso) regression technique yields %Fe 3+ values that are accurate to ±3.6% absolute when the full spectral region is employed. This method can be used across a broad range of glass compositions, is easily automated, and is demonstrated to yieldmore » accurate results from different synchrotrons. It will enable future studies involving X-ray mapping of redox gradients on standard thin sections at 1 × 1 μm pixel sizes.« less
NASA Astrophysics Data System (ADS)
Singh, Veena D.; Daharwal, Sanjay J.
2017-01-01
Three multivariate calibration spectrophotometric methods were developed for simultaneous estimation of Paracetamol (PARA), Enalapril maleate (ENM) and Hydrochlorothiazide (HCTZ) in tablet dosage form; namely multi-linear regression calibration (MLRC), trilinear regression calibration method (TLRC) and classical least square (CLS) method. The selectivity of the proposed methods were studied by analyzing the laboratory prepared ternary mixture and successfully applied in their combined dosage form. The proposed methods were validated as per ICH guidelines and good accuracy; precision and specificity were confirmed within the concentration range of 5-35 μg mL- 1, 5-40 μg mL- 1 and 5-40 μg mL- 1of PARA, HCTZ and ENM, respectively. The results were statistically compared with reported HPLC method. Thus, the proposed methods can be effectively useful for the routine quality control analysis of these drugs in commercial tablet dosage form.
[Multivariate Adaptive Regression Splines (MARS), an alternative for the analysis of time series].
Vanegas, Jairo; Vásquez, Fabián
Multivariate Adaptive Regression Splines (MARS) is a non-parametric modelling method that extends the linear model, incorporating nonlinearities and interactions between variables. It is a flexible tool that automates the construction of predictive models: selecting relevant variables, transforming the predictor variables, processing missing values and preventing overshooting using a self-test. It is also able to predict, taking into account structural factors that might influence the outcome variable, thereby generating hypothetical models. The end result could identify relevant cut-off points in data series. It is rarely used in health, so it is proposed as a tool for the evaluation of relevant public health indicators. For demonstrative purposes, data series regarding the mortality of children under 5 years of age in Costa Rica were used, comprising the period 1978-2008. Copyright © 2016 SESPAS. Publicado por Elsevier España, S.L.U. All rights reserved.
Veale, James P; Pearce, Alan J; Koehn, Stefan; Carlson, John S
2008-04-01
The aim of the study was to compare anthropometric and physical performance data of players who were selected for a Victorian elite junior U18 Australian rules football squad. Prior to the selection of the final training squad, 54 players were assessed using a battery of standard anthropometric and physical performance tests. Multivariate analysis (MANOVA) showed significant (p<0.05) differences between selected and non-selected players when height, mass, 20-m sprint, agility and vertical jump height were considered collectively. Univariate analysis revealed that the vertical jump was the only significant (p<0.05) individual test and a near significant trend (p=0.07) for height differentiating between selected and non-selected players with medium effect sizes for all other tests except endurance. In this elite junior football squad, physical characteristics can be observed that discriminate between players selected and non-selected, and demonstrates the value of physical fitness testing within the talent identification process of junior (16-18 years) players for squad and/or team selection. Based on MANOVA results, the findings from this study suggest team selection appeared to be related to a generally higher performance across the range of tests. Further, age was not a confounding variable as players selected tended to be younger than those non-selected. These findings reflect the general consensus that, in state-based junior competition, there is evidence of promoting overall player development, selecting those who are generally able to fulfil a range of positions and selecting players on their potential.
Shepherd, Emma; Stuart, Graham; Martin, Rob; Walsh, Mark A
2015-06-01
SelectSecure™ pacing leads (Medtronic Inc) are increasingly being used in pediatric patients and adults with structural congenital heart disease. The 4Fr lead is ideal for patients who may require lifelong pacing and can be advantageous for patients with complex anatomy. The purpose of this study was to compare the extraction of SelectSecure leads with conventional (stylette-driven) pacing leads in patients with structural congenital heart disease and congenital atrioventricular block. The data on lead extractions from pediatric and adult congenital heart disease (ACHD) patients from August 2004 to July 2014 at Bristol Royal Hospital for Children and the Bristol Heart Institute were reviewed. Multivariable regression analysis was used to determine whether conventional pacing leads were associated with a more difficult extraction process. A total of 57 patients underwent pacemaker lead extractions (22 SelectSecure, 35 conventional). No deaths occurred. Mean age at the time of extraction was 17.6 ± 10.5 years, mean weight was 47 ± 18 kg, and mean lead age was 5.6 ± 2.6 years (range 1-11 years). Complex extraction (partial extraction/femoral extraction) was more common in patients with conventional pacing leads at univariate (P < .01) and multivariate (P = .04) levels. Lead age was also a significant predictor of complex extraction (P < .01). SelectSecure leads can be successfully extracted using techniques that are used for conventional pacing leads. They are less likely to be partially extracted and are less likely to require extraction using a femoral approach compared with conventional pacing leads. Copyright © 2015 Heart Rhythm Society. Published by Elsevier Inc. All rights reserved.
Analysis techniques for multivariate root loci. [a tool in linear control systems
NASA Technical Reports Server (NTRS)
Thompson, P. M.; Stein, G.; Laub, A. J.
1980-01-01
Analysis and techniques are developed for the multivariable root locus and the multivariable optimal root locus. The generalized eigenvalue problem is used to compute angles and sensitivities for both types of loci, and an algorithm is presented that determines the asymptotic properties of the optimal root locus.
Methods for presentation and display of multivariate data
NASA Technical Reports Server (NTRS)
Myers, R. H.
1981-01-01
Methods for the presentation and display of multivariate data are discussed with emphasis placed on the multivariate analysis of variance problems and the Hotelling T(2) solution in the two-sample case. The methods utilize the concepts of stepwise discrimination analysis and the computation of partial correlation coefficients.
A Primer on Multivariate Analysis of Variance (MANOVA) for Behavioral Scientists
ERIC Educational Resources Information Center
Warne, Russell T.
2014-01-01
Reviews of statistical procedures (e.g., Bangert & Baumberger, 2005; Kieffer, Reese, & Thompson, 2001; Warne, Lazo, Ramos, & Ritter, 2012) show that one of the most common multivariate statistical methods in psychological research is multivariate analysis of variance (MANOVA). However, MANOVA and its associated procedures are often not…
Sun, Jin; Rutkoski, Jessica E; Poland, Jesse A; Crossa, José; Jannink, Jean-Luc; Sorrells, Mark E
2017-07-01
High-throughput phenotyping (HTP) platforms can be used to measure traits that are genetically correlated with wheat ( L.) grain yield across time. Incorporating such secondary traits in the multivariate pedigree and genomic prediction models would be desirable to improve indirect selection for grain yield. In this study, we evaluated three statistical models, simple repeatability (SR), multitrait (MT), and random regression (RR), for the longitudinal data of secondary traits and compared the impact of the proposed models for secondary traits on their predictive abilities for grain yield. Grain yield and secondary traits, canopy temperature (CT) and normalized difference vegetation index (NDVI), were collected in five diverse environments for 557 wheat lines with available pedigree and genomic information. A two-stage analysis was applied for pedigree and genomic selection (GS). First, secondary traits were fitted by SR, MT, or RR models, separately, within each environment. Then, best linear unbiased predictions (BLUPs) of secondary traits from the above models were used in the multivariate prediction models to compare predictive abilities for grain yield. Predictive ability was substantially improved by 70%, on average, from multivariate pedigree and genomic models when including secondary traits in both training and test populations. Additionally, (i) predictive abilities slightly varied for MT, RR, or SR models in this data set, (ii) results indicated that including BLUPs of secondary traits from the MT model was the best in severe drought, and (iii) the RR model was slightly better than SR and MT models under drought environment. Copyright © 2017 Crop Science Society of America.
Can we discover double Higgs production at the LHC?
NASA Astrophysics Data System (ADS)
Alves, Alexandre; Ghosh, Tathagata; Sinha, Kuver
2017-08-01
We explore double Higgs production via gluon fusion in the b b ¯γ γ channel at the high-luminosity LHC using machine learning tools. We first propose a Bayesian optimization approach to select cuts on kinematic variables, obtaining a 30%-50% increase in the significance compared to current results in the literature. We show that this improvement persists once systematic uncertainties are taken into account. We next use boosted decision trees (BDT) to further discriminate signal and background events. Our analysis shows that a joint optimization of kinematic cuts and BDT hyperparameters results in an appreciable improvement in the significance. Finally, we perform a multivariate analysis of the output scores of the BDT. We find that assuming a very low level of systematics, the techniques proposed here will be able to confirm the production of a pair of standard model Higgs bosons at 5 σ level with 3 ab-1 of data. Assuming a more realistic projection of the level of systematics, around 10%, the optimization of cuts to train BDTs combined with a multivariate analysis delivers a respectable significance of 4.6 σ . Even assuming large systematics of 20%, our analysis predicts a 3.6 σ significance, which represents at least strong evidence in favor of double Higgs production. We carefully incorporate background contributions coming from light flavor jets or c jets being misidentified as b jets and jets being misidentified as photons in our analysis.
Lee, Byeong-Ju; Zhou, Yaoyao; Lee, Jae Soung; Shin, Byeung Kon; Seo, Jeong-Ah; Lee, Doyup; Kim, Young-Suk
2018-01-01
The ability to determine the origin of soybeans is an important issue following the inclusion of this information in the labeling of agricultural food products becoming mandatory in South Korea in 2017. This study was carried out to construct a prediction model for discriminating Chinese and Korean soybeans using Fourier-transform infrared (FT-IR) spectroscopy and multivariate statistical analysis. The optimal prediction models for discriminating soybean samples were obtained by selecting appropriate scaling methods, normalization methods, variable influence on projection (VIP) cutoff values, and wave-number regions. The factors for constructing the optimal partial-least-squares regression (PLSR) prediction model were using second derivatives, vector normalization, unit variance scaling, and the 4000–400 cm–1 region (excluding water vapor and carbon dioxide). The PLSR model for discriminating Chinese and Korean soybean samples had the best predictability when a VIP cutoff value was not applied. When Chinese soybean samples were identified, a PLSR model that has the lowest root-mean-square error of the prediction value was obtained using a VIP cutoff value of 1.5. The optimal PLSR prediction model for discriminating Korean soybean samples was also obtained using a VIP cutoff value of 1.5. This is the first study that has combined FT-IR spectroscopy with normalization methods, VIP cutoff values, and selected wave-number regions for discriminating Chinese and Korean soybeans. PMID:29689113
Lee, Byeong-Ju; Kim, Hye-Youn; Lim, Sa Rang; Huang, Linfang; Choi, Hyung-Kyoon
2017-01-01
Panax ginseng C.A. Meyer is a herb used for medicinal purposes, and its discrimination according to cultivation age has been an important and practical issue. This study employed Fourier-transform infrared (FT-IR) spectroscopy with multivariate statistical analysis to obtain a prediction model for discriminating cultivation ages (5 and 6 years) and three different parts (rhizome, tap root, and lateral root) of P. ginseng. The optimal partial-least-squares regression (PLSR) models for discriminating ginseng samples were determined by selecting normalization methods, number of partial-least-squares (PLS) components, and variable influence on projection (VIP) cutoff values. The best prediction model for discriminating 5- and 6-year-old ginseng was developed using tap root, vector normalization applied after the second differentiation, one PLS component, and a VIP cutoff of 1.0 (based on the lowest root-mean-square error of prediction value). In addition, for discriminating among the three parts of P. ginseng, optimized PLSR models were established using data sets obtained from vector normalization, two PLS components, and VIP cutoff values of 1.5 (for 5-year-old ginseng) and 1.3 (for 6-year-old ginseng). To our knowledge, this is the first study to provide a novel strategy for rapidly discriminating the cultivation ages and parts of P. ginseng using FT-IR by selected normalization methods, number of PLS components, and VIP cutoff values.
Lim, Sa Rang; Huang, Linfang
2017-01-01
Panax ginseng C.A. Meyer is a herb used for medicinal purposes, and its discrimination according to cultivation age has been an important and practical issue. This study employed Fourier-transform infrared (FT-IR) spectroscopy with multivariate statistical analysis to obtain a prediction model for discriminating cultivation ages (5 and 6 years) and three different parts (rhizome, tap root, and lateral root) of P. ginseng. The optimal partial-least-squares regression (PLSR) models for discriminating ginseng samples were determined by selecting normalization methods, number of partial-least-squares (PLS) components, and variable influence on projection (VIP) cutoff values. The best prediction model for discriminating 5- and 6-year-old ginseng was developed using tap root, vector normalization applied after the second differentiation, one PLS component, and a VIP cutoff of 1.0 (based on the lowest root-mean-square error of prediction value). In addition, for discriminating among the three parts of P. ginseng, optimized PLSR models were established using data sets obtained from vector normalization, two PLS components, and VIP cutoff values of 1.5 (for 5-year-old ginseng) and 1.3 (for 6-year-old ginseng). To our knowledge, this is the first study to provide a novel strategy for rapidly discriminating the cultivation ages and parts of P. ginseng using FT-IR by selected normalization methods, number of PLS components, and VIP cutoff values. PMID:29049369
Tariq, Saadia R; Shah, Munir H; Shaheen, Nazia
2009-09-30
Two tanning units of Pakistan, namely, Kasur and Mian Channun were investigated with respect to the tanning processes (chrome and vegetable, respectively) and the effects of the tanning agents on the quality of soil in vicinity of tanneries were evaluated. The effluent and soil samples from 16 tanneries each of Kasur and Mian Channun were collected. The levels of selected metals (Na, K, Ca, Mg, Fe, Cr, Mn, Co, Cd, Ni, Pb and Zn) were determined by using flame atomic absorption spectrophotometer under optimum analytical conditions. The data thus obtained were subjected to univariate and multivariate statistical analyses. Most of the metals exhibited considerably higher concentrations in the effluents and soils of Kasur compared with those of Mian Channun. It was observed that the soil of Kasur was highly contaminated by Na, K, Ca and Mg emanating from various processes of leather manufacture. Furthermore, the levels of Cr were also present at much enhanced levels than its background concentration due to the adoption of chrome tanning. The levels of Cr determined in soil samples collected from the vicinity of Mian Channun tanneries were almost comparable to the background levels. The soil of this city was found to have contaminated only by the metals originating from pre-tanning processes. The apportionment of selected metals in the effluent and soil samples was determined by a multivariate cluster analysis, which revealed significant differences in chrome and vegetable tanning processes.
Bhaumik, Runa; Jenkins, Lisanne M; Gowins, Jennifer R; Jacobs, Rachel H; Barba, Alyssa; Bhaumik, Dulal K; Langenecker, Scott A
2017-01-01
Understanding abnormal resting-state functional connectivity of distributed brain networks may aid in probing and targeting mechanisms involved in major depressive disorder (MDD). To date, few studies have used resting state functional magnetic resonance imaging (rs-fMRI) to attempt to discriminate individuals with MDD from individuals without MDD, and to our knowledge no investigations have examined a remitted (r) population. In this study, we examined the efficiency of support vector machine (SVM) classifier to successfully discriminate rMDD individuals from healthy controls (HCs) in a narrow early-adult age range. We empirically evaluated four feature selection methods including multivariate Least Absolute Shrinkage and Selection Operator (LASSO) and Elastic Net feature selection algorithms. Our results showed that SVM classification with Elastic Net feature selection achieved the highest classification accuracy of 76.1% (sensitivity of 81.5% and specificity of 68.9%) by leave-one-out cross-validation across subjects from a dataset consisting of 38 rMDD individuals and 29 healthy controls. The highest discriminating functional connections were between the left amygdala, left posterior cingulate cortex, bilateral dorso-lateral prefrontal cortex, and right ventral striatum. These appear to be key nodes in the etiopathophysiology of MDD, within and between default mode, salience and cognitive control networks. This technique demonstrates early promise for using rs-fMRI connectivity as a putative neurobiological marker capable of distinguishing between individuals with and without rMDD. These methods may be extended to periods of risk prior to illness onset, thereby allowing for earlier diagnosis, prevention, and intervention.
Pre and Post-copulatory Selection Favor Similar Genital Phenotypes in the Male Broad Horned Beetle
House, Clarissa M.; Sharma, M. D.; Okada, Kensuke; Hosken, David J.
2016-01-01
Sexual selection can operate before and after copulation and the same or different trait(s) can be targeted during these episodes of selection. The direction and form of sexual selection imposed on characters prior to mating has been relatively well described, but the same is not true after copulation. In general, when male–male competition and female choice favor the same traits then there is the expectation of reinforcing selection on male sexual traits that improve competitiveness before and after copulation. However, when male–male competition overrides pre-copulatory choice then the opposite could be true. With respect to studies of selection on genitalia there is good evidence that male genital morphology influences mating and fertilization success. However, whether genital morphology affects reproductive success in more than one context (i.e., mating versus fertilization success) is largely unknown. Here we use multivariate analysis to estimate linear and nonlinear selection on male body size and genital morphology in the flour beetle Gnatocerus cornutus, simulated in a non-competitive (i.e., monogamous) setting. This analysis estimates the form of selection on multiple traits and typically, linear (directional) selection is easiest to detect, while nonlinear selection is more complex and can be stabilizing, disruptive, or correlational. We find that mating generates stabilizing selection on male body size and genitalia, and fertilization causes a blend of directional and stabilizing selection. Differences in the form of selection across these bouts of selection result from a significant alteration of nonlinear selection on body size and a marginally significant difference in nonlinear selection on a component of genital shape. This suggests that both bouts of selection favor similar genital phenotypes, whereas the strong stabilizing selection imposed on male body size during mate acquisition is weak during fertilization. PMID:27371390
NASA Astrophysics Data System (ADS)
Yan, Ying; Zhang, Shen; Tang, Jinjun; Wang, Xiaofei
2017-07-01
Discovering dynamic characteristics in traffic flow is the significant step to design effective traffic managing and controlling strategy for relieving traffic congestion in urban cities. A new method based on complex network theory is proposed to study multivariate traffic flow time series. The data were collected from loop detectors on freeway during a year. In order to construct complex network from original traffic flow, a weighted Froenius norm is adopt to estimate similarity between multivariate time series, and Principal Component Analysis is implemented to determine the weights. We discuss how to select optimal critical threshold for networks at different hour in term of cumulative probability distribution of degree. Furthermore, two statistical properties of networks: normalized network structure entropy and cumulative probability of degree, are utilized to explore hourly variation in traffic flow. The results demonstrate these two statistical quantities express similar pattern to traffic flow parameters with morning and evening peak hours. Accordingly, we detect three traffic states: trough, peak and transitional hours, according to the correlation between two aforementioned properties. The classifying results of states can actually represent hourly fluctuation in traffic flow by analyzing annual average hourly values of traffic volume, occupancy and speed in corresponding hours.
Multivariate Analysis and Machine Learning in Cerebral Palsy Research
Zhang, Jing
2017-01-01
Cerebral palsy (CP), a common pediatric movement disorder, causes the most severe physical disability in children. Early diagnosis in high-risk infants is critical for early intervention and possible early recovery. In recent years, multivariate analytic and machine learning (ML) approaches have been increasingly used in CP research. This paper aims to identify such multivariate studies and provide an overview of this relatively young field. Studies reviewed in this paper have demonstrated that multivariate analytic methods are useful in identification of risk factors, detection of CP, movement assessment for CP prediction, and outcome assessment, and ML approaches have made it possible to automatically identify movement impairments in high-risk infants. In addition, outcome predictors for surgical treatments have been identified by multivariate outcome studies. To make the multivariate and ML approaches useful in clinical settings, further research with large samples is needed to verify and improve these multivariate methods in risk factor identification, CP detection, movement assessment, and outcome evaluation or prediction. As multivariate analysis, ML and data processing technologies advance in the era of Big Data of this century, it is expected that multivariate analysis and ML will play a bigger role in improving the diagnosis and treatment of CP to reduce mortality and morbidity rates, and enhance patient care for children with CP. PMID:29312134
Multivariate Analysis and Machine Learning in Cerebral Palsy Research.
Zhang, Jing
2017-01-01
Cerebral palsy (CP), a common pediatric movement disorder, causes the most severe physical disability in children. Early diagnosis in high-risk infants is critical for early intervention and possible early recovery. In recent years, multivariate analytic and machine learning (ML) approaches have been increasingly used in CP research. This paper aims to identify such multivariate studies and provide an overview of this relatively young field. Studies reviewed in this paper have demonstrated that multivariate analytic methods are useful in identification of risk factors, detection of CP, movement assessment for CP prediction, and outcome assessment, and ML approaches have made it possible to automatically identify movement impairments in high-risk infants. In addition, outcome predictors for surgical treatments have been identified by multivariate outcome studies. To make the multivariate and ML approaches useful in clinical settings, further research with large samples is needed to verify and improve these multivariate methods in risk factor identification, CP detection, movement assessment, and outcome evaluation or prediction. As multivariate analysis, ML and data processing technologies advance in the era of Big Data of this century, it is expected that multivariate analysis and ML will play a bigger role in improving the diagnosis and treatment of CP to reduce mortality and morbidity rates, and enhance patient care for children with CP.
Huang, Jun; Kaul, Goldi; Cai, Chunsheng; Chatlapalli, Ramarao; Hernandez-Abad, Pedro; Ghosh, Krishnendu; Nagi, Arwinder
2009-12-01
To facilitate an in-depth process understanding, and offer opportunities for developing control strategies to ensure product quality, a combination of experimental design, optimization and multivariate techniques was integrated into the process development of a drug product. A process DOE was used to evaluate effects of the design factors on manufacturability and final product CQAs, and establish design space to ensure desired CQAs. Two types of analyses were performed to extract maximal information, DOE effect & response surface analysis and multivariate analysis (PCA and PLS). The DOE effect analysis was used to evaluate the interactions and effects of three design factors (water amount, wet massing time and lubrication time), on response variables (blend flow, compressibility and tablet dissolution). The design space was established by the combined use of DOE, optimization and multivariate analysis to ensure desired CQAs. Multivariate analysis of all variables from the DOE batches was conducted to study relationships between the variables and to evaluate the impact of material attributes/process parameters on manufacturability and final product CQAs. The integrated multivariate approach exemplifies application of QbD principles and tools to drug product and process development.
Varekar, Vikas; Karmakar, Subhankar; Jha, Ramakar
2016-02-01
The design of surface water quality sampling location is a crucial decision-making process for rationalization of monitoring network. The quantity, quality, and types of available dataset (watershed characteristics and water quality data) may affect the selection of appropriate design methodology. The modified Sanders approach and multivariate statistical techniques [particularly factor analysis (FA)/principal component analysis (PCA)] are well-accepted and widely used techniques for design of sampling locations. However, their performance may vary significantly with quantity, quality, and types of available dataset. In this paper, an attempt has been made to evaluate performance of these techniques by accounting the effect of seasonal variation, under a situation of limited water quality data but extensive watershed characteristics information, as continuous and consistent river water quality data is usually difficult to obtain, whereas watershed information may be made available through application of geospatial techniques. A case study of Kali River, Western Uttar Pradesh, India, is selected for the analysis. The monitoring was carried out at 16 sampling locations. The discrete and diffuse pollution loads at different sampling sites were estimated and accounted using modified Sanders approach, whereas the monitored physical and chemical water quality parameters were utilized as inputs for FA/PCA. The designed optimum number of sampling locations for monsoon and non-monsoon seasons by modified Sanders approach are eight and seven while that for FA/PCA are eleven and nine, respectively. Less variation in the number and locations of designed sampling sites were obtained by both techniques, which shows stability of results. A geospatial analysis has also been carried out to check the significance of designed sampling location with respect to river basin characteristics and land use of the study area. Both methods are equally efficient; however, modified Sanders approach outperforms FA/PCA when limited water quality and extensive watershed information is available. The available water quality dataset is limited and FA/PCA-based approach fails to identify monitoring locations with higher variation, as these multivariate statistical approaches are data-driven. The priority/hierarchy and number of sampling sites designed by modified Sanders approach are well justified by the land use practices and observed river basin characteristics of the study area.
Stukel, Thérèse A.; Fisher, Elliott S; Wennberg, David E.; Alter, David A.; Gottlieb, Daniel J.; Vermeulen, Marian J.
2007-01-01
Context Comparisons of outcomes between patients treated and untreated in observational studies may be biased due to differences in patient prognosis between groups, often because of unobserved treatment selection biases. Objective To compare 4 analytic methods for removing the effects of selection bias in observational studies: multivariable model risk adjustment, propensity score risk adjustment, propensity-based matching, and instrumental variable analysis. Design, Setting, and Patients A national cohort of 122 124 patients who were elderly (aged 65–84 years), receiving Medicare, and hospitalized with acute myocardial infarction (AMI) in 1994–1995, and who were eligible for cardiac catheterization. Baseline chart reviews were taken from the Cooperative Cardiovascular Project and linked to Medicare health administrative data to provide a rich set of prognostic variables. Patients were followed up for 7 years through December 31, 2001, to assess the association between long-term survival and cardiac catheterization within 30 days of hospital admission. Main Outcome Measure Risk-adjusted relative mortality rate using each of the analytic methods. Results Patients who received cardiac catheterization (n=73 238) were younger and had lower AMI severity than those who did not. After adjustment for prognostic factors by using standard statistical risk-adjustment methods, cardiac catheterization was associated with a 50% relative decrease in mortality (for multivariable model risk adjustment: adjusted relative risk [RR], 0.51; 95% confidence interval [CI], 0.50–0.52; for propensity score risk adjustment: adjusted RR, 0.54; 95% CI, 0.53–0.55; and for propensity-based matching: adjusted RR, 0.54; 95% CI, 0.52–0.56). Using regional catheterization rate as an instrument, instrumental variable analysis showed a 16% relative decrease in mortality (adjusted RR, 0.84; 95% CI, 0.79–0.90). The survival benefits of routine invasive care from randomized clinical trials are between 8% and 21 %. Conclusions Estimates of the observational association of cardiac catheterization with long-term AMI mortality are highly sensitive to analytic method. All standard risk-adjustment methods have the same limitations regarding removal of unmeasured treatment selection biases. Compared with standard modeling, instrumental variable analysis may produce less biased estimates of treatment effects, but is more suited to answering policy questions than specific clinical questions. PMID:17227979
Principal components analysis in clinical studies.
Zhang, Zhongheng; Castelló, Adela
2017-09-01
In multivariate analysis, independent variables are usually correlated to each other which can introduce multicollinearity in the regression models. One approach to solve this problem is to apply principal components analysis (PCA) over these variables. This method uses orthogonal transformation to represent sets of potentially correlated variables with principal components (PC) that are linearly uncorrelated. PCs are ordered so that the first PC has the largest possible variance and only some components are selected to represent the correlated variables. As a result, the dimension of the variable space is reduced. This tutorial illustrates how to perform PCA in R environment, the example is a simulated dataset in which two PCs are responsible for the majority of the variance in the data. Furthermore, the visualization of PCA is highlighted.
[Measurement of Water COD Based on UV-Vis Spectroscopy Technology].
Wang, Xiao-ming; Zhang, Hai-liang; Luo, Wei; Liu, Xue-mei
2016-01-01
Ultraviolet/visible (UV/Vis) spectroscopy technology was used to measure water COD. A total of 135 water samples were collected from Zhejiang province. Raw spectra with 3 different pretreatment methods (Multiplicative Scatter Correction (MSC), Standard Normal Variate (SNV) and 1st Derivatives were compared to determine the optimal pretreatment method for analysis. Spectral variable selection is an important strategy in spectrum modeling analysis, because it tends to parsimonious data representation and can lead to multivariate models with better performance. In order to simply calibration models, the preprocessed spectra were then used to select sensitive wavelengths by competitive adaptive reweighted sampling (CARS), Random frog and Successive Genetic Algorithm (GA) methods. Different numbers of sensitive wavelengths were selected by different variable selection methods with SNV preprocessing method. Partial least squares (PLS) was used to build models with the full spectra, and Extreme Learning Machine (ELM) was applied to build models with the selected wavelength variables. The overall results showed that ELM model performed better than PLS model, and the ELM model with the selected wavelengths based on CARS obtained the best results with the determination coefficient (R2), RMSEP and RPD were 0.82, 14.48 and 2.34 for prediction set. The results indicated that it was feasible to use UV/Vis with characteristic wavelengths which were obtained by CARS variable selection method, combined with ELM calibration could apply for the rapid and accurate determination of COD in aquaculture water. Moreover, this study laid the foundation for further implementation of online analysis of aquaculture water and rapid determination of other water quality parameters.
Estimating an Effect Size in One-Way Multivariate Analysis of Variance (MANOVA)
ERIC Educational Resources Information Center
Steyn, H. S., Jr.; Ellis, S. M.
2009-01-01
When two or more univariate population means are compared, the proportion of variation in the dependent variable accounted for by population group membership is eta-squared. This effect size can be generalized by using multivariate measures of association, based on the multivariate analysis of variance (MANOVA) statistics, to establish whether…
An Application of Multivariate Generalizability in Selection of Mathematically Gifted Students
ERIC Educational Resources Information Center
Kim, Sungyeun; Berebitsky, Dan
2016-01-01
This study investigates error sources and the effects of each error source to determine optimal weights of the composite score of teacher recommendation letters and self-introduction letters using multivariate generalizability theory. Data were collected from the science education institute for the gifted attached to the university located within…
Choosing the Greenest Synthesis: A Multivariate Metric Green Chemistry Exercise
ERIC Educational Resources Information Center
Mercer, Sean M.; Andraos, John; Jessop, Philip G.
2012-01-01
The ability to correctly identify the greenest of several syntheses is a particularly useful asset for young chemists in the growing green economy. The famous univariate metrics atom economy and environmental factor provide insufficient information to allow for a proper selection of a green process. Multivariate metrics, such as those used in…
Dangers in Using Analysis of Covariance Procedures.
ERIC Educational Resources Information Center
Campbell, Kathleen T.
Problems associated with the use of analysis of covariance (ANCOVA) as a statistical control technique are explained. Three problems relate to the use of "OVA" methods (analysis of variance, analysis of covariance, multivariate analysis of variance, and multivariate analysis of covariance) in general. These are: (1) the wasting of information when…
Royston, Patrick; Sauerbrei, Willi
2016-01-01
In a recent article, Royston (2015, Stata Journal 15: 275-291) introduced the approximate cumulative distribution (acd) transformation of a continuous covariate x as a route toward modeling a sigmoid relationship between x and an outcome variable. In this article, we extend the approach to multivariable modeling by modifying the standard Stata program mfp. The result is a new program, mfpa, that has all the features of mfp plus the ability to fit a new model for user-selected covariates that we call fp1( p 1 , p 2 ). The fp1( p 1 , p 2 ) model comprises the best-fitting combination of a dimension-one fractional polynomial (fp1) function of x and an fp1 function of acd ( x ). We describe a new model-selection algorithm called function-selection procedure with acd transformation, which uses significance testing to attempt to simplify an fp1( p 1 , p 2 ) model to a submodel, an fp1 or linear model in x or in acd ( x ). The function-selection procedure with acd transformation is related in concept to the fsp (fp function-selection procedure), which is an integral part of mfp and which is used to simplify a dimension-two (fp2) function. We describe the mfpa command and give univariable and multivariable examples with real data to demonstrate its use.
Pan, Xin; Lopez-Olivo, Maria A; Song, Juhee; Pratt, Gregory; Suarez-Almazor, Maria E
2017-03-01
We appraised the methodological and reporting quality of randomised controlled clinical trials (RCTs) evaluating the efficacy and safety of Chinese herbal medicine (CHM) in patients with rheumatoid arthritis (RA). For this systematic review, electronic databases were searched from inception until June 2015. The search was limited to humans and non-case report studies, but was not limited by language, year of publication or type of publication. Two independent reviewers selected RCTs, evaluating CHM in RA (herbals and decoctions). Descriptive statistics were used to report on risk of bias and their adherence to reporting standards. Multivariable logistic regression analysis was performed to determine study characteristics associated with high or unclear risk of bias. Out of 2342 unique citations, we selected 119 RCTs including 18 919 patients: 10 108 patients received CHM alone and 6550 received one of 11 treatment combinations. A high risk of bias was observed across all domains: 21% had a high risk for selection bias (11% from sequence generation and 30% from allocation concealment), 85% for performance bias, 89% for detection bias, 4% for attrition bias and 40% for reporting bias. In multivariable analysis, fewer authors were associated with selection bias (allocation concealment), performance bias and attrition bias, and earlier year of publication and funding source not reported or disclosed were associated with selection bias (sequence generation). Studies published in non-English language were associated with reporting bias. Poor adherence to recommended reporting standards (<60% of the studies not providing sufficient information) was observed in 11 of the 23 sections evaluated. Study quality and data extraction were performed by one reviewer and cross-checked by a second reviewer. Translation to English was performed by one reviewer in 85% of the included studies. Studies evaluating CHM often fail to meet expected methodological criteria, and high-quality evidence is lacking. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.
Insausti, Matías; Gomes, Adriano A; Cruz, Fernanda V; Pistonesi, Marcelo F; Araujo, Mario C U; Galvão, Roberto K H; Pereira, Claudete F; Band, Beatriz S F
2012-08-15
This paper investigates the use of UV-vis, near infrared (NIR) and synchronous fluorescence (SF) spectrometries coupled with multivariate classification methods to discriminate biodiesel samples with respect to the base oil employed in their production. More specifically, the present work extends previous studies by investigating the discrimination of corn-based biodiesel from two other biodiesel types (sunflower and soybean). Two classification methods are compared, namely full-spectrum SIMCA (soft independent modelling of class analogies) and SPA-LDA (linear discriminant analysis with variables selected by the successive projections algorithm). Regardless of the spectrometric technique employed, full-spectrum SIMCA did not provide an appropriate discrimination of the three biodiesel types. In contrast, all samples were correctly classified on the basis of a reduced number of wavelengths selected by SPA-LDA. It can be concluded that UV-vis, NIR and SF spectrometries can be successfully employed to discriminate corn-based biodiesel from the two other biodiesel types, but wavelength selection by SPA-LDA is key to the proper separation of the classes. Copyright © 2012 Elsevier B.V. All rights reserved.
Sørensen, Klavs M; Westley, Chloe; Goodacre, Royston; Engelsen, Søren Balling
2015-10-01
This study investigates the feasibility of using surface-enhanced Raman scattering (SERS) for the quantification of absolute levels of the boar-taint compounds skatole and androstenone in porcine fat. By investigation of different types of nanoparticles, pH and aggregating agents, an optimized environment that promotes SERS of the analytes was developed and tested with different multivariate spectral pre-processing techniques, and this was combined with variable selection on a series of analytical standards. The resulting method exhibited prediction errors (root mean square error of cross validation, RMSECV) of 2.4 × 10(-6) M skatole and 1.2 × 10(-7) M androstenone, with a limit of detection corresponding to approximately 2.1 × 10(-11) M for skatole and approximately 1.8 × 10(-10) for androstenone. The method was subsequently tested on porcine fat extract, leading to prediction errors (RMSECV) of 0.17 μg/g for skatole and 1.5 μg/g for androstenone. It is clear that this optimized SERS method, when combined with multivariate analysis, shows great potential for optimization into an on-line application, which will be the first of its kind, and opens up possibilities for simultaneous detection of other meat-quality metabolites or pathogen markers. Graphical abstract Artistic rendering of a laser-illuminated gold colloid sphere with skatole and androstenone adsorbed on the surface.
Sun, Wei; Huang, Guo H; Zeng, Guangming; Qin, Xiaosheng; Yu, Hui
2011-03-01
It is widely known that variation of the C/N ratio is dependent on many state variables during composting processes. This study attempted to develop a genetic algorithm aided stepwise cluster analysis (GASCA) method to describe the nonlinear relationships between the selected state variables and the C/N ratio in food waste composting. The experimental data from six bench-scale composting reactors were used to demonstrate the applicability of GASCA. Within the GASCA framework, GA searched optimal sets of both specified state variables and SCA's internal parameters; SCA established statistical nonlinear relationships between state variables and the C/N ratio; to avoid unnecessary and time-consuming calculation, a proxy table was introduced to save around 70% computational efforts. The obtained GASCA cluster trees had smaller sizes and higher prediction accuracy than the conventional SCA trees. Based on the optimal GASCA tree, the effects of the GA-selected state variables on the C/N ratio were ranged in a descending order as: NH₄+-N concentration>Moisture content>Ash Content>Mean Temperature>Mesophilic bacteria biomass. Such a rank implied that the variation of ammonium nitrogen concentration, the associated temperature and the moisture conditions, the total loss of both organic matters and available mineral constituents, and the mesophilic bacteria activity, were critical factors affecting the C/N ratio during the investigated food waste composting. This first application of GASCA to composting modelling indicated that more direct search algorithms could be coupled with SCA or other multivariate analysis methods to analyze complicated relationships during composting and many other environmental processes. Copyright © 2010 Elsevier B.V. All rights reserved.
Milot, Marie-Hélène; Spencer, Steven J; Chan, Vicky; Allington, James P; Klein, Julius; Chou, Cathy; Pearson-Fuhrhop, Kristin; Bobrow, James E; Reinkensmeyer, David J; Cramer, Steven C
2014-01-01
Robotic training can help improve function of a paretic limb following a stroke, but individuals respond differently to the training. A predictor of functional gains might improve the ability to select those individuals more likely to benefit from robot-based therapy. Studies evaluating predictors of functional improvement after a robotic training are scarce. One study has found that white matter tract integrity predicts functional gains following a robotic training of the hand and wrist. Objective. To determine the predictive ability of behavioral and brain measures in order to improve selection of individuals for robotic training. Twenty subjects with chronic stroke participated in an 8-week course of robotic exoskeletal training for the arm. Before training, a clinical evaluation, functional magnetic resonance imaging (fMRI), diffusion tensor imaging, and transcranial magnetic stimulation (TMS) were each measured as predictors. Final functional gain was defined as change in the Box and Block Test (BBT). Measures significant in bivariate analysis were fed into a multivariate linear regression model. Training was associated with an average gain of 6 ± 5 blocks on the BBT (P < .0001). Bivariate analysis revealed that lower baseline motor-evoked potential (MEP) amplitude on TMS, and lower laterality M1 index on fMRI each significantly correlated with greater BBT change. In the multivariate linear regression analysis, baseline MEP magnitude was the only measure that remained significant. Subjects with lower baseline MEP magnitude benefited the most from robotic training of the affected arm. These subjects might have reserve remaining for the training to boost corticospinal excitability, translating into functional gains. © The Author(s) 2014.
Wong, Andrew T; Shao, Meng; Rineer, Justin; Lee, Anna; Schwartz, David; Schreiber, David
2017-06-01
The objective of this study was to analyze the impact on overall survival (OS) from the addition of postoperative radiation with or without chemotherapy after esophagectomy, using a large, hospital-based dataset. Previous retrospective studies have suggested an OS advantage for postoperative chemoradiation over surgery alone, although prospective data are lacking. The National Cancer Data Base was queried to select patients diagnosed with stage pT3-4Nx-0M0 or pT1-4N1-3M0 esophageal carcinoma (squamous cell or adenocarcinoma) from 1998 to 2011 treated with definitive esophagectomy ± postoperative radiation and/or chemotherapy. OS was analyzed using the Kaplan-Meier method and compared using the log-rank test. Multivariate Cox regression analysis was used to identify covariates associated with OS. There were 4893 patients selected, of whom 1153 (23.6%) received postoperative radiation. Most patients receiving radiation also received sequential/concomitant chemotherapy (89.9%). For the entire cohort, postoperative radiation was associated with a statistically significant but modest absolute improvement in survival (hazard ratio 0.77; 95% CI, 0.71-0.83; P < 0.001). On subgroup analysis, postoperative radiation was associated with improved OS for patients with node-positive disease (3-yr OS 34.3 % vs 27.8%, P < 0.001) or positive margins (3-yr OS 36.4% vs 18.0%, P < 0.001). When chemotherapy usage was incorporated, sequential chemotherapy was associated with the best survival (P < 0.001). Multivariate analysis revealed that the addition of chemotherapy to radiation therapy, whether sequentially or concurrently, was a strong prognostic factor for OS. In this hospital-based study, the addition of postoperative chemoradiation (either sequentially or concomitantly) after esophagectomy was associated with improved OS for patients with node-positive disease or positive margins.
Tada, Toshifumi; Kumada, Takashi; Toyoda, Hidenori; Kiriyama, Seiki; Tanikawa, Makoto; Hisanaga, Yasuhiro; Kanamori, Akira; Kitabatake, Shusuke; Yama, Tsuyoki
2015-09-01
It has been reported that the branched-chain amino acid (BCAA) to tyrosine ratio (BTR) is a useful indicator of liver function and BCAA therapy is associated with a decreased incidence of hepatocellular carcinoma (HCC). However, there has not been sufficient research on the relationship between BTR and the effects of BCAA therapy after initial treatment of HCC. We investigated the impact of BTR and BCAA therapy on survival in patients with HCC. A total of 315 patients with HCC who were treated (n = 66) or not treated (n = 249) with BCAA were enrolled; of these, 66 were selected from each group using propensity score matching. Survival from liver-related mortality was analyzed. In patients who did not receive BCAA therapy (n = 249), multivariate analysis for factors associated with survival indicated that low BTR (≤ 4.4) was independently associated with poor prognosis in patients with HCC (hazard ratio, 1.880; 95% confidence interval, 1.125-3.143; P = 0.016). In addition, among patients selected by propensity score matching (n = 132), multivariate analysis indicated that BCAA therapy was independently associated with good prognosis in patients with HCC (hazard ratio, 0.524; 95% confidence interval, 0.282-0.973; P = 0.041). BTR was not significantly associated with survival. Intervention involving BCAA therapy improved survival in patients with HCC versus untreated controls, regardless of BTR. In addition, low BTR was associated with poor prognosis in patients who did not receive BCAA therapy. © 2015 Journal of Gastroenterology and Hepatology Foundation and Wiley Publishing Asia Pty Ltd.
Risk factors associated with gastric cancer in Mexico: education, breakfast and chili.
Trujillo Rivera, Alejandro; Sampieri, Clara Luz; Morales Romero, Jaime; Montero, Hilda; Acosta Mesa, Héctor Gabriel; Cruz Ramírez, Nicandro; Novoa Del Toro, Elva María; León Córdoba, Kenneth
2018-06-01
the aim of the study was to use a validated questionnaire to identify factors associated with the development of gastric cancer (GC) in the Mexican population. the study included cases and controls that were paired by sex and ± 10 years of age at diagnosis. In relation to cases, 46 patients with a confirmed histopathological diagnosis of adenocarcinoma-type GC, as reported in the hospital records, were selected, and 46 blood bank donors from the same hospital were included as controls. The previously validated Questionnaire to Find Factors Associated with Gastric Cancer (QUFA-GC©) was used to collect data. Odds ratio (OR) and 95% confidence interval (IC) were estimated via univariate analysis (paired OR). Multivariate analysis was performed by logistic regression. A decision tree was constructed using the J48 algorithm. an association was found by univariate analysis between GC risk and a lack of formal education, having smoked for ≥ 10 years, eating rapidly, consuming very hot food and drinks, a non-suitable breakfast within two hours of waking, pickled food and capsaicin. In contrast, a protective association against GC was found with taking recreational exercise and consuming fresh fruit and vegetables. No association was found between the development of GC and having an income that reflected poverty, using a refrigerator, perception of the omission of breakfast and time period of alcoholism. In the final multivariate analysis model, having no formal education (OR = 17.47, 95% CI = 5.17-76.69), consuming a non-suitable breakfast within two hours of waking (OR = 8.99, 95% CI = 2.85-35.50) and the consumption of capsaicin ˃ 29.9 mg capsaicin per day (OR = 3.77, 95% CI = 1.21-13.11) were factors associated with GC. an association was found by multivariate analysis between the presence of GC and education, type of breakfast and the consumption of capsaicin. These variables are susceptible to intervention and can be identified via the QUFA-GC ©.
Fast Genome-Wide QTL Association Mapping on Pedigree and Population Data.
Zhou, Hua; Blangero, John; Dyer, Thomas D; Chan, Kei-Hang K; Lange, Kenneth; Sobel, Eric M
2017-04-01
Since most analysis software for genome-wide association studies (GWAS) currently exploit only unrelated individuals, there is a need for efficient applications that can handle general pedigree data or mixtures of both population and pedigree data. Even datasets thought to consist of only unrelated individuals may include cryptic relationships that can lead to false positives if not discovered and controlled for. In addition, family designs possess compelling advantages. They are better equipped to detect rare variants, control for population stratification, and facilitate the study of parent-of-origin effects. Pedigrees selected for extreme trait values often segregate a single gene with strong effect. Finally, many pedigrees are available as an important legacy from the era of linkage analysis. Unfortunately, pedigree likelihoods are notoriously hard to compute. In this paper, we reexamine the computational bottlenecks and implement ultra-fast pedigree-based GWAS analysis. Kinship coefficients can either be based on explicitly provided pedigrees or automatically estimated from dense markers. Our strategy (a) works for random sample data, pedigree data, or a mix of both; (b) entails no loss of power; (c) allows for any number of covariate adjustments, including correction for population stratification; (d) allows for testing SNPs under additive, dominant, and recessive models; and (e) accommodates both univariate and multivariate quantitative traits. On a typical personal computer (six CPU cores at 2.67 GHz), analyzing a univariate HDL (high-density lipoprotein) trait from the San Antonio Family Heart Study (935,392 SNPs on 1,388 individuals in 124 pedigrees) takes less than 2 min and 1.5 GB of memory. Complete multivariate QTL analysis of the three time-points of the longitudinal HDL multivariate trait takes less than 5 min and 1.5 GB of memory. The algorithm is implemented as the Ped-GWAS Analysis (Option 29) in the Mendel statistical genetics package, which is freely available for Macintosh, Linux, and Windows platforms from http://genetics.ucla.edu/software/mendel. © 2016 WILEY PERIODICALS, INC.
Tables of Optimal Allocations of Observations for Comparing Treatments with a Control.
1981-01-01
variatevpp v ,p,P t-distribution (p-variate Itl-distribution) with d.f. v and equal corre- lations p = N /C (N ); tables of t are given by Krishnaiah and1 0 1...values of (tp,/) 2 tabulated for selected -d and 1-a by Krishnaiah and Armitage (1965). Entries for v = 60 1.. the tables of Hahn and Hendrickson (1971...with a Control," Multivariate Analysis - II (Ed. P.R. Krishnaiah ), New York: Academic Press, 463-473. Bechhofer, R.E. and Nocturne, D.J. (1972), "Optimal
Strasberg, Steven M; Gao, Feng; Sanford, Dominic; Linehan, David C; Hawkins, William G; Fields, Ryan; Carpenter, Danielle H; Brunt, Elizabeth M; Phillips, Carolyn
2014-01-01
Objectives: Jaundice impairs cellular immunity, an important defence against the dissemination of cancer. Jaundice is a common mode of presentation in pancreatic head adenocarcinoma. The purpose of this study was to determine whether there is an association between preoperative jaundice and survival in patients who have undergone resection of such tumours. Methods: Thirty possible survival risk factors were evaluated in a database of over 400 resected patients. Univariate analysis was used to determine odds ratio for death. All factors for which a P-value of <0.30 was obtained were entered into a multivariate analysis using the Cox model with backward selection. Results: Preoperative jaundice, age, positive node status, poor differentiation and lymphatic invasion were significant indicators of poor outcome in multivariate analysis. Absence of jaundice was a highly favourable prognostic factor. Interaction emerged between jaundice and nodal status. The benefit conferred by the absence of jaundice was restricted to patients in whom negative node status was present. Five-year overall survival in this group was 66%. Jaundiced patients who underwent preoperative stenting had a survival advantage. Conclusions: Preoperative jaundice is a negative risk factor in adenocarcinoma of the pancreas. Additional studies are required to determine the exact mechanism for this effect. PMID:23600768
Hutengs, Christopher; Ludwig, Bernard; Jung, András; Eisele, Andreas; Vohland, Michael
2018-03-27
Mid-infrared (MIR) spectroscopy has received widespread interest as a method to complement traditional soil analysis. Recently available portable MIR spectrometers additionally offer potential for on-site applications, given sufficient spectral data quality. We therefore tested the performance of the Agilent 4300 Handheld FTIR (DRIFT spectra) in comparison to a Bruker Tensor 27 bench-top instrument in terms of (i) spectral quality and measurement noise quantified by wavelet analysis; (ii) accuracy of partial least squares (PLS) calibrations for soil organic carbon (SOC), total nitrogen (N), pH, clay and sand content with a repeated cross-validation analysis; and (iii) key spectral regions for these soil properties identified with a Monte Carlo spectral variable selection approach. Measurements and multivariate calibrations with the handheld device were as good as or slightly better than Bruker equipped with a DRIFT accessory, but not as accurate as with directional hemispherical reflectance (DHR) data collected with an integrating sphere. Variations in noise did not markedly affect the accuracy of multivariate PLS calibrations. Identified key spectral regions for PLS calibrations provided a good match between Agilent and Bruker DHR data, especially for SOC and N. Our findings suggest that portable FTIR instruments are a viable alternative for MIR measurements in the laboratory and offer great potential for on-site applications.
H. Pylori as a predictor of marginal ulceration: A nationwide analysis.
Schulman, Allison R; Abougergi, Marwan S; Thompson, Christopher C
2017-03-01
Helicobacter pylori has been implicated as a risk factor for development of marginal ulceration following gastric bypass, although studies have been small and yielded conflicting results. This study sought to determine the relationship between H. pylori infection and development of marginal ulceration following bariatric surgery in a nationwide analysis. This was a retrospective cohort study using the 2012 Nationwide Inpatient Sample (NIS) database. Discharges with ICD-9-CM code indicating marginal ulceration and a secondary ICD-9-CM code for bariatric surgery were included. Primary outcome was incidence of marginal ulceration. A stepwise forward selection model was used to build the multivariate logistic regression model based on known risk factors. A P value of 0.05 was considered significant. There were 253,765 patients who met inclusion criteria. Prevalence of marginal ulceration was 3.90%. Of those patients found to have marginal ulceration, 31.20% of patients were H. pylori-positive. Final multivariate regression analysis revealed that H. pylori was the strongest independent predictor of marginal ulceration. H. pylori is an independent predictor of marginal ulceration using a large national database. Preoperative testing for and eradication of H. pylori prior to bariatric surgery may be an important preventive measure to reduce the incidence of ulcer development. © 2017 The Obesity Society.
Villiger, P; Ryan, E A; Owen, R; O'Kelly, K; Oberholzer, J; Al Saif, F; Kin, T; Wang, H; Larsen, I; Blitz, S L; Menon, V; Senior, P; Bigam, D L; Paty, B; Kneteman, N M; Lakey, J R T; Shapiro, A M James
2005-12-01
Islet transplantation is being offered increasingly for selected patients with unstable type 1 diabetes. Percutaneous transhepatic portal access avoids a need for surgery, but is associated with potential risk of bleeding. Between 1999 and 2005, we performed 132 percutaneous transhepatic islet transplants in 67 patients. We encountered bleeding in 18/132 cases (13.6%). In univariate analysis, the risk of bleeding in the absence of effective track ablation was associated with an increasing number of procedures (2nd and 3rd procedures with an odds ratio (OR) of 9.5 and 20.9, respectively), platelets count <150,000 (OR 4.4), elevated portal pressure (OR 1.1 per mm Hg rise), heparin dose > or =45 U/kg (OR 9.8) and pre-transplant aspirin (81 mg per day) (OR 2.6, p = 0.05). A multivariate analysis further confirmed the cumulative transplant procedure number (p < 0.001) and heparin dose > or =45 U/kg (p = 0.02) as independent risk factors for bleeding. Effective mechanical sealing of the intrahepatic portal catheter tract with thrombostatic coils and tissue fibrin glue completely prevented bleeding in all subsequent procedures (n = 26, p = 0.02). We conclude that bleeding after percutaneous islet implantation is an avoidable complication provided the intraparenchymal liver tract is sealed effectively.
Endpoint in plasma etch process using new modified w-multivariate charts and windowed regression
NASA Astrophysics Data System (ADS)
Zakour, Sihem Ben; Taleb, Hassen
2017-09-01
Endpoint detection is very important undertaking on the side of getting a good understanding and figuring out if a plasma etching process is done in the right way, especially if the etched area is very small (0.1%). It truly is a crucial part of supplying repeatable effects in every single wafer. When the film being etched has been completely cleared, the endpoint is reached. To ensure the desired device performance on the produced integrated circuit, the high optical emission spectroscopy (OES) sensor is employed. The huge number of gathered wavelengths (profiles) is then analyzed and pre-processed using a new proposed simple algorithm named Spectra peak selection (SPS) to select the important wavelengths, then we employ wavelet analysis (WA) to enhance the performance of detection by suppressing noise and redundant information. The selected and treated OES wavelengths are then used in modified multivariate control charts (MEWMA and Hotelling) for three statistics (mean, SD and CV) and windowed polynomial regression for mean. The employ of three aforementioned statistics is motivated by controlling mean shift, variance shift and their ratio (CV) if both mean and SD are not stable. The control charts show their performance in detecting endpoint especially W-mean Hotelling chart and the worst result is given by CV statistic. As the best detection of endpoint is given by the W-Hotelling mean statistic, this statistic will be used to construct a windowed wavelet Hotelling polynomial regression. This latter can only identify the window containing endpoint phenomenon.
Jerlström, Tomas; Gårdmark, Truls; Carringer, Malcolm; Holmäng, Sten; Liedberg, Fredrik; Hosseini, Abolfazl; Malmström, Per-Uno; Ljungberg, Börje; Hagberg, Oskar; Jahnson, Staffan
2014-08-01
Cystectomy combined with pelvic lymph-node dissection and urinary diversion entails high morbidity and mortality. Improvements are needed, and a first step is to collect information on the current situation. In 2011, this group took the initiative to start a population-based database in Sweden (population 9.5 million in 2011) with prospective registration of patients and complications until 90 days after cystectomy. This article reports findings from the first year of registration. Participation was voluntary, and data were reported by local urologists or research nurses. Perioperative parameters and early complications classified according to the modified Clavien system were registered, and selected variables of possible importance for complications were analysed by univariate and multivariate logistic regression. During 2011, 285 (65%) of 435 cystectomies performed in Sweden were registered in the database, the majority reported by the seven academic centres. Median blood loss was 1000 ml, operating time 318 min, and length of hospital stay 15 days. Any complications were registered for 103 patients (36%). Clavien grades 1-2 and 3-5 were noted in 19% and 15%, respectively. Thirty-seven patients (13%) were reoperated on at least once. In logistic regression analysis elevated risk of complications was significantly associated with operating time exceeding 318 min in both univariate and multivariate analysis, and with age 76-89 years only in multivariate analysis. It was feasible to start a national population-based registry of radical cystectomies for bladder cancer. The evaluation of the first year shows an increased risk of complications in patients with longer operating time and higher age. The results agree with some previously published series but should be interpreted with caution considering the relatively low coverage, which is expected to be higher in the future.
NASA Astrophysics Data System (ADS)
Eilert, Tobias; Beckers, Maximilian; Drechsler, Florian; Michaelis, Jens
2017-10-01
The analysis tool and software package Fast-NPS can be used to analyse smFRET data to obtain quantitative structural information about macromolecules in their natural environment. In the algorithm a Bayesian model gives rise to a multivariate probability distribution describing the uncertainty of the structure determination. Since Fast-NPS aims to be an easy-to-use general-purpose analysis tool for a large variety of smFRET networks, we established an MCMC based sampling engine that approximates the target distribution and requires no parameter specification by the user at all. For an efficient local exploration we automatically adapt the multivariate proposal kernel according to the shape of the target distribution. In order to handle multimodality, the sampler is equipped with a parallel tempering scheme that is fully adaptive with respect to temperature spacing and number of chains. Since the molecular surrounding of a dye molecule affects its spatial mobility and thus the smFRET efficiency, we introduce dye models which can be selected for every dye molecule individually. These models allow the user to represent the smFRET network in great detail leading to an increased localisation precision. Finally, a tool to validate the chosen model combination is provided. Programme Files doi:http://dx.doi.org/10.17632/7ztzj63r68.1 Licencing provisions: Apache-2.0 Programming language: GUI in MATLAB (The MathWorks) and the core sampling engine in C++ Nature of problem: Sampling of highly diverse multivariate probability distributions in order to solve for macromolecular structures from smFRET data. Solution method: MCMC algorithm with fully adaptive proposal kernel and parallel tempering scheme.
Sun, Hui; Wang, Huiyu; Zhang, Aihua; Yan, Guangli; Han, Ying; Li, Yuan; Wu, Xiuhong; Meng, Xiangcai; Wang, Xijun
2016-01-01
As herbal medicines have an important position in health care systems worldwide, their current assessment, and quality control are a major bottleneck. Cortex Phellodendri chinensis (CPC) and Cortex Phellodendri amurensis (CPA) are widely used in China, however, how to identify species of CPA and CPC has become urgent. In this study, multivariate analysis approach was performed to the investigation of chemical discrimination of CPA and CPC. Principal component analysis showed that two herbs could be separated clearly. The chemical markers such as berberine, palmatine, phellodendrine, magnoflorine, obacunone, and obaculactone were identified through the orthogonal partial least squared discriminant analysis, and were identified tentatively by the accurate mass of quadruple-time-of-flight mass spectrometry. A total of 29 components can be used as the chemical markers for discrimination of CPA and CPC. Of them, phellodenrine is significantly higher in CPC than that of CPA, whereas obacunone and obaculactone are significantly higher in CPA than that of CPC. The present study proves that multivariate analysis approach based chemical analysis greatly contributes to the investigation of CPA and CPC, and showed that the identified chemical markers as a whole should be used to discriminate the two herbal medicines, and simultaneously the results also provided chemical information for their quality assessment. Multivariate analysis approach was performed to the investigate the herbal medicineThe chemical markers were identified through multivariate analysis approachA total of 29 components can be used as the chemical markers. UPLC-Q/TOF-MS-based multivariate analysis method for the herbal medicine samples Abbreviations used: CPC: Cortex Phellodendri chinensis, CPA: Cortex Phellodendri amurensis, PCA: Principal component analysis, OPLS-DA: Orthogonal partial least squares discriminant analysis, BPI: Base peaks ion intensity.
Refusal of postoperative radiotherapy and its association with survival in head and neck cancer.
Schwam, Zachary G; Husain, Zain; Judson, Benjamin L
2015-11-01
Administering postoperative radiotherapy (PORT) is associated with improved survival and slower disease progression in select head and neck cancer patients. Predictive factors for PORT refusal have not been described in this population. Retrospective analysis of 6127 head and neck cancer patients who received or refused PORT in the National Cancer Database (2003-2006) was performed. Statistical analysis included Chi-square, multivariable logistic regression, Kaplan-Meier, and Cox proportional hazards analysis. In total, 247 patients (4.0%) refused PORT. Three-year overall survival was 62.8% versus 53.4% for those who received and refused PORT, respectively. PORT refusers were more likely to have negative nodes than those who underwent PORT (37.4% versus 20.1%, p<.001). In multivariate analysis, predictive factors for refusing PORT included living far from the treatment facility (OR 1.92), having negative nodes (OR 2.14), and Charlson score of ⩾ 2 (OR 2.14) (all p ⩽.001). PORT refusal was associated with increased mortality (hazard ratio 1.20, p=.044). A significant proportion of head and neck cancer patients refused PORT; this was associated with compromised overall survival. Predictive factors for PORT refusal included socioeconomic, demographic, and pathologic variables. Elucidating root causes of refusal may lead to interventions that improve long-term outcomes. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Devarajan, Karthik; Parsons, Theodore; Wang, Qiong; O'Neill, Raymond; Solomides, Charalambos; Peiper, Stephen C.; Testa, Joseph R.; Uzzo, Robert; Yang, Haifeng
2017-01-01
Intratumoral heterogeneity (ITH) is a prominent feature of kidney cancer. It is not known whether it has utility in finding associations between protein expression and clinical parameters. We used ITH that is detected by immunohistochemistry (IHC) to aid the association analysis between the loss of SWI/SNF components and clinical parameters.160 ccRCC tumors (40 per tumor stage) were used to generate tissue microarray (TMA). Four foci from different regions of each tumor were selected. IHC was performed against PBRM1, ARID1A, SETD2, SMARCA4, and SMARCA2. Statistical analyses were performed to correlate biomarker losses with patho-clinical parameters. Categorical variables were compared between groups using Fisher's exact tests. Univariate and multivariable analyses were used to correlate biomarker changes and patient survivals. Multivariable analyses were performed by constructing decision trees using the classification and regression trees (CART) methodology. IHC detected widespread ITH in ccRCC tumors. The statistical analysis of the “Truncal loss” (root loss) found additional correlations between biomarker losses and tumor stages than the traditional “Loss in tumor (total)”. Losses of SMARCA4 or SMARCA2 significantly improved prognosis for overall survival (OS). Losses of PBRM1, ARID1A or SETD2 had the opposite effect. Thus “Truncal Loss” analysis revealed hidden links between protein losses and patient survival in ccRCC. PMID:28445125
Belaz, Kátia Roberta A; Pereira-Filho, Edenir Rodrigues; Oliveira, Regina V
2013-08-01
In this work, the development of two multidimensional liquid chromatography methods coupled to a fluorescence detector is described for direct analysis of microsomal fractions obtained from rat livers. The chiral multidimensional method was then applied for the optimization of the in vitro metabolism of albendazole by experimental design. Albendazole was selected as a model drug because of its anthelmintics properties and recent potential for cancer treatment. The development of two fully automated achiral-chiral and chiral-chiral high performance liquid chromatography (HPLC) methods for the determination of albendazole (ABZ) and its metabolites albendazole sulphoxide (ABZ-SO), albendazole sulphone (ABZ-SO2) and albendazole 2-aminosulphone (ABZ-SO2NH2) in microsomal fractions are described. These methods involve the use of a phenyl (RAM-phenyl-BSA) or octyl (RAM-C8-BSA) restricted access media bovine serum albumin column for the sample clean-up, followed by an achiral phenyl column (15.0×0.46cmI.D.) or a chiral amylose tris(3,5-dimethylphenylcarbamate) column (15.0×0.46cmI.D.). The chiral 2D HPLC method was applied to the development of a compromise condition for the in vitro metabolism of ABZ by means of experimental design involving multivariate analysis. Copyright © 2013 Elsevier B.V. All rights reserved.
Cichonska, Anna; Rousu, Juho; Marttinen, Pekka; Kangas, Antti J; Soininen, Pasi; Lehtimäki, Terho; Raitakari, Olli T; Järvelin, Marjo-Riitta; Salomaa, Veikko; Ala-Korpela, Mika; Ripatti, Samuli; Pirinen, Matti
2016-07-01
A dominant approach to genetic association studies is to perform univariate tests between genotype-phenotype pairs. However, analyzing related traits together increases statistical power, and certain complex associations become detectable only when several variants are tested jointly. Currently, modest sample sizes of individual cohorts, and restricted availability of individual-level genotype-phenotype data across the cohorts limit conducting multivariate tests. We introduce metaCCA, a computational framework for summary statistics-based analysis of a single or multiple studies that allows multivariate representation of both genotype and phenotype. It extends the statistical technique of canonical correlation analysis to the setting where original individual-level records are not available, and employs a covariance shrinkage algorithm to achieve robustness.Multivariate meta-analysis of two Finnish studies of nuclear magnetic resonance metabolomics by metaCCA, using standard univariate output from the program SNPTEST, shows an excellent agreement with the pooled individual-level analysis of original data. Motivated by strong multivariate signals in the lipid genes tested, we envision that multivariate association testing using metaCCA has a great potential to provide novel insights from already published summary statistics from high-throughput phenotyping technologies. Code is available at https://github.com/aalto-ics-kepaco anna.cichonska@helsinki.fi or matti.pirinen@helsinki.fi Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Cichonska, Anna; Rousu, Juho; Marttinen, Pekka; Kangas, Antti J.; Soininen, Pasi; Lehtimäki, Terho; Raitakari, Olli T.; Järvelin, Marjo-Riitta; Salomaa, Veikko; Ala-Korpela, Mika; Ripatti, Samuli; Pirinen, Matti
2016-01-01
Motivation: A dominant approach to genetic association studies is to perform univariate tests between genotype-phenotype pairs. However, analyzing related traits together increases statistical power, and certain complex associations become detectable only when several variants are tested jointly. Currently, modest sample sizes of individual cohorts, and restricted availability of individual-level genotype-phenotype data across the cohorts limit conducting multivariate tests. Results: We introduce metaCCA, a computational framework for summary statistics-based analysis of a single or multiple studies that allows multivariate representation of both genotype and phenotype. It extends the statistical technique of canonical correlation analysis to the setting where original individual-level records are not available, and employs a covariance shrinkage algorithm to achieve robustness. Multivariate meta-analysis of two Finnish studies of nuclear magnetic resonance metabolomics by metaCCA, using standard univariate output from the program SNPTEST, shows an excellent agreement with the pooled individual-level analysis of original data. Motivated by strong multivariate signals in the lipid genes tested, we envision that multivariate association testing using metaCCA has a great potential to provide novel insights from already published summary statistics from high-throughput phenotyping technologies. Availability and implementation: Code is available at https://github.com/aalto-ics-kepaco Contacts: anna.cichonska@helsinki.fi or matti.pirinen@helsinki.fi Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153689
Bayesian Factor Analysis as a Variable Selection Problem: Alternative Priors and Consequences
Lu, Zhao-Hua; Chow, Sy-Miin; Loken, Eric
2016-01-01
Factor analysis is a popular statistical technique for multivariate data analysis. Developments in the structural equation modeling framework have enabled the use of hybrid confirmatory/exploratory approaches in which factor loading structures can be explored relatively flexibly within a confirmatory factor analysis (CFA) framework. Recently, a Bayesian structural equation modeling (BSEM) approach (Muthén & Asparouhov, 2012) has been proposed as a way to explore the presence of cross-loadings in CFA models. We show that the issue of determining factor loading patterns may be formulated as a Bayesian variable selection problem in which Muthén and Asparouhov’s approach can be regarded as a BSEM approach with ridge regression prior (BSEM-RP). We propose another Bayesian approach, denoted herein as the Bayesian structural equation modeling with spike and slab prior (BSEM-SSP), which serves as a one-stage alternative to the BSEM-RP. We review the theoretical advantages and disadvantages of both approaches and compare their empirical performance relative to two modification indices-based approaches and exploratory factor analysis with target rotation. A teacher stress scale data set (Byrne, 2012; Pettegrew & Wolf, 1982) is used to demonstrate our approach. PMID:27314566
Effectively Transforming IMC Flight into VMC Flight: An SVS Case Study
NASA Technical Reports Server (NTRS)
Glaab, Louis J.; Hughes, Monic F.; Parrish, Russell V.; Takallu, Mohammad A.
2006-01-01
A flight-test experiment was conducted using the NASA LaRC Cessna 206 aircraft. Four primary flight and navigation display concepts, including baseline and Synthetic Vision System (SVS) concepts, were evaluated in the local area of Roanoke Virginia Airport, flying visual and instrument approach procedures. A total of 19 pilots, from 3 pilot groups reflecting the diverse piloting skills of the GA population, served as evaluation pilots. Multi-variable Discriminant Analysis was applied to three carefully selected and markedly different operating conditions with conventional instrumentation to provide an extension of traditional analysis methods as well as provide an assessment of the effectiveness of SVS displays to effectively transform IMC flight into VMC flight.
Comparative effectiveness research in cancer with observational data.
Giordano, Sharon H
2015-01-01
Observational studies are increasingly being used for comparative effectiveness research. These studies can have the greatest impact when randomized trials are not feasible or when randomized studies have not included the population or outcomes of interest. However, careful attention must be paid to study design to minimize the likelihood of selection biases. Analytic techniques, such as multivariable regression modeling, propensity score analysis, and instrumental variable analysis, also can also be used to help address confounding. Oncology has many existing large and clinically rich observational databases that can be used for comparative effectiveness research. With careful study design, observational studies can produce valid results to assess the benefits and harms of a treatment or intervention in representative real-world populations.
Using Interactive Graphics to Teach Multivariate Data Analysis to Psychology Students
ERIC Educational Resources Information Center
Valero-Mora, Pedro M.; Ledesma, Ruben D.
2011-01-01
This paper discusses the use of interactive graphics to teach multivariate data analysis to Psychology students. Three techniques are explored through separate activities: parallel coordinates/boxplots; principal components/exploratory factor analysis; and cluster analysis. With interactive graphics, students may perform important parts of the…
Liljeholm, Mimi; Zika, Ondrej; O'Doherty, John P.
2015-01-01
While there is accumulating evidence for the existence of distinct neural systems supporting goal-directed and habitual action selection in the mammalian brain, much less is known about the nature of the information being processed in these different brain regions. Associative learning theory predicts that brain systems involved in habitual control, such as the dorsolateral striatum, should contain stimulus and response information only, but not outcome information, while regions involved in goal-directed action, such as ventromedial and dorsolateral prefrontal cortex and dorsomedial striatum, should be involved in processing information about outcomes as well as stimuli and responses. To test this prediction, human participants underwent fMRI while engaging in a binary choice task designed to enable the separate identification of these different representations with a multivariate classification analysis approach. Consistent with our predictions, the dorsolateral striatum contained information about responses but not outcomes at the time of an initial stimulus, while the regions implicated in goal-directed action selection contained information about both responses and outcomes. These findings suggest that differential contributions of these regions to habitual and goal-directed behavioral control may depend in part on basic differences in the type of information that these regions have access to at the time of decision making. PMID:25740507
A power analysis for multivariate tests of temporal trend in species composition.
Irvine, Kathryn M; Dinger, Eric C; Sarr, Daniel
2011-10-01
Long-term monitoring programs emphasize power analysis as a tool to determine the sampling effort necessary to effectively document ecologically significant changes in ecosystems. Programs that monitor entire multispecies assemblages require a method for determining the power of multivariate statistical models to detect trend. We provide a method to simulate presence-absence species assemblage data that are consistent with increasing or decreasing directional change in species composition within multiple sites. This step is the foundation for using Monte Carlo methods to approximate the power of any multivariate method for detecting temporal trends. We focus on comparing the power of the Mantel test, permutational multivariate analysis of variance, and constrained analysis of principal coordinates. We find that the power of the various methods we investigate is sensitive to the number of species in the community, univariate species patterns, and the number of sites sampled over time. For increasing directional change scenarios, constrained analysis of principal coordinates was as or more powerful than permutational multivariate analysis of variance, the Mantel test was the least powerful. However, in our investigation of decreasing directional change, the Mantel test was typically as or more powerful than the other models.
Miaw, Carolina Sheng Whei; Assis, Camila; Silva, Alessandro Rangel Carolino Sales; Cunha, Maria Luísa; Sena, Marcelo Martins; de Souza, Scheilla Vitorino Carvalho
2018-07-15
Grape, orange, peach and passion fruit nectars were formulated and adulterated by dilution with syrup, apple and cashew juices at 10 levels for each adulterant. Attenuated total reflectance Fourier transform mid infrared (ATR-FTIR) spectra were obtained. Partial least squares (PLS) multivariate calibration models allied to different variable selection methods, such as interval partial least squares (iPLS), ordered predictors selection (OPS) and genetic algorithm (GA), were used to quantify the main fruits. PLS improved by iPLS-OPS variable selection showed the highest predictive capacity to quantify the main fruit contents. The selected variables in the final models varied from 72 to 100; the root mean square errors of prediction were estimated from 0.5 to 2.6%; the correlation coefficients of prediction ranged from 0.948 to 0.990; and, the mean relative errors of prediction varied from 3.0 to 6.7%. All of the developed models were validated. Copyright © 2018 Elsevier Ltd. All rights reserved.
Mueller, Daniela; Ferrão, Marco Flôres; Marder, Luciano; da Costa, Adilson Ben; de Cássia de Souza Schneider, Rosana
2013-01-01
The main objective of this study was to use infrared spectroscopy to identify vegetable oils used as raw material for biodiesel production and apply multivariate analysis to the data. Six different vegetable oil sources—canola, cotton, corn, palm, sunflower and soybeans—were used to produce biodiesel batches. The spectra were acquired by Fourier transform infrared spectroscopy using a universal attenuated total reflectance sensor (FTIR-UATR). For the multivariate analysis principal component analysis (PCA), hierarchical cluster analysis (HCA), interval principal component analysis (iPCA) and soft independent modeling of class analogy (SIMCA) were used. The results indicate that is possible to develop a methodology to identify vegetable oils used as raw material in the production of biodiesel by FTIR-UATR applying multivariate analysis. It was also observed that the iPCA found the best spectral range for separation of biodiesel batches using FTIR-UATR data, and with this result, the SIMCA method classified 100% of the soybean biodiesel samples. PMID:23539030
Multivariate meta-analysis for non-linear and other multi-parameter associations
Gasparrini, A; Armstrong, B; Kenward, M G
2012-01-01
In this paper, we formalize the application of multivariate meta-analysis and meta-regression to synthesize estimates of multi-parameter associations obtained from different studies. This modelling approach extends the standard two-stage analysis used to combine results across different sub-groups or populations. The most straightforward application is for the meta-analysis of non-linear relationships, described for example by regression coefficients of splines or other functions, but the methodology easily generalizes to any setting where complex associations are described by multiple correlated parameters. The modelling framework of multivariate meta-analysis is implemented in the package mvmeta within the statistical environment R. As an illustrative example, we propose a two-stage analysis for investigating the non-linear exposure–response relationship between temperature and non-accidental mortality using time-series data from multiple cities. Multivariate meta-analysis represents a useful analytical tool for studying complex associations through a two-stage procedure. Copyright © 2012 John Wiley & Sons, Ltd. PMID:22807043
Fitzpatrick, John L; Simmons, Leigh W; Evans, Jonathan P
2012-08-01
Assessing how selection operates on several, potentially interacting, components of the ejaculate is a challenging endeavor. Ejaculates can be subject to natural and/or sexual selection, which can impose both linear (directional) and nonlinear (stabilizing, disruptive, and correlational) selection on different ejaculate components. Most previous studies have examined linear selection of ejaculate components and, consequently, we know very little about patterns of nonlinear selection on the ejaculate. Even less is known about how selection acts on the ejaculate as a functionally integrated unit, despite evidence of covariance among ejaculate components. Here, we assess how selection acts on multiple ejaculate components simultaneously in the broadcast spawning sessile invertebrate Mytilus galloprovincialis using the statistical tools of multivariate selection analyses. Our analyses of relative fertilization rates revealed complex patterns of selection on sperm velocity, motility, and morphology. Interestingly, the most successful ejaculates were made up of slower swimming sperm with relatively low percentages of motile cells, and sperm with smaller head volumes that swam in highly pronounced curved swimming trajectories. These results are consistent with an emerging body of literature on fertilization kinetics in broadcast spawners, and shed light on the fundamental nature of selection acting on the ejaculate as a functionally integrated unit. © 2012 The Author(s). Evolution© 2012 The Society for the Study of Evolution.
The Potential of Multivariate Analysis in Assessing Students' Attitude to Curriculum Subjects
ERIC Educational Resources Information Center
Gaotlhobogwe, Michael; Laugharne, Janet; Durance, Isabelle
2011-01-01
Background: Understanding student attitudes to curriculum subjects is central to providing evidence-based options to policy makers in education. Purpose: We illustrate how quantitative approaches used in the social sciences and based on multivariate analysis (categorical Principal Components Analysis, Clustering Analysis and General Linear…
Guzman, L; Ortega-Hrepich, C; Polyzos, N P; Anckaert, E; Verheyen, G; Coucke, W; Devroey, P; Tournaye, H; Smitz, J; De Vos, M
2013-05-01
Which baseline patient characteristics can help assisted reproductive technology practitioners to identify patients who are suitable for in-vitro maturation (IVM) treatment? In patients with polycystic ovary syndrome (PCOS) who undergo oocyte IVM in a non-hCG-triggered system, circulating anti-Müllerian hormone (AMH), antral follicle count (AFC) and total testosterone are independently related to the number of immature oocytes and hold promise as outcome predictors to guide the patient selection process for IVM. Patient selection criteria for IVM treatment have been described in normo-ovulatory patients, although patients with PCOS constitute the major target population for IVM. With this study, we assessed the independent predictive value of clinical and endocrine parameters that are related to oocyte yield in patients with PCOS undergoing IVM. Cohort study involving 124 consecutive patients with PCOS undergoing IVM whose data were prospectively collected. Enrolment took place between January 2010 and January 2012. Only data relating to the first IVM cycle of each patient were included. Patients with PCOS underwent oocyte retrieval for IVM after minimal gonadotrophin stimulation and no hCG trigger. Correlation coefficients were calculated to investigate which parameters are related to immature oocyte yield (patient's age, BMI, baseline hormonal profile and AMH, AFC). The independence of predictive parameters was tested using multivariate linear regression analysis. Finally, multivariate receiver operating characteristic (ROC) analyses for cumulus oocyte complexes (COC) yield were performed to assess the efficiency of the prediction model to select suitable candidates for IVM. Using multivariate regression analysis, circulating baseline AMH, AFC and baseline total testosterone serum concentration were incorporated into a model to predict the number of COC retrieved in an IVM cycle, with unstandardized coefficients [95% confidence interval (CI)] of 0.03 (0.02-0.03) (P < 0.001), 0.012 (0.008-0.017) (P < 0.001) and 0.37 (0.18-0.57) (P < 0.001), respectively. Logistic regression analysis shows that a prediction model based on AMH and AFC, with unstandardized coefficients (95% CI) of 0.148 (0.03-0.25) (P < 0.001) and 0.034 (-0.003-0.07) (P = 0.025), respectively, is a useful patient selection tool to predict the probability to yield at least eight COCs for IVM in patients with PCOS. In this population, patients with at least eight COC available for IVM have a statistically higher number of embryos of good morphological quality (2.9 ± 2.3; 0.9 ± 0.9; P < 0.001) and cumulative ongoing pregnancy rate [30.4% (24 out of 79); 11% (5 out of 45); P = 0.01] when compared with patients with less than eight COC. ROC curve analysis showed that this prediction model has an area under the curve of 0.7864 (95% CI = 0.6997-0.8732) for the prediction of oocyte yield in IVM. The proposed model has been constructed based on a genuine IVM system, i.e. no hCG trigger was given and none of the oocytes matured in vivo. However, other variables, such as needle type, aspiration technique and whether or not hCG-triggering is used, should be considered as confounding factors. The results of this study have to be confirmed using a second independent validation sample. The proposed model could be applied to patients with PCOS after confirmation through a further validation study. This study was supported by a research grant by the Institute for the Promotion of Innovation by Science and Technology in Flanders, Project number IWT 070719.
Integrative Exploratory Analysis of Two or More Genomic Datasets.
Meng, Chen; Culhane, Aedin
2016-01-01
Exploratory analysis is an essential step in the analysis of high throughput data. Multivariate approaches such as correspondence analysis (CA), principal component analysis, and multidimensional scaling are widely used in the exploratory analysis of single dataset. Modern biological studies often assay multiple types of biological molecules (e.g., mRNA, protein, phosphoproteins) on a same set of biological samples, thereby creating multiple different types of omics data or multiassay data. Integrative exploratory analysis of these multiple omics data is required to leverage the potential of multiple omics studies. In this chapter, we describe the application of co-inertia analysis (CIA; for analyzing two datasets) and multiple co-inertia analysis (MCIA; for three or more datasets) to address this problem. These methods are powerful yet simple multivariate approaches that represent samples using a lower number of variables, allowing a more easily identification of the correlated structure in and between multiple high dimensional datasets. Graphical representations can be employed to this purpose. In addition, the methods simultaneously project samples and variables (genes, proteins) onto the same lower dimensional space, so the most variant variables from each dataset can be selected and associated with samples, which can be further used to facilitate biological interpretation and pathway analysis. We applied CIA to explore the concordance between mRNA and protein expression in a panel of 60 tumor cell lines from the National Cancer Institute. In the same 60 cell lines, we used MCIA to perform a cross-platform comparison of mRNA gene expression profiles obtained on four different microarray platforms. Last, as an example of integrative analysis of multiassay or multi-omics data we analyzed transcriptomic, proteomic, and phosphoproteomic data from pluripotent (iPS) and embryonic stem (ES) cell lines.
Two-sample tests and one-way MANOVA for multivariate biomarker data with nondetects.
Thulin, M
2016-09-10
Testing whether the mean vector of a multivariate set of biomarkers differs between several populations is an increasingly common problem in medical research. Biomarker data is often left censored because some measurements fall below the laboratory's detection limit. We investigate how such censoring affects multivariate two-sample and one-way multivariate analysis of variance tests. Type I error rates, power and robustness to increasing censoring are studied, under both normality and non-normality. Parametric tests are found to perform better than non-parametric alternatives, indicating that the current recommendations for analysis of censored multivariate data may have to be revised. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
A non-iterative extension of the multivariate random effects meta-analysis.
Makambi, Kepher H; Seung, Hyunuk
2015-01-01
Multivariate methods in meta-analysis are becoming popular and more accepted in biomedical research despite computational issues in some of the techniques. A number of approaches, both iterative and non-iterative, have been proposed including the multivariate DerSimonian and Laird method by Jackson et al. (2010), which is non-iterative. In this study, we propose an extension of the method by Hartung and Makambi (2002) and Makambi (2001) to multivariate situations. A comparison of the bias and mean square error from a simulation study indicates that, in some circumstances, the proposed approach perform better than the multivariate DerSimonian-Laird approach. An example is presented to demonstrate the application of the proposed approach.
Multivariate Autoregressive Modeling and Granger Causality Analysis of Multiple Spike Trains
Krumin, Michael; Shoham, Shy
2010-01-01
Recent years have seen the emergence of microelectrode arrays and optical methods allowing simultaneous recording of spiking activity from populations of neurons in various parts of the nervous system. The analysis of multiple neural spike train data could benefit significantly from existing methods for multivariate time-series analysis which have proven to be very powerful in the modeling and analysis of continuous neural signals like EEG signals. However, those methods have not generally been well adapted to point processes. Here, we use our recent results on correlation distortions in multivariate Linear-Nonlinear-Poisson spiking neuron models to derive generalized Yule-Walker-type equations for fitting ‘‘hidden” Multivariate Autoregressive models. We use this new framework to perform Granger causality analysis in order to extract the directed information flow pattern in networks of simulated spiking neurons. We discuss the relative merits and limitations of the new method. PMID:20454705
A refined method for multivariate meta-analysis and meta-regression.
Jackson, Daniel; Riley, Richard D
2014-02-20
Making inferences about the average treatment effect using the random effects model for meta-analysis is problematic in the common situation where there is a small number of studies. This is because estimates of the between-study variance are not precise enough to accurately apply the conventional methods for testing and deriving a confidence interval for the average effect. We have found that a refined method for univariate meta-analysis, which applies a scaling factor to the estimated effects' standard error, provides more accurate inference. We explain how to extend this method to the multivariate scenario and show that our proposal for refined multivariate meta-analysis and meta-regression can provide more accurate inferences than the more conventional approach. We explain how our proposed approach can be implemented using standard output from multivariate meta-analysis software packages and apply our methodology to two real examples. Copyright © 2013 John Wiley & Sons, Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Johnson, Kevin J.; Wright, Bob W.; Jarman, Kristin H.
2003-05-09
A rapid retention time alignment algorithm was developed as a preprocessing utility to be used prior to chemometric analysis of large datasets of diesel fuel gas chromatographic profiles. Retention time variation from chromatogram-to-chromatogram has been a significant impediment against the use of chemometric techniques in the analysis of chromatographic data due to the inability of current multivariate techniques to correctly model information that shifts from variable to variable within a dataset. The algorithm developed is shown to increase the efficacy of pattern recognition methods applied to a set of diesel fuel chromatograms by retaining chemical selectivity while reducing chromatogram-to-chromatogram retentionmore » time variations and to do so on a time scale that makes analysis of large sets of chromatographic data practical.« less
Lai, Shih-Wei; Lai, Hsueh-Chou; Lin, Cheng-Li; Liao, Kuan-Fu; Tseng, Chun-Hung
2015-07-01
The objective of this study was to examine the relationship between chronic osteomyelitis and acute pancreatitis in Taiwan. This was a population-based case-control study utilizing the database of the Taiwan National Health Insurance Program. We identified 7678 cases aged 20-84 with newly diagnosed acute pancreatitis during the period of 1998 to 2011. From the same database, 30,712 subjects without diagnosis of acute pancreatitis were selected as controls. The cases and controls were matched with sex, age and index year of diagnosing acute pancreatitis. The odds ratio with 95% confidence interval of acute pancreatitis associated with chronic osteomyelitis was examined by the multivariable unconditional logistic regression analysis. After adjustment for multiple confounders, the multivariable analysis showed that the adjusted odds ratio of acute pancreatitis was 1.93 for subjects with chronic osteomyelitis (95% confidence interval 1.01, 3.69), when compared with subjects without chronic osteomyelitis. Chronic osteomyelitis correlates with increased risk of acute pancreatitis. Patients with chronic osteomyelitis should be carefully monitored about the risk of acute pancreatitis. Copyright © 2015 European Federation of Internal Medicine. Published by Elsevier B.V. All rights reserved.
Retro-regression--another important multivariate regression improvement.
Randić, M
2001-01-01
We review the serious problem associated with instabilities of the coefficients of regression equations, referred to as the MRA (multivariate regression analysis) "nightmare of the first kind". This is manifested when in a stepwise regression a descriptor is included or excluded from a regression. The consequence is an unpredictable change of the coefficients of the descriptors that remain in the regression equation. We follow with consideration of an even more serious problem, referred to as the MRA "nightmare of the second kind", arising when optimal descriptors are selected from a large pool of descriptors. This process typically causes at different steps of the stepwise regression a replacement of several previously used descriptors by new ones. We describe a procedure that resolves these difficulties. The approach is illustrated on boiling points of nonanes which are considered (1) by using an ordered connectivity basis; (2) by using an ordering resulting from application of greedy algorithm; and (3) by using an ordering derived from an exhaustive search for optimal descriptors. A novel variant of multiple regression analysis, called retro-regression (RR), is outlined showing how it resolves the ambiguities associated with both "nightmares" of the first and the second kind of MRA.
[Violence and post-traumatic stress disorder in childhood].
Ximenes, Liana Furtado; de Oliveira, Raquel de Vasconcelos Carvalhães; de Assis, Simone Gonçalves
2009-01-01
This study presents the prevalence of symptoms of Posttraumatic Stress Disorder (PTSD) in 500 schoolchildren (6-13 years old) in São Gonçalo, Rio de Janeiro. It also investigates the association between PTSD, violence and other adverse events in the lives of these children. The multi-stage cluster sampling strategy involved three selection stages. Parents were interviewed about their children's behavior. The instrument used to screen symptoms of PTSD was the Child Behavior Checklist-Posttraumatic Stress Disorder Scale (CBCL-PTSD). Conflict Tactics Scales (CTS) were applied to evaluate family violence and other scales to investigate the socioeconomic profile, familiar relationship, characteristics and adverse events in the lives of the children. Multivariate analysis was performed using a hierarchical model with a significance level of 5%. The prevalence of clinical symptoms of PTSD was of 6.5%. The multivariate analysis suggested an explanation model of PTSD characterized by 18 variables, such as the child's characteristics; specific life events; family violence; and other family factors. The results reveal that it is necessary to work with the child in particularly difficult moments of his/her life in order to prevent or minimize the impact of adverse events on their mental and social functioning.
Tufto, Jarle
2010-01-01
Domesticated species frequently spread their genes into populations of wild relatives through interbreeding. The domestication process often involves artificial selection for economically desirable traits. This can lead to an indirect response in unknown correlated traits and a reduction in fitness of domesticated individuals in the wild. Previous models for the effect of gene flow from domesticated species to wild relatives have assumed that evolution occurs in one dimension. Here, I develop a quantitative genetic model for the balance between migration and multivariate stabilizing selection. Different forms of correlational selection consistent with a given observed ratio between average fitness of domesticated and wild individuals offsets the phenotypic means at migration-selection balance away from predictions based on simpler one-dimensional models. For almost all parameter values, correlational selection leads to a reduction in the migration load. For ridge selection, this reduction arises because the distance the immigrants deviates from the local optimum in effect is reduced. For realistic parameter values, however, the effect of correlational selection on the load is small, suggesting that simpler one-dimensional models may still be adequate in terms of predicting mean population fitness and viability.
A Machine Learning Approach to Automated Gait Analysis for the Noldus Catwalk System.
Frohlich, Holger; Claes, Kasper; De Wolf, Catherine; Van Damme, Xavier; Michel, Anne
2018-05-01
Gait analysis of animal disease models can provide valuable insights into in vivo compound effects and thus help in preclinical drug development. The purpose of this paper is to establish a computational gait analysis approach for the Noldus Catwalk system, in which footprints are automatically captured and stored. We present a - to our knowledge - first machine learning based approach for the Catwalk system, which comprises a step decomposition, definition and extraction of meaningful features, multivariate step sequence alignment, feature selection, and training of different classifiers (gradient boosting machine, random forest, and elastic net). Using animal-wise leave-one-out cross validation we demonstrate that with our method we can reliable separate movement patterns of a putative Parkinson's disease animal model and several control groups. Furthermore, we show that we can predict the time point after and the type of different brain lesions and can even forecast the brain region, where the intervention was applied. We provide an in-depth analysis of the features involved into our classifiers via statistical techniques for model interpretation. A machine learning method for automated analysis of data from the Noldus Catwalk system was established. Our works shows the ability of machine learning to discriminate pharmacologically relevant animal groups based on their walking behavior in a multivariate manner. Further interesting aspects of the approach include the ability to learn from past experiments, improve with more data arriving and to make predictions for single animals in future studies.
Meta-analysis of prognostic value of inflammation parameter in breast cancer.
Chen, Jie; Pan, Yuqin; He, Bangshun; Ying, Houqun; Sun, Huiling; Deng, Qiwen; Liu, Xian; Wang, Shukui
2018-01-01
Recently, increasing studies investigated the association between inflammation parameter such as neutrophil to lymphocyte ratio (NLR) and the prognosis of cancers. However, the clinical and prognostic significance of NLR in breast cancer remains controversial. This meta-analysis was conducted to establish the overall accuracy of the NLR test in the diagnosis of breast cancer. A comprehensive search of the literature was conducted using PubMed and Web of Science. Six studies dating up to July 2014 with 2267 patients were enrolled in the present study. STATA 11.0 software (STATA Corporation, College Station, TX, USA) was selected for data analysis. In order to evaluate the association between NLR and overall survival (OS), disease-free survival (DFS), recurrence-free survival or cancer-specific survival, the hazard ratios (HRs), and their 95% confidence intervals (CIs) were extracted. Subgroup analyses showed that NLR was a strong prognostic factor for OS in multivariate analysis (HR = 2.81, 95% CI = 2.13-3.71, P H = 0.992) and without metastasis (HR = 1.45, 95% CI = 0.37-5.66, P H < 0.001). Elevated NLR was associated with a high risk for DFS in subgroups of multivariate analysis (HR = 2.16, 95% CI = 1.67-2.80, P H = 0.977) and mixed metastasis (HR = 2.13, 95% CI = 1.38-3.30, P H = 0.84). In summary, NLR may be considered as a predictive factor for patients with breast cancer.
Chasset, Thibaut; Häbe, Tim T; Ristivojevic, Petar; Morlock, Gertrud E
2016-09-23
Quality control of propolis is challenging, as it is a complex natural mixture of compounds, and thus, very difficult to analyze and standardize. Shown on the example of 30 French propolis samples, a strategy for an improved quality control was demonstrated in which high-performance thin-layer chromatography (HPTLC) fingerprints were evaluated in combination with selected mass signals obtained by desorption-based scanning mass spectrometry (MS). The French propolis sample extracts were separated by a newly developed reversed phase (RP)-HPTLC method. The fingerprints obtained by two different detection modes, i.e. after (1) derivatization and fluorescence detection (FLD) at UV 366nm and (2) scanning direct analysis in real time (DART)-MS, were analyzed by multivariate data analysis. Thus, RP-HPTLC-FLD and RP-HPTLC-DART-MS fingerprints were explored and the best classification was obtained using both methods in combination with pattern recognition techniques, such as principal component analysis. All investigated French propolis samples were divided in two types and characteristic patterns were observed. Phenolic compounds such as caffeic acid, p-coumaric acid, chrysin, pinobanksin, pinobanksin-3-acetate, galangin, kaempferol, tectochrysin and pinocembrin were identified as characteristic marker compounds of French propolis samples. This study expanded the research on the European poplar type of propolis and confirmed the presence of two botanically different types of propolis, known as the blue and orange types. Copyright © 2016 Elsevier B.V. All rights reserved.
Simulation Exploration through Immersive Parallel Planes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brunhart-Lupo, Nicholas J; Bush, Brian W; Gruchalla, Kenny M
We present a visualization-driven simulation system that tightly couples systems dynamics simulations with an immersive virtual environment to allow analysts to rapidly develop and test hypotheses in a high-dimensional parameter space. To accomplish this, we generalize the two-dimensional parallel-coordinates statistical graphic as an immersive 'parallel-planes' visualization for multivariate time series emitted by simulations running in parallel with the visualization. In contrast to traditional parallel coordinate's mapping the multivariate dimensions onto coordinate axes represented by a series of parallel lines, we map pairs of the multivariate dimensions onto a series of parallel rectangles. As in the case of parallel coordinates, eachmore » individual observation in the dataset is mapped to a polyline whose vertices coincide with its coordinate values. Regions of the rectangles can be 'brushed' to highlight and select observations of interest: a 'slider' control allows the user to filter the observations by their time coordinate. In an immersive virtual environment, users interact with the parallel planes using a joystick that can select regions on the planes, manipulate selection, and filter time. The brushing and selection actions are used to both explore existing data as well as to launch additional simulations corresponding to the visually selected portions of the input parameter space. As soon as the new simulations complete, their resulting observations are displayed in the virtual environment. This tight feedback loop between simulation and immersive analytics accelerates users' realization of insights about the simulation and its output.« less
Simulation Exploration through Immersive Parallel Planes: Preprint
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brunhart-Lupo, Nicholas; Bush, Brian W.; Gruchalla, Kenny
We present a visualization-driven simulation system that tightly couples systems dynamics simulations with an immersive virtual environment to allow analysts to rapidly develop and test hypotheses in a high-dimensional parameter space. To accomplish this, we generalize the two-dimensional parallel-coordinates statistical graphic as an immersive 'parallel-planes' visualization for multivariate time series emitted by simulations running in parallel with the visualization. In contrast to traditional parallel coordinate's mapping the multivariate dimensions onto coordinate axes represented by a series of parallel lines, we map pairs of the multivariate dimensions onto a series of parallel rectangles. As in the case of parallel coordinates, eachmore » individual observation in the dataset is mapped to a polyline whose vertices coincide with its coordinate values. Regions of the rectangles can be 'brushed' to highlight and select observations of interest: a 'slider' control allows the user to filter the observations by their time coordinate. In an immersive virtual environment, users interact with the parallel planes using a joystick that can select regions on the planes, manipulate selection, and filter time. The brushing and selection actions are used to both explore existing data as well as to launch additional simulations corresponding to the visually selected portions of the input parameter space. As soon as the new simulations complete, their resulting observations are displayed in the virtual environment. This tight feedback loop between simulation and immersive analytics accelerates users' realization of insights about the simulation and its output.« less
Jędrkiewicz, Renata; Tsakovski, Stefan; Lavenu, Aurore; Namieśnik, Jacek; Tobiszewski, Marek
2018-02-01
Novel methodology for grouping and ranking with application of self-organizing maps and multicriteria decision analysis is presented. The dataset consists of 22 objects that are analytical procedures applied to furan determination in food samples. They are described by 10 variables, referred to their analytical performance, environmental and economic aspects. Multivariate statistics analysis allows to limit the amount of input data for ranking analysis. Assessment results show that the most beneficial procedures are based on microextraction techniques with GC-MS final determination. It is presented how the information obtained from both tools complement each other. The applicability of combination of grouping and ranking is also discussed. Copyright © 2017 Elsevier B.V. All rights reserved.
Analysis of Exhaled Breath Volatile Organic Compounds in Inflammatory Bowel Disease: A Pilot Study.
Hicks, Lucy C; Huang, Juzheng; Kumar, Sacheen; Powles, Sam T; Orchard, Timothy R; Hanna, George B; Williams, Horace R T
2015-09-01
Distinguishing between the inflammatory bowel diseases [IBD], Crohn's disease [CD] and ulcerative colitis [UC], is important for determining management and prognosis. Selected ion flow tube mass spectrometry [SIFT-MS] may be used to analyse volatile organic compounds [VOCs] in exhaled breath: these may be altered in disease states, and distinguishing breath VOC profiles can be identified. The aim of this pilot study was to identify, quantify, and analyse VOCs present in the breath of IBD patients and controls, potentially providing insights into disease pathogenesis and complementing current diagnostic algorithms. SIFT-MS breath profiling of 56 individuals [20 UC, 18 CD, and 18 healthy controls] was undertaken. Multivariate analysis included principal components analysis and partial least squares discriminant analysis with orthogonal signal correction [OSC-PLS-DA]. Receiver operating characteristic [ROC] analysis was performed for each comparative analysis using statistically significant VOCs. OSC-PLS-DA modelling was able to distinguish both CD and UC from healthy controls and from one other with good sensitivity and specificity. ROC analysis using combinations of statistically significant VOCs [dimethyl sulphide, hydrogen sulphide, hydrogen cyanide, ammonia, butanal, and nonanal] gave integrated areas under the curve of 0.86 [CD vs healthy controls], 0.74 [UC vs healthy controls], and 0.83 [CD vs UC]. Exhaled breath VOC profiling was able to distinguish IBD patients from controls, as well as to separate UC from CD, using both multivariate and univariate statistical techniques. Copyright © 2015 European Crohn’s and Colitis Organisation (ECCO). Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Multivariate missing data in hydrology - Review and applications
NASA Astrophysics Data System (ADS)
Ben Aissia, Mohamed-Aymen; Chebana, Fateh; Ouarda, Taha B. M. J.
2017-12-01
Water resources planning and management require complete data sets of a number of hydrological variables, such as flood peaks and volumes. However, hydrologists are often faced with the problem of missing data (MD) in hydrological databases. Several methods are used to deal with the imputation of MD. During the last decade, multivariate approaches have gained popularity in the field of hydrology, especially in hydrological frequency analysis (HFA). However, treating the MD remains neglected in the multivariate HFA literature whereas the focus has been mainly on the modeling component. For a complete analysis and in order to optimize the use of data, MD should also be treated in the multivariate setting prior to modeling and inference. Imputation of MD in the multivariate hydrological framework can have direct implications on the quality of the estimation. Indeed, the dependence between the series represents important additional information that can be included in the imputation process. The objective of the present paper is to highlight the importance of treating MD in multivariate hydrological frequency analysis by reviewing and applying multivariate imputation methods and by comparing univariate and multivariate imputation methods. An application is carried out for multiple flood attributes on three sites in order to evaluate the performance of the different methods based on the leave-one-out procedure. The results indicate that, the performance of imputation methods can be improved by adopting the multivariate setting, compared to mean substitution and interpolation methods, especially when using the copula-based approach.
1993-06-18
the exception. In the Standardized Aquatic Microcosm and the Mixed Flask Culture (MFC) microcosms, multivariate analysis and clustering methods...rule rather than the exception. In the Standardized Aquatic Microcosm and the Mixed Flask Culture (MFC) microcosms, multivariate analysis and...experiments using two microcosm protocols. We use nonmetric clustering, a multivariate pattern recognition technique developed by Matthews and Heame (1991
NASA Astrophysics Data System (ADS)
Song, Sutao; Huang, Yuxia; Long, Zhiying; Zhang, Jiacai; Chen, Gongxiang; Wang, Shuqing
2016-03-01
Recently, several studies have successfully applied multivariate pattern analysis methods to predict the categories of emotions. These studies are mainly focused on self-experienced emotions, such as the emotional states elicited by music or movie. In fact, most of our social interactions involve perception of emotional information from the expressions of other people, and it is an important basic skill for humans to recognize the emotional facial expressions of other people in a short time. In this study, we aimed to determine the discriminability of perceived emotional facial expressions. In a rapid event-related fMRI design, subjects were instructed to classify four categories of facial expressions (happy, disgust, angry and neutral) by pressing different buttons, and each facial expression stimulus lasted for 2s. All participants performed 5 fMRI runs. One multivariate pattern analysis method, support vector machine was trained to predict the categories of facial expressions. For feature selection, ninety masks defined from anatomical automatic labeling (AAL) atlas were firstly generated and each were treated as the input of the classifier; then, the most stable AAL areas were selected according to prediction accuracies, and comprised the final feature sets. Results showed that: for the 6 pair-wise classification conditions, the accuracy, sensitivity and specificity were all above chance prediction, among which, happy vs. neutral , angry vs. disgust achieved the lowest results. These results suggested that specific neural signatures of perceived emotional facial expressions may exist, and happy vs. neutral, angry vs. disgust might be more similar in information representation in the brain.
Fatty acid methyl ester analysis to identify sources of soil in surface water.
Banowetz, Gary M; Whittaker, Gerald W; Dierksen, Karen P; Azevedo, Mark D; Kennedy, Ann C; Griffith, Stephen M; Steiner, Jeffrey J
2006-01-01
Efforts to improve land-use practices to prevent contamination of surface waters with soil are limited by an inability to identify the primary sources of soil present in these waters. We evaluated the utility of fatty acid methyl ester (FAME) profiles of dry reference soils for multivariate statistical classification of soils collected from surface waters adjacent to agricultural production fields and a wooded riparian zone. Trials that compared approaches to concentrate soil from surface water showed that aluminum sulfate precipitation provided comparable yields to that obtained by vacuum filtration and was more suitable for handling large numbers of samples. Fatty acid methyl ester profiles were developed from reference soils collected from contrasting land uses in different seasons to determine whether specific fatty acids would consistently serve as variables in multivariate statistical analyses to permit reliable classification of soils. We used a Bayesian method and an independent iterative process to select appropriate fatty acids and found that variable selection was strongly impacted by the season during which soil was collected. The apparent seasonal variation in the occurrence of marker fatty acids in FAME profiles from reference soils prevented preparation of a standardized set of variables. Nevertheless, accurate classification of soil in surface water was achieved utilizing fatty acid variables identified in seasonally matched reference soils. Correlation analysis of entire chromatograms and subsequent discriminant analyses utilizing a restricted number of fatty acid variables showed that FAME profiles of soils exposed to the aquatic environment still had utility for classification at least 1 wk after submersion.
He, Alex Jingwei
2017-08-25
Struggling to correct the public-private imbalance in its health care system, the Hong Kong SAR Government seeks to introduce a government-regulated voluntary health insurance scheme, or VHIS, a distinctive financing instrument that combines the characteristics of private insurance with strong government regulation. This study examines citizens' responses to the new scheme and their willingness to subscribe. First-hand data were collected from a telephone survey that randomly sampled 1793 Hong Kong adults from September 2014 to February 2015. Univariate and multivariate methods were employed in data analysis. More than one third of the respondents explicitly stated intention of subscribing to the VHIS, a fairly high figure considering the scheme's voluntary nature. Multivariate analysis revealed moderate evidence of adverse selection, defined as individuals' opportunistic behaviors when making insurance purchasing decision based on their own assessment of risks or likelihood of making a claim. The excellent performance of Hong Kong's public medical system has had two parallel impacts. On the one hand, high-risk residents, particularly the uninsured, do not face a pressing need to switch out of the overloaded public system despite its inadequacies; this, in turn, may reduce the impact of adverse selection that may lead to detrimental effects to the insurance market. On the other hand, high satisfaction reinforces the interests of those who have both the need for better services and the ability to pay for supplementary insurance. Furthermore, the high-risk population demonstrates a moderate interest in the insurance despite the availability of government subsidies. This may offset the intended effect of the reform to some extent.
Chino, Junzo; Schroeck, Florian R; Sun, Leon; Lee, W Robert; Albala, David M; Moul, Judd W; Koontz, Bridget F
2009-11-01
To compare open radical prostatectomy (RP) and robot-assisted laparoscopic prostatectomy (RALP), and to determine whether RALP is associated with a higher risk of features that determine recommendations for postoperative radiation therapy (RT). Patients undergoing RP from 2003 to 2007 were stratified into two groups: open RP and RALP. Preoperative (PSA level, T stage and Gleason score), pathological factors (T stage, Gleason score, extracapsular extension [ECE] and the status of surgical margins and seminal vesicle invasion [SVI]) and early treatment with RT or referral for RT within 6 months were compared between the groups. Multivariate analysis was used to control for selection bias in the RALP group. In all, 904 patients were identified; 368 underwent RALP and 536 underwent open RP (retropubic or perineal). Patients undergoing open RP had a higher pathological stage with ECE present in 24.8% vs 19.3% in RALP (P = 0.05) and SVI in 10.3% vs 3.8% (P < 0.001). In the RALP vs open RP group, there were positive surgical margins in 31.5% vs 31.9% (P = 0.9) and there were postoperative PSA levels of (3) 0.2 ng/mL in 5.7% vs 6.3% (P = 0.7), respectively. On multivariate analysis to control for selection bias, RALP was not associated with indication for RT (odds ratio (OR) 1.10, P = 0.55), or referral for RT (OR 1.04, P = 0.86). RALP was not associated with an increase in either indication or referral for early postoperative RT.
Development of a prediction model for residual disease in newly diagnosed advanced ovarian cancer.
Janco, Jo Marie Tran; Glaser, Gretchen; Kim, Bohyun; McGree, Michaela E; Weaver, Amy L; Cliby, William A; Dowdy, Sean C; Bakkum-Gamez, Jamie N
2015-07-01
To construct a tool, using computed tomography (CT) imaging and preoperative clinical variables, to estimate successful primary cytoreduction for advanced epithelial ovarian cancer (EOC). Women who underwent primary cytoreductive surgery for stage IIIC/IV EOC at Mayo Clinic between 1/2/2003 and 12/30/2011 and had preoperative CT images of the abdomen and pelvis within 90days prior to their surgery available for review were included. CT images were reviewed for large-volume ascites, diffuse peritoneal thickening (DPT), omental cake, lymphadenopathy (LP), and spleen or liver involvement. Preoperative factors included age, body mass index (BMI), Eastern Cooperative Oncology Group performance status (ECOG PS), American Society of Anesthesiologists (ASA) score, albumin, CA-125, and thrombocytosis. Two prediction models were developed to estimate the probability of (i) complete and (ii) suboptimal cytoreduction (residual disease (RD) >1cm) using multivariable logistic analysis with backward and stepwise variable selection methods. Internal validation was assessed using bootstrap resampling to derive an optimism-corrected estimate of the c-index. 279 patients met inclusion criteria: 143 had complete cytoreduction, 26 had suboptimal cytoreduction (RD>1cm), and 110 had measurable RD ≤1cm. On multivariable analysis, age, absence of ascites, omental cake, and DPT on CT imaging independently predicted complete cytoreduction (c-index=0.748). Conversely, predictors of suboptimal cytoreduction were ECOG PS, DPT, and LP on preoperative CT imaging (c-index=0.685). The generated models serve as preoperative evaluation tools that may improve counseling and selection for primary surgery, but need to be externally validated. Copyright © 2015 Elsevier Inc. All rights reserved.
Murphy, Terrence E; Gaughan, Monica; Hume, Robert; Moore, S Gordon
2010-03-01
There are many approaches to solving the problem of underrepresentation of some racial and ethnic groups and women in scientific and technical disciplines. Here, the authors evaluate the association of a summer bridge program with the graduation rate of underrepresented minority (URM) students at a selective technical university. They demonstrate that this 5-week program prior to the fall of the 1st year contains elements reported as vital for successful student retention. Using multivariable survival analysis, they show that for URM students entering as fall-semester freshmen, relative to their nonparticipating peers, participation in this accelerated summer bridge program is associated with higher likelihood of graduation. The longitudinal panel data include more than 2,200 URM students.
Kobayashi, Yoshikazu; Habara, Masaaki; Ikezazki, Hidekazu; Chen, Ronggang; Naito, Yoshinobu; Toko, Kiyoshi
2010-01-01
Effective R&D and strict quality control of a broad range of foods, beverages, and pharmaceutical products require objective taste evaluation. Advanced taste sensors using artificial-lipid membranes have been developed based on concepts of global selectivity and high correlation with human sensory score. These sensors respond similarly to similar basic tastes, which they quantify with high correlations to sensory score. Using these unique properties, these sensors can quantify the basic tastes of saltiness, sourness, bitterness, umami, astringency and richness without multivariate analysis or artificial neural networks. This review describes all aspects of these taste sensors based on artificial lipid, ranging from the response principle and optimal design methods to applications in the food, beverage, and pharmaceutical markets. PMID:22319306
Murphy, Terrence E.; Gaughan, Monica; Hume, Robert; Moore, S. Gordon
2012-01-01
There are many approaches to solving the problem of underrepresentation of some racial and ethnic groups and women in scientific and technical disciplines. Here, the authors evaluate the association of a summer bridge program with the graduation rate of underrepresented minority (URM) students at a selective technical university. They demonstrate that this 5-week program prior to the fall of the 1st year contains elements reported as vital for successful student retention. Using multivariable survival analysis, they show that for URM students entering as fall-semester freshmen, relative to their nonparticipating peers, participation in this accelerated summer bridge program is associated with higher likelihood of graduation. The longitudinal panel data include more than 2,200 URM students. PMID:23136456
Multivariate analysis for scanning tunneling spectroscopy data
NASA Astrophysics Data System (ADS)
Yamanishi, Junsuke; Iwase, Shigeru; Ishida, Nobuyuki; Fujita, Daisuke
2018-01-01
We applied principal component analysis (PCA) to two-dimensional tunneling spectroscopy (2DTS) data obtained on a Si(111)-(7 × 7) surface to explore the effectiveness of multivariate analysis for interpreting 2DTS data. We demonstrated that several components that originated mainly from specific atoms at the Si(111)-(7 × 7) surface can be extracted by PCA. Furthermore, we showed that hidden components in the tunneling spectra can be decomposed (peak separation), which is difficult to achieve with normal 2DTS analysis without the support of theoretical calculations. Our analysis showed that multivariate analysis can be an additional powerful way to analyze 2DTS data and extract hidden information from a large amount of spectroscopic data.
A modal analysis of flexible aircraft dynamics with handling qualities implications
NASA Technical Reports Server (NTRS)
Schmidt, D. K.
1983-01-01
A multivariable modal analysis technique is presented for evaluating flexible aircraft dynamics, focusing on meaningful vehicle responses to pilot inputs and atmospheric turbulence. Although modal analysis is the tool, vehicle time response is emphasized, and the analysis is performed on the linear, time-domain vehicle model. In evaluating previously obtained experimental pitch tracking data for a family of vehicle dynamic models, it is shown that flexible aeroelastic effects can significantly affect pitch attitude handling qualities. Consideration of the eigenvalues alone, of both rigid-body and aeroelastic modes, does not explain the simulation results. Modal analysis revealed, however, that although the lowest aeroelastic mode frequency was still three times greater than the short-period frequency, the rigid-body attitude response was dominated by this aeroelastic mode. This dominance was defined in terms of the relative magnitudes of the modal residues in selected vehicle responses.
Jung, Jiwon; Moon, Song Mi; Jang, Hee-Chang; Kang, Cheol-In; Jun, Jae-Bum; Cho, Yong Kyun; Kang, Seung-Ji; Seo, Bo-Jeong; Kim, Young-Joo; Park, Seong-Beom; Lee, Juneyoung; Yu, Chang Sik; Kim, Sung-Han
2018-01-01
The aim of this study was to investigate the incidence and risk factors of postoperative pneumonia (POP) within 1 year after cancer surgery in patients with the five most common cancers (gastric, colorectal, lung, breast cancer, and hepatocellular carcinoma [HCC]) in South Korea. This was a multicenter and retrospective cohort study performed at five nationwide cancer centers. The number of cancer patients in each center was allocated by the proportion of cancer surgery. Adult patients were randomly selected according to the allocated number, among those who underwent cancer surgery from January to December 2014 within 6 months after diagnosis of cancer. One-year cumulative incidence of POP was estimated using Kaplan-Meier analysis. An univariable Cox's proportional hazard regression analysis was performed to identify risk factors for POP development. As a multivariable analysis, confounders were adjusted using multiple Cox's PH regression model. Among the total 2000 patients, the numbers of patients with gastric cancer, colorectal cancer, lung cancer, breast cancer, and HCC were 497 (25%), 525 (26%), 277 (14%), 552 (28%), and 149 (7%), respectively. Overall, the 1-year cumulative incidence of POP was 2.0% (95% CI, 1.4-2.6). The 1-year cumulative incidences in each cancer were as follows: lung 8.0%, gastric 1.8%, colorectal 1.0%, HCC 0.7%, and breast 0.4%. In multivariable analysis, older age, higher Charlson comorbidity index (CCI) score, ulcer disease, history of pneumonia, and smoking were related with POP development. In conclusions, the 1-year cumulative incidence of POP in the five most common cancers was 2%. Older age, higher CCI scores, smoker, ulcer disease, and previous pneumonia history increased the risk of POP development in cancer patients. © 2017 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.
NASA Astrophysics Data System (ADS)
Zhang, Yongyong; Zhou, Yujian; Shao, Quanxi; Liu, Hongbin; Lei, Qiuliang; Zhai, Xiaoyan; Wang, Xuelei
2016-12-01
Diffuse nutrient loss mechanism is complicated and shows remarkably regional differences due to spatial heterogeneities of underlying surface conditions, climate and agricultural practices. Moreover, current available observations are still hard to support the identification of impact factors due to different time or space steps. In this study, an integrated water system model (HEQM) was adopted to obtain the simulated loads of diffuse components (carriers: runoff and sediment; nutrient: total nitrogen (TN) and total phosphorous (TP)) with synchronous scales. Multivariable statistical analysis approaches (Analysis of Similarity and redundancy analysis) were used to assess the regional differences, and to identify impact factors as well as their contributions. Four catchments were selected as our study areas, i.e., Xiahui and Zhangjiafen Catchments of Miyun Basin in North China, Yuliang and Tunxi Catchments of Xin'anjiang Basin in South China. Results showed that the model performances of monthly processes were very good for runoff and good for sediment, TN and TP. The annual average coefficients of all the diffuse components in Xin'anjiang Basin were much greater than those in Miyun Basin, and showed significantly regional differences. All the selected impact factors interpreted 72.87-82.16% of the regional differences of carriers, and 62.72-71.62% of those of nutrient coefficients, respectively. For individual impact factor categories, the critical category was geography, followed by land-use/cover, carriers, climate, as well as soil and agricultural practices in Miyun Basin, or agricultural practices and soil in Xin'anjiang Basin. For individual factors, the critical factors were locations for the carrier regional differences, and carriers or chemical fertilizer for the nutrient regional differences. This study is expected to promote further applications of integrated water system model and multivariable statistical analysis in the diffuse nutrient studies, and provide a scientific support for the diffuse pollution control and management in China.
Multivariate Analysis of Schools and Educational Policy.
ERIC Educational Resources Information Center
Kiesling, Herbert J.
This report describes a multivariate analysis technique that approaches the problems of educational production function analysis by (1) using comparable measures of output across large experiments, (2) accounting systematically for differences in socioeconomic background, and (3) treating the school as a complete system in which different…
Brodsky, Burton S; Owens, Gary M; Scotti, Dennis J; Needham, Keith A; Cool, Christina L
2017-10-01
Ovarian cancer is the eighth most common cancer among women, but ranks fifth in cancer-related causes of death, the majority of which are detected in late stages, after the cancer has metastasized. The CA125 test is the standard of care for assessing suspicious pelvic masses. However, the primary use of CA125 is to monitor treatment progress rather than to screen for disease, and its sensitivity is exceedingly low, unlike the multivariate assay OVA1. A cost-effective treatment of ovarian cancer requires early and accurate diagnosis of pelvic masses and reduced referrals of patients with benign tumors to a gynecologic oncologist. To analyze the economic impact of increased utilization of a multivariate assay, such as OVA1, to guide the treatment of ovarian cancer. The study population was drawn from Medicare and commercial health plan claims data. A budget impact model was constructed to estimate the economic consequences of substituting the multivariate assay OVA1 to replace the single biomarker assay CA125 to assess the likelihood of pelvic mass malignancy in premenopausal and/or postmenopausal women. All patients selected for the analysis had CA125 testing before surgical intervention. A total of 92,843 health plan members were included for analysis, comprising 48,113 commercially insured members and 44,730 Medicare beneficiaries. Estimates of future health plan expenditures, which were calculated from base-case assumptions, projected overall savings of $0.05 per-member per-month (PMPM) for commercially insured members and $0.01 PMPM for Medicare beneficiaries as a result of increased utilization of OVA1. Sensitivity analysis revealed potential savings of up to $0.17 PMPM for commercially insured patients and up to $0.05 for Medicare beneficiaries. The results of the budget impact model support the use of OVA1 instead of CA125 by indicating that modest cost-savings can be achieved, while reaping the clinical benefits of improved diagnostic accuracy, early disease detection, and reductions in multiple, and possibly unnecessary, referrals to gynecologic oncologists.
Ballabio, Davide; Consonni, Viviana; Mauri, Andrea; Todeschini, Roberto
2010-01-11
In multivariate regression and classification issues variable selection is an important procedure used to select an optimal subset of variables with the aim of producing more parsimonious and eventually more predictive models. Variable selection is often necessary when dealing with methodologies that produce thousands of variables, such as Quantitative Structure-Activity Relationships (QSARs) and highly dimensional analytical procedures. In this paper a novel method for variable selection for classification purposes is introduced. This method exploits the recently proposed Canonical Measure of Correlation between two sets of variables (CMC index). The CMC index is in this case calculated for two specific sets of variables, the former being comprised of the independent variables and the latter of the unfolded class matrix. The CMC values, calculated by considering one variable at a time, can be sorted and a ranking of the variables on the basis of their class discrimination capabilities results. Alternatively, CMC index can be calculated for all the possible combinations of variables and the variable subset with the maximal CMC can be selected, but this procedure is computationally more demanding and classification performance of the selected subset is not always the best one. The effectiveness of the CMC index in selecting variables with discriminative ability was compared with that of other well-known strategies for variable selection, such as the Wilks' Lambda, the VIP index based on the Partial Least Squares-Discriminant Analysis, and the selection provided by classification trees. A variable Forward Selection based on the CMC index was finally used in conjunction of Linear Discriminant Analysis. This approach was tested on several chemical data sets. Obtained results were encouraging.
NASA Technical Reports Server (NTRS)
Wolf, S. F.; Lipschutz, M. E.
1993-01-01
Multivariate statistical analysis techniques (linear discriminant analysis and logistic regression) can provide powerful discrimination tools which are generally unfamiliar to the planetary science community. Fall parameters were used to identify a group of 17 H chondrites (Cluster 1) that were part of a coorbital stream which intersected Earth's orbit in May, from 1855 - 1895, and can be distinguished from all other H chondrite falls. Using multivariate statistical techniques, it was demonstrated that a totally different criterion, labile trace element contents - hence thermal histories - or 13 Cluster 1 meteorites are distinguishable from those of 45 non-Cluster 1 H chondrites. Here, we focus upon the principles of multivariate statistical techniques and illustrate their application using non-meteoritic and meteoritic examples.
Haller, Sven; Lovblad, Karl-Olof; Giannakopoulos, Panteleimon; Van De Ville, Dimitri
2014-05-01
Many diseases are associated with systematic modifications in brain morphometry and function. These alterations may be subtle, in particular at early stages of the disease progress, and thus not evident by visual inspection alone. Group-level statistical comparisons have dominated neuroimaging studies for many years, proving fascinating insight into brain regions involved in various diseases. However, such group-level results do not warrant diagnostic value for individual patients. Recently, pattern recognition approaches have led to a fundamental shift in paradigm, bringing multivariate analysis and predictive results, notably for the early diagnosis of individual patients. We review the state-of-the-art fundamentals of pattern recognition including feature selection, cross-validation and classification techniques, as well as limitations including inter-individual variation in normal brain anatomy and neurocognitive reserve. We conclude with the discussion of future trends including multi-modal pattern recognition, multi-center approaches with data-sharing and cloud-computing.
Moya, Claudio E; Raiber, Matthias; Taulis, Mauricio; Cox, Malcolm E
2015-03-01
The Galilee and Eromanga basins are sub-basins of the Great Artesian Basin (GAB). In this study, a multivariate statistical approach (hierarchical cluster analysis, principal component analysis and factor analysis) is carried out to identify hydrochemical patterns and assess the processes that control hydrochemical evolution within key aquifers of the GAB in these basins. The results of the hydrochemical assessment are integrated into a 3D geological model (previously developed) to support the analysis of spatial patterns of hydrochemistry, and to identify the hydrochemical and hydrological processes that control hydrochemical variability. In this area of the GAB, the hydrochemical evolution of groundwater is dominated by evapotranspiration near the recharge area resulting in a dominance of the Na-Cl water types. This is shown conceptually using two selected cross-sections which represent discrete groundwater flow paths from the recharge areas to the deeper parts of the basins. With increasing distance from the recharge area, a shift towards a dominance of carbonate (e.g. Na-HCO3 water type) has been observed. The assessment of hydrochemical changes along groundwater flow paths highlights how aquifers are separated in some areas, and how mixing between groundwater from different aquifers occurs elsewhere controlled by geological structures, including between GAB aquifers and coal bearing strata of the Galilee Basin. The results of this study suggest that distinct hydrochemical differences can be observed within the previously defined Early Cretaceous-Jurassic aquifer sequence of the GAB. A revision of the two previously recognised hydrochemical sequences is being proposed, resulting in three hydrochemical sequences based on systematic differences in hydrochemistry, salinity and dominant hydrochemical processes. The integrated approach presented in this study which combines different complementary multivariate statistical techniques with a detailed assessment of the geological framework of these sedimentary basins, can be adopted in other complex multi-aquifer systems to assess hydrochemical evolution and its geological controls. Copyright © 2014 Elsevier B.V. All rights reserved.
Santalla, M; De Ron, A M; De La Fuente, M
2010-05-01
Southwestern Europe has been considered as a secondary centre of genetic diversity for the common bean. The dispersal of domesticated materials from their centres of origin provides an experimental system that reveals how human selection during cultivation and adaptation to novel environments affects the genetic composition. In this paper, our goal was to elucidate how distinct events could modify the structure and level of genetic diversity in the common bean. The genome-wide genetic composition was analysed at 42 microsatellite loci in individuals of 22 landraces of domesticated common bean from the Mesoamerican gene pool. The accessions were also characterised for phaseolin seed protein and for nine allozyme polymorphisms and phenotypic traits. One of this study's important findings was the complementary information obtained from all the polymorphisms examined. Most of the markers found to be potentially under the influence of selection were located in the proximity of previously mapped genes and quantitative trait loci (QTLs) related to important agronomic traits, which indicates that population genomics approaches are very efficient in detecting QTLs. As it was revealed by outlier simple sequence repeats, loci analysis with STRUCTURE software and multivariate analysis of phenotypic data, the landraces were grouped into three clusters according to seed size and shape, vegetative growth habit and genetic resistance. A total of 151 alleles were detected with an average of 4 alleles per locus and an average polymorphism information content of 0.31. Using a model-based approach, on the basis of neutral markers implemented in the software STRUCTURE, three clusters were inferred, which were in good agreement with multivariate analysis. Geographic and genetic distances were congruent with the exception of a few putative hybrids identified in this study, suggesting a predominant effect of isolation by distance. Genomic scans using both markers linked to genes affected by selection (outlier) and neutral markers showed advantages relative to other approaches, since they help to create a more complete picture of how adaptation to environmental conditions has sculpted the common bean genomes in southern Europe. The use of outlier loci also gives a clue about what selective forces gave rise to the actual phenotypes of the analysed landraces.
Regression Model Optimization for the Analysis of Experimental Data
NASA Technical Reports Server (NTRS)
Ulbrich, N.
2009-01-01
A candidate math model search algorithm was developed at Ames Research Center that determines a recommended math model for the multivariate regression analysis of experimental data. The search algorithm is applicable to classical regression analysis problems as well as wind tunnel strain gage balance calibration analysis applications. The algorithm compares the predictive capability of different regression models using the standard deviation of the PRESS residuals of the responses as a search metric. This search metric is minimized during the search. Singular value decomposition is used during the search to reject math models that lead to a singular solution of the regression analysis problem. Two threshold dependent constraints are also applied. The first constraint rejects math models with insignificant terms. The second constraint rejects math models with near-linear dependencies between terms. The math term hierarchy rule may also be applied as an optional constraint during or after the candidate math model search. The final term selection of the recommended math model depends on the regressor and response values of the data set, the user s function class combination choice, the user s constraint selections, and the result of the search metric minimization. A frequently used regression analysis example from the literature is used to illustrate the application of the search algorithm to experimental data.
NASA Astrophysics Data System (ADS)
Kiss, I.; Cioată, V. G.; Alexa, V.; Raţiu, S. A.
2017-05-01
The braking system is one of the most important and complex subsystems of railway vehicles, especially when it comes for safety. Therefore, installing efficient safe brakes on the modern railway vehicles is essential. Nowadays is devoted attention to solving problems connected with using high performance brake materials and its impact on thermal and mechanical loading of railway wheels. The main factor that influences the selection of a friction material for railway applications is the performance criterion, due to the interaction between the brake block and the wheel produce complex thermos-mechanical phenomena. In this work, the investigated subjects are the cast-iron brake shoes, which are still widely used on freight wagons. Therefore, the cast-iron brake shoes - with lamellar graphite and with a high content of phosphorus (0.8-1.1%) - need a special investigation. In order to establish the optimal condition for the cast-iron brake shoes we proposed a mathematical modelling study by using the statistical analysis and multiple regression equations. Multivariate research is important in areas of cast-iron brake shoes manufacturing, because many variables interact with each other simultaneously. Multivariate visualization comes to the fore when researchers have difficulties in comprehending many dimensions at one time. Technological data (hardness and chemical composition) obtained from cast-iron brake shoes were used for this purpose. In order to settle the multiple correlation between the hardness of the cast-iron brake shoes, and the chemical compositions elements several model of regression equation types has been proposed. Because a three-dimensional surface with variables on three axes is a common way to illustrate multivariate data, in which the maximum and minimum values are easily highlighted, we plotted graphical representation of the regression equations in order to explain interaction of the variables and locate the optimal level of each variable for maximal response. For the calculation of the regression coefficients, dispersion and correlation coefficients, the software Matlab was used.
Multivariate Analysis of Genotype-Phenotype Association.
Mitteroecker, Philipp; Cheverud, James M; Pavlicev, Mihaela
2016-04-01
With the advent of modern imaging and measurement technology, complex phenotypes are increasingly represented by large numbers of measurements, which may not bear biological meaning one by one. For such multivariate phenotypes, studying the pairwise associations between all measurements and all alleles is highly inefficient and prevents insight into the genetic pattern underlying the observed phenotypes. We present a new method for identifying patterns of allelic variation (genetic latent variables) that are maximally associated-in terms of effect size-with patterns of phenotypic variation (phenotypic latent variables). This multivariate genotype-phenotype mapping (MGP) separates phenotypic features under strong genetic control from less genetically determined features and thus permits an analysis of the multivariate structure of genotype-phenotype association, including its dimensionality and the clustering of genetic and phenotypic variables within this association. Different variants of MGP maximize different measures of genotype-phenotype association: genetic effect, genetic variance, or heritability. In an application to a mouse sample, scored for 353 SNPs and 11 phenotypic traits, the first dimension of genetic and phenotypic latent variables accounted for >70% of genetic variation present in all 11 measurements; 43% of variation in this phenotypic pattern was explained by the corresponding genetic latent variable. The first three dimensions together sufficed to account for almost 90% of genetic variation in the measurements and for all the interpretable genotype-phenotype association. Each dimension can be tested as a whole against the hypothesis of no association, thereby reducing the number of statistical tests from 7766 to 3-the maximal number of meaningful independent tests. Important alleles can be selected based on their effect size (additive or nonadditive effect on the phenotypic latent variable). This low dimensionality of the genotype-phenotype map has important consequences for gene identification and may shed light on the evolvability of organisms. Copyright © 2016 by the Genetics Society of America.
Ferreira, Ana P; Tobyn, Mike
2015-01-01
In the pharmaceutical industry, chemometrics is rapidly establishing itself as a tool that can be used at every step of product development and beyond: from early development to commercialization. This set of multivariate analysis methods allows the extraction of information contained in large, complex data sets thus contributing to increase product and process understanding which is at the core of the Food and Drug Administration's Process Analytical Tools (PAT) Guidance for Industry and the International Conference on Harmonisation's Pharmaceutical Development guideline (Q8). This review is aimed at providing pharmaceutical industry professionals an introduction to multivariate analysis and how it is being adopted and implemented by companies in the transition from "quality-by-testing" to "quality-by-design". It starts with an introduction to multivariate analysis and the two methods most commonly used: principal component analysis and partial least squares regression, their advantages, common pitfalls and requirements for their effective use. That is followed with an overview of the diverse areas of application of multivariate analysis in the pharmaceutical industry: from the development of real-time analytical methods to definition of the design space and control strategy, from formulation optimization during development to the application of quality-by-design principles to improve manufacture of existing commercial products.
Uppugunduri, Chakradhara Rao S.; Storelli, Flavia; Mlakar, Vid; Huezo-Diaz Curtis, Patricia; Rezgui, Aziz; Théorêt, Yves; Marino, Denis; Doffey-Lazeyras, Fabienne; Chalandon, Yves; Bader, Peter; Daali, Youssef; Bittencourt, Henrique; Krajinovic, Maja; Ansari, Marc
2017-01-01
Hemorrhagic cystitis (HC) is one of the complications of busulfan-cyclophosphamide (BU-CY) conditioning regimen during allogeneic hematopoietic stem cell transplantation (HSCT) in children. Identifying children at high risk of developing HC in a HSCT setting could facilitate the evaluation and implementation of effective prophylactic measures. In this retrospective analysis genotyping of selected candidate gene variants was performed in 72 children and plasma Sulfolane (Su, water soluble metabolite of BU) levels were measured in 39 children following treatment with BU-CY regimen. The cytotoxic effects of Su and acrolein (Ac, water soluble metabolite of CY) were tested on human urothelial cells (HUCs). The effect of Su was also tested on cytochrome P 450 (CYP) function in HepaRG hepatic cells. Cumulative incidences of HC before day 30 post HSCT were estimated using Kaplan–Meier curves and log-rank test was used to compare the difference between groups in a univariate analysis. Multivariate Cox regression was used to estimate hazard ratios with 95% confidence intervals (CIs). Multivariate analysis included co-variables that were significantly associated with HC in a univariate analysis. Cumulative incidence of HC was 15.3%. In the univariate analysis, HC incidence was significantly (p < 0.05) higher in children older than 10 years (28.6 vs. 6.8%) or in children with higher Su levels (>40 vs. <11%) or in carriers of both functional GSTM1 and CYP2C9 (33.3 vs. 6.3%) compared to the other group. In a multivariate analysis, combined GSTM1 and CYP2C9 genotype status was associated with HC occurrence with a hazards ratio of 4.8 (95% CI: 1.3–18.4; p = 0.02). Ac was found to be toxic to HUC cells at lower concentrations (33 μM), Su was not toxic to HUC cells at concentrations below 1 mM and did not affect CYP function in HepaRG cells. Our observations suggest that pre-emptive genotyping of CYP2C9 and GSTM1 may aid in selection of more effective prophylaxis to reduce HC development in pediatric patients undergoing allogeneic HSCT. Article summary: (1) Children carrying functional alleles in GSTM1 and CYP2C9 are at high risk for developing hemorrhagic cystitis following treatment with busulfan and cyclophosphamide based conditioning regimen. (2) Identification of children at high risk for developing hemorrhagic cystitis in an allogeneic HSCT setting will enable us to evaluate and implement optimal strategies for its prevention. Trial registration: This study is a part of the trail “clinicaltrials.gov identifier: NCT01257854.” PMID:28744217
Uppugunduri, Chakradhara Rao S; Storelli, Flavia; Mlakar, Vid; Huezo-Diaz Curtis, Patricia; Rezgui, Aziz; Théorêt, Yves; Marino, Denis; Doffey-Lazeyras, Fabienne; Chalandon, Yves; Bader, Peter; Daali, Youssef; Bittencourt, Henrique; Krajinovic, Maja; Ansari, Marc
2017-01-01
Hemorrhagic cystitis (HC) is one of the complications of busulfan-cyclophosphamide (BU-CY) conditioning regimen during allogeneic hematopoietic stem cell transplantation (HSCT) in children. Identifying children at high risk of developing HC in a HSCT setting could facilitate the evaluation and implementation of effective prophylactic measures. In this retrospective analysis genotyping of selected candidate gene variants was performed in 72 children and plasma Sulfolane (Su, water soluble metabolite of BU) levels were measured in 39 children following treatment with BU-CY regimen. The cytotoxic effects of Su and acrolein (Ac, water soluble metabolite of CY) were tested on human urothelial cells (HUCs). The effect of Su was also tested on cytochrome P 450 (CYP) function in HepaRG hepatic cells. Cumulative incidences of HC before day 30 post HSCT were estimated using Kaplan-Meier curves and log-rank test was used to compare the difference between groups in a univariate analysis. Multivariate Cox regression was used to estimate hazard ratios with 95% confidence intervals (CIs). Multivariate analysis included co-variables that were significantly associated with HC in a univariate analysis. Cumulative incidence of HC was 15.3%. In the univariate analysis, HC incidence was significantly ( p < 0.05) higher in children older than 10 years (28.6 vs. 6.8%) or in children with higher Su levels (>40 vs. <11%) or in carriers of both functional GSTM1 and CYP2C9 (33.3 vs. 6.3%) compared to the other group. In a multivariate analysis, combined GSTM1 and CYP2C9 genotype status was associated with HC occurrence with a hazards ratio of 4.8 (95% CI: 1.3-18.4; p = 0.02). Ac was found to be toxic to HUC cells at lower concentrations (33 μM), Su was not toxic to HUC cells at concentrations below 1 mM and did not affect CYP function in HepaRG cells. Our observations suggest that pre-emptive genotyping of CYP2C9 and GSTM1 may aid in selection of more effective prophylaxis to reduce HC development in pediatric patients undergoing allogeneic HSCT. Article summary : (1) Children carrying functional alleles in GSTM1 and CYP2C9 are at high risk for developing hemorrhagic cystitis following treatment with busulfan and cyclophosphamide based conditioning regimen. (2) Identification of children at high risk for developing hemorrhagic cystitis in an allogeneic HSCT setting will enable us to evaluate and implement optimal strategies for its prevention. Trial registration : This study is a part of the trail "clinicaltrials.gov identifier: NCT01257854."
Multi-variant study of obesity risk genes in African Americans: The Jackson Heart Study.
Liu, Shijian; Wilson, James G; Jiang, Fan; Griswold, Michael; Correa, Adolfo; Mei, Hao
2016-11-30
Genome-wide association study (GWAS) has been successful in identifying obesity risk genes by single-variant association analysis. For this study, we designed steps of analysis strategy and aimed to identify multi-variant effects on obesity risk among candidate genes. Our analyses were focused on 2137 African American participants with body mass index measured in the Jackson Heart Study and 657 common single nucleotide polymorphisms (SNPs) genotyped at 8 GWAS-identified obesity risk genes. Single-variant association test showed that no SNPs reached significance after multiple testing adjustment. The following gene-gene interaction analysis, which was focused on SNPs with unadjusted p-value<0.10, identified 6 significant multi-variant associations. Logistic regression showed that SNPs in these associations did not have significant linear interactions; examination of genetic risk score evidenced that 4 multi-variant associations had significant additive effects of risk SNPs; and haplotype association test presented that all multi-variant associations contained one or several combinations of particular alleles or haplotypes, associated with increased obesity risk. Our study evidenced that obesity risk genes generated multi-variant effects, which can be additive or non-linear interactions, and multi-variant study is an important supplement to existing GWAS for understanding genetic effects of obesity risk genes. Copyright © 2016 Elsevier B.V. All rights reserved.
Linking brain-wide multivoxel activation patterns to behaviour: Examples from language and math.
Raizada, Rajeev D S; Tsao, Feng-Ming; Liu, Huei-Mei; Holloway, Ian D; Ansari, Daniel; Kuhl, Patricia K
2010-05-15
A key goal of cognitive neuroscience is to find simple and direct connections between brain and behaviour. However, fMRI analysis typically involves choices between many possible options, with each choice potentially biasing any brain-behaviour correlations that emerge. Standard methods of fMRI analysis assess each voxel individually, but then face the problem of selection bias when combining those voxels into a region-of-interest, or ROI. Multivariate pattern-based fMRI analysis methods use classifiers to analyse multiple voxels together, but can also introduce selection bias via data-reduction steps as feature selection of voxels, pre-selecting activated regions, or principal components analysis. We show here that strong brain-behaviour links can be revealed without any voxel selection or data reduction, using just plain linear regression as a classifier applied to the whole brain at once, i.e. treating each entire brain volume as a single multi-voxel pattern. The brain-behaviour correlations emerged despite the fact that the classifier was not provided with any information at all about subjects' behaviour, but instead was given only the neural data and its condition-labels. Surprisingly, more powerful classifiers such as a linear SVM and regularised logistic regression produce very similar results. We discuss some possible reasons why the very simple brain-wide linear regression model is able to find correlations with behaviour that are as strong as those obtained on the one hand from a specific ROI and on the other hand from more complex classifiers. In a manner which is unencumbered by arbitrary choices, our approach offers a method for investigating connections between brain and behaviour which is simple, rigorous and direct. Copyright (c) 2010 Elsevier Inc. All rights reserved.
Linking brain-wide multivoxel activation patterns to behaviour: Examples from language and math
Raizada, Rajeev D.S.; Tsao, Feng-Ming; Liu, Huei-Mei; Holloway, Ian D.; Ansari, Daniel; Kuhl, Patricia K.
2010-01-01
A key goal of cognitive neuroscience is to find simple and direct connections between brain and behaviour. However, fMRI analysis typically involves choices between many possible options, with each choice potentially biasing any brain–behaviour correlations that emerge. Standard methods of fMRI analysis assess each voxel individually, but then face the problem of selection bias when combining those voxels into a region-of-interest, or ROI. Multivariate pattern-based fMRI analysis methods use classifiers to analyse multiple voxels together, but can also introduce selection bias via data-reduction steps as feature selection of voxels, pre-selecting activated regions, or principal components analysis. We show here that strong brain–behaviour links can be revealed without any voxel selection or data reduction, using just plain linear regression as a classifier applied to the whole brain at once, i.e. treating each entire brain volume as a single multi-voxel pattern. The brain–behaviour correlations emerged despite the fact that the classifier was not provided with any information at all about subjects' behaviour, but instead was given only the neural data and its condition-labels. Surprisingly, more powerful classifiers such as a linear SVM and regularised logistic regression produce very similar results. We discuss some possible reasons why the very simple brain-wide linear regression model is able to find correlations with behaviour that are as strong as those obtained on the one hand from a specific ROI and on the other hand from more complex classifiers. In a manner which is unencumbered by arbitrary choices, our approach offers a method for investigating connections between brain and behaviour which is simple, rigorous and direct. PMID:20132896
Sexual selection on cuticular hydrocarbons of male sagebrush crickets in the wild
Steiger, Sandra; Ower, Geoffrey D.; Stökl, Johannes; Mitchell, Christopher; Hunt, John; Sakaluk, Scott K.
2013-01-01
Cuticular hydrocarbons (CHCs) play an essential role in mate recognition in insects but the form and intensity of sexual selection on CHCs has only been evaluated in a handful of studies, and never in a natural population. We quantified sexual selection operating on CHCs in a wild population of sagebrush crickets, a species in which nuptial feeding by females imposes an unambiguous phenotypic marker on males. Multivariate selection analysis revealed a saddle-shaped fitness surface, suggesting a complex interplay between the total abundance of CHCs and specific CHC combinations in their influence on female choice. The fitness surface resulting from two axes of disruptive selection reflected a trade-off between short- and long-chained CHCs, suggesting that males may be sacrificing some level of desiccation resistance in favour of increased attractiveness. There was a significant correlation between male body size and total CHC abundance, suggesting that male CHCs provide females with a reliable cue for maximizing benefits obtained from males. Notwithstanding the conspicuousness of males’ acoustic signals, our results suggest that selection imposed on males via female mating preferences may be far more complex than previously appreciated and operating in multiple sensory modalities. PMID:24197415
Sexual selection on cuticular hydrocarbons of male sagebrush crickets in the wild.
Steiger, Sandra; Ower, Geoffrey D; Stökl, Johannes; Mitchell, Christopher; Hunt, John; Sakaluk, Scott K
2013-12-22
Cuticular hydrocarbons (CHCs) play an essential role in mate recognition in insects but the form and intensity of sexual selection on CHCs has only been evaluated in a handful of studies, and never in a natural population. We quantified sexual selection operating on CHCs in a wild population of sagebrush crickets, a species in which nuptial feeding by females imposes an unambiguous phenotypic marker on males. Multivariate selection analysis revealed a saddle-shaped fitness surface, suggesting a complex interplay between the total abundance of CHCs and specific CHC combinations in their influence on female choice. The fitness surface resulting from two axes of disruptive selection reflected a trade-off between short- and long-chained CHCs, suggesting that males may be sacrificing some level of desiccation resistance in favour of increased attractiveness. There was a significant correlation between male body size and total CHC abundance, suggesting that male CHCs provide females with a reliable cue for maximizing benefits obtained from males. Notwithstanding the conspicuousness of males' acoustic signals, our results suggest that selection imposed on males via female mating preferences may be far more complex than previously appreciated and operating in multiple sensory modalities.
FT-IR-cPAS—New Photoacoustic Measurement Technique for Analysis of Hot Gases: A Case Study on VOCs
Hirschmann, Christian Bernd; Koivikko, Niina Susanna; Raittila, Jussi; Tenhunen, Jussi; Ojala, Satu; Rahkamaa-Tolonen, Katariina; Marbach, Ralf; Hirschmann, Sarah; Keiski, Riitta Liisa
2011-01-01
This article describes a new photoacoustic FT-IR system capable of operating at elevated temperatures. The key hardware component is an optical-readout cantilever microphone that can work up to 200 °C. All parts in contact with the sample gas were put into a heated oven, incl. the photoacoustic cell. The sensitivity of the built photoacoustic system was tested by measuring 18 different VOCs. At 100 ppm gas concentration, the univariate signal to noise ratios (1σ, measurement time 25.5 min, at highest peak, optical resolution 8 cm−1) of the spectra varied from minimally 19 for o-xylene up to 329 for butyl acetate. The sensitivity can be improved by multivariate analyses over broad wavelength ranges, which effectively co-adds the univariate sensitivities achievable at individual wavelengths. The multivariate limit of detection (3σ, 8.5 min, full useful wavelength range), i.e., the best possible inverse analytical sensitivity achievable at optimum calibration, was calculated using the SBC method and varied from 2.60 ppm for dichloromethane to 0.33 ppm for butyl acetate. Depending on the shape of the spectra, which often only contain a few sharp peaks, the multivariate analysis improved the analytical sensitivity by 2.2 to 9.2 times compared to the univariate case. Selectivity and multi component ability were tested by a SBC calibration including 5 VOCs and water. The average cross selectivities turned out to be less than 2% and the resulting inverse analytical sensitivities of the 5 interfering VOCs was increased by maximum factor of 2.2 compared to the single component sensitivities. Water subtraction using SBC gave the true analyte concentration with a variation coefficient of 3%, although the sample spectra (methyl ethyl ketone, 200 ppm) contained water from 1,400 to 100k ppm and for subtraction only one water spectra (10k ppm) was used. The developed device shows significant improvement to the current state-of-the-art measurement methods used in industrial VOC measurements. PMID:22163900
NASA Astrophysics Data System (ADS)
Tucić, Branka; Tomić, Vladimir; Avramov, Stevan; Pemac, Danijela
1998-12-01
A multivariate selection analysis has been used to test the adaptiveness of several Iris pumila leaf traits that display plasticity to natural light conditions. Siblings of a synthetic population comprising 31 families of two populations from contrasting light habitats were grown at an open dune site and in the understory of a Pinus nigra stand in order to score variation in phenotypic expression of six leaf traits: number of senescent leaves, number of live leaves, leaf length, leaf width, leaf angle, and specific leaf area. The ambient light conditions affected the values of all traits studied except for specific leaf area. In accordance to ecophysiological expectations for an adaptive response to light, both leaf length and width were significantly greater while the angle between sequential leaves was significantly smaller in the woodland understory than at the exposed dune site. The relationship between leaf traits and vegetative fitness (total leaf area) differed across light habitats as predicted by functional hypotheses. The standardized linear selection gradient ( β') for leaf length and width were positive in sign in both environments, but their magnitude for leaf length was higher in the shade than under full sunlight. Since plasticity of leaf length in the woodland shade has been recognized as adaptive, fitness cost of producing plastic change in leaf length was assessed. In both of the available methods used, the two-step and the multivariate regression procedures, a rather high negative association between the fitness value and the plasticity of leaf length was obtained, indicating a cost of plasticity. The selection gradient for leaf angle was weak and significant only in the woodland understory. Genetic correlations between trait expressions in contrasting light environments were negative in sign and low in magnitude, implying a significant genetic variation for plasticity in these leaf traits. Furthermore, leaf length and leaf width were found to be genetically positively coupled, which indicates that there is a potential for these two traits to evolve toward their optimal phenotypic values even faster than would be expected if they were genetically independent.
Cider fermentation process monitoring by Vis-NIR sensor system and chemometrics.
Villar, Alberto; Vadillo, Julen; Santos, Jose I; Gorritxategi, Eneko; Mabe, Jon; Arnaiz, Aitor; Fernández, Luis A
2017-04-15
Optimization of a multivariate calibration process has been undertaken for a Visible-Near Infrared (400-1100nm) sensor system, applied in the monitoring of the fermentation process of the cider produced in the Basque Country (Spain). The main parameters that were monitored included alcoholic proof, l-lactic acid content, glucose+fructose and acetic acid content. The multivariate calibration was carried out using a combination of different variable selection techniques and the most suitable pre-processing strategies were selected based on the spectra characteristics obtained by the sensor system. The variable selection techniques studied in this work include Martens Uncertainty test, interval Partial Least Square Regression (iPLS) and Genetic Algorithm (GA). This procedure arises from the need to improve the calibration models prediction ability for cider monitoring. Copyright © 2016 Elsevier Ltd. All rights reserved.
Instrumental Neutron Activation Analysis and Multivariate Statistics for Pottery Provenance
NASA Astrophysics Data System (ADS)
Glascock, M. D.; Neff, H.; Vaughn, K. J.
2004-06-01
The application of instrumental neutron activation analysis and multivariate statistics to archaeological studies of ceramics and clays is described. A small pottery data set from the Nasca culture in southern Peru is presented for illustration.
Pre and Post-copulatory Selection Favor Similar Genital Phenotypes in the Male Broad Horned Beetle.
House, Clarissa M; Sharma, M D; Okada, Kensuke; Hosken, David J
2016-10-01
Sexual selection can operate before and after copulation and the same or different trait(s) can be targeted during these episodes of selection. The direction and form of sexual selection imposed on characters prior to mating has been relatively well described, but the same is not true after copulation. In general, when male-male competition and female choice favor the same traits then there is the expectation of reinforcing selection on male sexual traits that improve competitiveness before and after copulation. However, when male-male competition overrides pre-copulatory choice then the opposite could be true. With respect to studies of selection on genitalia there is good evidence that male genital morphology influences mating and fertilization success. However, whether genital morphology affects reproductive success in more than one context (i.e., mating versus fertilization success) is largely unknown. Here we use multivariate analysis to estimate linear and nonlinear selection on male body size and genital morphology in the flour beetle Gnatocerus cornutus, simulated in a non-competitive (i.e., monogamous) setting. This analysis estimates the form of selection on multiple traits and typically, linear (directional) selection is easiest to detect, while nonlinear selection is more complex and can be stabilizing, disruptive, or correlational. We find that mating generates stabilizing selection on male body size and genitalia, and fertilization causes a blend of directional and stabilizing selection. Differences in the form of selection across these bouts of selection result from a significant alteration of nonlinear selection on body size and a marginally significant difference in nonlinear selection on a component of genital shape. This suggests that both bouts of selection favor similar genital phenotypes, whereas the strong stabilizing selection imposed on male body size during mate acquisition is weak during fertilization. © The Author 2016. Published by Oxford University Press on behalf of the Society for Integrative and Comparative Biology.
A Study of Effects of MultiCollinearity in the Multivariable Analysis
Yoo, Wonsuk; Mayberry, Robert; Bae, Sejong; Singh, Karan; (Peter) He, Qinghua; Lillard, James W.
2015-01-01
A multivariable analysis is the most popular approach when investigating associations between risk factors and disease. However, efficiency of multivariable analysis highly depends on correlation structure among predictive variables. When the covariates in the model are not independent one another, collinearity/multicollinearity problems arise in the analysis, which leads to biased estimation. This work aims to perform a simulation study with various scenarios of different collinearity structures to investigate the effects of collinearity under various correlation structures amongst predictive and explanatory variables and to compare these results with existing guidelines to decide harmful collinearity. Three correlation scenarios among predictor variables are considered: (1) bivariate collinear structure as the most simple collinearity case, (2) multivariate collinear structure where an explanatory variable is correlated with two other covariates, (3) a more realistic scenario when an independent variable can be expressed by various functions including the other variables. PMID:25664257
A Study of Effects of MultiCollinearity in the Multivariable Analysis.
Yoo, Wonsuk; Mayberry, Robert; Bae, Sejong; Singh, Karan; Peter He, Qinghua; Lillard, James W
2014-10-01
A multivariable analysis is the most popular approach when investigating associations between risk factors and disease. However, efficiency of multivariable analysis highly depends on correlation structure among predictive variables. When the covariates in the model are not independent one another, collinearity/multicollinearity problems arise in the analysis, which leads to biased estimation. This work aims to perform a simulation study with various scenarios of different collinearity structures to investigate the effects of collinearity under various correlation structures amongst predictive and explanatory variables and to compare these results with existing guidelines to decide harmful collinearity. Three correlation scenarios among predictor variables are considered: (1) bivariate collinear structure as the most simple collinearity case, (2) multivariate collinear structure where an explanatory variable is correlated with two other covariates, (3) a more realistic scenario when an independent variable can be expressed by various functions including the other variables.
Localization of genes involved in the metabolic syndrome using multivariate linkage analysis.
Olswold, Curtis; de Andrade, Mariza
2003-12-31
There are no well accepted criteria for the diagnosis of the metabolic syndrome. However, the metabolic syndrome is identified clinically by the presence of three or more of these five variables: larger waist circumference, higher triglyceride levels, lower HDL-cholesterol concentrations, hypertension, and impaired fasting glucose. We use sets of two or three variables, which are available in the Framingham Heart Study data set, to localize genes responsible for this syndrome using multivariate quantitative linkage analysis. This analysis demonstrates the applicability of using multivariate linkage analysis and how its use increases the power to detect linkage when genes are involved in the same disease mechanism.
ECOPASS - a multivariate model used as an index of growth performance of poplar clones
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ceulemans, R.; Impens, I.
The model (ECOlogical PASSport) reported was constructed by principal component analysis from a combination of biochemical, anatomical/morphological and ecophysiological gas exchange parameters measured on 5 fast growing poplar clones. Productivity data were 10 selected trees in 3 plantations in Belgium and given as m.a.i.(b.a.). The model is shown to be able to reflect not only genetic origin and the relative effects of the different parameters of the clones, but also their production potential. Multiple regression analysis of the 4 principal components showed a high cumulative correlation (96%) between the 3 components related to ecophysiological, biochemical and morphological parameters, and productivity;more » the ecophysiological component alone correlated 85% with productivity.« less
Rupert, Michael G.
2003-01-01
Draft Federal regulations may require that each State develop a State Pesticide Management Plan for the herbicides atrazine, alachlor, metolachlor, and simazine. Maps were developed that the State of Colorado could use to predict the probability of detecting atrazine and desethyl-atrazine (a breakdown product of atrazine) in ground water in Colorado. These maps can be incorporated into the State Pesticide Management Plan and can help provide a sound hydrogeologic basis for atrazine management in Colorado. Maps showing the probability of detecting elevated nitrite plus nitrate as nitrogen (nitrate) concentrations in ground water in Colorado also were developed because nitrate is a contaminant of concern in many areas of Colorado. Maps showing the probability of detecting atrazine and(or) desethyl-atrazine (atrazine/DEA) at or greater than concentrations of 0.1 microgram per liter and nitrate concentrations in ground water greater than 5 milligrams per liter were developed as follows: (1) Ground-water quality data were overlaid with anthropogenic and hydrogeologic data using a geographic information system to produce a data set in which each well had corresponding data on atrazine use, fertilizer use, geology, hydrogeomorphic regions, land cover, precipitation, soils, and well construction. These data then were downloaded to a statistical software package for analysis by logistic regression. (2) Relations were observed between ground-water quality and the percentage of land-cover categories within circular regions (buffers) around wells. Several buffer sizes were evaluated; the buffer size that provided the strongest relation was selected for use in the logistic regression models. (3) Relations between concentrations of atrazine/DEA and nitrate in ground water and atrazine use, fertilizer use, geology, hydrogeomorphic regions, land cover, precipitation, soils, and well-construction data were evaluated, and several preliminary multivariate models with various combinations of independent variables were constructed. (4) The multivariate models that best predicted the presence of atrazine/DEA and elevated concentrations of nitrate in ground water were selected. (5) The accuracy of the multivariate models was confirmed by validating the models with an independent set of ground-water quality data. (6) The multivariate models were entered into a geographic information system and the probability maps were constructed.
Binquet, C; Abrahamowicz, M; Mahboubi, A; Jooste, V; Faivre, J; Bonithon-Kopp, C; Quantin, C
2008-12-30
Flexible survival models, which avoid assumptions about hazards proportionality (PH) or linearity of continuous covariates effects, bring the issues of model selection to a new level of complexity. Each 'candidate covariate' requires inter-dependent decisions regarding (i) its inclusion in the model, and representation of its effects on the log hazard as (ii) either constant over time or time-dependent (TD) and, for continuous covariates, (iii) either loglinear or non-loglinear (NL). Moreover, 'optimal' decisions for one covariate depend on the decisions regarding others. Thus, some efficient model-building strategy is necessary.We carried out an empirical study of the impact of the model selection strategy on the estimates obtained in flexible multivariable survival analyses of prognostic factors for mortality in 273 gastric cancer patients. We used 10 different strategies to select alternative multivariable parametric as well as spline-based models, allowing flexible modeling of non-parametric (TD and/or NL) effects. We employed 5-fold cross-validation to compare the predictive ability of alternative models.All flexible models indicated significant non-linearity and changes over time in the effect of age at diagnosis. Conventional 'parametric' models suggested the lack of period effect, whereas more flexible strategies indicated a significant NL effect. Cross-validation confirmed that flexible models predicted better mortality. The resulting differences in the 'final model' selected by various strategies had also impact on the risk prediction for individual subjects.Overall, our analyses underline (a) the importance of accounting for significant non-parametric effects of covariates and (b) the need for developing accurate model selection strategies for flexible survival analyses. Copyright 2008 John Wiley & Sons, Ltd.
PERIODIC AUTOREGRESSIVE-MOVING AVERAGE (PARMA) MODELING WITH APPLICATIONS TO WATER RESOURCES.
Vecchia, A.V.
1985-01-01
Results involving correlation properties and parameter estimation for autogressive-moving average models with periodic parameters are presented. A multivariate representation of the PARMA model is used to derive parameter space restrictions and difference equations for the periodic autocorrelations. Close approximation to the likelihood function for Gaussian PARMA processes results in efficient maximum-likelihood estimation procedures. Terms in the Fourier expansion of the parameters are sequentially included, and a selection criterion is given for determining the optimal number of harmonics to be included. Application of the techniques is demonstrated through analysis of a monthly streamflow time series.
Multivariate frequency domain analysis of protein dynamics
NASA Astrophysics Data System (ADS)
Matsunaga, Yasuhiro; Fuchigami, Sotaro; Kidera, Akinori
2009-03-01
Multivariate frequency domain analysis (MFDA) is proposed to characterize collective vibrational dynamics of protein obtained by a molecular dynamics (MD) simulation. MFDA performs principal component analysis (PCA) for a bandpass filtered multivariate time series using the multitaper method of spectral estimation. By applying MFDA to MD trajectories of bovine pancreatic trypsin inhibitor, we determined the collective vibrational modes in the frequency domain, which were identified by their vibrational frequencies and eigenvectors. At near zero temperature, the vibrational modes determined by MFDA agreed well with those calculated by normal mode analysis. At 300 K, the vibrational modes exhibited characteristic features that were considerably different from the principal modes of the static distribution given by the standard PCA. The influences of aqueous environments were discussed based on two different sets of vibrational modes, one derived from a MD simulation in water and the other from a simulation in vacuum. Using the varimax rotation, an algorithm of the multivariate statistical analysis, the representative orthogonal set of eigenmodes was determined at each vibrational frequency.
Imaging of polysaccharides in the tomato cell wall with Raman microspectroscopy
2014-01-01
Background The primary cell wall of fruits and vegetables is a structure mainly composed of polysaccharides (pectins, hemicelluloses, cellulose). Polysaccharides are assembled into a network and linked together. It is thought that the percentage of components and of plant cell wall has an important influence on mechanical properties of fruits and vegetables. Results In this study the Raman microspectroscopy technique was introduced to the visualization of the distribution of polysaccharides in cell wall of fruit. The methodology of the sample preparation, the measurement using Raman microscope and multivariate image analysis are discussed. Single band imaging (for preliminary analysis) and multivariate image analysis methods (principal component analysis and multivariate curve resolution) were used for the identification and localization of the components in the primary cell wall. Conclusions Raman microspectroscopy supported by multivariate image analysis methods is useful in distinguishing cellulose and pectins in the cell wall in tomatoes. It presents how the localization of biopolymers was possible with minimally prepared samples. PMID:24917885
A refined method for multivariate meta-analysis and meta-regression
Jackson, Daniel; Riley, Richard D
2014-01-01
Making inferences about the average treatment effect using the random effects model for meta-analysis is problematic in the common situation where there is a small number of studies. This is because estimates of the between-study variance are not precise enough to accurately apply the conventional methods for testing and deriving a confidence interval for the average effect. We have found that a refined method for univariate meta-analysis, which applies a scaling factor to the estimated effects’ standard error, provides more accurate inference. We explain how to extend this method to the multivariate scenario and show that our proposal for refined multivariate meta-analysis and meta-regression can provide more accurate inferences than the more conventional approach. We explain how our proposed approach can be implemented using standard output from multivariate meta-analysis software packages and apply our methodology to two real examples. © 2013 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd. PMID:23996351
Sexual Selection on male cuticular hydrocarbons via male-male competition and female choice.
Lane, S M; Dickinson, A W; Tregenza, T; House, C M
2016-07-01
Traditional views of sexual selection assumed that male-male competition and female mate choice work in harmony, selecting upon the same traits in the same direction. However, we now know that this is not always the case and that these two mechanisms often impose conflicting selection on male sexual traits. Cuticular hydrocarbons (CHCs) have been shown to be linked to both social dominance and male attractiveness in several insect species. However, although several studies have estimated the strength and form of sexual selection imposed on male CHCs by female mate choice, none have established whether these chemical traits are also subject to sexual selection via male-male competition. Using a multivariate selection analysis, we estimate and compare sexual selection exerted by male-male competition and female mate choice on male CHC composition in the broad-horned flour beetle Gnatocerus cornutus. We show that male-male competition exerts strong linear selection on both overall CHC abundance and body size in males, while female mate choice exerts a mixture of linear and nonlinear selection, targeting not just the overall amount of CHCs expressed but the relative abundance of specific hydrocarbons as well. We discuss the potential implications of this antagonistic selection with regard to male reproductive success. © 2016 The Authors. Journal of Evolutionary Biology published by John Wiley & Sons Ltd on behalf of European Society for Evolutionary Biology.
Malone, Patrick S; Glezer, Laurie S; Kim, Judy; Jiang, Xiong; Riesenhuber, Maximilian
2016-09-28
The neural substrates of semantic representation have been the subject of much controversy. The study of semantic representations is complicated by difficulty in disentangling perceptual and semantic influences on neural activity, as well as in identifying stimulus-driven, "bottom-up" semantic selectivity unconfounded by top-down task-related modulations. To address these challenges, we trained human subjects to associate pseudowords (TPWs) with various animal and tool categories. To decode semantic representations of these TPWs, we used multivariate pattern classification of fMRI data acquired while subjects performed a semantic oddball detection task. Crucially, the classifier was trained and tested on disjoint sets of TPWs, so that the classifier had to use the semantic information from the training set to correctly classify the test set. Animal and tool TPWs were successfully decoded based on fMRI activity in spatially distinct subregions of the left medial anterior temporal lobe (LATL). In addition, tools (but not animals) were successfully decoded from activity in the left inferior parietal lobule. The tool-selective LATL subregion showed greater functional connectivity with left inferior parietal lobule and ventral premotor cortex, indicating that each LATL subregion exhibits distinct patterns of connectivity. Our findings demonstrate category-selective organization of semantic representations in LATL into spatially distinct subregions, continuing the lateral-medial segregation of activation in posterior temporal cortex previously observed in response to images of animals and tools, respectively. Together, our results provide evidence for segregation of processing hierarchies for different classes of objects and the existence of multiple, category-specific semantic networks in the brain. The location and specificity of semantic representations in the brain are still widely debated. We trained human participants to associate specific pseudowords with various animal and tool categories, and used multivariate pattern classification of fMRI data to decode the semantic representations of the trained pseudowords. We found that: (1) animal and tool information was organized in category-selective subregions of medial left anterior temporal lobe (LATL); (2) tools, but not animals, were encoded in left inferior parietal lobe; and (3) LATL subregions exhibited distinct patterns of functional connectivity with category-related regions across cortex. Our findings suggest that semantic knowledge in LATL is organized in category-related subregions, providing evidence for the existence of multiple, category-specific semantic representations in the brain. Copyright © 2016 the authors 0270-6474/16/3610089-08$15.00/0.
Jackson, Dan; White, Ian R; Riley, Richard D
2013-01-01
Multivariate meta-analysis is becoming more commonly used. Methods for fitting the multivariate random effects model include maximum likelihood, restricted maximum likelihood, Bayesian estimation and multivariate generalisations of the standard univariate method of moments. Here, we provide a new multivariate method of moments for estimating the between-study covariance matrix with the properties that (1) it allows for either complete or incomplete outcomes and (2) it allows for covariates through meta-regression. Further, for complete data, it is invariant to linear transformations. Our method reduces to the usual univariate method of moments, proposed by DerSimonian and Laird, in a single dimension. We illustrate our method and compare it with some of the alternatives using a simulation study and a real example. PMID:23401213
Biostatistics Series Module 10: Brief Overview of Multivariate Methods.
Hazra, Avijit; Gogtay, Nithya
2017-01-01
Multivariate analysis refers to statistical techniques that simultaneously look at three or more variables in relation to the subjects under investigation with the aim of identifying or clarifying the relationships between them. These techniques have been broadly classified as dependence techniques, which explore the relationship between one or more dependent variables and their independent predictors, and interdependence techniques, that make no such distinction but treat all variables equally in a search for underlying relationships. Multiple linear regression models a situation where a single numerical dependent variable is to be predicted from multiple numerical independent variables. Logistic regression is used when the outcome variable is dichotomous in nature. The log-linear technique models count type of data and can be used to analyze cross-tabulations where more than two variables are included. Analysis of covariance is an extension of analysis of variance (ANOVA), in which an additional independent variable of interest, the covariate, is brought into the analysis. It tries to examine whether a difference persists after "controlling" for the effect of the covariate that can impact the numerical dependent variable of interest. Multivariate analysis of variance (MANOVA) is a multivariate extension of ANOVA used when multiple numerical dependent variables have to be incorporated in the analysis. Interdependence techniques are more commonly applied to psychometrics, social sciences and market research. Exploratory factor analysis and principal component analysis are related techniques that seek to extract from a larger number of metric variables, a smaller number of composite factors or components, which are linearly related to the original variables. Cluster analysis aims to identify, in a large number of cases, relatively homogeneous groups called clusters, without prior information about the groups. The calculation intensive nature of multivariate analysis has so far precluded most researchers from using these techniques routinely. The situation is now changing with wider availability, and increasing sophistication of statistical software and researchers should no longer shy away from exploring the applications of multivariate methods to real-life data sets.
Prognostic Significance of Tumor Necrosis in Hilar Cholangiocarcinoma.
Atanasov, Georgi; Schierle, Katrin; Hau, Hans-Michael; Dietel, Corinna; Krenzien, Felix; Brandl, Andreas; Wiltberger, Georg; Englisch, Julianna Paulina; Robson, Simon C; Reutzel-Selke, Anja; Pascher, Andreas; Jonas, Sven; Pratschke, Johann; Benzing, Christian; Schmelzle, Moritz
2017-02-01
Tumor necrosis and peritumoral fibrosis have both been suggested to have a prognostic value in selected solid tumors. However, little is known regarding their influence on tumor progression and prognosis in hilar cholangiocarcinoma (HC). Surgically resected tumor specimens of HC (n = 47) were analyzed for formation of necrosis and extent of peritumoral fibrosis. Tumor necrosis and grade of fibrosis were assessed histologically and correlated with clinicopathological characteristics, tumor recurrence, and patients' survival. Univariate Kaplan-Meier analysis and a stepwise multivariable Cox regression model were applied. Mild peritumoral fibrosis was evident in 12 tumor samples, moderate peritumoral fibrosis in 20, and high-grade fibrosis in 15. Necrosis was evident in 19 of 47 tumor samples. Patients with tumors characterized by necrosis showed a significantly decreased 5-year recurrence-free survival (37.9 vs. 25.7 %; p < .05) and a significantly decreased 5-year overall survival (42.6 vs. 12.4 %; p < .05), when compared with patients with tumors showing no necrosis. R status, tumor recurrence, and tumor necrosis were of prognostic value in the univariate analysis (all p < .05). Multivariate survival analysis confirmed tumor necrosis (p = .038) as the only independent prognostic variable. The assessment of tumor necrosis appears as a valuable additional prognostic tool in routine histopathological evaluation of HC. These observations might have implications for monitoring and more individualized multimodal therapeutic strategies.
Xu, Chunsheng; Sun, Jianping; Ji, Fuling; Tian, Xiaocao; Duan, Haiping; Zhai, Yaoming; Wang, Shaojie; Pang, Zengchang; Zhang, Dongfeng; Zhao, Zhongtang; Li, Shuxia; Hjelmborg, Jacob V B; Christensen, Kaare; Tan, Qihua
2015-02-01
The genetic influences on aging-related phenotypes, including cognition and depression, have been well confirmed in the Western populations. We performed the first twin-based analysis on cognitive performance, memory and depression status in middle-aged and elderly Chinese twins, representing the world's largest and most rapidly aging population. The sample consisted of 384 twin pairs with a median age of 50 years. Cognitive function was measured using the Montreal Cognitive Assessment (MoCA) scale; memory was assessed using the revised Wechsler Adult Intelligence scale; depression symptomatology was evaluated by the self-reported 30-item Geriatric Depression (GDS-30)scale. Both univariate and multivariate twin models were fitted to the three phenotypes with full and nested models and compared to select the best fitting models. Univariate analysis showed moderate-to-high genetic influences with heritability 0.44 for cognition and 0.56 for memory. Multivariate analysis by the reduced Cholesky model estimated significant genetic (rG = 0.69) and unique environmental (rE = 0.25) correlation between cognitive ability and memory. The model also estimated weak but significant inverse genetic correlation for depression with cognition (-0.31) and memory (-0.28). No significant unique environmental correlation was found for depression with other two phenotypes. In conclusion, there can be a common genetic architecture for cognitive ability and memory that weakly correlates with depression symptomatology, but in the opposite direction.
Takahashi, Masazumi; Terashima, Masanori; Kawahira, Hiroshi; Nagai, Eishi; Uenosono, Yoshikazu; Kinami, Shinichi; Nagata, Yasuhiro; Yoshida, Masashi; Aoyagi, Keishiro; Kodera, Yasuhiro; Nakada, Koji
2017-01-01
AIM To investigate the detrimental impact of loss of reservoir capacity by comparing total gastrectomy (TGRY) and distal gastrectomy with the same Roux-en-Y (DGRY) reconstruction. The study was conducted using an integrated questionnaire, the Postgastrectomy Syndrome Assessment Scale (PGSAS)-45, recently developed by the Japan Postgastrectomy Syndrome Working Party. METHODS The PGSAS-45 comprises 8 items from the Short Form-8, 15 from the Gastrointestinal Symptom Rating Scale, and 22 newly selected items. Uni- and multivariate analysis was performed on 868 questionnaires completed by patients who underwent either TGRY (n = 393) or DGRY (n = 475) for stage I gastric cancer (52 institutions). Multivariate analysis weighed of six explanatory variables, including the type of gastrectomy (TGRY/DGRY), interval after surgery, age, gender, surgical approach (laparoscopic/open), and whether the celiac branch of the vagus nerve was preserved/divided on the quality of life (QOL). RESULTS The patients who underwent TGRY experienced the poorer QOL compared to DGRY in the 15 of 19 main outcome measures of PGSAS-45. Moreover, multiple regression analysis indicated that the type of gastrectomy, TGRY, most strongly and broadly impaired the postoperative QOL among six explanatory variables. CONCLUSION The results of the present study suggested that TGRY had a certain detrimental impact on the postoperative QOL, and the loss of reservoir capacity could be a major cause. PMID:28373774
Robust tumor morphometry in multispectral fluorescence microscopy
NASA Astrophysics Data System (ADS)
Tabesh, Ali; Vengrenyuk, Yevgen; Teverovskiy, Mikhail; Khan, Faisal M.; Sapir, Marina; Powell, Douglas; Mesa-Tejada, Ricardo; Donovan, Michael J.; Fernandez, Gerardo
2009-02-01
Morphological and architectural characteristics of primary tissue compartments, such as epithelial nuclei (EN) and cytoplasm, provide important cues for cancer diagnosis, prognosis, and therapeutic response prediction. We propose two feature sets for the robust quantification of these characteristics in multiplex immunofluorescence (IF) microscopy images of prostate biopsy specimens. To enable feature extraction, EN and cytoplasm regions were first segmented from the IF images. Then, feature sets consisting of the characteristics of the minimum spanning tree (MST) connecting the EN and the fractal dimension (FD) of gland boundaries were obtained from the segmented compartments. We demonstrated the utility of the proposed features in prostate cancer recurrence prediction on a multi-institution cohort of 1027 patients. Univariate analysis revealed that both FD and one of the MST features were highly effective for predicting cancer recurrence (p <= 0.0001). In multivariate analysis, an MST feature was selected for a model incorporating clinical and image features. The model achieved a concordance index (CI) of 0.73 on the validation set, which was significantly higher than the CI of 0.69 for the standard multivariate model based solely on clinical features currently used in clinical practice (p < 0.0001). The contributions of this work are twofold. First, it is the first demonstration of the utility of the proposed features in morphometric analysis of IF images. Second, this is the largest scale study of the efficacy and robustness of the proposed features in prostate cancer prognosis.
DiMagno, Matthew J; Spaete, Joshua P; Ballard, Darren D; Wamsteker, Erik-Jan; Saini, Sameer D
2013-08-01
We investigated which variables independently associated with protection against or development of postendoscopic retrograde cholangiopancreatography (ERCP) pancreatitis (PEP) and severity of PEP. Subsequently, we derived predictive risk models for PEP. In a case-control design, 6505 patients had 8264 ERCPs, 211 patients had PEP, and 22 patients had severe PEP. We randomly selected 348 non-PEP controls. We examined 7 established- and 9 investigational variables. In univariate analysis, 7 variables predicted PEP: younger age, female sex, suspected sphincter of Oddi dysfunction (SOD), pancreatic sphincterotomy, moderate-difficult cannulation (MDC), pancreatic stent placement, and lower Charlson score. Protective variables were current smoking, former drinking, diabetes, and chronic liver disease (CLD, biliary/transplant complications). Multivariate analysis identified seven independent variables for PEP, three protective (current smoking, CLD-biliary, CLD-transplant/hepatectomy complications) and 4 predictive (younger age, suspected SOD, pancreatic sphincterotomy, MDC). Pre- and post-ERCP risk models of 7 variables have a C-statistic of 0.74. Removing age (seventh variable) did not significantly affect the predictive value (C-statistic of 0.73) and reduced model complexity. Severity of PEP did not associate with any variables by multivariate analysis. By using the newly identified protective variables with 3 predictive variables, we derived 2 risk models with a higher predictive value for PEP compared to prior studies.
Factors associated with abnormal eating attitudes among Greek adolescents.
Bilali, Aggeliki; Galanis, Petros; Velonakis, Emmanuel; Katostaras, Theofanis
2010-01-01
To estimate the prevalence of abnormal eating attitudes among Greek adolescents and identify possible risk factors associated with these attitudes. Cross-sectional, school-based study. Six randomly selected schools in Patras, southern Greece. The study population consisted of 540 Greek students aged 13-18 years, and the response rate was 97%. The dependent variable was scores on the Eating Attitudes Test-26, with scores > or = 20 indicating abnormal eating attitudes. Bivariate analysis included independent Student t test, chi-square test, and Fisher's exact test. Multivariate logistic regression analysis was applied for the identification of the predictive factors, which were associated independently with abnormal eating attitudes. A 2-sided P value of less than .05 was considered statistically significant. The prevalence of abnormal eating attitudes was 16.7%. Multivariate logistic regression analysis demonstrated that females, urban residents, and those with a body mass index outside normal range, a perception of being overweight, body dissatisfaction, and a family member on a diet were independently related to abnormal eating attitudes. The results indicate that a proportion of Greek adolescents report abnormal eating attitudes and suggest that multiple factors contribute to the development of these attitudes. These findings are useful for further research into this topic and would be valuable in designing preventive interventions. Copyright 2010 Society for Nutrition Education. Published by Elsevier Inc. All rights reserved.
Barreto, Goncalo; Soininen, Antti; Sillat, Tarvo; Konttinen, Yrjö T; Kaivosoja, Emilia
2014-01-01
Time-of-flight secondary ion mass spectrometry (ToF-SIMS) is increasingly being used in analysis of biological samples. For example, it has been applied to distinguish healthy and osteoarthritic human cartilage. This chapter discusses ToF-SIMS principle and instrumentation including the three modes of analysis in ToF-SIMS. ToF-SIMS sets certain requirements for the samples to be analyzed; for example, the samples have to be vacuum compatible. Accordingly, sample processing steps for different biological samples, i.e., proteins, cells, frozen and paraffin-embedded tissues and extracellular matrix for the ToF-SIMS are presented. Multivariate analysis of the ToF-SIMS data and the necessary data preprocessing steps (peak selection, data normalization, mean-centering, and scaling and transformation) are discussed in this chapter.
Moreno-Opo, Rubén; Fernández-Olalla, Mariana; Margalida, Antoni; Arredondo, Ángel; Guil, Francisco
2012-01-01
The application of scientific-based conservation measures requires that sampling methodologies in studies modelling similar ecological aspects produce comparable results making easier their interpretation. We aimed to show how the choice of different methodological and ecological approaches can affect conclusions in nest-site selection studies along different Palearctic meta-populations of an indicator species. First, a multivariate analysis of the variables affecting nest-site selection in a breeding colony of cinereous vulture (Aegypius monachus) in central Spain was performed. Then, a meta-analysis was applied to establish how methodological and habitat-type factors determine differences and similarities in the results obtained by previous studies that have modelled the forest breeding habitat of the species. Our results revealed patterns in nesting-habitat modelling by the cinereous vulture throughout its whole range: steep and south-facing slopes, great cover of large trees and distance to human activities were generally selected. The ratio and situation of the studied plots (nests/random), the use of plots vs. polygons as sampling units and the number of years of data set determined the variability explained by the model. Moreover, a greater size of the breeding colony implied that ecological and geomorphological variables at landscape level were more influential. Additionally, human activities affected in greater proportion to colonies situated in Mediterranean forests. For the first time, a meta-analysis regarding the factors determining nest-site selection heterogeneity for a single species at broad scale was achieved. It is essential to homogenize and coordinate experimental design in modelling the selection of species' ecological requirements in order to avoid that differences in results among studies would be due to methodological heterogeneity. This would optimize best conservation and management practices for habitats and species in a global context. PMID:22413023
Blankers, T; Lübke, A K; Hennig, R M
2015-09-01
Studying the genetic architecture of sexual traits provides insight into the rate and direction at which traits can respond to selection. Traits associated with few loci and limited genetic and phenotypic constraints tend to evolve at high rates typically observed for secondary sexual characters. Here, we examined the genetic architecture of song traits and female song preferences in the field crickets Gryllus rubens and Gryllus texensis. Song and preference data were collected from both species and interspecific F1 and F2 hybrids. We first analysed phenotypic variation to examine interspecific differentiation and trait distributions in parental and hybrid generations. Then, the relative contribution of additive and additive-dominance variation was estimated. Finally, phenotypic variance-covariance (P) matrices were estimated to evaluate the multivariate phenotype available for selection. Song traits and preferences had unimodal trait distributions, and hybrid offspring were intermediate with respect to the parents. We uncovered additive and dominance variation in song traits and preferences. For two song traits, we found evidence for X-linked inheritance. On the one hand, the observed genetic architecture does not suggest rapid divergence, although sex linkage may have allowed for somewhat higher evolutionary rates. On the other hand, P matrices revealed that multivariate variation in song traits aligned with major dimensions in song preferences, suggesting a strong selection response. We also found strong covariance between the main traits that are sexually selected and traits that are not directly selected by females, providing an explanation for the striking multivariate divergence in male calling songs despite limited divergence in female preferences. © 2015 European Society For Evolutionary Biology.
NASA Astrophysics Data System (ADS)
Naguib, Ibrahim A.; Darwish, Hany W.
2012-02-01
A comparison between support vector regression (SVR) and Artificial Neural Networks (ANNs) multivariate regression methods is established showing the underlying algorithm for each and making a comparison between them to indicate the inherent advantages and limitations. In this paper we compare SVR to ANN with and without variable selection procedure (genetic algorithm (GA)). To project the comparison in a sensible way, the methods are used for the stability indicating quantitative analysis of mixtures of mebeverine hydrochloride and sulpiride in binary mixtures as a case study in presence of their reported impurities and degradation products (summing up to 6 components) in raw materials and pharmaceutical dosage form via handling the UV spectral data. For proper analysis, a 6 factor 5 level experimental design was established resulting in a training set of 25 mixtures containing different ratios of the interfering species. An independent test set consisting of 5 mixtures was used to validate the prediction ability of the suggested models. The proposed methods (linear SVR (without GA) and linear GA-ANN) were successfully applied to the analysis of pharmaceutical tablets containing mebeverine hydrochloride and sulpiride mixtures. The results manifest the problem of nonlinearity and how models like the SVR and ANN can handle it. The methods indicate the ability of the mentioned multivariate calibration models to deconvolute the highly overlapped UV spectra of the 6 components' mixtures, yet using cheap and easy to handle instruments like the UV spectrophotometer.
Li, Y-H; Li, G-Q; Guo, S-M; Che, Y-N; Wang, X; Cheng, F-T
2017-10-01
To analyze the related influencing factors of urinary tract infection in patients undergoing transurethral resection of the prostate (TURP). A total of 343 patients with benign prostatic hyperplasia admitted to this hospital from January 2013 to December 2016, were selected and treated by TURP. Patients were divided into infection group and non-infection group according to the occurrence of urinary tract infection after operation. The possible influencing factors were collected to perform univariate and multivariate logistic regression analysis. There were 53 cases with urinary tract infection after operation among 343 patients with benign prostatic hyperplasia, accounting for 15.5%. The univariate analysis displayed that the occurrence of urinary tract infection in patients undergoing TURP was closely associated with patient's age ≥ 65 years old, complicated diabetes, catheterization for urinary retention before operation, no use of antibiotics before operation and postoperative indwelling catheter duration ≥ 5 d (p < 0.05). Multivariate logistic regression analysis revealed that age ≥ 65 years old, complicated diabetes, catheterization before operation, indwelling catheter duration ≥ 5 d and no use of antibiotics before operation were risk factors of urinary tract infection in patients receiving TURP (p < 0.05). The patient's age ≥ 65 years old, catheterization before operation, complicated diabetes and long-term indwelling catheter after operation, can increase the occurrence of urinary tract infection after TURP, while preoperative prophylactic utilization of anti-infective drugs can reduce the occurrence of postoperative urinary tract infection.
Shimura, Hiroshi; Mitsui, Takahiko; Kira, Satoru; Ihara, Tatsuya; Sawada, Norifumi; Nakagomi, Hiroshi; Miyamoto, Tatsuya; Tsuchiya, Sachiko; Kanda, Mie; Takeda, Masayuki
2018-05-09
To identify metabolites that are associated with an overactive bladder (OAB) using metabolomics. A total of 58 male patients without apparent neurologic disease completed 24-hour bladder diaries of their micturition behavior and International Prostate Symptom Score (IPSS) for the assessment of micturition behavior and lower urinary tract symptoms. Urgency was defined as an IPSS urgency score of ≥2 (OAB group), and patients with IPSS urgency scores of ≤1 belonged to the control group. A comprehensive study of plasma metabolites was also conducted using capillary electrophoresis time-of-flight mass spectrometry. Metabolite levels were compared between the control and OAB groups using the Mann-Whitney U test. Potential metabolite biomarkers were selected using multivariate logistic regression analysis. Of the 58 subjects, the control and OAB groups consisted of 32 and 26 male patients, respectively. Nocturnal urinary volume, 24-hour micturition frequency, nocturnal micturition frequency, and the nocturia index were significantly higher in the OAB group. Metabolomic analysis revealed 60 metabolites in the subjects' plasma. The levels of 11 metabolites differed between the control and OAB groups. Multivariate analysis showed that an increased glutamate level and reduced arginine, glutamine, and inosine monophosphate levels are significantly associated with OAB in male patients. Reduced levels of asparagine and hydroxyproline could also be associated with OAB. Urgency is associated with abnormal metabolism. Analyses of amino acid profiles might aid the search for new treatment targets for OAB. Copyright © 2018 Elsevier Inc. All rights reserved.
Cho, Hyun-Deok; Kim, Unyong; Suh, Joon Hyuk; Eom, Han Young; Kim, Junghyun; Lee, Seul Gi; Choi, Yong Seok; Han, Sang Beom
2016-04-01
Analytical methods using high-performance liquid chromatography with diode array and tandem mass spectrometry detection were developed for the discrimination of the rhizomes of four Atractylodes medicinal plants: A. japonica, A. macrocephala, A. chinensis, and A. lancea. A quantitative study was performed, selecting five bioactive components, including atractylenolide I, II, III, eudesma-4(14),7(11)-dien-8-one and atractylodin, on twenty-six Atractylodes samples of various origins. Sample extraction was optimized to sonication with 80% methanol for 40 min at room temperature. High-performance liquid chromatography with diode array detection was established using a C18 column with a water/acetonitrile gradient system at a flow rate of 1.0 mL/min, and the detection wavelength was set at 236 nm. Liquid chromatography with tandem mass spectrometry was applied to certify the reliability of the quantitative results. The developed methods were validated by ensuring specificity, linearity, limit of quantification, accuracy, precision, recovery, robustness, and stability. Results showed that cangzhu contained higher amounts of atractylenolide I and atractylodin than baizhu, and especially atractylodin contents showed the greatest variation between baizhu and cangzhu. Multivariate statistical analysis, such as principal component analysis and hierarchical cluster analysis, were also employed for further classification of the Atractylodes plants. The established method was suitable for quality control of the Atractylodes plants. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Xu, Cheng-Jian; van der Schaaf, Arjen; Schilstra, Cornelis; Langendijk, Johannes A; van't Veld, Aart A
2012-03-15
To study the impact of different statistical learning methods on the prediction performance of multivariate normal tissue complication probability (NTCP) models. In this study, three learning methods, stepwise selection, least absolute shrinkage and selection operator (LASSO), and Bayesian model averaging (BMA), were used to build NTCP models of xerostomia following radiotherapy treatment for head and neck cancer. Performance of each learning method was evaluated by a repeated cross-validation scheme in order to obtain a fair comparison among methods. It was found that the LASSO and BMA methods produced models with significantly better predictive power than that of the stepwise selection method. Furthermore, the LASSO method yields an easily interpretable model as the stepwise method does, in contrast to the less intuitive BMA method. The commonly used stepwise selection method, which is simple to execute, may be insufficient for NTCP modeling. The LASSO method is recommended. Copyright © 2012 Elsevier Inc. All rights reserved.
Fatty acid composition and its association with chemical and sensory analysis of boar taint.
Liu, Xiaoye; Trautmann, Johanna; Wigger, Ruth; Zhou, Guanghong; Mörlein, Daniel
2017-09-15
A certain level of disagreement between the chemical analysis of androstenone and skatole and the human perception of boar taint has been found in many studies. Here we analyze whether the fatty acid composition can explain such inconsistency between sensory evaluation and chemical analysis of boar taint compounds. Therefore, back fat samples (n=143) were selected according to their sensory evaluation by a 10-person sensory panel, and the chemical analysis (stable isotope dilution analysis with headspace solid-phase microextraction and gas chromatography-mass spectrometry) of androstenone and skatole. Subsequently a quantification of fatty acids using gas chromatography-flame ionization detection was conducted. The correlation analyses revealed that several fatty acids are significantly correlated with androstenone, skatole, and the sensory rating. However, multivariate analyses (principal component analysis) revealed no explanation of the fatty acid composition with respect to the (dis-)agreement between sensory and chemical analysis. Copyright © 2017 Elsevier Ltd. All rights reserved.
Multivariate time series analysis of neuroscience data: some challenges and opportunities.
Pourahmadi, Mohsen; Noorbaloochi, Siamak
2016-04-01
Neuroimaging data may be viewed as high-dimensional multivariate time series, and analyzed using techniques from regression analysis, time series analysis and spatiotemporal analysis. We discuss issues related to data quality, model specification, estimation, interpretation, dimensionality and causality. Some recent research areas addressing aspects of some recurring challenges are introduced. Copyright © 2015 Elsevier Ltd. All rights reserved.
Tyler, Susan; Truong, Pauline T; Lesperance, Mary; Nichol, Alan; Baliski, Chris; Warburton, Rebecca; Tyldesley, Scott
2018-03-13
The 2014 Society of Surgical Oncology-American Society for Radiation Oncology consensus suggested "no ink on tumor" is a sufficient surgical margin for invasive breast cancer treated with breast-conserving surgery (BCS). Whether close margins <2 mm are associated with inferior outcomes remains controversial. This study evaluated 10-year outcomes by margin status in a population-based cohort treated with BCS and adjuvant radiation therapy (RT). The subjects were 10,863 women with invasive cancer categorized as pT1 to T3, any N, and M0 referred from 2001 to 2011, an era in which the institutional policy was to re-excise close or positive margins, except in select cases. All women underwent BCS and whole-breast RT with or without boost RT. Local recurrence (LR) and breast cancer-specific survival (BCSS) were examined using competing-risk analysis in cohorts with negative (≥2 mm; n = 9241, 85%), close (<2 mm; n = 1310, 12%), or positive (tumor touching ink; n = 312, 3%) margins. Multivariable analysis and matched-pair analysis were performed. The median follow-up period was 8 years. Systemic therapy was used in 87% of patients. Boost RT was used in 34.1%, 76.9%, and 79.5% of patients with negative, close, and positive margins, respectively. In the negative, close, and positive margin cohorts, the 10-year cumulative incidence of LR was 1.8%, 2.0%, and 1.1%, respectively (P = .759). Corresponding BCSS estimates were 93.9%, 91.8%, and 87.9%, respectively (P < .001). On multivariable analysis, close margins were not associated with increased LR (hazard ratio, 1.25; 95% confidence interval 0.79-1.97; P = .350) or reduced BCSS (hazard ratio, 1.25; 95% confidence interval 0.98-1.58, P = .071) relative to negative margins. On matched-pair analysis, close margin cases had similar LR (P = .114) and BCSS (P = .100) to negative margin controls. Select cases with close or positive margins in this population-based analysis had similar LR and BCSS to cases with negative margins. While these findings do not endorse omitting re-excision for all cases, the data support a policy of accepting carefully selected cases with close margins for adjuvant RT without re-excision. Copyright © 2018 Elsevier Inc. All rights reserved.
NASA Technical Reports Server (NTRS)
Park, Steve
1990-01-01
A large and diverse number of computational techniques are routinely used to process and analyze remotely sensed data. These techniques include: univariate statistics; multivariate statistics; principal component analysis; pattern recognition and classification; other multivariate techniques; geometric correction; registration and resampling; radiometric correction; enhancement; restoration; Fourier analysis; and filtering. Each of these techniques will be considered, in order.
Chemical structure of wood charcoal by infrared spectroscopy and multivariate analysis
Nicole Labbe; David Harper; Timothy Rials; Thomas Elder
2006-01-01
In this work, the effect of temperature on charcoal structure and chemical composition is investigated for four tree species. Wood charcoal carbonized at various temperatures is analyzed by mid infrared spectroscopy coupled with multivariate analysis and by thermogravimetric analysis to characterize the chemical composition during the carbonization process. The...
Yan, Binjun; Fang, Zhonghua; Shen, Lijuan; Qu, Haibin
2015-01-01
The batch-to-batch quality consistency of herbal drugs has always been an important issue. To propose a methodology for batch-to-batch quality control based on HPLC-MS fingerprints and process knowledgebase. The extraction process of Compound E-jiao Oral Liquid was taken as a case study. After establishing the HPLC-MS fingerprint analysis method, the fingerprints of the extract solutions produced under normal and abnormal operation conditions were obtained. Multivariate statistical models were built for fault detection and a discriminant analysis model was built using the probabilistic discriminant partial-least-squares method for fault diagnosis. Based on multivariate statistical analysis, process knowledge was acquired and the cause-effect relationship between process deviations and quality defects was revealed. The quality defects were detected successfully by multivariate statistical control charts and the type of process deviations were diagnosed correctly by discriminant analysis. This work has demonstrated the benefits of combining HPLC-MS fingerprints, process knowledge and multivariate analysis for the quality control of herbal drugs. Copyright © 2015 John Wiley & Sons, Ltd.
Hevesi, Joseph A.; Istok, Jonathan D.; Flint, Alan L.
1992-01-01
Values of average annual precipitation (AAP) are desired for hydrologic studies within a watershed containing Yucca Mountain, Nevada, a potential site for a high-level nuclear-waste repository. Reliable values of AAP are not yet available for most areas within this watershed because of a sparsity of precipitation measurements and the need to obtain measurements over a sufficient length of time. To estimate AAP over the entire watershed, historical precipitation data and station elevations were obtained from a network of 62 stations in southern Nevada and southeastern California. Multivariate geostatistics (cokriging) was selected as an estimation method because of a significant (p = 0.05) correlation of r = .75 between the natural log of AAP and station elevation. A sample direct variogram for the transformed variable, TAAP = ln [(AAP) 1000], was fitted with an isotropic, spherical model defined by a small nugget value of 5000, a range of 190 000 ft, and a sill value equal to the sample variance of 163 151. Elevations for 1531 additional locations were obtained from topographic maps to improve the accuracy of cokriged estimates. A sample direct variogram for elevation was fitted with an isotropic model consisting of a nugget value of 5500 and three nested transition structures: a Gaussian structure with a range of 61 000 ft, a spherical structure with a range of 70 000 ft, and a quasi-stationary, linear structure. The use of an isotropic, stationary model for elevation was considered valid within a sliding-neighborhood radius of 120 000 ft. The problem of fitting a positive-definite, nonlinear model of coregionalization to an inconsistent sample cross variogram for TAAP and elevation was solved by a modified use of the Cauchy-Schwarz inequality. A selected cross-variogram model consisted of two nested structures: a Gaussian structure with a range of 61 000 ft and a spherical structure with a range of 190 000 ft. Cross validation was used for model selection and for comparing the geostatistical model with six alternate estimation methods. Multivariate geostatistics provided the best cross-validation results.
Multivariate analysis: greater insights into complex systems
USDA-ARS?s Scientific Manuscript database
Many agronomic researchers measure and collect multiple response variables in an effort to understand the more complex nature of the system being studied. Multivariate (MV) statistical methods encompass the simultaneous analysis of all random variables (RV) measured on each experimental or sampling ...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Van Benthem, Mark Hilary; Mowry, Curtis Dale; Kotula, Paul Gabriel
Thermal decomposition of poly dimethyl siloxane compounds, Sylgard{reg_sign} 184 and 186, were examined using thermal desorption coupled gas chromatography-mass spectrometry (TD/GC-MS) and multivariate analysis. This work describes a method of producing multiway data using a stepped thermal desorption. The technique involves sequentially heating a sample of the material of interest with subsequent analysis in a commercial GC/MS system. The decomposition chromatograms were analyzed using multivariate analysis tools including principal component analysis (PCA), factor rotation employing the varimax criterion, and multivariate curve resolution. The results of the analysis show seven components related to offgassing of various fractions of siloxanes that varymore » as a function of temperature. Thermal desorption coupled with gas chromatography-mass spectrometry (TD/GC-MS) is a powerful analytical technique for analyzing chemical mixtures. It has great potential in numerous analytic areas including materials analysis, sports medicine, in the detection of designer drugs; and biological research for metabolomics. Data analysis is complicated, far from automated and can result in high false positive or false negative rates. We have demonstrated a step-wise TD/GC-MS technique that removes more volatile compounds from a sample before extracting the less volatile compounds. This creates an additional dimension of separation before the GC column, while simultaneously generating three-way data. Sandia's proven multivariate analysis methods, when applied to these data, have several advantages over current commercial options. It also has demonstrated potential for success in finding and enabling identification of trace compounds. Several challenges remain, however, including understanding the sources of noise in the data, outlier detection, improving the data pretreatment and analysis methods, developing a software tool for ease of use by the chemist, and demonstrating our belief that this multivariate analysis will enable superior differentiation capabilities. In addition, noise and system artifacts challenge the analysis of GC-MS data collected on lower cost equipment, ubiquitous in commercial laboratories. This research has the potential to affect many areas of analytical chemistry including materials analysis, medical testing, and environmental surveillance. It could also provide a method to measure adsorption parameters for chemical interactions on various surfaces by measuring desorption as a function of temperature for mixtures. We have presented results of a novel method for examining offgas products of a common PDMS material. Our method involves utilizing a stepped TD/GC-MS data acquisition scheme that may be almost totally automated, coupled with multivariate analysis schemes. This method of data generation and analysis can be applied to a number of materials aging and thermal degradation studies.« less
Tang, Rongnian; Chen, Xupeng; Li, Chuang
2018-05-01
Near-infrared spectroscopy is an efficient, low-cost technology that has potential as an accurate method in detecting the nitrogen content of natural rubber leaves. Successive projections algorithm (SPA) is a widely used variable selection method for multivariate calibration, which uses projection operations to select a variable subset with minimum multi-collinearity. However, due to the fluctuation of correlation between variables, high collinearity may still exist in non-adjacent variables of subset obtained by basic SPA. Based on analysis to the correlation matrix of the spectra data, this paper proposed a correlation-based SPA (CB-SPA) to apply the successive projections algorithm in regions with consistent correlation. The result shows that CB-SPA can select variable subsets with more valuable variables and less multi-collinearity. Meanwhile, models established by the CB-SPA subset outperform basic SPA subsets in predicting nitrogen content in terms of both cross-validation and external prediction. Moreover, CB-SPA is assured to be more efficient, for the time cost in its selection procedure is one-twelfth that of the basic SPA.
Marzaro, Giovanni; Ferrarese, Alessandro; Chilin, Adriana
2014-08-01
The selection of the most appropriate protein conformation is a crucial aspect in molecular docking experiments. In order to reduce the errors arising from the use of a single protein conformation, several authors suggest the use of several tridimensional structures for the target. However, the selection of the most appropriate protein conformations still remains a challenging goal. The protein 3D-structures selection is mainly performed based on pairwise root-mean-square-deviation (RMSD) values computation, followed by hierarchical clustering. Herein we report an alternative strategy, based on the computation of only two atom affinity map for each protein conformation, followed by multivariate analysis and hierarchical clustering. This methodology was applied on seven different kinases of pharmaceutical interest. The comparison with the classical RMSD-based strategy was based on cross-docking of co-crystallized ligands. In the case of epidermal growth factor receptor kinase, also the docking performance on 220 known ligands were evaluated, followed by 3D-QSAR studies. In all the cases, the herein proposed methodology outperformed the RMSD-based one.
McNamee, Daniel; Liljeholm, Mimi; Zika, Ondrej; O'Doherty, John P
2015-03-04
While there is accumulating evidence for the existence of distinct neural systems supporting goal-directed and habitual action selection in the mammalian brain, much less is known about the nature of the information being processed in these different brain regions. Associative learning theory predicts that brain systems involved in habitual control, such as the dorsolateral striatum, should contain stimulus and response information only, but not outcome information, while regions involved in goal-directed action, such as ventromedial and dorsolateral prefrontal cortex and dorsomedial striatum, should be involved in processing information about outcomes as well as stimuli and responses. To test this prediction, human participants underwent fMRI while engaging in a binary choice task designed to enable the separate identification of these different representations with a multivariate classification analysis approach. Consistent with our predictions, the dorsolateral striatum contained information about responses but not outcomes at the time of an initial stimulus, while the regions implicated in goal-directed action selection contained information about both responses and outcomes. These findings suggest that differential contributions of these regions to habitual and goal-directed behavioral control may depend in part on basic differences in the type of information that these regions have access to at the time of decision making. Copyright © 2015 the authors 0270-6474/15/353764-08$15.00/0.
2011-01-01
Background In Taiwan, there is a high incidence of breast cancer and a high prevalence of viral hepatitis. In this case-control study, we used a population-based insurance dataset to evaluate whether breast cancer in women is associated with chronic viral hepatitis infection. Methods From the claims data, we identified 1,958 patients with newly diagnosed breast cancer during the period 2000-2008. A randomly selected, age-matched cohort of 7,832 subjects without cancer was selected for comparison. Multivariable logistic regression models were constructed to calculate odds ratios of breast cancer associated with viral hepatitis after adjustment for age, residential area, occupation, urbanization, and income. The age-specific (<50 years and ≥50 years) risk of breast cancer was also evaluated. Results There were no significant differences in the prevalence of hepatitis C virus (HCV) infection, hepatitis B virus (HBV), or the prevalence of combined HBC/HBV infection between breast cancer patients and control subjects (p = 0.48). Multivariable logistic regression analysis, however, revealed that age <50 years was associated with a 2-fold greater risk of developing breast cancer (OR = 2.03, 95% CI = 1.23-3.34). Conclusions HCV infection, but not HBV infection, appears to be associated with early onset risk of breast cancer in areas endemic for HCV and HBV. This finding needs to be replicated in further studies. PMID:22115285
ERIC Educational Resources Information Center
Jones, Gerald L.; Westen, Risdon J.
The multivariate approach of canonical correlation was used to assess selection procedures of the Air Force Academy. It was felt that improved student selection methods might reduce the number of dropouts while maintaining or improving the quality of graduates. The method of canonical correlation was designed to maximize prediction of academic…
Evolutionary rates for multivariate traits: the role of selection and genetic variation
Pitchers, William; Wolf, Jason B.; Tregenza, Tom; Hunt, John; Dworkin, Ian
2014-01-01
A fundamental question in evolutionary biology is the relative importance of selection and genetic architecture in determining evolutionary rates. Adaptive evolution can be described by the multivariate breeders' equation (), which predicts evolutionary change for a suite of phenotypic traits () as a product of directional selection acting on them (β) and the genetic variance–covariance matrix for those traits (G). Despite being empirically challenging to estimate, there are enough published estimates of G and β to allow for synthesis of general patterns across species. We use published estimates to test the hypotheses that there are systematic differences in the rate of evolution among trait types, and that these differences are, in part, due to genetic architecture. We find some evidence that sexually selected traits exhibit faster rates of evolution compared with life-history or morphological traits. This difference does not appear to be related to stronger selection on sexually selected traits. Using numerous proposed approaches to quantifying the shape, size and structure of G, we examine how these parameters relate to one another, and how they vary among taxonomic and trait groupings. Despite considerable variation, they do not explain the observed differences in evolutionary rates. PMID:25002697
Hou, Deyi; O'Connor, David; Nathanail, Paul; Tian, Li; Ma, Yan
2017-12-01
Heavy metal soil contamination is associated with potential toxicity to humans or ecotoxicity. Scholars have increasingly used a combination of geographical information science (GIS) with geostatistical and multivariate statistical analysis techniques to examine the spatial distribution of heavy metals in soils at a regional scale. A review of such studies showed that most soil sampling programs were based on grid patterns and composite sampling methodologies. Many programs intended to characterize various soil types and land use types. The most often used sampling depth intervals were 0-0.10 m, or 0-0.20 m, below surface; and the sampling densities used ranged from 0.0004 to 6.1 samples per km 2 , with a median of 0.4 samples per km 2 . The most widely used spatial interpolators were inverse distance weighted interpolation and ordinary kriging; and the most often used multivariate statistical analysis techniques were principal component analysis and cluster analysis. The review also identified several determining and correlating factors in heavy metal distribution in soils, including soil type, soil pH, soil organic matter, land use type, Fe, Al, and heavy metal concentrations. The major natural and anthropogenic sources of heavy metals were found to derive from lithogenic origin, roadway and transportation, atmospheric deposition, wastewater and runoff from industrial and mining facilities, fertilizer application, livestock manure, and sewage sludge. This review argues that the full potential of integrated GIS and multivariate statistical analysis for assessing heavy metal distribution in soils on a regional scale has not yet been fully realized. It is proposed that future research be conducted to map multivariate results in GIS to pinpoint specific anthropogenic sources, to analyze temporal trends in addition to spatial patterns, to optimize modeling parameters, and to expand the use of different multivariate analysis tools beyond principal component analysis (PCA) and cluster analysis (CA). Copyright © 2017 Elsevier Ltd. All rights reserved.
Multi-criteria evaluation of CMIP5 GCMs for climate change impact analysis
NASA Astrophysics Data System (ADS)
Ahmadalipour, Ali; Rana, Arun; Moradkhani, Hamid; Sharma, Ashish
2017-04-01
Climate change is expected to have severe impacts on global hydrological cycle along with food-water-energy nexus. Currently, there are many climate models used in predicting important climatic variables. Though there have been advances in the field, there are still many problems to be resolved related to reliability, uncertainty, and computing needs, among many others. In the present work, we have analyzed performance of 20 different global climate models (GCMs) from Climate Model Intercomparison Project Phase 5 (CMIP5) dataset over the Columbia River Basin (CRB) in the Pacific Northwest USA. We demonstrate a statistical multicriteria approach, using univariate and multivariate techniques, for selecting suitable GCMs to be used for climate change impact analysis in the region. Univariate methods includes mean, standard deviation, coefficient of variation, relative change (variability), Mann-Kendall test, and Kolmogorov-Smirnov test (KS-test); whereas multivariate methods used were principal component analysis (PCA), singular value decomposition (SVD), canonical correlation analysis (CCA), and cluster analysis. The analysis is performed on raw GCM data, i.e., before bias correction, for precipitation and temperature climatic variables for all the 20 models to capture the reliability and nature of the particular model at regional scale. The analysis is based on spatially averaged datasets of GCMs and observation for the period of 1970 to 2000. Ranking is provided to each of the GCMs based on the performance evaluated against gridded observational data on various temporal scales (daily, monthly, and seasonal). Results have provided insight into each of the methods and various statistical properties addressed by them employed in ranking GCMs. Further; evaluation was also performed for raw GCM simulations against different sets of gridded observational dataset in the area.
Comparison of connectivity analyses for resting state EEG data
NASA Astrophysics Data System (ADS)
Olejarczyk, Elzbieta; Marzetti, Laura; Pizzella, Vittorio; Zappasodi, Filippo
2017-06-01
Objective. In the present work, a nonlinear measure (transfer entropy, TE) was used in a multivariate approach for the analysis of effective connectivity in high density resting state EEG data in eyes open and eyes closed. Advantages of the multivariate approach in comparison to the bivariate one were tested. Moreover, the multivariate TE was compared to an effective linear measure, i.e. directed transfer function (DTF). Finally, the existence of a relationship between the information transfer and the level of brain synchronization as measured by phase synchronization value (PLV) was investigated. Approach. The comparison between the connectivity measures, i.e. bivariate versus multivariate TE, TE versus DTF, TE versus PLV, was performed by means of statistical analysis of indexes based on graph theory. Main results. The multivariate approach is less sensitive to false indirect connections with respect to the bivariate estimates. The multivariate TE differentiated better between eyes closed and eyes open conditions compared to DTF. Moreover, the multivariate TE evidenced non-linear phenomena in information transfer, which are not evidenced by the use of DTF. We also showed that the target of information flow, in particular the frontal region, is an area of greater brain synchronization. Significance. Comparison of different connectivity analysis methods pointed to the advantages of nonlinear methods, and indicated a relationship existing between the flow of information and the level of synchronization of the brain.
Motion Direction Biases and Decoding in Human Visual Cortex
Wang, Helena X.; Merriam, Elisha P.; Freeman, Jeremy
2014-01-01
Functional magnetic resonance imaging (fMRI) studies have relied on multivariate analysis methods to decode visual motion direction from measurements of cortical activity. Above-chance decoding has been commonly used to infer the motion-selective response properties of the underlying neural populations. Moreover, patterns of reliable response biases across voxels that underlie decoding have been interpreted to reflect maps of functional architecture. Using fMRI, we identified a direction-selective response bias in human visual cortex that: (1) predicted motion-decoding accuracy; (2) depended on the shape of the stimulus aperture rather than the absolute direction of motion, such that response amplitudes gradually decreased with distance from the stimulus aperture edge corresponding to motion origin; and 3) was present in V1, V2, V3, but not evident in MT+, explaining the higher motion-decoding accuracies reported previously in early visual cortex. These results demonstrate that fMRI-based motion decoding has little or no dependence on the underlying functional organization of motion selectivity. PMID:25209297
Lee, Tsair-Fwu; Liou, Ming-Hsiang; Huang, Yu-Jie; Chao, Pei-Ju; Ting, Hui-Min; Lee, Hsiao-Yi
2014-01-01
To predict the incidence of moderate-to-severe patient-reported xerostomia among head and neck squamous cell carcinoma (HNSCC) and nasopharyngeal carcinoma (NPC) patients treated with intensity-modulated radiotherapy (IMRT). Multivariable normal tissue complication probability (NTCP) models were developed by using quality of life questionnaire datasets from 152 patients with HNSCC and 84 patients with NPC. The primary endpoint was defined as moderate-to-severe xerostomia after IMRT. The numbers of predictive factors for a multivariable logistic regression model were determined using the least absolute shrinkage and selection operator (LASSO) with bootstrapping technique. Four predictive models were achieved by LASSO with the smallest number of factors while preserving predictive value with higher AUC performance. For all models, the dosimetric factors for the mean dose given to the contralateral and ipsilateral parotid gland were selected as the most significant predictors. Followed by the different clinical and socio-economic factors being selected, namely age, financial status, T stage, and education for different models were chosen. The predicted incidence of xerostomia for HNSCC and NPC patients can be improved by using multivariable logistic regression models with LASSO technique. The predictive model developed in HNSCC cannot be generalized to NPC cohort treated with IMRT without validation and vice versa. PMID:25163814
Determinants of orphan drugs prices in France: a regression analysis.
Korchagina, Daria; Millier, Aurelie; Vataire, Anne-Lise; Aballea, Samuel; Falissard, Bruno; Toumi, Mondher
2017-04-21
The introduction of the orphan drug legislation led to the increase in the number of available orphan drugs, but the access to them is often limited due to the high price. Social preferences regarding funding orphan drugs as well as the criteria taken into consideration while setting the price remain unclear. The study aimed at identifying the determinant of orphan drug prices in France using a regression analysis. All drugs with a valid orphan designation at the moment of launch for which the price was available in France were included in the analysis. The selection of covariates was based on a literature review and included drug characteristics (Anatomical Therapeutic Chemical (ATC) class, treatment line, age of target population), diseases characteristics (severity, prevalence, availability of alternative therapeutic options), health technology assessment (HTA) details (actual benefit (AB) and improvement in actual benefit (IAB) scores, delay between the HTA and commercialisation), and study characteristics (type of study, comparator, type of endpoint). The main data sources were European public assessment reports, HTA reports, summaries of opinion on orphan designation of the European Medicines Agency, and the French insurance database of drugs and tariffs. A generalized regression model was developed to test the association between the annual treatment cost and selected covariates. A total of 68 drugs were included. The mean annual treatment cost was €96,518. In the univariate analysis, the ATC class (p = 0.01), availability of alternative treatment options (p = 0.02) and the prevalence (p = 0.02) showed a significant correlation with the annual cost. The multivariate analysis demonstrated significant association between the annual cost and availability of alternative treatment options, ATC class, IAB score, type of comparator in the pivotal clinical trial, as well as commercialisation date and delay between the HTA and commercialisation. The orphan drug pricing is a multivariate phenomenon. The complex association between drug prices and the studied attributes and shows that payers integrate multiple variables in decision making when setting orphan drug prices. The interpretation of the study results is limited by the small sample size and the complex data structure.
Perception of olive oils sensory defects using a potentiometric taste device.
Veloso, Ana C A; Silva, Lucas M; Rodrigues, Nuno; Rebello, Ligia P G; Dias, Luís G; Pereira, José A; Peres, António M
2018-01-01
The capability of perceiving olive oils sensory defects and intensities plays a key role on olive oils quality grade classification since olive oils can only be classified as extra-virgin if no defect can be perceived by a human trained sensory panel. Otherwise, olive oils may be classified as virgin or lampante depending on the median intensity of the defect predominantly perceived and on the physicochemical levels. However, sensory analysis is time-consuming and requires an official sensory panel, which can only evaluate a low number of samples per day. In this work, the potential use of an electronic tongue as a taste sensor device to identify the defect predominantly perceived in olive oils was evaluated. The potentiometric profiles recorded showed that intra- and inter-day signal drifts could be neglected (i.e., relative standard deviations lower than 25%), being not statistically significant the effect of the analysis day on the overall recorded E-tongue sensor fingerprints (P-value = 0.5715, for multivariate analysis of variance using Pillai's trace test), which significantly differ according to the olive oils' sensory defect (P-value = 0.0084, for multivariate analysis of variance using Pillai's trace test). Thus, a linear discriminant model based on 19 potentiometric signal sensors, selected by the simulated annealing algorithm, could be established to correctly predict the olive oil main sensory defect (fusty, rancid, wet-wood or winey-vinegary) with average sensitivity of 75 ± 3% and specificity of 73 ± 4% (repeated K-fold cross-validation variant: 4 folds×10 repeats). Similarly, a linear discriminant model, based on 24 selected sensors, correctly classified 92 ± 3% of the olive oils as virgin or lampante, being an average specificity of 93 ± 3% achieved. The overall satisfactory predictive performances strengthen the feasibility of the developed taste sensor device as a complementary methodology for olive oils' defects analysis and subsequent quality grade classification. Furthermore, the capability of identifying the type of sensory defect of an olive oil may allow establishing helpful insights regarding bad practices of olives or olive oils production, harvesting, transport and storage. Copyright © 2017 Elsevier B.V. All rights reserved.
MULTIVARIATE CURVE RESOLUTION OF NMR SPECTROSCOPY METABONOMIC DATA
Sandia National Laboratories is working with the EPA to evaluate and develop mathematical tools for analysis of the collected NMR spectroscopy data. Initially, we have focused on the use of Multivariate Curve Resolution (MCR) also known as molecular factor analysis (MFA), a tech...
Characterizing multivariate decoding models based on correlated EEG spectral features
McFarland, Dennis J.
2013-01-01
Objective Multivariate decoding methods are popular techniques for analysis of neurophysiological data. The present study explored potential interpretative problems with these techniques when predictors are correlated. Methods Data from sensorimotor rhythm-based cursor control experiments was analyzed offline with linear univariate and multivariate models. Features were derived from autoregressive (AR) spectral analysis of varying model order which produced predictors that varied in their degree of correlation (i.e., multicollinearity). Results The use of multivariate regression models resulted in much better prediction of target position as compared to univariate regression models. However, with lower order AR features interpretation of the spectral patterns of the weights was difficult. This is likely to be due to the high degree of multicollinearity present with lower order AR features. Conclusions Care should be exercised when interpreting the pattern of weights of multivariate models with correlated predictors. Comparison with univariate statistics is advisable. Significance While multivariate decoding algorithms are very useful for prediction their utility for interpretation may be limited when predictors are correlated. PMID:23466267
Drunk driving detection based on classification of multivariate time series.
Li, Zhenlong; Jin, Xue; Zhao, Xiaohua
2015-09-01
This paper addresses the problem of detecting drunk driving based on classification of multivariate time series. First, driving performance measures were collected from a test in a driving simulator located in the Traffic Research Center, Beijing University of Technology. Lateral position and steering angle were used to detect drunk driving. Second, multivariate time series analysis was performed to extract the features. A piecewise linear representation was used to represent multivariate time series. A bottom-up algorithm was then employed to separate multivariate time series. The slope and time interval of each segment were extracted as the features for classification. Third, a support vector machine classifier was used to classify driver's state into two classes (normal or drunk) according to the extracted features. The proposed approach achieved an accuracy of 80.0%. Drunk driving detection based on the analysis of multivariate time series is feasible and effective. The approach has implications for drunk driving detection. Copyright © 2015 Elsevier Ltd and National Safety Council. All rights reserved.
Melucci, Dora; Bendini, Alessandra; Tesini, Federica; Barbieri, Sara; Zappi, Alessandro; Vichi, Stefania; Conte, Lanfranco; Gallina Toschi, Tullia
2016-08-01
At present, the geographical origin of extra virgin olive oils can be ensured by documented traceability, although chemical analysis may add information that is useful for possible confirmation. This preliminary study investigated the effectiveness of flash gas chromatography electronic nose and multivariate data analysis to perform rapid screening of commercial extra virgin olive oils characterized by a different geographical origin declared in the label. A comparison with solid phase micro extraction coupled to gas chromatography mass spectrometry was also performed. The new method is suitable to verify the geographic origin of extra virgin olive oils based on principal components analysis and discriminant analysis applied to the volatile profile of the headspace as a fingerprint. The selected variables were suitable in discriminating between "100% Italian" and "non-100% Italian" oils. Partial least squares discriminant analysis also allowed prediction of the degree of membership of unknown samples to the classes examined. Copyright © 2016. Published by Elsevier Ltd.
PROSPECT Eligibility and Clinical Outcomes: Results From the Pan-Canadian Rectal Cancer Consortium.
Bossé, Dominick; Mercer, Jamison; Raissouni, Soundouss; Dennis, Kristopher; Goodwin, Rachel; Jiang, Di; Powell, Erin; Kumar, Aalok; Lee-Ying, Richard; Price-Hiller, Julie; Heng, Daniel Y C; Tang, Patricia A; MacLean, Anthony; Cheung, Winson Y; Vickers, Michael M
2016-09-01
The PROSPECT trial (N1048) is evaluating the selective use of chemoradiation in patients with cT2N1 and cT3N0-1 rectal cancer undergoing sphincter-sparing low anterior resection. We evaluated outcomes of PROSPECT-eligible and -ineligible patients from a multi-institutional database. Data from patients with locally advanced rectal cancer who received chemoradiation and low anterior resection from 2005 to 2014 were retrospectively collected from 5 Canadian centers. Overall survival, disease-free survival (DFS), recurrence-free survival (RFS), and time to local recurrence (LR) were estimated using the Kaplan-Meier method, and a multivariate analysis was performed adjusting for prognostic factors. A total of 566 (37%) of 1531 patients met the PROSPECT eligibility criteria. Eligible patients were more likely to have better PS (P = .0003) and negative circumferential resection margin (P < .0001). PROSPECT eligibility was associated with improved DFS (hazard ratio [HR], 0.75; 95% confidence interval [CI], 0.61-0.91), overall survival (HR, 0.73; 95% CI, 0.57-0.95), and RFS (HR, 0.68; 95% CI, 0.54-0.86) in univariate analyses. In multivariate analysis, only RFS remained significantly improved for PROSPECT-eligible patients (HR, 0.75; 95% CI, 0.57-1.00, P = .0499). The 3-year DFS and freedom from LR for PROSPECT-eligible patients were 79.1% and 97.4%, respectively, compared to 71.1% and 96.8% for PROSPECT-ineligible patients. Real-world data corroborate the eligibility criteria used in the PROSPECT study; the criteria identify a subgroup of patients in whom risk of recurrence is lower and in whom selective use of chemoradiation should be actively examined. Copyright © 2016 Elsevier Inc. All rights reserved.
Kaseda, Kaoru; Asakura, Keisuke; Kazama, Akio; Ozawa, Yukihiko
2016-12-01
Lymph nodes in patients with non-small cell lung cancer (NSCLC) are often staged using integrated 18F-fluorodeoxyglucose positron emission tomography/computed tomography (FDG-PET/CT). However, this modality has limited ability to detect micrometastases. We aimed to define risk factors for occult lymph node metastasis in patients with clinical stage I NSCLC diagnosed by preoperative integrated FDG-PET/CT. We retrospectively reviewed the records of 246 patients diagnosed with clinical stage I NSCLC based on integrated FDG-PET/CT between April 2007 and May 2015. All patients were treated by complete surgical resection. The prevalence of occult lymph node metastasis in patients with clinical stage I NSCLC was analysed according to clinicopathological factors. Risk factors for occult lymph node metastasis were defined using univariate and multivariate analyses. Occult lymph node metastasis was detected in 31 patients (12.6 %). Univariate analysis revealed CEA (P = 0.04), SUV max of the primary tumour (P = 0.031), adenocarcinoma (P = 0.023), tumour size (P = 0.002) and pleural invasion (P = 0.046) as significant predictors of occult lymph node metastasis. Multivariate analysis selected SUV max of the primary tumour (P = 0.049), adenocarcinoma (P = 0.003) and tumour size (P = 0.019) as independent predictors of occult lymph node metastasis. The SUV max of the primary tumour, adenocarcinoma and tumour size were risk factors for occult lymph node metastasis in patients with NSCLC diagnosed as clinical stage I by preoperative integrated FDG-PET/CT. These findings would be helpful in selecting candidates for mediastinoscopy or endobronchial ultrasound-guided transbronchial needle aspiration.
Prevalence of kidney stones and associated risk factors in the Shunyi District of Beijing, China.
Jiang, Y G; He, L H; Luo, G T; Zhang, X D
2017-10-01
Kidney stone formation is a multifactorial condition that involves interaction of environmental and genetic factors. Presence of kidney stones is strongly related to other diseases, which may result in a heavy economic and social burden. Clinical data on the prevalence and influencing factors in kidney stone disease in the north of China are scarce. In this study, we explored the prevalence of kidney stone and potentially associated risk factors in the Shunyi District of Beijing, China. A population-based cross-sectional study was conducted from December 2011 to November 2012 in a northern area of China. Participants were interviewed in randomly selected towns. Univariate analysis of continuous and categorical variables was first performed by calculation of Spearman's correlation coefficient and Pearson Chi squared value, respectively. Variables with statistical significance were further analysed by multivariate logistic regression to explore the potential influencing factors. A total of 3350 participants (1091 males and 2259 females) completed the survey and the response rate was 99.67%. Among the participants, 3.61% were diagnosed with kidney stone. Univariate analysis showed that significant differences were evident in 31 variables. Blood and urine tests were performed in 100 randomly selected patients with kidney stone and 100 healthy controls. Serum creatinine, calcium, and uric acid were significantly different between the patients with kidney stone and healthy controls. Multivariate logistic regression revealed that being male (odds ratio=102.681; 95% confidence interval, 1.062-9925.797), daily intake of white spirits (6.331; 1.204-33.282), and a history of urolithiasis (1797.775; 24.228-133 396.982) were factors potentially associated with kidney stone prevalence. Male gender, drinking white spirits, and a history of urolithiasis are potentially associated with kidney stone formation.
Wang, Cun; Huang, Qiaorong; Meng, Wentong; Yu, Yongyang; Yang, Lie; Peng, Zhihai; Hu, Jiankun; Li, Yuan; Mo, Xianming; Zhou, Zongguang
2016-01-01
Introduction Liver is the most common site of distant metastasis in colorectal cancer (CRC). Early diagnosis and appropriate treatment selection decides overall prognosis of patients. However, current diagnostic measures were basically imaging but not functional. Circulating tumor cells (CTCs) known as hold the key to understand the biology of metastatic mechanism provide a novel and auxiliary diagnostic strategy for CRC with liver metastasis (CRC-LM). Results The expression of CD133+ and CD133+CD54+CD44+ cellular subpopulations were higher in the peripheral blood of CRC-LM patients when compared with those without metastasis (P<0.001). Multivariate analysis proved the association between the expression of CD133+CD44+CD54+ cellular subpopulation and the existence of CRC-LM (P<0.001). The combination of abdominal CT/MRI, CEA and the CD133+CD44+CD54+ cellular subpopulation showed increased detection and discrimination rate for liver metastasis, with a sensitivity of 88.2% and a specificity of 92.4%. Meanwhile, it also show accurate predictive value for liver metastasis (OR=2.898, 95% C.I.1.374–6.110). Materials and Method Flow cytometry and multivariate analysis was performed to detect the expression of cancer initiating cells the correlation between cellular subpopulations and liver metastasis in patients with CRC. The receiver operating characteristic curves combined with the area under the curve were generated to compare the predictive ability of the cellular subpopulation for liver metastasis with current CT and MRI images. Conclusions The identification, expression and application of CTC subpopulations will provide an ideal cellular predictive marker for CRC liver metastasis and a potential marker for further investigation. PMID:27764803
Zhou, Jia; Ma, Yinghua; Ma, Jun; Zou, Zhiyong; Meng, Xiangkun; Tao, Fangbiao; Luo, Chunyan; Jing, Jin; Pan, Dehong; Luo, Jiayou; Zhang, Xin; Wang, Hong; Zhao, Haiping
2016-01-01
To understand the prevalence of myopia in primary and middle school students in 6 provinces and the possible influencing factors. Primary and middle school students were selected through multistage cluster sampling in 60 primary and middle schools in 6 provinces in China. The questionnaire survey and eyesight test were conducted among all the students selected according to the national student's physique and health survey protocol. Pearson chi-square test and binary multivariate logistic regression analysis were done to identify the influencing factors for myopia in students. The prevalence of myopia among primary and middle school students surveyed was 55.7%, the gender specific difference was statistically significant (59.7% for girls, 51.9% for boys) (P<0.01). The prevalence of myopia increased with age obviously. The prevalence was 35.8% in age group 6-8 years, 58.9% in age group 10-12 years, 73.4% in age group 13-15 years and 81.2% in age group 16-18 years, the differences were statistically significant (P<0.001). Single factor and multivariate analysis showed that parents' myopia, distance between computer screen and eyes, distance less than 30 cm between eyes and book while reading, distance less than 10 cm between chest and the table edge while studying, distance less than 3 cm between fingers and pen tip, sleep time, average outdoor activity time during last week, school sport activities in the afternoon, the size of television set at home, time spent on watching TV and playing computer were the influencing factors for myopia. The prevalence of myopia is till high in primary and middle school students. Myopia is associated with both genetic factors and individual eye health related behaviors.
Lungscape: resected non-small-cell lung cancer outcome by clinical and pathological parameters.
Peters, Solange; Weder, Walter; Dafni, Urania; Kerr, Keith M; Bubendorf, Lukas; Meldgaard, Peter; O'Byrne, Kenneth J; Wrona, Anna; Vansteenkiste, Johan; Felip, Enriqueta; Marchetti, Antonio; Savic, Spasenija; Lu, Shun; Smit, Egbert; Dingemans, Anne-Marie; Blackhall, Fiona H; Baas, Paul; Camps, Carlos; Rosell, Rafael; Stahel, Rolf A
2014-11-01
The Lungscape project was designed to address the impact of clinical, pathological, and molecular characteristics on outcome in resected non-small- cell lung cancer (NSCLC). A decentralized biobank with fully annotated tissue samples was established. Selection criteria for participating centers included sufficient number of cases, tissue microarray building capability, and documented ethical approval. Patient selection was based on availability of comprehensive clinical data, radical resection between 2003 and 2009 with adequate follow-up, and adequate quantity and quality of formalin-fixed tissue. Fifteen centers contributed 2449 cases. The 5-year overall survival (OS) was 69.6% and 63.6% for stages IA and IB, 51.6% and 47.7% for stages IIA and IIB, and 29.0% and 13.0% for stages IIIA and IIIB, respectively (p < 0.001). Median and 5-year relapse-free survival (RFS) were 52.8 months and 47.3%, respectively. Distant relapse was recorded for 44.4%, local for 26.0%, and both for 16.9% of patients. Based on multivariate analysis for the OS, RFS, and time to relapse, the factors significantly associated with all of them are performance status and pathological stage. The aim of this report is to present the results from Lungscape, the first large series reporting on NSCLC surgical outcome measured not only by OS but also by RFS and time to relapse and including multivariate analysis by significant clinical and pathological prognostic parameters. As tissue from all patients is preserved locally and is available for detailed molecular investigations, Lungscape provides an excellent basis to evaluate the influence of molecular parameters on the disease outcome after radical resection, besides providing an overview of the molecular landscape of stage I to III NSCLC.
Parotid gland shrinkage during IMRT predicts the time to Xerostomia resolution.
Sanguineti, Giuseppe; Ricchetti, Francesco; Wu, Binbin; McNutt, Todd; Fiorino, Claudio
2015-01-17
To assess the impact of mid-treatment parotid gland shrinkage on long term xerostomia during IMRT for oropharyngeal SCC. All patients treated with IMRT at a single Institution from November 2007 to June 2010 and undergoing weekly CT scans were selected. Parotid glands were contoured retrospectively on the mid treatment CT scan. For each parotid gland, the percent change relative to the planning volume was calculated and combined as weighted average. Patients were considered to be xerostomic if developed GR2+ dry mouth according to CTCAE v3.0. Predictors of the time to xerostomia resolution or downgrade to 1 were investigated at both uni- and multivariate analysis. 85 patients were selected. With a median follow up of 35.8 months (range: 2.4-62.6 months), the actuarial rate of xerostomia is 26.2% (SD: 5.3%) and 15.9% (SD: 5.3%) at 2 and 3 yrs, respectively. At multivariate analysis, mid-treatment shrink along with weighted average mean parotid dose at planning and body mass index are independent predictors of the time to xerostomia resolution. Patients were pooled in 4 groups based on median values of both mid-treatment shrink (cut-off: 19.6%) and mean WA parotid pl-D (cut-off: 35.7 Gy). Patients with a higher than median parotid dose at planning and who showed poor shrinkage at mid treatment are the ones with the outcome significantly worse (3-yr rate of xerostomia ≈ 50%) than the other three subgroups (3-yr rate of xerostomia ≈ 10%). For a given planned dose, patients whose parotids significantly shrink during IMRT are less likely to be long-term supplemental fluids dependent.
Hurtado Rúa, Sandra M; Mazumdar, Madhu; Strawderman, Robert L
2015-12-30
Bayesian meta-analysis is an increasingly important component of clinical research, with multivariate meta-analysis a promising tool for studies with multiple endpoints. Model assumptions, including the choice of priors, are crucial aspects of multivariate Bayesian meta-analysis (MBMA) models. In a given model, two different prior distributions can lead to different inferences about a particular parameter. A simulation study was performed in which the impact of families of prior distributions for the covariance matrix of a multivariate normal random effects MBMA model was analyzed. Inferences about effect sizes were not particularly sensitive to prior choice, but the related covariance estimates were. A few families of prior distributions with small relative biases, tight mean squared errors, and close to nominal coverage for the effect size estimates were identified. Our results demonstrate the need for sensitivity analysis and suggest some guidelines for choosing prior distributions in this class of problems. The MBMA models proposed here are illustrated in a small meta-analysis example from the periodontal field and a medium meta-analysis from the study of stroke. Copyright © 2015 John Wiley & Sons, Ltd. Copyright © 2015 John Wiley & Sons, Ltd.
ERIC Educational Resources Information Center
Barton, Mitch; Yeatts, Paul E.; Henson, Robin K.; Martin, Scott B.
2016-01-01
There has been a recent call to improve data reporting in kinesiology journals, including the appropriate use of univariate and multivariate analysis techniques. For example, a multivariate analysis of variance (MANOVA) with univariate post hocs and a Bonferroni correction is frequently used to investigate group differences on multiple dependent…
Henry, Stephen G.; Jerant, Anthony; Iosif, Ana-Maria; Feldman, Mitchell D.; Cipri, Camille; Kravitz, Richard L.
2015-01-01
Objective To identify factors associated with participant consent to record visits; to estimate effects of recording on patient-clinician interactions Methods Secondary analysis of data from a randomized trial studying communication about depression; participants were asked for optional consent to audio record study visits. Multiple logistic regression was used to model likelihood of patient and clinician consent. Multivariable regression and propensity score analyses were used to estimate effects of audio recording on 6 dependent variables: discussion of depressive symptoms, preventive health, and depression diagnosis; depression treatment recommendations; visit length; visit difficulty. Results Of 867 visits involving 135 primary care clinicians, 39% were recorded. For clinicians, only working in academic settings (P=0.003) and having worked longer at their current practice (P=0.02) were associated with increased likelihood of consent. For patients, white race (P=0.002) and diabetes (P=0.03) were associated with increased likelihood of consent. Neither multivariable regression nor propensity score analyses revealed any significant effects of recording on the variables examined. Conclusion Few clinician or patient characteristics were significantly associated with consent. Audio recording had no significant effect on any dependent variables. Practice Implications Benefits of recording clinic visits likely outweigh the risks of bias in this setting. PMID:25837372
Hadrevi, Jenny; Ghafouri, Bijar; Larsson, Britt; Gerdle, Björn; Hellström, Fredrik
2013-01-01
The prevalence of chronic trapezius myalgia is high in women with high exposure to awkward working positions, repetitive movements and movements with high precision demands. The mechanisms behind chronic trapezius myalgia are not fully understood. The purpose of this study was to explore the differences in protein content between healthy and myalgic trapezius muscle using proteomics. Muscle biopsies from 12 female cleaners with work-related trapezius myalgia and 12 pain free female cleaners were obtained from the descending part of the trapezius. Proteins were separated with two-dimensional differential gel electrophoresis (2D-DIGE) and selected proteins were identified with mass spectrometry. In order to discriminate the two groups, quantified proteins were fitted to a multivariate analysis: partial least square discriminate analysis. The model separated 28 unique proteins which were related to glycolysis, the tricaboxylic acid cycle, to the contractile apparatus, the cytoskeleton and to acute response proteins. The results suggest altered metabolism, a higher abundance of proteins related to inflammation in myalgic cleaners compared to healthy, and a possible alteration of the contractile apparatus. This explorative proteomic screening of proteins related to chronic pain in the trapezius muscle provides new important aspects of the pathophysiology behind chronic trapezius myalgia. PMID:24023854
Hadrevi, Jenny; Ghafouri, Bijar; Larsson, Britt; Gerdle, Björn; Hellström, Fredrik
2013-01-01
The prevalence of chronic trapezius myalgia is high in women with high exposure to awkward working positions, repetitive movements and movements with high precision demands. The mechanisms behind chronic trapezius myalgia are not fully understood. The purpose of this study was to explore the differences in protein content between healthy and myalgic trapezius muscle using proteomics. Muscle biopsies from 12 female cleaners with work-related trapezius myalgia and 12 pain free female cleaners were obtained from the descending part of the trapezius. Proteins were separated with two-dimensional differential gel electrophoresis (2D-DIGE) and selected proteins were identified with mass spectrometry. In order to discriminate the two groups, quantified proteins were fitted to a multivariate analysis: partial least square discriminate analysis. The model separated 28 unique proteins which were related to glycolysis, the tricaboxylic acid cycle, to the contractile apparatus, the cytoskeleton and to acute response proteins. The results suggest altered metabolism, a higher abundance of proteins related to inflammation in myalgic cleaners compared to healthy, and a possible alteration of the contractile apparatus. This explorative proteomic screening of proteins related to chronic pain in the trapezius muscle provides new important aspects of the pathophysiology behind chronic trapezius myalgia.
Relationship of breastfeeding self-efficacy with quality of life in Iranian breastfeeding mothers.
Mirghafourvand, Mojgan; Kamalifard, Mahin; Ranjbar, Fatemeh; Gordani, Nasrin
2017-07-20
Due to the importance of breastfeeding, we decided to conduct a study to examine the relationship between breastfeeding self-efficacy and quality of life. This study was a cross-sectional study, which was carried out on 547 breastfeeding mothers that had 2-6 months old infants. The participants were selected randomly, and the sociodemographic characteristics questionnaire, Dennis' breastfeeding self-efficacy scale, and WHO's Quality of Life (WHOQOL) questionnaire were completed through interview. The multivariate linear regression model was used for data analysis. The means (standard deviations) of breastfeeding self-efficacy score and quality of life score were 134.5 (13.3) and 67.7 (13.7), respectively. Quality of life and all of its dimensions were directly and significantly related to breastfeeding self-efficacy. According to the results of multivariate linear regression analysis, there was a relationship between breastfeeding self-efficacy and the following variables: environmental dimension of quality of life, education, spouse's age, spouse's job, average duration of previous breastfeeding period and receiving breastfeeding training. Findings showed that there is direct and significant relationship between breastfeeding self-efficacy and quality of life. Moreover, it seems that the development of appropriate training programs is necessary for improving the quality of life of pregnant women, as it consequently enhances breastfeeding self-efficacy.
Blasco-Costa, I; Pankov, P; Gibson, D I; Balbuena, J A; Raga, J A; Sarabeev, V L; Kostadinova, A
2006-09-01
Three species of the bunocotyline genus Saturnius Manter, 1969 are described from the stomach lining of mugilid fishes of the Mediterranean and Black Seas. Two of the species are new: S. minutus n. sp. occurs in Mugil cephalus off the Mediterranean coast of Spain; and S. dimitrovi n. sp., a parasite of M. cephalus off the Bulgarian Black Sea coast and the Spanish Mediterranean coast, was originally described as S. papernai by Dimitrov et al. (1998). In addition, S. papernai Overstreet, 1977 is redescribed from M. cephalus off the Spanish Mediterranean coast and from Liza aurata and L. saliens off the Bulgarian Black Sea coast. The three species are distinguished morphometrically using univariate and multivariate analyses. These results were verified using Linear Discriminant Analysis which correctly allocated all specimens to their species designations based on morphology (i.e. 100% successful classification rate) and assigned almost all specimens to the correct population (locality). The following variables were selected for optimal separation between samples: the length of the forebody, ventral sucker and posterior testis, the length and width of the posteriormost pseudosegment, and the width of the muscular flange at ventral sucker level.
De Stefano, Valerio; Za, Tommaso; Rossi, Elena; Vannucchi, Alessandro M; Ruggeri, Marco; Elli, Elena; Micò, Caterina; Tieghi, Alessia; Cacciola, Rossella R; Santoro, Cristina; Gerli, Giancarla; Guglielmelli, Paola; Pieri, Lisa; Scognamiglio, Francesca; Rodeghiero, Francesco; Pogliani, Enrico M; Finazzi, Guido; Gugliotta, Luigi; Leone, Giuseppe; Barbui, Tiziano
2010-02-01
There is evidence that leukocytosis is associated with an increased risk of first thrombosis in patients with polycythemia vera (PV) and essential thrombocythemia (ET). Whether it is a risk factor for recurrent thrombosis too is currently unknown. In the frame of a multicenter retrospective cohort study, we recruited 253 patients with PV (n = 133) or ET (n = 120), who were selected on the basis of a first arterial (70%) or venous major thrombosis (27.6%) or both (2.4%), and who were not receiving cytoreduction at the time of thrombosis. The probability of recurrent thrombosis associated with the leukocyte count recorded at the time of the first thrombosis was estimated by a receiver operating characteristic analysis and a multivariable Cox proportional hazards regression model. Thrombosis recurred in 78 patients (30.7%); multivariable analysis showed an independent risk of arterial recurrence (hazard ratio [HR] 2.16, 95% CI 1.12-4.18) in patients with a leukocyte count that was >12.4 x 10(9)/L at the time of the first thrombotic episode. The prognostic role for leukocytosis was age-related, as it was only significant in patients that were aged <60 years (HR for arterial recurrence 3.35, 95% CI 1.22-9.19).
Kou, Peng Meng; Pallassana, Narayanan; Bowden, Rebeca; Cunningham, Barry; Joy, Abraham; Kohn, Joachim; Babensee, Julia E.
2011-01-01
Dendritic cells (DCs) play a critical role in orchestrating the host responses to a wide variety of foreign antigens and are essential in maintaining immune tolerance. Distinct biomaterials have been shown to differentially affect the phenotype of DCs, which suggested that biomaterials may be used to modulate immune response towards the biologic component in combination products. The elucidation of biomaterial property-DC phenotype relationships is expected to inform rational design of immuno-modulatory biomaterials. In this study, DC response to a set of 12 polymethacrylates (pMAs) was assessed in terms of surface marker expression and cytokine profile. Principal component analysis (PCA) determined that surface carbon correlated with enhanced DC maturation, while surface oxygen was associated with an immature DC phenotype. Partial square linear regression, a multivariate modeling approach, was implemented and successfully predicted biomaterial-induced DC phenotype in terms of surface marker expression from biomaterial properties with R2prediction = 0.76. Furthermore, prediction of DC phenotype was effective based on only theoretical chemical composition of the bulk polymers with R2prediction = 0.80. These results demonstrated that immune cell response can be predicted from biomaterial properties, and computational models will expedite future biomaterial design and selection. PMID:22136715
Seroepidemiology of HBV infection in South-East of iran; a population based study.
Salehi, M; Alavian, S M; Tabatabaei, S V; Izadi, Sh; Sanei Moghaddam, E; Amini Kafi-Abad, S; Gharehbaghian, A; Khosravi, S; Abolghasemi, H
2012-05-01
Hepatitis B virus (HBV) infection is a major risk factor of cirrhosis and hepatocellular carcinoma affecting billions of people globally. Since information on its prevalence in general population is mandatory for formulating effective policies, this population based serological survey was conducted in Sistan and Baluchistan, where no previous epidemiological data were available. Using random cluster sampling 3989 healthy subjects were selected from 9 districts of Sistan and Baluchistan Province in southeastern Iran. The subjects' age ranged from 6 to 65 years old. Serum samples were tested for HBcAb, HBsAg. Screening tests were carried out by the third generation of ELISA. Various risk factors were recorded and multivariate analysis was performed. The prevalence of HBsAg and HBcAb in Sistan and Baluchistan was 3.38% (95% CI 2.85; 3.98) and 23.58% (95% CI 22.29; 24.93) respectively. We found 8 cases of positive anti-HDV antibody. Predictors of HBsAg or HBcAb in multivariate analysis were age, marital status and addiction. The rate of HBV infection in Sistan and Baluchistan was higher than other parts of Iran. Approximately 25% of general population in this province had previous exposure to HBV and 3% were HBsAg carriers. Intrafamilial and addiction were major routes of HBV transmission in this province.
Nikolić, Biljana; Martinović, Jelena; Matić, Milan; Stefanović, Đorđe
2018-05-29
Different variables determine the performance of cyclists, which brings up the question how these parameters may help in their classification by specialty. The aim of the study was to determine differences in cardiorespiratory parameters of male cyclists according to their specialty, flat rider (N=21), hill rider (N=35) and sprinter (N=20) and obtain the multivariate model for further cyclists classification by specialties, based on selected variables. Seventeen variables were measured at submaximal and maximum load on the cycle ergometer Cosmed E 400HK (Cosmed, Rome, Italy) (initial 100W with 25W increase, 90-100 rpm). Multivariate discriminant analysis was used to determine which variables group cyclists within their specialty, and to predict which variables can direct cyclists to a particular specialty. Among nine variables that statistically contribute to the discriminant power of the model, achieved power on the anaerobic threshold and the produced CO2 had the biggest impact. The obtained discriminatory model correctly classified 91.43% of flat riders, 85.71% of hill riders, while sprinters were classified completely correct (100%), i.e. 92.10% of examinees were correctly classified, which point out the strength of the discriminatory model. Respiratory indicators mostly contribute to the discriminant power of the model, which may significantly contribute to training practice and laboratory tests in future.
MGAS: a powerful tool for multivariate gene-based genome-wide association analysis.
Van der Sluis, Sophie; Dolan, Conor V; Li, Jiang; Song, Youqiang; Sham, Pak; Posthuma, Danielle; Li, Miao-Xin
2015-04-01
Standard genome-wide association studies, testing the association between one phenotype and a large number of single nucleotide polymorphisms (SNPs), are limited in two ways: (i) traits are often multivariate, and analysis of composite scores entails loss in statistical power and (ii) gene-based analyses may be preferred, e.g. to decrease the multiple testing problem. Here we present a new method, multivariate gene-based association test by extended Simes procedure (MGAS), that allows gene-based testing of multivariate phenotypes in unrelated individuals. Through extensive simulation, we show that under most trait-generating genotype-phenotype models MGAS has superior statistical power to detect associated genes compared with gene-based analyses of univariate phenotypic composite scores (i.e. GATES, multiple regression), and multivariate analysis of variance (MANOVA). Re-analysis of metabolic data revealed 32 False Discovery Rate controlled genome-wide significant genes, and 12 regions harboring multiple genes; of these 44 regions, 30 were not reported in the original analysis. MGAS allows researchers to conduct their multivariate gene-based analyses efficiently, and without the loss of power that is often associated with an incorrectly specified genotype-phenotype models. MGAS is freely available in KGG v3.0 (http://statgenpro.psychiatry.hku.hk/limx/kgg/download.php). Access to the metabolic dataset can be requested at dbGaP (https://dbgap.ncbi.nlm.nih.gov/). The R-simulation code is available from http://ctglab.nl/people/sophie_van_der_sluis. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.
Using phenotypic manipulations to study multivariate selection of floral trait associations.
Campbell, Diane R
2009-06-01
A basic theme in the study of plant-pollinator interactions is that pollinators select not just for single floral traits, but for associations of traits. Responses of pollinators to sets of traits are inherent in the idea of pollinator syndromes. In its most extreme form, selection on a suite of traits can take the form of correlational selection, in which a response to one trait depends on the value of another, thereby favouring floral integration. Despite the importance of selection for combinations of traits in the evolution of flowers, evidence is relatively sparse and relies mostly on observational approaches. Here, methods for measuring selection on multivariate suites of floral traits are presented, and the studies to date are reviewed. It is argued that phenotypic manipulations present a powerful, but rarely used, approach to teasing apart the separate and combined effects of particular traits. The approach is illustrated with data from studies of alpine plants in Colorado and New Zealand, and recommendations are made about several features of the design of such experiments. Phenotypic manipulations of two or more traits in combination provide a direct way of testing for selection of floral trait associations. Such experiments will be particularly valuable if rooted in hypotheses about differences between types of pollinators and tied to a proposed evolutionary history.
ERIC Educational Resources Information Center
Henry, Gary T.; And Others
1992-01-01
A statistical technique is presented for developing performance standards based on benchmark groups. The benchmark groups are selected using a multivariate technique that relies on a squared Euclidean distance method. For each observation unit (a school district in the example), a unique comparison group is selected. (SLD)
Mas, Sergi; Gassó, Patricia; Morer, Astrid; Calvo, Anna; Bargalló, Nuria; Lafuente, Amalia; Lázaro, Luisa
2016-01-01
We propose an integrative approach that combines structural magnetic resonance imaging data (MRI), diffusion tensor imaging data (DTI), neuropsychological data, and genetic data to predict early-onset obsessive compulsive disorder (OCD) severity. From a cohort of 87 patients, 56 with complete information were used in the present analysis. First, we performed a multivariate genetic association analysis of OCD severity with 266 genetic polymorphisms. This association analysis was used to select and prioritize the SNPs that would be included in the model. Second, we split the sample into a training set (N = 38) and a validation set (N = 18). Third, entropy-based measures of information gain were used for feature selection with the training subset. Fourth, the selected features were fed into two supervised methods of class prediction based on machine learning, using the leave-one-out procedure with the training set. Finally, the resulting model was validated with the validation set. Nine variables were used for the creation of the OCD severity predictor, including six genetic polymorphisms and three variables from the neuropsychological data. The developed model classified child and adolescent patients with OCD by disease severity with an accuracy of 0.90 in the testing set and 0.70 in the validation sample. Above its clinical applicability, the combination of particular neuropsychological, neuroimaging, and genetic characteristics could enhance our understanding of the neurobiological basis of the disorder. PMID:27093171
Hou, Ting; Sun, Yidi; Li, Junyi; Liu, Chenglin; Wang, Zhen; Li, Yixue; Lu, Yuan
2017-01-01
Most patients with early stage endometrial endometrioid adenocarcinoma (EEAC) are treated with hysterectomy and bilateral oophorectomy. But this surgical menopause leads to long-term sequelae for premenopausal women, especially for young women of childbearing age. This population-based study was to evaluate the safety of ovarian preservation in young women with stage I EEAC. Patients of age 50 or younger with stage I EEAC were explored from the Surveillance, Epidemiology and End Results program database during 2004 to 2013. Propensity score matching was used to randomize the data set and reduce the selection biases of doctors. Univariate analysis and multivariate cox proportional hazards model were utilized to estimate the safety of ovarian preservation. A total of 7183 patients were identified, and ovarian preservation was performed in 863 (12 %) patients. Compared with women treated with oophorectomy, patients with ovarian preservation significantly tend to be younger at diagnosis (P-value < 0.001) and more likely diagnosed as stage IA EEAC, to have better differentiated tumor tissues and smaller tumors, as well as less likely to undergo radiation and lymphadenectomy. 863 patients treated with oophorectomy were selected by propensity score matching. After propensity score matching, the differences of all characteristics between ovarian preservation and oophorectomy were not significant and potential confounders in the two groups decreased. In univariate analysis of matched population, ovarian preservation had no effect on overall (P-value=0.928) and cancer-specific (P-value=0.390) mortality. In propensityadjusted multivariate analysis, ovarian preservation was not significantly associated with overall (HR=0.69, 95%CI=0.41-1.68, P-value=0.611) and cancer-specific (HR=1.65, 95%CI=0.54-5.06, Pvalue= 0.379) survival. Ovarian preservation is safe for young women with stage I EEAC, which is not significantly associated with overall and cancer-specific mortality. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Croome, Kristopher P; Lee, David D; Keaveny, Andrew P; Taner, C Burcin
2016-12-01
Published reports describing the national experience with liver grafts from donation after cardiac death (DCD) donors have resulted in reservations with their widespread utilization. The present study aimed to investigate if temporal improvements in outcomes have been observed on a national level and to determine if donor and recipient selection have been modified in a fashion consistent with published data on DCD use in liver transplantation (LT). Patients undergoing DCD LT between 2003 and 2014 were obtained from the United Network of Organ Sharing Standard Transplant Analysis and Research file and divided into 3 equal eras based on the date of DCD LT: era 1 (2003-2006), era 2 (2007-2010), and era 3 (2011-2014). Improvement in graft survival was seen between era 1 and era 2 (P = 0.001) and between era 2 and era 3 (P < 0.001). Concurrently, an increase in the proportion of patients with hepatocellular carcinoma and a decrease in critically ill patients, retransplant recipients, donor age, warm ischemia time greater than 30 minutes and cold ischemic time also occurred over the same period. On multivariate analysis, significant predictors of graft survival included: recipient age, biologic MELD score, recipient on ventilator, recipient hepatitis C virus + serology, donor age and cold ischemic time. In addition, even after adjustment for all of the aforementioned variables, both era 2 (hazard ratio, 0.81; confidence interval, 0.69-0.94; P = 0.007), and era 3 (hazard ratio, 0.61; confidence interval, 0.5-0.73; P < 0.001) had a protective effect compared to era 1. The national outcomes for DCD LT have improved over the last 12 years. This change was associated with modifications in both recipient and donor selection. Furthermore, an era effect was observed, even after adjustment for all recipient and donor variables on multivariate analysis.
Comorbidity and prognosis in advanced hypopharyngeal-laryngeal cancer under combined therapy.
Montero, Elena Hernández; Trufero, Javier Martínez; Romeo, Javier Azúa; Terré, Fernando Clau
2008-01-01
The success of combined treatment in head and neck cancer resides largely in its completion, which can be compromised when the patient's general health status is precarious. The objective of this investigation was to study the role of comorbidity as a prognostic factor in a large, homogeneous population affected by locally advanced pharyngeal-laryngeal cancer, under a combined protocol treatment. The a priori hypothesis is that comorbidity strongly conditions overall survival and specific overall survival in these patients and can aid in the selection and individualization of treatments. After a 24-month follow-up, a univariate and multivariate retrospective analysis of survival and prognostic factors was performed using 14 clinical, pathological and molecular variables including the comorbidity index calculated following the Picarillo method. The settings were the Otolaryngology, Oncology and Pathology Departments of the Miguel Servet University Hospital, Zaragoza, Spain, a referral center of the National Health System. Of the original 114 patients selected, 15 were withdrawn because the tumor spread to maxillofacial areas, or due to the lack of attendance at the clinic, incomplete clinical data or coexistent primary tumors. The group under analysis consisted of the 99 remaining patients affected by stage III and IV laryngeal and/or hypopharyngeal cancers that had not received previous treatments. The main outcomes to analyze were overall survival, specific overall survival and relative risk. Overall survival at 2.5 years was 68.1% (95% CI, 57.7-78.5). Specific overall survival at 2.5 years was 74.8% (95% CI, 64.9-84.6). In the multivariate analysis, tumor staging, neoadjuvant chemotherapy response and comorbidity (RR = 1.55 and 1.44 for overall and specific overall survival, respectively) present themselves as three prognostic factors independent of overall and specific overall survival. The role of comorbidity as an independent prognostic factor in patients affected by laryngeal and/or hypopharyngeal cancer treated with chemo-radiotherapy should be taken into account in the tailoring of treatments and the improvement of therapeutic results.
Halabi, Wissam J; Kang, Celeste Y; Jafari, Mehraneh D; Nguyen, Vinh Q; Carmichael, Joseph C; Mills, Steven; Stamos, Michael J; Pigazzi, Alessio
2013-12-01
While robotic-assisted colorectal surgery (RACS) is becoming increasingly popular, data comparing its outcomes to other established techniques remain limited to small case series. Moreover, there are no large studies evaluating the trends of RACS at the national level. The Nationwide Inpatient Sample 2009-2010 was retrospectively reviewed for robotic-assisted and laparoscopic colorectal procedures performed for cancer, benign polyps, and diverticular disease. Trends in different settings, indications, and demographics were analyzed. Multivariate regression analysis was used to compare selected outcomes between RACS and conventional laparoscopic surgery (CLS). An estimated 128,288 colorectal procedures were performed through minimally invasive techniques over the study period, and RACS was used in 2.78 % of cases. From 2009 to 2010, the use of robotics increased in all hospital settings but was still more common in large, urban, and teaching hospitals. Rectal cancer was the most common indication for RACS, with a tendency toward its selective use in male patients. On multivariate analysis, robotic surgery was associated with higher hospital charges in colonic ($11,601.39; 95 % CI 6,921.82-16,280.97) and rectal cases ($12,964.90; 95 % CI 6,534.79-19,395.01), and higher rates of postoperative bleeding in colonic cases (OR = 2.15; 95 % CI 1.27- 3.65). RACS was similar to CLS with respect to length of hospital stay, morbidity, anastomotic leak, and ileus. Conversion to open surgery was significantly lower in robotic colonic and rectal procedures (0.41; 95 % CI 0.25-0.67) and (0.10; 95 % CI 0.06-0.16), respectively. The use of RACS is still limited in the United States. However, its use increased over the study period despite higher associated charges and no real advantages over laparoscopy in terms of outcome. The one advantage is lower conversion rates.
Hybrid least squares multivariate spectral analysis methods
Haaland, David M.
2002-01-01
A set of hybrid least squares multivariate spectral analysis methods in which spectral shapes of components or effects not present in the original calibration step are added in a following estimation or calibration step to improve the accuracy of the estimation of the amount of the original components in the sampled mixture. The "hybrid" method herein means a combination of an initial classical least squares analysis calibration step with subsequent analysis by an inverse multivariate analysis method. A "spectral shape" herein means normally the spectral shape of a non-calibrated chemical component in the sample mixture but can also mean the spectral shapes of other sources of spectral variation, including temperature drift, shifts between spectrometers, spectrometer drift, etc. The "shape" can be continuous, discontinuous, or even discrete points illustrative of the particular effect.
PAPANNA, Ramesha; BLOCK-ABRAHAM, Dana; Mann, Lovepreet K; BUHIMSCHI, Irina A.; BEBBINGTON, Michael; GARCIA, Elisa; KAHLEK, Nahla; HARMAN, Christopher; JOHNSON, Anthony; BASCHAT, Ahmet; MOISE, Kenneth J.
2014-01-01
OBJECTIVE Despite improved perinatal survival following fetoscopic laser surgery (FLS) for twin twin transfusion syndrome (TTTS), prematurity remains an important contributor to perinatal mortality and morbidity. The objective of the study was to identify risk factors for complicated preterm delivery after FLS. STUDY DESIGN Retrospective cohort study of prospectively collected data on maternal/fetal demographics and pre-operative, operative and post-operative variables of 459 patients treated in 3 U.S. fetal centers. Multivariate linear regression was performed to identify significant risk factors associated with preterm delivery, which was cross-validated using K-fold method. Multivariate logistic regression was performed to identify risk factors for early vs. late preterm delivery based on median gestational age at delivery of 32 weeks. RESULTS There were significant differences in case selection and outcomes between the centers. After controlling for the center of surgery, a multivariate analysis indicated a lower maternal age at procedure, history of previous prematurity, shortened cervical length, use of amnioinfusion, 12 Fr cannula diameter, lack of a collagen plug placement and iatrogenic preterm premature rupture of membranes (iPPROM) were significantly associated with a lower gestational age at delivery. CONCLUSION Specific fetal/maternal and operative variables are associated with preterm delivery after FLS for the treatment of TTTS. Further studies to modify some of these variables may decrease the perinatal morbidity after laser therapy. PMID:24013922
Heidema, A Geert; Thissen, Uwe; Boer, Jolanda M A; Bouwman, Freek G; Feskens, Edith J M; Mariman, Edwin C M
2009-06-01
In this study, we applied the multivariate statistical tool Partial Least Squares (PLS) to analyze the relative importance of 83 plasma proteins in relation to coronary heart disease (CHD) mortality and the intermediate end points body mass index, HDL-cholesterol and total cholesterol. From a Dutch monitoring project for cardiovascular disease risk factors, men who died of CHD between initial participation (1987-1991) and end of follow-up (January 1, 2000) (N = 44) and matched controls (N = 44) were selected. Baseline plasma concentrations of proteins were measured by a multiplex immunoassay. With the use of PLS, we identified 15 proteins with prognostic value for CHD mortality and sets of proteins associated with the intermediate end points. Subsequently, sets of proteins and intermediate end points were analyzed together by Principal Components Analysis, indicating that proteins involved in inflammation explained most of the variance, followed by proteins involved in metabolism and proteins associated with total-C. This study is one of the first in which the association of a large number of plasma proteins with CHD mortality and intermediate end points is investigated by applying multivariate statistics, providing insight in the relationships among proteins, intermediate end points and CHD mortality, and a set of proteins with prognostic value.
Analysis of private health insurance premium growth rates: 1985-1992.
Feldstein, P J; Wickizer, T M
1995-10-01
The rate of increase in health care expenditures has been a central policy concern for well over a decade, yet little empirical research has been conducted to examine expenditure growth rates. This study analyzed health insurance premium growth rates for a selected sample of 95 insured groups over the period 1985 to 1992. During this time, premiums increased by approximately 150% in nominal terms and by 45% in real terms. The observed rate of growth was not constant over time, however. The most rapid growth occurred during the years 1986 to 1989; thereafter, the rate of increase in premiums declined. Multivariate analysis was conducted to assess the effects on premium growth rates of selected variables representing insurance benefit design features, market competitive factors, insurance system factors, and group-specific factors. In addition to the percentage increase in benefit payments, other factors found to affect premium growth rates were health maintenance organization market penetration, deductible level, the coinsurance rate, and state insurance mandates. Further, this analysis suggests that the insurance underwriting cycle may play an important role in influencing insurance premium growth rates. These results support the belief that health maintenance organization induced competition has potential to control the rate of increase in health care costs.
Cai, Rui; Wang, Shisheng; Tang, Bo; Li, Yueqing; Zhao, Weijie
2018-01-01
Sea cucumber is the major tonic seafood worldwide, and geographical origin traceability is an important part of its quality and safety control. In this work, a non-destructive method for origin traceability of sea cucumber (Apostichopus japonicus) from northern China Sea and East China Sea using near infrared spectroscopy (NIRS) and multivariate analysis methods was proposed. Total fat contents of 189 fresh sea cucumber samples were determined and partial least-squares (PLS) regression was used to establish the quantitative NIRS model. The ordered predictor selection algorithm was performed to select feasible wavelength regions for the construction of PLS and identification models. The identification model was developed by principal component analysis combined with Mahalanobis distance and scaling to the first range algorithms. In the test set of the optimum PLS models, the root mean square error of prediction was 0.45, and correlation coefficient was 0.90. The correct classification rates of 100% were obtained in both identification calibration model and test model. The overall results indicated that NIRS method combined with chemometric analysis was a suitable tool for origin traceability and identification of fresh sea cucumber samples from nine origins in China. PMID:29410795
Guo, Xiuhan; Cai, Rui; Wang, Shisheng; Tang, Bo; Li, Yueqing; Zhao, Weijie
2018-01-01
Sea cucumber is the major tonic seafood worldwide, and geographical origin traceability is an important part of its quality and safety control. In this work, a non-destructive method for origin traceability of sea cucumber ( Apostichopus japonicus ) from northern China Sea and East China Sea using near infrared spectroscopy (NIRS) and multivariate analysis methods was proposed. Total fat contents of 189 fresh sea cucumber samples were determined and partial least-squares (PLS) regression was used to establish the quantitative NIRS model. The ordered predictor selection algorithm was performed to select feasible wavelength regions for the construction of PLS and identification models. The identification model was developed by principal component analysis combined with Mahalanobis distance and scaling to the first range algorithms. In the test set of the optimum PLS models, the root mean square error of prediction was 0.45, and correlation coefficient was 0.90. The correct classification rates of 100% were obtained in both identification calibration model and test model. The overall results indicated that NIRS method combined with chemometric analysis was a suitable tool for origin traceability and identification of fresh sea cucumber samples from nine origins in China.
Pion, Johan A; Fransen, Job; Deprez, Dieter N; Segers, Veerle I; Vaeyens, Roel; Philippaerts, Renaat M; Lenoir, Matthieu
2015-06-01
It was hypothesized that differences in anthropometry, physical performance, and motor coordination would be found between Belgian elite and sub-elite level female volleyball players using a retrospective analysis of test results gathered over a 5-year period. The test sample in this study consisted of 21 young female volleyball players (15.3 ± 1.5 years) who were selected to train at the Flemish Top Sports Academy for Volleyball in 2008. All players (elite, n = 13; sub-elite, n = 8) were included in the same talent development program, and the elite-level athletes were of a high to very high performance levels according to European competition level in 2013. Five multivariate analyses of variance were used. There was no significant effect of playing level on measures of anthropometry (F = 0.455, p = 0.718, (Equation is included in full-text article.)= 0.07), flexibility (F = 1.861, p = 0.188, (Equation is included in full-text article.)= 0.19), strength (F = 1.218, p = 0.355, (Equation is included in full-text article.)= 0.32); and speed and agility (F = 1.176, p = 0.350, (Equation is included in full-text article.)= 0.18). Multivariate analyses of variance revealed significant multivariate effects between playing levels for motor coordination (F = 3.470, p = 0.036, (Equation is included in full-text article.)= 0.59). A Mann-Whitney U test and a sequential discriminant analysis confirmed these results. Previous research revealed that stature and jump height are prerequisites for talent identification in female volleyball. In addition, the results show that motor coordination is an important factor in determining inclusion into the elite level in female volleyball.
Vegetation characteristics important to common songbirds in east Texas
Conner, Richard N.; Dickson, James G.; Locke, Brian A.; Segelquist, Charles A.
1983-01-01
Multivariate studies of breeding bird communities have used principal component analysis (PCA) or several-group (three or more groups) discriminant function analysis (DFA) to ordinate bird species on vegetational continua (Cody 1968, James 1971, Whitmore 1975). In community studies, high resolution of habitat requirements for individual species is not always possible with either PCA or several-group DFA. When habitat characteristics of several species are examined with a DFA the resultant axes optimally discriminate among all species simultaneously. Hence, the characteristics assigned to a particular species reflect in part the presence of other species in the analyses. A better resolution of each species' habitat requirements may be obtained from a two-group DFA, wherein habitats selected by a species are discriminated from all other available habitats. Analyses using two-group DFAs to compare habitat used by a species with habitat unused by the same species have the potential to provide an optimal frame of reference from which to examine habitat variables (Martinka 1972, Conner and Adkisson 1976, Whitmore 1981). Mathematically (DFA) it is possible to maximally separate two groups of multivariate observations with a single axis (Harner and whitmore 1977). A line drawn in three or n-dimensional space can easily be positioned to intersect two multivariate means (centroids). If three or more centroids for species are analyzed simultaneously, a single line can no longer intersect all centroids unless a perfectly linear relationship exists for the species being examined. The probability of such an occurrence is extremely low. Thus, a high degree of resolution can be realized when a two-group DFA is used to determine habitat parameters important to individual species. We have used two-group DFA to identify vegetation variable important to 12 common species of songbirds in East Texas.
Exploring connectivity with large-scale Granger causality on resting-state functional MRI.
DSouza, Adora M; Abidin, Anas Z; Leistritz, Lutz; Wismüller, Axel
2017-08-01
Large-scale Granger causality (lsGC) is a recently developed, resting-state functional MRI (fMRI) connectivity analysis approach that estimates multivariate voxel-resolution connectivity. Unlike most commonly used multivariate approaches, which establish coarse-resolution connectivity by aggregating voxel time-series avoiding an underdetermined problem, lsGC estimates voxel-resolution, fine-grained connectivity by incorporating an embedded dimension reduction. We investigate application of lsGC on realistic fMRI simulations, modeling smoothing of neuronal activity by the hemodynamic response function and repetition time (TR), and empirical resting-state fMRI data. Subsequently, functional subnetworks are extracted from lsGC connectivity measures for both datasets and validated quantitatively. We also provide guidelines to select lsGC free parameters. Results indicate that lsGC reliably recovers underlying network structure with area under receiver operator characteristic curve (AUC) of 0.93 at TR=1.5s for a 10-min session of fMRI simulations. Furthermore, subnetworks of closely interacting modules are recovered from the aforementioned lsGC networks. Results on empirical resting-state fMRI data demonstrate recovery of visual and motor cortex in close agreement with spatial maps obtained from (i) visuo-motor fMRI stimulation task-sequence (Accuracy=0.76) and (ii) independent component analysis (ICA) of resting-state fMRI (Accuracy=0.86). Compared with conventional Granger causality approach (AUC=0.75), lsGC produces better network recovery on fMRI simulations. Furthermore, it cannot recover functional subnetworks from empirical fMRI data, since quantifying voxel-resolution connectivity is not possible as consequence of encountering an underdetermined problem. Functional network recovery from fMRI data suggests that lsGC gives useful insight into connectivity patterns from resting-state fMRI at a multivariate voxel-resolution. Copyright © 2017 Elsevier B.V. All rights reserved.
Teixeira, Kelly Sivocy Sampaio; da Cruz Fonseca, Said Gonçalves; de Moura, Luís Carlos Brigido; de Moura, Mario Luís Ribeiro; Borges, Márcia Herminia Pinheiro; Barbosa, Euzébio Guimaraes; De Lima E Moura, Túlio Flávio Accioly
2018-02-05
The World Health Organization recommends that TB treatment be administered using combination therapy. The methodologies for quantifying simultaneously associated drugs are highly complex, being costly, extremely time consuming and producing chemical residues harmful to the environment. The need to seek alternative techniques that minimize these drawbacks is widely discussed in the pharmaceutical industry. Therefore, the objective of this study was to develop and validate a multivariate calibration model in association with the near infrared spectroscopy technique (NIR) for the simultaneous determination of rifampicin, isoniazid, pyrazinamide and ethambutol. These models allow the quality control of these medicines to be optimized using simple, fast, low-cost techniques that produce no chemical waste. In the NIR - PLS method, spectra readings were acquired in the 10,000-4000cm -1 range using an infrared spectrophotometer (IRPrestige - 21 - Shimadzu) with a resolution of 4cm -1 , 20 sweeps, under controlled temperature and humidity. For construction of the model, the central composite experimental design was employed on the program Statistica 13 (StatSoft Inc.). All spectra were treated by computational tools for multivariate analysis using partial least squares regression (PLS) on the software program Pirouette 3.11 (Infometrix, Inc.). Variable selections were performed by the QSAR modeling program. The models developed by NIR in association with multivariate analysis provided good prediction of the APIs for the external samples and were therefore validated. For the tablets, however, the slightly different quantitative compositions of excipients compared to the mixtures prepared for building the models led to results that were not statistically similar, despite having prediction errors considered acceptable in the literature. Copyright © 2017 Elsevier B.V. All rights reserved.
Multivariate generalized multifactor dimensionality reduction to detect gene-gene interactions
2013-01-01
Background Recently, one of the greatest challenges in genome-wide association studies is to detect gene-gene and/or gene-environment interactions for common complex human diseases. Ritchie et al. (2001) proposed multifactor dimensionality reduction (MDR) method for interaction analysis. MDR is a combinatorial approach to reduce multi-locus genotypes into high-risk and low-risk groups. Although MDR has been widely used for case-control studies with binary phenotypes, several extensions have been proposed. One of these methods, a generalized MDR (GMDR) proposed by Lou et al. (2007), allows adjusting for covariates and applying to both dichotomous and continuous phenotypes. GMDR uses the residual score of a generalized linear model of phenotypes to assign either high-risk or low-risk group, while MDR uses the ratio of cases to controls. Methods In this study, we propose multivariate GMDR, an extension of GMDR for multivariate phenotypes. Jointly analysing correlated multivariate phenotypes may have more power to detect susceptible genes and gene-gene interactions. We construct generalized estimating equations (GEE) with multivariate phenotypes to extend generalized linear models. Using the score vectors from GEE we discriminate high-risk from low-risk groups. We applied the multivariate GMDR method to the blood pressure data of the 7,546 subjects from the Korean Association Resource study: systolic blood pressure (SBP) and diastolic blood pressure (DBP). We compare the results of multivariate GMDR for SBP and DBP to the results from separate univariate GMDR for SBP and DBP, respectively. We also applied the multivariate GMDR method to the repeatedly measured hypertension status from 5,466 subjects and compared its result with those of univariate GMDR at each time point. Results Results from the univariate GMDR and multivariate GMDR in two-locus model with both blood pressures and hypertension phenotypes indicate best combinations of SNPs whose interaction has significant association with risk for high blood pressures or hypertension. Although the test balanced accuracy (BA) of multivariate analysis was not always greater than that of univariate analysis, the multivariate BAs were more stable with smaller standard deviations. Conclusions In this study, we have developed multivariate GMDR method using GEE approach. It is useful to use multivariate GMDR with correlated multiple phenotypes of interests. PMID:24565370
Shahzad, Khalid; Menon, Ashok; Turner, Paul; Ward, Jeremy; Pursnani, Kishore; Alkhaffaf, Bilal
2015-08-01
The prompt recognition of complications is essential in reducing morbidity following anti-reflux surgery. Consequently, many centres employ a policy of routine post-operative contrast studies. The study aimed to examine whether routine contrast studies more effectively recognised early post-operative complications following anti-reflux surgery compared with selective use. This was a retrospective analysis of 240 adults who had undergone primary anti-reflux surgery. Selective use of water-soluble contrast swallows was employed for 115 patients (Group 1) while 125 patients (Group 2) had routine studies. 10 (0.9%) patients from Group 1 underwent contrast studies, four (40%) of which were abnormal. Routine studies in Group 2 identified thirty-two abnormalities (27%) however the inter-group difference was not significant (p = 0.32). Only one case from group 2 required immediate re-intervention. This was not statistically significant (p = 0.78). Multivariate analysis found no significant association between selective or routine imaging and re-intervention rates. One patient from group 2 presented three days following discharge with wrap migration requiring reoperation despite a normal post-operative study. Routine use of contrast imaging following anti-reflux and hiatus hernia surgery is not necessary. It does not identify a significantly greater number of post-operative complications in comparison to selective use. Additionally, routine use of contrast studies does not ensure the diagnosis of all complications in the post-operative period. Copyright © 2015 IJS Publishing Group Limited. Published by Elsevier Ltd. All rights reserved.
D'Amico, E J; Neilands, T B; Zambarano, R
2001-11-01
Although power analysis is an important component in the planning and implementation of research designs, it is often ignored. Computer programs for performing power analysis are available, but most have limitations, particularly for complex multivariate designs. An SPSS procedure is presented that can be used for calculating power for univariate, multivariate, and repeated measures models with and without time-varying and time-constant covariates. Three examples provide a framework for calculating power via this method: an ANCOVA, a MANOVA, and a repeated measures ANOVA with two or more groups. The benefits and limitations of this procedure are discussed.
Delay differential analysis of time series.
Lainscsek, Claudia; Sejnowski, Terrence J
2015-03-01
Nonlinear dynamical system analysis based on embedding theory has been used for modeling and prediction, but it also has applications to signal detection and classification of time series. An embedding creates a multidimensional geometrical object from a single time series. Traditionally either delay or derivative embeddings have been used. The delay embedding is composed of delayed versions of the signal, and the derivative embedding is composed of successive derivatives of the signal. The delay embedding has been extended to nonuniform embeddings to take multiple timescales into account. Both embeddings provide information on the underlying dynamical system without having direct access to all the system variables. Delay differential analysis is based on functional embeddings, a combination of the derivative embedding with nonuniform delay embeddings. Small delay differential equation (DDE) models that best represent relevant dynamic features of time series data are selected from a pool of candidate models for detection or classification. We show that the properties of DDEs support spectral analysis in the time domain where nonlinear correlation functions are used to detect frequencies, frequency and phase couplings, and bispectra. These can be efficiently computed with short time windows and are robust to noise. For frequency analysis, this framework is a multivariate extension of discrete Fourier transform (DFT), and for higher-order spectra, it is a linear and multivariate alternative to multidimensional fast Fourier transform of multidimensional correlations. This method can be applied to short or sparse time series and can be extended to cross-trial and cross-channel spectra if multiple short data segments of the same experiment are available. Together, this time-domain toolbox provides higher temporal resolution, increased frequency and phase coupling information, and it allows an easy and straightforward implementation of higher-order spectra across time compared with frequency-based methods such as the DFT and cross-spectral analysis.
Arase, Shuntaro; Horie, Kanta; Kato, Takashi; Noda, Akira; Mito, Yasuhiro; Takahashi, Masatoshi; Yanagisawa, Toshinobu
2016-10-21
Multivariate curve resolution-alternating least squares (MCR-ALS) method was investigated for its potential to accelerate pharmaceutical research and development. The fast and efficient separation of complex mixtures consisting of multiple components, including impurities as well as major drug substances, remains a challenging application for liquid chromatography in the field of pharmaceutical analysis. In this paper we suggest an integrated analysis algorithm functioning on a matrix of data generated from HPLC coupled with photo-diode array detector (HPLC-PDA) and consisting of the mathematical program for the developed multivariate curve resolution method using an expectation maximization (EM) algorithm with a bidirectional exponentially modified Gaussian (BEMG) model function as a constraint for chromatograms and numerous PDA spectra aligned with time axis. The algorithm provided less than ±1.0% error between true and separated peak area values at resolution (R s ) of 0.6 using simulation data for a three-component mixture with an elution order of a/b/c with similarity (a/b)=0.8410, (b/c)=0.9123 and (a/c)=0.9809 of spectra at peak apex. This software concept provides fast and robust separation analysis even when method development efforts fail to achieve complete separation of the target peaks. Additionally, this approach is potentially applicable to peak deconvolution, allowing quantitative analysis of co-eluted compounds having exactly the same molecular weight. This is complementary to the use of LC-MS to perform quantitative analysis on co-eluted compounds using selected ions to differentiate the proportion of response attributable to each compound. Copyright © 2016 Elsevier B.V. All rights reserved.
Distinct but Overlapping Patterns of Response to Words and Faces in the Fusiform Gyrus.
Harris, Richard J; Rice, Grace E; Young, Andrew W; Andrews, Timothy J
2016-07-01
Converging evidence suggests that the fusiform gyrus is involved in the processing of both faces and words. We used fMRI to investigate the extent to which the representation of words and faces in this region of the brain is based on a common neural representation. In Experiment 1, a univariate analysis revealed regions in the fusiform gyrus that were only selective for faces and other regions that were only selective for words. However, we also found regions that showed both word-selective and face-selective responses, particularly in the left hemisphere. We then used a multivariate analysis to measure the pattern of response to faces and words. Despite the overlap in regional responses, we found distinct patterns of response to both faces and words in the left and right fusiform gyrus. In Experiment 2, fMR adaptation was used to determine whether information about familiar faces and names is integrated in the fusiform gyrus. Distinct regions of the fusiform gyrus showed adaptation to either familiar faces or familiar names. However, there was no adaptation to sequences of faces and names with the same identity. Taken together, these results provide evidence for distinct, but overlapping, neural representations for words and faces in the fusiform gyrus. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.