On the explaining-away phenomenon in multivariate latent variable models.
van Rijn, Peter; Rijmen, Frank
2015-02-01
Many probabilistic models for psychological and educational measurements contain latent variables. Well-known examples are factor analysis, item response theory, and latent class model families. We discuss what is referred to as the 'explaining-away' phenomenon in the context of such latent variable models. This phenomenon can occur when multiple latent variables are related to the same observed variable, and can elicit seemingly counterintuitive conditional dependencies between latent variables given observed variables. We illustrate the implications of explaining away for a number of well-known latent variable models by using both theoretical and real data examples. © 2014 The British Psychological Society.
A Composite Likelihood Inference in Latent Variable Models for Ordinal Longitudinal Responses
ERIC Educational Resources Information Center
Vasdekis, Vassilis G. S.; Cagnone, Silvia; Moustaki, Irini
2012-01-01
The paper proposes a composite likelihood estimation approach that uses bivariate instead of multivariate marginal probabilities for ordinal longitudinal responses using a latent variable model. The model considers time-dependent latent variables and item-specific random effects to be accountable for the interdependencies of the multivariate…
Variable Importance in Multivariate Group Comparisons.
ERIC Educational Resources Information Center
Huberty, Carl J.; Wisenbaker, Joseph M.
1992-01-01
Interpretations of relative variable importance in multivariate analysis of variance are discussed, with attention to (1) latent construct definition; (2) linear discriminant function scores; and (3) grouping variable effects. Two numerical ranking methods are proposed and compared by the bootstrap approach using two real data sets. (SLD)
Multivariate Analysis of Genotype-Phenotype Association.
Mitteroecker, Philipp; Cheverud, James M; Pavlicev, Mihaela
2016-04-01
With the advent of modern imaging and measurement technology, complex phenotypes are increasingly represented by large numbers of measurements, which may not bear biological meaning one by one. For such multivariate phenotypes, studying the pairwise associations between all measurements and all alleles is highly inefficient and prevents insight into the genetic pattern underlying the observed phenotypes. We present a new method for identifying patterns of allelic variation (genetic latent variables) that are maximally associated-in terms of effect size-with patterns of phenotypic variation (phenotypic latent variables). This multivariate genotype-phenotype mapping (MGP) separates phenotypic features under strong genetic control from less genetically determined features and thus permits an analysis of the multivariate structure of genotype-phenotype association, including its dimensionality and the clustering of genetic and phenotypic variables within this association. Different variants of MGP maximize different measures of genotype-phenotype association: genetic effect, genetic variance, or heritability. In an application to a mouse sample, scored for 353 SNPs and 11 phenotypic traits, the first dimension of genetic and phenotypic latent variables accounted for >70% of genetic variation present in all 11 measurements; 43% of variation in this phenotypic pattern was explained by the corresponding genetic latent variable. The first three dimensions together sufficed to account for almost 90% of genetic variation in the measurements and for all the interpretable genotype-phenotype association. Each dimension can be tested as a whole against the hypothesis of no association, thereby reducing the number of statistical tests from 7766 to 3-the maximal number of meaningful independent tests. Important alleles can be selected based on their effect size (additive or nonadditive effect on the phenotypic latent variable). This low dimensionality of the genotype-phenotype map has important consequences for gene identification and may shed light on the evolvability of organisms. Copyright © 2016 by the Genetics Society of America.
Estimation of Latent Group Effects: Psychometric Technical Report No. 2.
ERIC Educational Resources Information Center
Mislevy, Robert J.
Conventional methods of multivariate normal analysis do not apply when the variables of interest are not observed directly, but must be inferred from fallible or incomplete data. For example, responses to mental test items may depend upon latent aptitude variables, which modeled in turn as functions of demographic effects in the population. A…
The spatial pattern of suicide in the US in relation to deprivation, fragmentation and rurality.
Congdon, Peter
2011-01-01
Analysis of geographical patterns of suicide and psychiatric morbidity has demonstrated the impact of latent ecological variables (such as deprivation, rurality). Such latent variables may be derived by conventional multivariate techniques from sets of observed indices (for example, by principal components), by composite variable methods or by methods which explicitly consider the spatial framework of areas and, in particular, the spatial clustering of latent risks and outcomes. This article considers a latent random variable approach to explaining geographical contrasts in suicide in the US; and it develops a spatial structural equation model incorporating deprivation, social fragmentation and rurality. The approach allows for such latent spatial constructs to be correlated both within and between areas. Potential effects of area ethnic mix are also included. The model is applied to male and female suicide deaths over 2002–06 in 3142 US counties.
Estimators for longitudinal latent exposure models: examining measurement model assumptions.
Sánchez, Brisa N; Kim, Sehee; Sammel, Mary D
2017-06-15
Latent variable (LV) models are increasingly being used in environmental epidemiology as a way to summarize multiple environmental exposures and thus minimize statistical concerns that arise in multiple regression. LV models may be especially useful when multivariate exposures are collected repeatedly over time. LV models can accommodate a variety of assumptions but, at the same time, present the user with many choices for model specification particularly in the case of exposure data collected repeatedly over time. For instance, the user could assume conditional independence of observed exposure biomarkers given the latent exposure and, in the case of longitudinal latent exposure variables, time invariance of the measurement model. Choosing which assumptions to relax is not always straightforward. We were motivated by a study of prenatal lead exposure and mental development, where assumptions of the measurement model for the time-changing longitudinal exposure have appreciable impact on (maximum-likelihood) inferences about the health effects of lead exposure. Although we were not particularly interested in characterizing the change of the LV itself, imposing a longitudinal LV structure on the repeated multivariate exposure measures could result in high efficiency gains for the exposure-disease association. We examine the biases of maximum likelihood estimators when assumptions about the measurement model for the longitudinal latent exposure variable are violated. We adapt existing instrumental variable estimators to the case of longitudinal exposures and propose them as an alternative to estimate the health effects of a time-changing latent predictor. We show that instrumental variable estimators remain unbiased for a wide range of data generating models and have advantages in terms of mean squared error. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
On the Power of Multivariate Latent Growth Curve Models to Detect Correlated Change
ERIC Educational Resources Information Center
Hertzog, Christopher; Lindenberger, Ulman; Ghisletta, Paolo; Oertzen, Timo von
2006-01-01
We evaluated the statistical power of single-indicator latent growth curve models (LGCMs) to detect correlated change between two variables (covariance of slopes) as a function of sample size, number of longitudinal measurement occasions, and reliability (measurement error variance). Power approximations following the method of Satorra and Saris…
MULTIVARIATE LINEAR MIXED MODELS FOR MULTIPLE OUTCOMES. (R824757)
We propose a multivariate linear mixed (MLMM) for the analysis of multiple outcomes, which generalizes the latent variable model of Sammel and Ryan. The proposed model assumes a flexible correlation structure among the multiple outcomes, and allows a global test of the impact of ...
FACTOR ANALYTIC MODELS OF CLUSTERED MULTIVARIATE DATA WITH INFORMATIVE CENSORING
This paper describes a general class of factor analytic models for the analysis of clustered multivariate data in the presence of informative missingness. We assume that there are distinct sets of cluster-level latent variables related to the primary outcomes and to the censorin...
Squeezing Interval Change From Ordinal Panel Data: Latent Growth Curves With Ordinal Outcomes
ERIC Educational Resources Information Center
Mehta, Paras D.; Neale, Michael C.; Flay, Brian R.
2004-01-01
A didactic on latent growth curve modeling for ordinal outcomes is presented. The conceptual aspects of modeling growth with ordinal variables and the notion of threshold invariance are illustrated graphically using a hypothetical example. The ordinal growth model is described in terms of 3 nested models: (a) multivariate normality of the…
Gale, Shawn D; Erickson, Lance D; Brown, Bruce L; Hedges, Dawson W
2015-01-01
Helicobacter pylori and latent toxoplasmosis are widespread diseases that have been associated with cognitive deficits and Alzheimer's disease. We sought to determine whether interactions between Helicobacter pylori and latent toxoplasmosis, age, race-ethnicity, educational attainment, economic status, and general health predict cognitive function in young and middle-aged adults. To do so, we used multivariable regression and multivariate models to analyze data obtained from the United States' National Health and Nutrition Examination Survey from the Centers for Disease Control and Prevention, which can be weighted to represent the US population. In this sample, we found that 31.6 percent of women and 36.2 percent of men of the overall sample had IgG Antibodies against Helicobacter pylori, although the seroprevalence of Helicobacter pylori varied with sociodemographic variables. There were no main effects for Helicobacter pylori or latent toxoplasmosis for any of the cognitive measures in models adjusting for age, sex, race-ethnicity, educational attainment, economic standing, and self-rated health predicting cognitive function. However, interactions between Helicobacter pylori and race-ethnicity, educational attainment, latent toxoplasmosis in the fully adjusted models predicted cognitive function. People seropositive for both Helicobacter pylori and latent toxoplasmosis - both of which appear to be common in the general population - appear to be more susceptible to cognitive deficits than are people seropositive for either Helicobacter pylori and or latent toxoplasmosis alone, suggesting a synergistic effect between these two infectious diseases on cognition in young to middle-aged adults.
Gale, Shawn D.; Erickson, Lance D.; Brown, Bruce L.; Hedges, Dawson W.
2015-01-01
Helicobacter pylori and latent toxoplasmosis are widespread diseases that have been associated with cognitive deficits and Alzheimer’s disease. We sought to determine whether interactions between Helicobacter pylori and latent toxoplasmosis, age, race-ethnicity, educational attainment, economic status, and general health predict cognitive function in young and middle-aged adults. To do so, we used multivariable regression and multivariate models to analyze data obtained from the United States’ National Health and Nutrition Examination Survey from the Centers for Disease Control and Prevention, which can be weighted to represent the US population. In this sample, we found that 31.6 percent of women and 36.2 percent of men of the overall sample had IgG Antibodies against Helicobacter pylori, although the seroprevalence of Helicobacter pylori varied with sociodemographic variables. There were no main effects for Helicobacter pylori or latent toxoplasmosis for any of the cognitive measures in models adjusting for age, sex, race-ethnicity, educational attainment, economic standing, and self-rated health predicting cognitive function. However, interactions between Helicobacter pylori and race-ethnicity, educational attainment, latent toxoplasmosis in the fully adjusted models predicted cognitive function. People seropositive for both Helicobacter pylori and latent toxoplasmosis – both of which appear to be common in the general population – appear to be more susceptible to cognitive deficits than are people seropositive for either Helicobacter pylori and or latent toxoplasmosis alone, suggesting a synergistic effect between these two infectious diseases on cognition in young to middle-aged adults. PMID:25590622
Mean Comparison: Manifest Variable versus Latent Variable
ERIC Educational Resources Information Center
Yuan, Ke-Hai; Bentler, Peter M.
2006-01-01
An extension of multiple correspondence analysis is proposed that takes into account cluster-level heterogeneity in respondents' preferences/choices. The method involves combining multiple correspondence analysis and k-means in a unified framework. The former is used for uncovering a low-dimensional space of multivariate categorical variables…
Specifying and Refining a Complex Measurement Model.
ERIC Educational Resources Information Center
Levy, Roy; Mislevy, Robert J.
This paper aims to describe a Bayesian approach to modeling and estimating cognitive models both in terms of statistical machinery and actual instrument development. Such a method taps the knowledge of experts to provide initial estimates for the probabilistic relationships among the variables in a multivariate latent variable model and refines…
On Fitting a Multivariate Two-Part Latent Growth Model
Xu, Shu; Blozis, Shelley A.; Vandewater, Elizabeth A.
2017-01-01
A 2-part latent growth model can be used to analyze semicontinuous data to simultaneously study change in the probability that an individual engages in a behavior, and if engaged, change in the behavior. This article uses a Monte Carlo (MC) integration algorithm to study the interrelationships between the growth factors of 2 variables measured longitudinally where each variable can follow a 2-part latent growth model. A SAS macro implementing Mplus is developed to estimate the model to take into account the sampling uncertainty of this simulation-based computational approach. A sample of time-use data is used to show how maximum likelihood estimates can be obtained using a rectangular numerical integration method and an MC integration method. PMID:29333054
Partial Granger causality--eliminating exogenous inputs and latent variables.
Guo, Shuixia; Seth, Anil K; Kendrick, Keith M; Zhou, Cong; Feng, Jianfeng
2008-07-15
Attempts to identify causal interactions in multivariable biological time series (e.g., gene data, protein data, physiological data) can be undermined by the confounding influence of environmental (exogenous) inputs. Compounding this problem, we are commonly only able to record a subset of all related variables in a system. These recorded variables are likely to be influenced by unrecorded (latent) variables. To address this problem, we introduce a novel variant of a widely used statistical measure of causality--Granger causality--that is inspired by the definition of partial correlation. Our 'partial Granger causality' measure is extensively tested with toy models, both linear and nonlinear, and is applied to experimental data: in vivo multielectrode array (MEA) local field potentials (LFPs) recorded from the inferotemporal cortex of sheep. Our results demonstrate that partial Granger causality can reveal the underlying interactions among elements in a network in the presence of exogenous inputs and latent variables in many cases where the existing conditional Granger causality fails.
Goold, Conor; Newberry, Ruth C
2017-01-01
Studies of animal personality attempt to uncover underlying or "latent" personality traits that explain broad patterns of behaviour, often by applying latent variable statistical models (e.g., factor analysis) to multivariate data sets. Two integral, but infrequently confirmed, assumptions of latent variable models in animal personality are: i) behavioural variables are independent (i.e., uncorrelated) conditional on the latent personality traits they reflect (local independence), and ii) personality traits are associated with behavioural variables in the same way across individuals or groups of individuals (measurement invariance). We tested these assumptions using observations of aggression in four age classes (4-10 months, 10 months-3 years, 3-6 years, over 6 years) of male and female shelter dogs (N = 4,743) in 11 different contexts. A structural equation model supported the hypothesis of two positively correlated personality traits underlying aggression across contexts: aggressiveness towards people and aggressiveness towards dogs (comparative fit index: 0.96; Tucker-Lewis index: 0.95; root mean square error of approximation: 0.03). Aggression across contexts was moderately repeatable (towards people: intraclass correlation coefficient (ICC) = 0.479; towards dogs: ICC = 0.303). However, certain contexts related to aggressiveness towards people (but not dogs) shared significant residual relationships unaccounted for by latent levels of aggressiveness. Furthermore, aggressiveness towards people and dogs in different contexts interacted with sex and age. Thus, sex and age differences in displays of aggression were not simple functions of underlying aggressiveness. Our results illustrate that the robustness of traits in latent variable models must be critically assessed before making conclusions about the effects of, or factors influencing, animal personality. Our findings are of concern because inaccurate "aggressive personality" trait attributions can be costly to dogs, recipients of aggression and society in general.
Ning, Jing; Rahbar, Mohammad H; Choi, Sangbum; Piao, Jin; Hong, Chuan; Del Junco, Deborah J; Rahbar, Elaheh; Fox, Erin E; Holcomb, John B; Wang, Mei-Cheng
2017-08-01
In comparative effectiveness studies of multicomponent, sequential interventions like blood product transfusion (plasma, platelets, red blood cells) for trauma and critical care patients, the timing and dynamics of treatment relative to the fragility of a patient's condition is often overlooked and underappreciated. While many hospitals have established massive transfusion protocols to ensure that physiologically optimal combinations of blood products are rapidly available, the period of time required to achieve a specified massive transfusion standard (e.g. a 1:1 or 1:2 ratio of plasma or platelets:red blood cells) has been ignored. To account for the time-varying characteristics of transfusions, we use semiparametric rate models for multivariate recurrent events to estimate blood product ratios. We use latent variables to account for multiple sources of informative censoring (early surgical or endovascular hemorrhage control procedures or death). The major advantage is that the distributions of latent variables and the dependence structure between the multivariate recurrent events and informative censoring need not be specified. Thus, our approach is robust to complex model assumptions. We establish asymptotic properties and evaluate finite sample performance through simulations, and apply the method to data from the PRospective Observational Multicenter Major Trauma Transfusion study.
2017-01-01
Studies of animal personality attempt to uncover underlying or “latent” personality traits that explain broad patterns of behaviour, often by applying latent variable statistical models (e.g., factor analysis) to multivariate data sets. Two integral, but infrequently confirmed, assumptions of latent variable models in animal personality are: i) behavioural variables are independent (i.e., uncorrelated) conditional on the latent personality traits they reflect (local independence), and ii) personality traits are associated with behavioural variables in the same way across individuals or groups of individuals (measurement invariance). We tested these assumptions using observations of aggression in four age classes (4–10 months, 10 months–3 years, 3–6 years, over 6 years) of male and female shelter dogs (N = 4,743) in 11 different contexts. A structural equation model supported the hypothesis of two positively correlated personality traits underlying aggression across contexts: aggressiveness towards people and aggressiveness towards dogs (comparative fit index: 0.96; Tucker-Lewis index: 0.95; root mean square error of approximation: 0.03). Aggression across contexts was moderately repeatable (towards people: intraclass correlation coefficient (ICC) = 0.479; towards dogs: ICC = 0.303). However, certain contexts related to aggressiveness towards people (but not dogs) shared significant residual relationships unaccounted for by latent levels of aggressiveness. Furthermore, aggressiveness towards people and dogs in different contexts interacted with sex and age. Thus, sex and age differences in displays of aggression were not simple functions of underlying aggressiveness. Our results illustrate that the robustness of traits in latent variable models must be critically assessed before making conclusions about the effects of, or factors influencing, animal personality. Our findings are of concern because inaccurate “aggressive personality” trait attributions can be costly to dogs, recipients of aggression and society in general. PMID:28854267
Revisiting the Dedifferentiation Hypothesis with Longitudinal Multi-Cohort Data
ERIC Educational Resources Information Center
de Frias, Cindy M.; Lovden, Martin; Lindenberger, Ulman; Nilsson, Lars-Goran
2007-01-01
The present longitudinal multi-cohort study examines whether interindividual variability in cognitive performance and change increases in old age, and whether associations among developments of different cognitive functions increase with adult age. Multivariate multiple-group latent growth modeling was applied to data from narrow cohorts separated…
Structural Equation Model Trees
ERIC Educational Resources Information Center
Brandmaier, Andreas M.; von Oertzen, Timo; McArdle, John J.; Lindenberger, Ulman
2013-01-01
In the behavioral and social sciences, structural equation models (SEMs) have become widely accepted as a modeling tool for the relation between latent and observed variables. SEMs can be seen as a unification of several multivariate analysis techniques. SEM Trees combine the strengths of SEMs and the decision tree paradigm by building tree…
Fitting and Testing Conditional Multinormal Partial Credit Models
ERIC Educational Resources Information Center
Hessen, David J.
2012-01-01
A multinormal partial credit model for factor analysis of polytomously scored items with ordered response categories is derived using an extension of the Dutch Identity (Holland in "Psychometrika" 55:5-18, 1990). In the model, latent variables are assumed to have a multivariate normal distribution conditional on unweighted sums of item…
Kinderman, Peter; Schwannauer, Matthias; Pontin, Eleanor; Tai, Sara
2013-01-01
Background Despite widespread acceptance of the ‘biopsychosocial model’, the aetiology of mental health problems has provoked debate amongst researchers and practitioners for decades. The role of psychological factors in the development of mental health problems remains particularly contentious, and to date there has not been a large enough dataset to conduct the necessary multivariate analysis of whether psychological factors influence, or are influenced by, mental health. This study reports on the first empirical, multivariate, test of the relationships between the key elements of the biospychosocial model of mental ill-health. Methods and Findings Participants were 32,827 (age 18–85 years) self-selected respondents from the general population who completed an open-access online battery of questionnaires hosted by the BBC. An initial confirmatory factor analysis was performed to assess the adequacy of the proposed factor structure and the relationships between latent and measured variables. The predictive path model was then tested whereby the latent variables of psychological processes were positioned as mediating between the causal latent variables (biological, social and circumstantial) and the outcome latent variables of mental health problems and well-being. This revealed an excellent fit to the data, S-B χ2 (3199, N = 23,397) = 126654·8, p<·001; RCFI = ·97; RMSEA = ·04 (·038–·039). As hypothesised, a family history of mental health difficulties, social deprivation, and traumatic or abusive life-experiences all strongly predicted higher levels of anxiety and depression. However, these relationships were strongly mediated by psychological processes; specifically lack of adaptive coping, rumination and self-blame. Conclusion These results support a significant revision of the biopsychosocial model, as psychological processes determine the causal impact of biological, social, and circumstantial risk factors on mental health. This has clear implications for policy, education and clinical practice as psychological processes such as rumination and self-blame are amenable to evidence-based psychological therapies. PMID:24146890
Kinderman, Peter; Schwannauer, Matthias; Pontin, Eleanor; Tai, Sara
2013-01-01
Despite widespread acceptance of the 'biopsychosocial model', the aetiology of mental health problems has provoked debate amongst researchers and practitioners for decades. The role of psychological factors in the development of mental health problems remains particularly contentious, and to date there has not been a large enough dataset to conduct the necessary multivariate analysis of whether psychological factors influence, or are influenced by, mental health. This study reports on the first empirical, multivariate, test of the relationships between the key elements of the biospychosocial model of mental ill-health. Participants were 32,827 (age 18-85 years) self-selected respondents from the general population who completed an open-access online battery of questionnaires hosted by the BBC. An initial confirmatory factor analysis was performed to assess the adequacy of the proposed factor structure and the relationships between latent and measured variables. The predictive path model was then tested whereby the latent variables of psychological processes were positioned as mediating between the causal latent variables (biological, social and circumstantial) and the outcome latent variables of mental health problems and well-being. This revealed an excellent fit to the data, S-B χ(2) (3199, N = 23,397) = 126654.8, p<.001; RCFI = .97; RMSEA = .04 (.038-.039). As hypothesised, a family history of mental health difficulties, social deprivation, and traumatic or abusive life-experiences all strongly predicted higher levels of anxiety and depression. However, these relationships were strongly mediated by psychological processes; specifically lack of adaptive coping, rumination and self-blame. These results support a significant revision of the biopsychosocial model, as psychological processes determine the causal impact of biological, social, and circumstantial risk factors on mental health. This has clear implications for policy, education and clinical practice as psychological processes such as rumination and self-blame are amenable to evidence-based psychological therapies.
Multivariate analysis of fears in dental phobic patients according to a reduced FSS-II scale.
Hakeberg, M; Gustafsson, J E; Berggren, U; Carlsson, S G
1995-10-01
This study analyzed and assessed dimensions of a questionnaire developed to measure general fears and phobias. A previous factor analysis among 109 dental phobics had revealed a five-factor structure with 22 items and an explained total variance of 54%. The present study analyzed the same material using a multivariate statistical procedure (LISREL) to reveal structural latent variables. The LISREL analysis, based on the correlation matrix, yielded a chi-square of 216.6 with 195 degrees of freedom (P = 0.138) and showed a model with seven latent variables. One was a general fear factor correlated to all 22 items. The other six factors concerned "Illness & Death" (5 items), "Failures & Embarrassment" (5 items), "Social situations" (5 items), "Physical injuries" (4 items), "Animals & Natural phenomena" (4 items). One item (opposite sex) was included in both "Failures & Embarrassment" and "Social situations". The last factor, "Social interaction", combined all the items in "Failures & Embarrassment" and "Social situations" (9 items). In conclusion, this multivariate statistical analysis (LISREL) revealed and confirmed a factor structure similar to our previous study, but added two important dimensions not shown with a traditional factor analysis. This reduced FSS-II version measures general fears and phobias and may be used on a routine clinical basis as well as in dental phobia research.
Hierarchical Multinomial Processing Tree Models: A Latent-Trait Approach
ERIC Educational Resources Information Center
Klauer, Karl Christoph
2010-01-01
Multinomial processing tree models are widely used in many areas of psychology. A hierarchical extension of the model class is proposed, using a multivariate normal distribution of person-level parameters with the mean and covariance matrix to be estimated from the data. The hierarchical model allows one to take variability between persons into…
Stability of Teacher Value-Added Rankings across Measurement Model and Scaling Conditions
ERIC Educational Resources Information Center
Hawley, Leslie R.; Bovaird, James A.; Wu, ChaoRong
2017-01-01
Value-added assessment methods have been criticized by researchers and policy makers for a number of reasons. One issue includes the sensitivity of model results across different outcome measures. This study examined the utility of incorporating multivariate latent variable approaches within a traditional value-added framework. We evaluated the…
A Semi-parametric Multivariate Gap-filling Model for Eddy Covariance Latent Heat Flux
NASA Astrophysics Data System (ADS)
Li, M.; Chen, Y.
2010-12-01
Quantitative descriptions of latent heat fluxes are important to study the water and energy exchanges between terrestrial ecosystems and the atmosphere. The eddy covariance approaches have been recognized as the most reliable technique for measuring surface fluxes over time scales ranging from hours to years. However, unfavorable micrometeorological conditions, instrument failures, and applicable measurement limitations may cause inevitable flux gaps in time series data. Development and application of suitable gap-filling techniques are crucial to estimate long term fluxes. In this study, a semi-parametric multivariate gap-filling model was developed to fill latent heat flux gaps for eddy covariance measurements. Our approach combines the advantages of a multivariate statistical analysis (principal component analysis, PCA) and a nonlinear interpolation technique (K-nearest-neighbors, KNN). The PCA method was first used to resolve the multicollinearity relationships among various hydrometeorological factors, such as radiation, soil moisture deficit, LAI, and wind speed. The KNN method was then applied as a nonlinear interpolation tool to estimate the flux gaps as the weighted sum latent heat fluxes with the K-nearest distances in the PCs’ domain. Two years, 2008 and 2009, of eddy covariance and hydrometeorological data from a subtropical mixed evergreen forest (the Lien-Hua-Chih Site) were collected to calibrate and validate the proposed approach with artificial gaps after standard QC/QA procedures. The optimal K values and weighting factors were determined by the maximum likelihood test. The results of gap-filled latent heat fluxes conclude that developed model successful preserving energy balances of daily, monthly, and yearly time scales. Annual amounts of evapotranspiration from this study forest were 747 mm and 708 mm for 2008 and 2009, respectively. Nocturnal evapotranspiration was estimated with filled gaps and results are comparable with other studies. Seasonal and daily variability of latent heat fluxes were also discussed.
Association between latent toxoplasmosis and cognition in adults: a cross-sectional study.
Gale, S D; Brown, B L; Erickson, L D; Berrett, A; Hedges, D W
2015-04-01
Latent infection from Toxoplasma gondii (T. gondii) is widespread worldwide and has been associated with cognitive deficits in some but not all animal models and in humans. We tested the hypothesis that latent toxoplasmosis is associated with decreased cognitive function in a large cross-sectional dataset, the National Health and Nutrition Examination Survey (NHANES). There were 4178 participants aged 20-59 years, of whom 19.1% had IgG antibodies against T. gondii. Two ordinary least squares (OLS) regression models adjusted for the NHANES complex sampling design and weighted to represent the US population were estimated for simple reaction time, processing speed and short-term memory or attention. The first model included only main effects of latent toxoplasmosis and demographic control variables, and the second added interaction terms between latent toxoplasmosis and the poverty-to-income ratio (PIR), educational attainment and race-ethnicity. We also used multivariate models to assess all three cognitive outcomes in the same model. Although the models evaluating main effects only demonstrated no association between latent toxoplasmosis and the cognitive outcomes, significant interactions between latent toxoplasmosis and the PIR, between latent toxoplasmosis and educational attainment, and between latent toxoplasmosis and race-ethnicity indicated that latent toxoplasmosis may adversely affect cognitive function in certain groups.
Long-Term Stability of Core Language Skill in Children with Contrasting Language Skills
Bornstein, Marc H.; Hahn, Chun-Shin; Putnick, Diane L.
2016-01-01
This four-wave longitudinal study evaluated stability of core language skill in 421 European American and African American children, half of whom were identified as low (n = 201) and half of whom were average-to-high (n = 220) in later language skill. Structural equation modeling supported loadings of multivariate age-appropriate multisource measures of child language on single latent variables of core language skill at 15 and 25 months and 5 and 11 years. Significant stability coefficients were obtained between language latent variables for children of low and average-to-high language skill, even accounting for child positive social interaction and nonverbal intelligence, maternal education and language, and family home environment. Prospects for children with different language skills and intervention implications are discussed. PMID:26998572
Implementing Restricted Maximum Likelihood Estimation in Structural Equation Models
ERIC Educational Resources Information Center
Cheung, Mike W.-L.
2013-01-01
Structural equation modeling (SEM) is now a generic modeling framework for many multivariate techniques applied in the social and behavioral sciences. Many statistical models can be considered either as special cases of SEM or as part of the latent variable modeling framework. One popular extension is the use of SEM to conduct linear mixed-effects…
The Recoverability of P-Technique Factor Analysis
ERIC Educational Resources Information Center
Molenaar, Peter C. M.; Nesselroade, John R.
2009-01-01
It seems that just when we are about to lay P-technique factor analysis finally to rest as obsolete because of newer, more sophisticated multivariate time-series models using latent variables--dynamic factor models--it rears its head to inform us that an obituary may be premature. We present the results of some simulations demonstrating that even…
A Systematic Comparison between Classical Optimal Scaling and the Two-Parameter IRT Model
ERIC Educational Resources Information Center
Warrens, Matthijs J.; de Gruijter, Dato N. M.; Heiser, Willem J.
2007-01-01
In this article, the relationship between two alternative methods for the analysis of multivariate categorical data is systematically explored. It is shown that the person score of the first dimension of classical optimal scaling correlates strongly with the latent variable for the two-parameter item response theory (IRT) model. Next, under the…
Using structural equation modeling to investigate relationships among ecological variables
Malaeb, Z.A.; Kevin, Summers J.; Pugesek, B.H.
2000-01-01
Structural equation modeling is an advanced multivariate statistical process with which a researcher can construct theoretical concepts, test their measurement reliability, hypothesize and test a theory about their relationships, take into account measurement errors, and consider both direct and indirect effects of variables on one another. Latent variables are theoretical concepts that unite phenomena under a single term, e.g., ecosystem health, environmental condition, and pollution (Bollen, 1989). Latent variables are not measured directly but can be expressed in terms of one or more directly measurable variables called indicators. For some researchers, defining, constructing, and examining the validity of latent variables may be the end task of itself. For others, testing hypothesized relationships of latent variables may be of interest. We analyzed the correlation matrix of eleven environmental variables from the U.S. Environmental Protection Agency's (USEPA) Environmental Monitoring and Assessment Program for Estuaries (EMAP-E) using methods of structural equation modeling. We hypothesized and tested a conceptual model to characterize the interdependencies between four latent variables-sediment contamination, natural variability, biodiversity, and growth potential. In particular, we were interested in measuring the direct, indirect, and total effects of sediment contamination and natural variability on biodiversity and growth potential. The model fit the data well and accounted for 81% of the variability in biodiversity and 69% of the variability in growth potential. It revealed a positive total effect of natural variability on growth potential that otherwise would have been judged negative had we not considered indirect effects. That is, natural variability had a negative direct effect on growth potential of magnitude -0.3251 and a positive indirect effect mediated through biodiversity of magnitude 0.4509, yielding a net positive total effect of 0.1258. Natural variability had a positive direct effect on biodiversity of magnitude 0.5347 and a negative indirect effect mediated through growth potential of magnitude -0.1105 yielding a positive total effects of magnitude 0.4242. Sediment contamination had a negative direct effect on biodiversity of magnitude -0.1956 and a negative indirect effect on growth potential via biodiversity of magnitude -0.067. Biodiversity had a positive effect on growth potential of magnitude 0.8432, and growth potential had a positive effect on biodiversity of magnitude 0.3398. The correlation between biodiversity and growth potential was estimated at 0.7658 and that between sediment contamination and natural variability at -0.3769.
Kelava, Augustin; Muma, Michael; Deja, Marlene; Dagdagan, Jack Y.; Zoubir, Abdelhak M.
2015-01-01
Emotion eliciting situations are accompanied by changes of multiple variables associated with subjective, physiological and behavioral responses. The quantification of the overall simultaneous synchrony of psychophysiological reactions plays a major role in emotion theories and has received increased attention in recent years. From a psychometric perspective, the reactions represent multivariate non-stationary intra-individual time series. In this paper, a new time-frequency based latent variable approach for the quantification of the synchrony of the responses is presented. The approach is applied to empirical data, collected during an emotion eliciting situation. The results are compared with a complementary inter-individual approach of Hsieh et al. (2011). Finally, the proposed approach is discussed in the context of emotion theories, and possible future applications and limitations are provided. PMID:25653624
Hayashi, Yoshihiro; Oshima, Etsuko; Maeda, Jin; Onuki, Yoshinori; Obata, Yasuko; Takayama, Kozo
2012-01-01
A multivariate statistical technique was applied to the design of an orally disintegrating tablet and to clarify the causal correlation among variables of the manufacturing process and pharmaceutical responses. Orally disintegrating tablets (ODTs) composed mainly of mannitol were prepared via the wet-granulation method using crystal transition from the δ to the β form of mannitol. Process parameters (water amounts (X(1)), kneading time (X(2)), compression force (X(3)), and amounts of magnesium stearate (X(4))) were optimized using a nonlinear response surface method (RSM) incorporating a thin plate spline interpolation (RSM-S). The results of a verification study revealed that the experimental responses, such as tensile strength and disintegration time, coincided well with the predictions. A latent structure analysis of the pharmaceutical formulations of the tablet performed using a Bayesian network led to the clear visualization of a causal connection among variables of the manufacturing process and tablet characteristics. The quantity of β-mannitol in the granules (Q(β)) was affected by X(2) and influenced all granule properties. The specific surface area of the granules was affected by X(1) and Q(β) and had an effect on all tablet characteristics. Moreover, the causal relationships among the variables were clarified by inferring conditional probability distributions. These techniques provide a better understanding of the complicated latent structure among variables of the manufacturing process and tablet characteristics.
A Database Approach for Predicting and Monitoring Baked Anode Properties
NASA Astrophysics Data System (ADS)
Lauzon-Gauthier, Julien; Duchesne, Carl; Tessier, Jayson
2012-11-01
The baked anode quality control strategy currently used by most carbon plants based on testing anode core samples in the laboratory is inadequate for facing increased raw material variability. The low core sampling rate limited by lab capacity and the common practice of reporting averaged properties based on some anode population mask a significant amount of individual anode variability. In addition, lab results are typically available a few weeks after production and the anodes are often already set in the reduction cells preventing early remedial actions when necessary. A database approach is proposed in this work to develop a soft-sensor for predicting individual baked anode properties at the end of baking cycle. A large historical database including raw material properties, process operating parameters and anode core data was collected from a modern Alcoa plant. A multivariate latent variable PLS regression method was used for analyzing the large database and building the soft-sensor model. It is shown that the general low frequency trends in most anode physical and mechanical properties driven by raw material changes are very well captured by the model. Improvements in the data infrastructure (instrumentation, sampling frequency and location) will be necessary for predicting higher frequency variations in individual baked anode properties. This paper also demonstrates how multivariate latent variable models can be interpreted against process knowledge and used for real-time process monitoring of carbon plants, and detection of faults and abnormal operation.
Multivariate control of plant species richness and community biomass in blackland prairie
Weiher, E.; Forbes, S.; Schauwecker, T.; Grace, J.B.
2004-01-01
Recent studies have shown that patterns of plant species richness and community biomass are best understood in a multivariate context. The objective of this study was to develop and evaluate a multivariate hypothesis about how herbaceous biomass and richness relate to gradients in soil conditions and woody plant cover in blackland prairies. Structural equation modeling was used to investigate how soil characteristics and shade by scattered Juniperus virginiana trees relate to standing biomass and species richness in 99 0.25 m2 quadrats collected in eastern Mississippi, USA. Analysis proceeded in two stages. In the first stage, we evaluated the hypothesis that correlations among soil parameters could be represented by two underlying (latent) soil factors, mineral content and organic content. In the second stage, we evaluated the hypothesis that richness and biomass were related to (1) soil properties, (2) tree canopy extent, and (3) each other (i.e. reciprocal effects between richness and biomass). With some modification to the details of the original model, it was found that soil properties could be represented as two latent variables. In the overall model, 51% and 53% of the observed variation in richness and biomass were explained. The order of importance for variables explaining variations in richness was (1) soil organic content, (2) soil mineral content, (3) community biomass, and (4) tree canopy extent. The order of importance for variables explaining biomass was (1) tree canopy and (2) soil organic content, with neither soil mineral content nor species richness explaining significant variation in biomass. Based on these findings, we conclude that variations in richness are uniquely related to both variations in soil conditions and variations in herbaceous biomass. We further conclude that there is no evidence in these data for effects of species richness on biomass.
Bayesian Estimation of Multivariate Latent Regression Models: Gauss versus Laplace
ERIC Educational Resources Information Center
Culpepper, Steven Andrew; Park, Trevor
2017-01-01
A latent multivariate regression model is developed that employs a generalized asymmetric Laplace (GAL) prior distribution for regression coefficients. The model is designed for high-dimensional applications where an approximate sparsity condition is satisfied, such that many regression coefficients are near zero after accounting for all the model…
A General Multivariate Latent Growth Model with Applications to Student Achievement
ERIC Educational Resources Information Center
Bianconcini, Silvia; Cagnone, Silvia
2012-01-01
The evaluation of the formative process in the University system has been assuming an ever increasing importance in the European countries. Within this context, the analysis of student performance and capabilities plays a fundamental role. In this work, the authors propose a multivariate latent growth model for studying the performances of a…
Belo, Celso; Naidoo, Saloshni
2017-06-08
Healthcare workers in high tuberculosis burdened countries are occupationally exposed to the tuberculosis disease with uncomplicated and complicated tuberculosis on the increase among them. Most of them acquire Mycobacterium tuberculosis but do not progress to the active disease - latent tuberculosis infection. The objective of this study was to assess the prevalence and risk factors associated with latent tuberculosis infection among healthcare workers in Nampula Central Hospital, Mozambique. This cross-sectional study of healthcare workers was conducted between 2014 and 2015. Participants (n = 209) were administered a questionnaire on demographics and occupational tuberculosis exposure and had a tuberculin skin test administered. Multivariate linear and logistic regression tested for associations between independent variables and dependent outcomes (tuberculin skin test induration and latent tuberculosis infection status). The prevalence of latent tuberculosis infection was 34.4%. Latent tuberculosis infection was highest in those working for more than eight years (39.3%), those who had no BCG vaccination (39.6%) and were immunocompromised (78.1%). Being immunocompromised was significantly associated with latent tuberculosis infection (OR 5.97 [95% CI 1.89; 18.87]). Positive but non-significant associations occurred with working in the medical domain (OR 1.02 [95% CI 0.17; 6.37]), length of employment > eight years (OR 1.97 [95% CI 0.70; 5.53]) and occupational contact with tuberculosis patients (OR 1.24 [95% CI 0.47; 3.27]). Personal and occupational factors were positively associated with latent tuberculosis infection among healthcare workers in Mozambique.
Structural Equation Model Trees
Brandmaier, Andreas M.; von Oertzen, Timo; McArdle, John J.; Lindenberger, Ulman
2015-01-01
In the behavioral and social sciences, structural equation models (SEMs) have become widely accepted as a modeling tool for the relation between latent and observed variables. SEMs can be seen as a unification of several multivariate analysis techniques. SEM Trees combine the strengths of SEMs and the decision tree paradigm by building tree structures that separate a data set recursively into subsets with significantly different parameter estimates in a SEM. SEM Trees provide means for finding covariates and covariate interactions that predict differences in structural parameters in observed as well as in latent space and facilitate theory-guided exploration of empirical data. We describe the methodology, discuss theoretical and practical implications, and demonstrate applications to a factor model and a linear growth curve model. PMID:22984789
An integrated phenomic approach to multivariate allelic association
Medland, Sarah Elizabeth; Neale, Michael Churton
2010-01-01
The increased feasibility of genome-wide association has resulted in association becoming the primary method used to localize genetic variants that cause phenotypic variation. Much attention has been focused on the vast multiple testing problems arising from analyzing large numbers of single nucleotide polymorphisms. However, the inflation of experiment-wise type I error rates through testing numerous phenotypes has received less attention. Multivariate analyses can be used to detect both pleiotropic effects that influence a latent common factor, and monotropic effects that operate at a variable-specific levels, whilst controlling for non-independence between phenotypes. In this study, we present a maximum likelihood approach, which combines both latent and variable-specific tests and which may be used with either individual or family data. Simulation results indicate that in the presence of factor-level association, the combined multivariate (CMV) analysis approach performs well with a minimal loss of power as compared with a univariate analysis of a factor or sum score (SS). As the deviation between the pattern of allelic effects and the factor loadings increases, the power of univariate analyses of both factor and SSs decreases dramatically, whereas the power of the CMV approach is maintained. We show the utility of the approach by examining the association between dopamine receptor D2 TaqIA and the initiation of marijuana, tranquilizers and stimulants in data from the Add Health Study. Perl scripts that takes ped and dat files as input and produces Mx scripts and data for running the CMV approach can be downloaded from www.vipbg.vcu.edu/~sarahme/WriteMx. PMID:19707246
Zhou, Yan; Wang, Pei; Wang, Xianlong; Zhu, Ji; Song, Peter X-K
2017-01-01
The multivariate regression model is a useful tool to explore complex associations between two kinds of molecular markers, which enables the understanding of the biological pathways underlying disease etiology. For a set of correlated response variables, accounting for such dependency can increase statistical power. Motivated by integrative genomic data analyses, we propose a new methodology-sparse multivariate factor analysis regression model (smFARM), in which correlations of response variables are assumed to follow a factor analysis model with latent factors. This proposed method not only allows us to address the challenge that the number of association parameters is larger than the sample size, but also to adjust for unobserved genetic and/or nongenetic factors that potentially conceal the underlying response-predictor associations. The proposed smFARM is implemented by the EM algorithm and the blockwise coordinate descent algorithm. The proposed methodology is evaluated and compared to the existing methods through extensive simulation studies. Our results show that accounting for latent factors through the proposed smFARM can improve sensitivity of signal detection and accuracy of sparse association map estimation. We illustrate smFARM by two integrative genomics analysis examples, a breast cancer dataset, and an ovarian cancer dataset, to assess the relationship between DNA copy numbers and gene expression arrays to understand genetic regulatory patterns relevant to the disease. We identify two trans-hub regions: one in cytoband 17q12 whose amplification influences the RNA expression levels of important breast cancer genes, and the other in cytoband 9q21.32-33, which is associated with chemoresistance in ovarian cancer. © 2016 WILEY PERIODICALS, INC.
Denis Valle; Benjamin Baiser; Christopher W. Woodall; Robin Chazdon; Jerome Chave
2014-01-01
We propose a novel multivariate method to analyse biodiversity data based on the Latent Dirichlet Allocation (LDA) model. LDA, a probabilistic model, reduces assemblages to sets of distinct component communities. It produces easily interpretable results, can represent abrupt and gradual changes in composition, accommodates missing data and allows for coherent estimates...
ERIC Educational Resources Information Center
Maslowsky, Julie; Jager, Justin; Hemken, Douglas
2015-01-01
Latent variables are common in psychological research. Research questions involving the interaction of two variables are likewise quite common. Methods for estimating and interpreting interactions between latent variables within a structural equation modeling framework have recently become available. The latent moderated structural equations (LMS)…
Yang, Jun-Ho; Yoh, Jack J
2018-01-01
A novel technique is reported for separating overlapping latent fingerprints using chemometric approaches that combine laser-induced breakdown spectroscopy (LIBS) and multivariate analysis. The LIBS technique provides the capability of real time analysis and high frequency scanning as well as the data regarding the chemical composition of overlapping latent fingerprints. These spectra offer valuable information for the classification and reconstruction of overlapping latent fingerprints by implementing appropriate statistical multivariate analysis. The current study employs principal component analysis and partial least square methods for the classification of latent fingerprints from the LIBS spectra. This technique was successfully demonstrated through a classification study of four distinct latent fingerprints using classification methods such as soft independent modeling of class analogy (SIMCA) and partial least squares discriminant analysis (PLS-DA). The novel method yielded an accuracy of more than 85% and was proven to be sufficiently robust. Furthermore, through laser scanning analysis at a spatial interval of 125 µm, the overlapping fingerprints were reconstructed as separate two-dimensional forms.
The algebraic theory of latent projectors in lambda matrices
NASA Technical Reports Server (NTRS)
Denman, E. D.; Leyva-Ramos, J.; Jeon, G. J.
1981-01-01
Multivariable systems such as a finite-element model of vibrating structures, control systems, and large-scale systems are often formulated in terms of differential equations which give rise to lambda matrices. The present investigation is concerned with the formulation of the algebraic theory of lambda matrices and the relationship of latent roots, latent vectors, and latent projectors to the eigenvalues, eigenvectors, and eigenprojectors of the companion form. The chain rule for latent projectors and eigenprojectors for the repeated latent root or eigenvalues is given.
Sharafi, Mastaneh; Rawal, Shristi; Fernandez, Maria Luz; Huedo-Medina, Tania B; Duffy, Valerie B
2018-05-08
Sensations from foods and beverages drive dietary choices, which in turn, affect risk of diet-related diseases. Perception of these sensation varies with environmental and genetic influences. This observational study aimed to examine associations between chemosensory phenotype, diet and cardiovascular disease (CVD) risk. Reportedly healthy women (n = 110, average age 45 ± 9 years) participated in laboratory-based measures of chemosensory phenotype (taste and smell function, propylthiouracil (PROP) bitterness) and CVD risk factors (waist circumference, blood pressure, serum lipids). Diet variables included preference and intake of sweet/high-fat foods, dietary restraint, and diet quality based on reported preference (Healthy Eating Preference Index-HEPI) and intake (Healthy Eating Index-HEI). We found that females who reported high preference yet low consumption of sweet/high-fat foods had the highest dietary restraint and depressed quinine taste function. PROP nontasters were more likely to report lower diet quality; PROP supertasters more likely to consume but not like a healthy diet. Multivariate structural models were fitted to identify predictors of CVD risk factors. Reliable latent taste (quinine taste function, PROP tasting) and smell (odor intensity) variables were identified, with taste explaining more variance in the CVD risk factors. Lower bitter taste perception was associated with elevated risk. In multivariate models, the HEPI completely mediated the taste-adiposity and taste-HDL associations and partially mediated the taste-triglyceride or taste-systolic blood pressure associations. The taste-LDL pathway was significant and direct. The HEI could not replace HEPI in adequate models. However, using a latent diet quality variable with HEPI and HEI, increased the strength of association between diet quality and adiposity or CVD risk factors. In conclusion, bitter taste phenotype was associated with CVD risk factors via diet quality, particularly when assessed by level of food liking/disliking. Copyright © 2018 Elsevier Inc. All rights reserved.
Bayesian Adaptive Lasso for Ordinal Regression with Latent Variables
ERIC Educational Resources Information Center
Feng, Xiang-Nan; Wu, Hao-Tian; Song, Xin-Yuan
2017-01-01
We consider an ordinal regression model with latent variables to investigate the effects of observable and latent explanatory variables on the ordinal responses of interest. Each latent variable is characterized by correlated observed variables through a confirmatory factor analysis model. We develop a Bayesian adaptive lasso procedure to conduct…
Avoiding and Correcting Bias in Score-Based Latent Variable Regression with Discrete Manifest Items
ERIC Educational Resources Information Center
Lu, Irene R. R.; Thomas, D. Roland
2008-01-01
This article considers models involving a single structural equation with latent explanatory and/or latent dependent variables where discrete items are used to measure the latent variables. Our primary focus is the use of scores as proxies for the latent variables and carrying out ordinary least squares (OLS) regression on such scores to estimate…
Graphical Models for Ordinal Data
Guo, Jian; Levina, Elizaveta; Michailidis, George; Zhu, Ji
2014-01-01
A graphical model for ordinal variables is considered, where it is assumed that the data are generated by discretizing the marginal distributions of a latent multivariate Gaussian distribution. The relationships between these ordinal variables are then described by the underlying Gaussian graphical model and can be inferred by estimating the corresponding concentration matrix. Direct estimation of the model is computationally expensive, but an approximate EM-like algorithm is developed to provide an accurate estimate of the parameters at a fraction of the computational cost. Numerical evidence based on simulation studies shows the strong performance of the algorithm, which is also illustrated on data sets on movie ratings and an educational survey. PMID:26120267
Bayesian Semiparametric Structural Equation Models with Latent Variables
ERIC Educational Resources Information Center
Yang, Mingan; Dunson, David B.
2010-01-01
Structural equation models (SEMs) with latent variables are widely useful for sparse covariance structure modeling and for inferring relationships among latent variables. Bayesian SEMs are appealing in allowing for the incorporation of prior information and in providing exact posterior distributions of unknowns, including the latent variables. In…
Infrared Spectroscopic Imaging of Latent Fingerprints and Associated Forensic Evidence
Chen, Tsoching; Schultz, Zachary D.; Levin, Ira W.
2011-01-01
Fingerprints reflecting a specific chemical history, such as exposure to explosives, are clearly distinguished from overlapping, and interfering latent fingerprints using infrared spectroscopic imaging techniques and multivariate analysis. PMID:19684917
Zhang, Zhenzhen; O'Neill, Marie S; Sánchez, Brisa N
2016-04-01
Factor analysis is a commonly used method of modelling correlated multivariate exposure data. Typically, the measurement model is assumed to have constant factor loadings. However, from our preliminary analyses of the Environmental Protection Agency's (EPA's) PM 2.5 fine speciation data, we have observed that the factor loadings for four constituents change considerably in stratified analyses. Since invariance of factor loadings is a prerequisite for valid comparison of the underlying latent variables, we propose a factor model that includes non-constant factor loadings that change over time and space using P-spline penalized with the generalized cross-validation (GCV) criterion. The model is implemented using the Expectation-Maximization (EM) algorithm and we select the multiple spline smoothing parameters by minimizing the GCV criterion with Newton's method during each iteration of the EM algorithm. The algorithm is applied to a one-factor model that includes four constituents. Through bootstrap confidence bands, we find that the factor loading for total nitrate changes across seasons and geographic regions.
Defining a Family of Cognitive Diagnosis Models Using Log-Linear Models with Latent Variables
ERIC Educational Resources Information Center
Henson, Robert A.; Templin, Jonathan L.; Willse, John T.
2009-01-01
This paper uses log-linear models with latent variables (Hagenaars, in "Loglinear Models with Latent Variables," 1993) to define a family of cognitive diagnosis models. In doing so, the relationship between many common models is explicitly defined and discussed. In addition, because the log-linear model with latent variables is a general model for…
ERIC Educational Resources Information Center
Bauer, Daniel J.; Curran, Patrick J.
2004-01-01
Structural equation mixture modeling (SEMM) integrates continuous and discrete latent variable models. Drawing on prior research on the relationships between continuous and discrete latent variable models, the authors identify 3 conditions that may lead to the estimation of spurious latent classes in SEMM: misspecification of the structural model,…
Latent mnemonic strengths are latent: a comment on Mickes, Wixted, and Wais (2007).
Rouder, Jeffrey N; Pratte, Michael S; Morey, Richard D
2010-06-01
Mickes, Wixted, and Wais (2007) proposed a simple test of latent strength variability in recognition memory. They asked participants to rate their confidence using either a 20-point or a 99-point strength scale and plotted distributions of the resulting ratings. They found 25% more variability in ratings for studied than for new items, which they interpreted as providing evidence that latent mnemonic strength distributions are 25% more variable for studied than for new items. We show here that this conclusion is critically dependent on assumptions--so much so that these assumptions determine the conclusions. In fact, opposite conclusions, such that study does not affect the variability of latent strength, may be reached by making different but equally plausible assumptions. Because all measurements of mnemonic strength variability are critically dependent on untestable assumptions, all are arbitrary. Hence, there is no principled method for assessing the relative variability of latent mnemonic strength distributions.
Guyon, Hervé; Falissard, Bruno; Kop, Jean-Luc
2017-01-01
Network Analysis is considered as a new method that challenges Latent Variable models in inferring psychological attributes. With Network Analysis, psychological attributes are derived from a complex system of components without the need to call on any latent variables. But the ontological status of psychological attributes is not adequately defined with Network Analysis, because a psychological attribute is both a complex system and a property emerging from this complex system. The aim of this article is to reappraise the legitimacy of latent variable models by engaging in an ontological and epistemological discussion on psychological attributes. Psychological attributes relate to the mental equilibrium of individuals embedded in their social interactions, as robust attractors within complex dynamic processes with emergent properties, distinct from physical entities located in precise areas of the brain. Latent variables thus possess legitimacy, because the emergent properties can be conceptualized and analyzed on the sole basis of their manifestations, without exploring the upstream complex system. However, in opposition with the usual Latent Variable models, this article is in favor of the integration of a dynamic system of manifestations. Latent Variables models and Network Analysis thus appear as complementary approaches. New approaches combining Latent Network Models and Network Residuals are certainly a promising new way to infer psychological attributes, placing psychological attributes in an inter-subjective dynamic approach. Pragmatism-realism appears as the epistemological framework required if we are to use latent variables as representations of psychological attributes. PMID:28572780
A Latent Variable Approach to the Simple View of Reading
ERIC Educational Resources Information Center
Kershaw, Sarah; Schatschneider, Chris
2012-01-01
The present study utilized a latent variable modeling approach to examine the Simple View of Reading in a sample of students from 3rd, 7th, and 10th grades (N = 215, 188, and 180, respectively). Latent interaction modeling and other latent variable models were employed to investigate (a) the functional form of the relationship between decoding and…
The distance between Mars and Venus: measuring global sex differences in personality.
Del Giudice, Marco; Booth, Tom; Irwing, Paul
2012-01-01
Sex differences in personality are believed to be comparatively small. However, research in this area has suffered from significant methodological limitations. We advance a set of guidelines for overcoming those limitations: (a) measure personality with a higher resolution than that afforded by the Big Five; (b) estimate sex differences on latent factors; and (c) assess global sex differences with multivariate effect sizes. We then apply these guidelines to a large, representative adult sample, and obtain what is presently the best estimate of global sex differences in personality. Personality measures were obtained from a large US sample (N = 10,261) with the 16PF Questionnaire. Multigroup latent variable modeling was used to estimate sex differences on individual personality dimensions, which were then aggregated to yield a multivariate effect size (Mahalanobis D). We found a global effect size D = 2.71, corresponding to an overlap of only 10% between the male and female distributions. Even excluding the factor showing the largest univariate ES, the global effect size was D = 1.71 (24% overlap). These are extremely large differences by psychological standards. The idea that there are only minor differences between the personality profiles of males and females should be rejected as based on inadequate methodology.
The Distance Between Mars and Venus: Measuring Global Sex Differences in Personality
Del Giudice, Marco; Booth, Tom; Irwing, Paul
2012-01-01
Background Sex differences in personality are believed to be comparatively small. However, research in this area has suffered from significant methodological limitations. We advance a set of guidelines for overcoming those limitations: (a) measure personality with a higher resolution than that afforded by the Big Five; (b) estimate sex differences on latent factors; and (c) assess global sex differences with multivariate effect sizes. We then apply these guidelines to a large, representative adult sample, and obtain what is presently the best estimate of global sex differences in personality. Methodology/Principal Findings Personality measures were obtained from a large US sample (N = 10,261) with the 16PF Questionnaire. Multigroup latent variable modeling was used to estimate sex differences on individual personality dimensions, which were then aggregated to yield a multivariate effect size (Mahalanobis D). We found a global effect size D = 2.71, corresponding to an overlap of only 10% between the male and female distributions. Even excluding the factor showing the largest univariate ES, the global effect size was D = 1.71 (24% overlap). These are extremely large differences by psychological standards. Significance The idea that there are only minor differences between the personality profiles of males and females should be rejected as based on inadequate methodology. PMID:22238596
Carroll, Rachel; Lawson, Andrew B; Kirby, Russell S; Faes, Christel; Aregay, Mehreteab; Watjou, Kevin
2017-01-01
Many types of cancer have an underlying spatiotemporal distribution. Spatiotemporal mixture modeling can offer a flexible approach to risk estimation via the inclusion of latent variables. In this article, we examine the application and benefits of using four different spatiotemporal mixture modeling methods in the modeling of cancer of the lung and bronchus as well as "other" respiratory cancer incidences in the state of South Carolina. Of the methods tested, no single method outperforms the other methods; which method is best depends on the cancer under consideration. The lung and bronchus cancer incidence outcome is best described by the univariate modeling formulation, whereas the "other" respiratory cancer incidence outcome is best described by the multivariate modeling formulation. Spatiotemporal multivariate mixture methods can aid in the modeling of cancers with small and sparse incidences when including information from a related, more common type of cancer. Copyright © 2016 Elsevier Inc. All rights reserved.
Dabkiewicz, Vanessa Emídio; de Mello Pereira Abrantes, Shirley; Cassella, Ricardo Jorgensen
2018-08-05
Near infrared spectroscopy (NIR) with diffuse reflectance associated to multivariate calibration has as main advantage the replacement of the physical separation of interferents by the mathematical separation of their signals, rapidly with no need for reagent consumption, chemical waste production or sample manipulation. Seeking to optimize quality control analyses, this spectroscopic analytical method was shown to be a viable alternative to the classical Kjeldahl method for the determination of protein nitrogen in yellow fever vaccine. The most suitable multivariate calibration was achieved by the partial least squares method (PLS) with multiplicative signal correction (MSC) treatment and data mean centering (MC), using a minimum number of latent variables (LV) equal to 1, with the lower value of the square root of the mean squared prediction error (0.00330) associated with the highest percentage value (91%) of samples. Accuracy ranged 95 to 105% recovery in the 4000-5184 cm -1 region. Copyright © 2018 Elsevier B.V. All rights reserved.
Latent Transition Analysis with a Mixture Item Response Theory Measurement Model
ERIC Educational Resources Information Center
Cho, Sun-Joo; Cohen, Allan S.; Kim, Seock-Ho; Bottge, Brian
2010-01-01
A latent transition analysis (LTA) model was described with a mixture Rasch model (MRM) as the measurement model. Unlike the LTA, which was developed with a latent class measurement model, the LTA-MRM permits within-class variability on the latent variable, making it more useful for measuring treatment effects within latent classes. A simulation…
Person Re-Identification via Distance Metric Learning With Latent Variables.
Sun, Chong; Wang, Dong; Lu, Huchuan
2017-01-01
In this paper, we propose an effective person re-identification method with latent variables, which represents a pedestrian as the mixture of a holistic model and a number of flexible models. Three types of latent variables are introduced to model uncertain factors in the re-identification problem, including vertical misalignments, horizontal misalignments and leg posture variations. The distance between two pedestrians can be determined by minimizing a given distance function with respect to latent variables, and then be used to conduct the re-identification task. In addition, we develop a latent metric learning method for learning the effective metric matrix, which can be solved via an iterative manner: once latent information is specified, the metric matrix can be obtained based on some typical metric learning methods; with the computed metric matrix, the latent variables can be determined by searching the state space exhaustively. Finally, extensive experiments are conducted on seven databases to evaluate the proposed method. The experimental results demonstrate that our method achieves better performance than other competing algorithms.
Adolescent cigarette smoking: health-related behavior or normative transgression?
Turbin, M S; Jessor, R; Costa, F M
2000-09-01
Relations among measures of adolescent behavior were examined to determine whether cigarette smoking fits into a structure of problem behaviors-behaviors that involve normative transgression-or a structure of health-related behaviors, or both. In an ethnically and socioeconomically diverse sample of 1782 male and female high school adolescents, four first-order problem behavior latent variables-sexual intercourse experience, alcohol abuse, illicit drug use, and delinquency-were established and together were shown to reflect a second-order latent variable of problem behavior. Four first-order latent variables of health-related behaviors-unhealthy dietary habits, sedentary behavior, unsafe behavior, and poor dental hygiene-were also established and together were shown to reflect a second-order latent variable of health-compromising behavior. The structure of relations among those latent variables was modeled. Cigarette smoking had a significant and substantial loading only on the problem-behavior latent variable; its loading on the health-compromising behavior latent variable was essentially zero. Adolescent cigarette smoking relates strongly and directly to problem behaviors and only indirectly, if at all, to health-compromising behaviors. Interventions to prevent or reduce adolescent smoking should attend more to factors that influence problem behaviors.
Quantitative Ultrasound Using Texture Analysis of Myofascial Pain Syndrome in the Trapezius.
Kumbhare, Dinesh A; Ahmed, Sara; Behr, Michael G; Noseworthy, Michael D
2018-01-01
Objective-The objective of this study is to assess the discriminative ability of textural analyses to assist in the differentiation of the myofascial trigger point (MTrP) region from normal regions of skeletal muscle. Also, to measure the ability to reliably differentiate between three clinically relevant groups: healthy asymptomatic, latent MTrPs, and active MTrP. Methods-18 and 19 patients were identified with having active and latent MTrPs in the trapezius muscle, respectively. We included 24 healthy volunteers. Images were obtained by research personnel, who were blinded with respect to the clinical status of the study participant. Histograms provided first-order parameters associated with image grayscale. Haralick, Galloway, and histogram-related features were used in texture analysis. Blob analysis was conducted on the regions of interest (ROIs). Principal component analysis (PCA) was performed followed by multivariate analysis of variance (MANOVA) to determine the statistical significance of the features. Results-92 texture features were analyzed for factorability using Bartlett's test of sphericity, which was significant. The Kaiser-Meyer-Olkin measure of sampling adequacy was 0.94. PCA demonstrated rotated eigenvalues of the first eight components (each comprised of multiple texture features) explained 94.92% of the cumulative variance in the ultrasound image characteristics. The 24 features identified by PCA were included in the MANOVA as dependent variables, and the presence of a latent or active MTrP or healthy muscle were independent variables. Conclusion-Texture analysis techniques can discriminate between the three clinically relevant groups.
Öhlén, Joakim; Russell, Lara; Håkanson, Cecilia; Alvariza, Anette; Fürst, Carl Johan; Årestedt, Kristofer; Sawatzky, Richard
2017-01-01
Symptom relief is a key goal of palliative care. There is a need to consider complexities in symptom relief patterns for groups of people to understand and evaluate symptom relief as an indicator of quality of care at end of life. The aims of this study were to distinguish classes of patients who have different symptom relief patterns during the last week of life and to identify predictors of these classes in an adult register population. In a cross-sectional retrospective design, data were used from 87,026 decedents with expected deaths registered in the Swedish Register of Palliative Care in 2011 and 2012. Study variables were structured into patient characteristics, and processes and outcomes of quality of care. A latent class analysis was used to identify symptom relief patterns. Multivariate multinomial regression analyses were used to identify predictors of class membership. Five latent classes were generated: "relieved pain," "relieved pain and rattles," "relieved pain and anxiety," "partly relieved shortness of breath, rattles and anxiety," and "partly relieved pain, anxiety and confusion." Important predictors of class membership were age, sex, cause of death, and having someone present at death, individual prescriptions as needed (PRN) and expert consultations. Interindividual variability and complexity in symptom relief patterns may inform quality of care and its evaluation for dying people across care settings. Copyright © 2016 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Ploubidis, G B; Edwards, P; Kendrick, D
2015-12-15
This paper reports the development and testing of a construct measuring parental fire safety behaviours for planning escape from a house fire. Latent variable modelling of data on parental-reported fire safety behaviours and plans for escaping from a house fire and multivariable logistic regression to quantify the association between groups defined by the latent variable modelling and parental-report of having a plan for escaping from a house fire. Data comes from 1112 participants in a cluster randomised controlled trial set in children's centres in 4 study centres in the UK. A two class model provided the best fit to the data, combining responses to five fire safety planning behaviours. The first group ('more behaviours for escaping from a house fire') comprised 86% of participants who were most likely to have a torch, be aware of how their smoke alarm sounds, to have external door and window keys accessible, and exits clear. The second group ('fewer behaviours for escaping from a house fire') comprised 14% of participants who were less likely to report these five behaviours. After adjusting for potential confounders, participants allocated to the 'more behaviours for escaping from a house fire group were 2.5 times more likely to report having an escape plan (OR 2.48; 95% CI 1.59-3.86) than those in the "fewer behaviours for escaping from a house fire" group. Multiple fire safety behaviour questions can be combined into a single binary summary measure of fire safety behaviours for escaping from a house fire. Our findings will be useful to future studies wishing to use a single measure of fire safety planning behaviour as measures of outcome or exposure. NCT 01452191. Date of registration 13/10/2011.
A Latent Class Analysis of Family Characteristics Linked to Youth Offending Outcomes.
Chng, Grace S; Chu, Chi Meng; Zeng, Gerald; Li, Dongdong; Ting, Ming Hwa
2016-11-01
There were two aims to this study: firstly, to identify family subtypes of Singaporean youth offenders based on eight family variables. Secondly, the associations of these family subtypes with youth offending outcomes were tested. With a sample of 3,744 youth, a latent class analysis was first conducted based on eight family variables. Multivariate analyses and a Cox regression were subsequently performed to analyze the associations of the family classes with age at first arrest, age at first charge, and recidivism. A three-class solution was found to have the best fit to the data: (1) intact functioning families had little family risk; (2) families with criminality had higher probabilities of family criminality, of drug/alcohol abuse, and of being nonintact; and (3) poorly managed families received the poorest parenting and were more likely to be nonintact. Youth offenders from the latter two classes were arrested and charged at younger ages. Additionally, they reoffended at a quicker rate. Family backgrounds matter for youth offending outcomes. Interventions have to be multifaceted and targeted at the family in order to mitigate the risk of young offenders from developing into pathological adult criminals.
A Latent Class Analysis of Family Characteristics Linked to Youth Offending Outcomes
Chu, Chi Meng; Zeng, Gerald; Li, Dongdong; Ting, Ming Hwa
2016-01-01
Objectives: There were two aims to this study: firstly, to identify family subtypes of Singaporean youth offenders based on eight family variables. Secondly, the associations of these family subtypes with youth offending outcomes were tested. Methods: With a sample of 3,744 youth, a latent class analysis was first conducted based on eight family variables. Multivariate analyses and a Cox regression were subsequently performed to analyze the associations of the family classes with age at first arrest, age at first charge, and recidivism. Results: A three-class solution was found to have the best fit to the data: (1) intact functioning families had little family risk; (2) families with criminality had higher probabilities of family criminality, of drug/alcohol abuse, and of being nonintact; and (3) poorly managed families received the poorest parenting and were more likely to be nonintact. Youth offenders from the latter two classes were arrested and charged at younger ages. Additionally, they reoffended at a quicker rate. Conclusions: Family backgrounds matter for youth offending outcomes. Interventions have to be multifaceted and targeted at the family in order to mitigate the risk of young offenders from developing into pathological adult criminals. PMID:28736458
A Framework for Multifaceted Evaluation of Student Models
ERIC Educational Resources Information Center
Huang, Yun; González-Brenes, José P.; Kumar, Rohit; Brusilovsky, Peter
2015-01-01
Latent variable models, such as the popular Knowledge Tracing method, are often used to enable adaptive tutoring systems to personalize education. However, finding optimal model parameters is usually a difficult non-convex optimization problem when considering latent variable models. Prior work has reported that latent variable models obtained…
The Latent Variable Approach as Applied to Transitive Reasoning
ERIC Educational Resources Information Center
Bouwmeester, Samantha; Vermunt, Jeroen K.; Sijtsma, Klaas
2012-01-01
We discuss the limitations of hypothesis testing using (quasi-) experiments in the study of cognitive development and suggest latent variable modeling as a viable alternative to experimentation. Latent variable models allow testing a theory as a whole, incorporating individual differences with respect to developmental processes or abilities in the…
Much Ado about Nothing--Or at Best, Very Little
ERIC Educational Resources Information Center
Widaman, Keith F.
2014-01-01
Latent variable structural equation modeling has become the analytic method of choice in many domains of research in psychology and allied social sciences. One important aspect of a latent variable model concerns the relations hypothesized to hold between latent variables and their indicators. The most common specification of structural equation…
A Bayesian Semiparametric Latent Variable Model for Mixed Responses
ERIC Educational Resources Information Center
Fahrmeir, Ludwig; Raach, Alexander
2007-01-01
In this paper we introduce a latent variable model (LVM) for mixed ordinal and continuous responses, where covariate effects on the continuous latent variables are modelled through a flexible semiparametric Gaussian regression model. We extend existing LVMs with the usual linear covariate effects by including nonparametric components for nonlinear…
Yasuda, Akihito; Onuki, Yoshinori; Obata, Yasuko; Takayama, Kozo
2015-01-01
The "quality by design" concept in pharmaceutical formulation development requires the establishment of a science-based rationale and design space. In this article, we integrate thin-plate spline (TPS) interpolation, Kohonen's self-organizing map (SOM) and a Bayesian network (BN) to visualize the latent structure underlying causal factors and pharmaceutical responses. As a model pharmaceutical product, theophylline tablets were prepared using a standard formulation. We measured the tensile strength and disintegration time as response variables and the compressibility, cohesion and dispersibility of the pretableting blend as latent variables. We predicted these variables quantitatively using nonlinear TPS, generated a large amount of data on pretableting blends and tablets and clustered these data into several clusters using a SOM. Our results show that we are able to predict the experimental values of the latent and response variables with a high degree of accuracy and are able to classify the tablet data into several distinct clusters. In addition, to visualize the latent structure between the causal and latent factors and the response variables, we applied a BN method to the SOM clustering results. We found that despite having inserted latent variables between the causal factors and response variables, their relation is equivalent to the results for the SOM clustering, and thus we are able to explain the underlying latent structure. Consequently, this technique provides a better understanding of the relationships between causal factors and pharmaceutical responses in theophylline tablet formulation.
Latent variable models are network models.
Molenaar, Peter C M
2010-06-01
Cramer et al. present an original and interesting network perspective on comorbidity and contrast this perspective with a more traditional interpretation of comorbidity in terms of latent variable theory. My commentary focuses on the relationship between the two perspectives; that is, it aims to qualify the presumed contrast between interpretations in terms of networks and latent variables.
Examining Parallelism of Sets of Psychometric Measures Using Latent Variable Modeling
ERIC Educational Resources Information Center
Raykov, Tenko; Patelis, Thanos; Marcoulides, George A.
2011-01-01
A latent variable modeling approach that can be used to examine whether several psychometric tests are parallel is discussed. The method consists of sequentially testing the properties of parallel measures via a corresponding relaxation of parameter constraints in a saturated model or an appropriately constructed latent variable model. The…
Predictive Inference Using Latent Variables with Covariates*
Schofield, Lynne Steuerle; Junker, Brian; Taylor, Lowell J.; Black, Dan A.
2014-01-01
Plausible Values (PVs) are a standard multiple imputation tool for analysis of large education survey data that measures latent proficiency variables. When latent proficiency is the dependent variable, we reconsider the standard institutionally-generated PV methodology and find it applies with greater generality than shown previously. When latent proficiency is an independent variable, we show that the standard institutional PV methodology produces biased inference because the institutional conditioning model places restrictions on the form of the secondary analysts’ model. We offer an alternative approach that avoids these biases based on the mixed effects structural equations (MESE) model of Schofield (2008). PMID:25231627
ERIC Educational Resources Information Center
Henson, James M.; Reise, Steven P.; Kim, Kevin H.
2007-01-01
The accuracy of structural model parameter estimates in latent variable mixture modeling was explored with a 3 (sample size) [times] 3 (exogenous latent mean difference) [times] 3 (endogenous latent mean difference) [times] 3 (correlation between factors) [times] 3 (mixture proportions) factorial design. In addition, the efficacy of several…
Burri, Andrea; Spector, Tim; Rahman, Qazi
2015-04-01
Homosexuality is a stable population-level trait in humans that lowers direct fitness and yet is substantially heritable, resulting in a so-called Darwinian "paradox." Evolutionary models have proposed that polymorphic genes influencing homosexuality confer a reproductive benefit to heterosexual carriers, thus offsetting the fitness costs associated with persistent homosexuality. This benefit may consist of a "sex typicality" intermediate phenotype. However, there are few empirical tests of this hypothesis using genetically informative data in humans. This study aimed to test the hypothesis that common genetic factors can explain the association between measures of sex typicality, mating success, and homosexuality in a Western (British) sample of female twins. Here, we used data from 996 female twins (498 twin pairs) comprising 242 full dizygotic pairs and 256 full monozygotic pairs (mean age 56.8) and 1,555 individuals whose co-twin did not participate. Measures of sexual orientation, sex typicality (recalled childhood gender nonconformity), and mating success (number of lifetime sexual partners) were completed. Variables were subject to multivariate variance component analysis. We found that masculine women are more likely to be nonheterosexual, report more sexual partners, and, when heterosexual, also report more sexual partners. Multivariate twin modeling showed that common genetic factors explained the relationship between sexual orientation, sex typicality, and mating success through a shared latent factor. Our findings suggest that genetic factors responsible for nonheterosexuality are shared with genetic factors responsible for the number of lifetime sexual partners via a latent sex typicality phenotype in human females. These results may have implications for evolutionary models of homosexuality but are limited by potential mediating variables (such as personality traits) and measurement issues. © 2015 International Society for Sexual Medicine.
Burri, Andrea; Cherkas, Lynn; Spector, Timothy; Rahman, Qazi
2011-01-01
Human sexual orientation is influenced by genetic and non-shared environmental factors as are two important psychological correlates--childhood gender typicality (CGT) and adult gender identity (AGI). However, researchers have been unable to resolve the genetic and non-genetic components that contribute to the covariation between these traits, particularly in women. Here we performed a multivariate genetic analysis in a large sample of British female twins (N = 4,426) who completed a questionnaire assessing sexual attraction, CGT and AGI. Univariate genetic models indicated modest genetic influences on sexual attraction (25%), AGI (11%) and CGT (31%). For the multivariate analyses, a common pathway model best fitted the data. This indicated that a single latent variable influenced by a genetic component and common non-shared environmental component explained the association between the three traits but there was substantial measurement error. These findings highlight common developmental factors affecting differences in sexual orientation.
Gene Variants Associated with Antisocial Behaviour: A Latent Variable Approach
ERIC Educational Resources Information Center
Bentley, Mary Jane; Lin, Haiqun; Fernandez, Thomas V.; Lee, Maria; Yrigollen, Carolyn M.; Pakstis, Andrew J.; Katsovich, Liliya; Olds, David L.; Grigorenko, Elena L.; Leckman, James F.
2013-01-01
Objective: The aim of this study was to determine if a latent variable approach might be useful in identifying shared variance across genetic risk alleles that is associated with antisocial behaviour at age 15 years. Methods: Using a conventional latent variable approach, we derived an antisocial phenotype in 328 adolescents utilizing data from a…
The Least-Squares Estimation of Latent Trait Variables.
ERIC Educational Resources Information Center
Tatsuoka, Kikumi
This paper presents a new method for estimating a given latent trait variable by the least-squares approach. The beta weights are obtained recursively with the help of Fourier series and expressed as functions of item parameters of response curves. The values of the latent trait variable estimated by this method and by maximum likelihood method…
Bayes Factor Covariance Testing in Item Response Models.
Fox, Jean-Paul; Mulder, Joris; Sinharay, Sandip
2017-12-01
Two marginal one-parameter item response theory models are introduced, by integrating out the latent variable or random item parameter. It is shown that both marginal response models are multivariate (probit) models with a compound symmetry covariance structure. Several common hypotheses concerning the underlying covariance structure are evaluated using (fractional) Bayes factor tests. The support for a unidimensional factor (i.e., assumption of local independence) and differential item functioning are evaluated by testing the covariance components. The posterior distribution of common covariance components is obtained in closed form by transforming latent responses with an orthogonal (Helmert) matrix. This posterior distribution is defined as a shifted-inverse-gamma, thereby introducing a default prior and a balanced prior distribution. Based on that, an MCMC algorithm is described to estimate all model parameters and to compute (fractional) Bayes factor tests. Simulation studies are used to show that the (fractional) Bayes factor tests have good properties for testing the underlying covariance structure of binary response data. The method is illustrated with two real data studies.
Selection of latent variables for multiple mixed-outcome models
ZHOU, LING; LIN, HUAZHEN; SONG, XINYUAN; LI, YI
2014-01-01
Latent variable models have been widely used for modeling the dependence structure of multiple outcomes data. However, the formulation of a latent variable model is often unknown a priori, the misspecification will distort the dependence structure and lead to unreliable model inference. Moreover, multiple outcomes with varying types present enormous analytical challenges. In this paper, we present a class of general latent variable models that can accommodate mixed types of outcomes. We propose a novel selection approach that simultaneously selects latent variables and estimates parameters. We show that the proposed estimator is consistent, asymptotically normal and has the oracle property. The practical utility of the methods is confirmed via simulations as well as an application to the analysis of the World Values Survey, a global research project that explores peoples’ values and beliefs and the social and personal characteristics that might influence them. PMID:27642219
ERIC Educational Resources Information Center
Kelava, Augustin; Nagengast, Benjamin
2012-01-01
Structural equation models with interaction and quadratic effects have become a standard tool for testing nonlinear hypotheses in the social sciences. Most of the current approaches assume normally distributed latent predictor variables. In this article, we present a Bayesian model for the estimation of latent nonlinear effects when the latent…
Gene variants associated with antisocial behaviour: A latent variable approach
Bentley, Mary Jane; Lin, Haiqun; Fernandez, Thomas V.; Lee, Maria; Yrigollen, Carolyn M.; Pakstis, Andrew J.; Katsovich, Liliya; Olds, David L.; Grigorenko, Elena L.; Leckman, James F.
2013-01-01
Objective The aim of this study was to determine if a latent variable approach might be useful in identifying shared variance across genetic risk alleles that is associated with antisocial behaviour at age 15 years. Methods Using a conventional latent variable approach, we derived an antisocial phenotype in 328 adolescents utilizing data from a 15-year follow-up of a randomized trial of a prenatal and infancy nurse-home visitation program in Elmira, New York. We then investigated, via a novel latent variable approach, 450 informative genetic polymorphisms in 71 genes previously associated with antisocial behaviour, drug use, affiliative behaviours, and stress response in 241 consenting individuals for whom DNA was available. Haplotype and Pathway analyses were also performed. Results Eight single-nucleotide polymorphisms (SNPs) from 8 genes contributed to the latent genetic variable that in turn accounted for 16.0% of the variance within the latent antisocial phenotype. The number of risk alleles was linearly related to the latent antisocial variable scores. Haplotypes that included the putative risk alleles for all 8 genes were also associated with higher latent antisocial variable scores. In addition, 33 SNPs from 63 of the remaining genes were also significant when added to the final model. Many of these genes interact on a molecular level, forming molecular networks. The results support a role for genes related to dopamine, norepinephrine, serotonin, glutamate, opioid, and cholinergic signaling as well as stress response pathways in mediating susceptibility to antisocial behaviour. Conclusions This preliminary study supports use of relevant behavioural indicators and latent variable approaches to study the potential “co-action” of gene variants associated with antisocial behaviour. It also underscores the cumulative relevance of common genetic variants for understanding the etiology of complex behaviour. If replicated in future studies, this approach may allow the identification of a ‘shared’ variance across genetic risk alleles associated with complex neuropsychiatric dimensional phenotypes using relatively small numbers of well-characterized research participants. PMID:23822756
ERIC Educational Resources Information Center
Henseler, Jorg; Chin, Wynne W.
2010-01-01
In social and business sciences, the importance of the analysis of interaction effects between manifest as well as latent variables steadily increases. Researchers using partial least squares (PLS) to analyze interaction effects between latent variables need an overview of the available approaches as well as their suitability. This article…
Accuracy of latent-variable estimation in Bayesian semi-supervised learning.
Yamazaki, Keisuke
2015-09-01
Hierarchical probabilistic models, such as Gaussian mixture models, are widely used for unsupervised learning tasks. These models consist of observable and latent variables, which represent the observable data and the underlying data-generation process, respectively. Unsupervised learning tasks, such as cluster analysis, are regarded as estimations of latent variables based on the observable ones. The estimation of latent variables in semi-supervised learning, where some labels are observed, will be more precise than that in unsupervised, and one of the concerns is to clarify the effect of the labeled data. However, there has not been sufficient theoretical analysis of the accuracy of the estimation of latent variables. In a previous study, a distribution-based error function was formulated, and its asymptotic form was calculated for unsupervised learning with generative models. It has been shown that, for the estimation of latent variables, the Bayes method is more accurate than the maximum-likelihood method. The present paper reveals the asymptotic forms of the error function in Bayesian semi-supervised learning for both discriminative and generative models. The results show that the generative model, which uses all of the given data, performs better when the model is well specified. Copyright © 2015 Elsevier Ltd. All rights reserved.
Cham, Heining; West, Stephen G.; Ma, Yue; Aiken, Leona S.
2012-01-01
A Monte Carlo simulation was conducted to investigate the robustness of four latent variable interaction modeling approaches (Constrained Product Indicator [CPI], Generalized Appended Product Indicator [GAPI], Unconstrained Product Indicator [UPI], and Latent Moderated Structural Equations [LMS]) under high degrees of non-normality of the observed exogenous variables. Results showed that the CPI and LMS approaches yielded biased estimates of the interaction effect when the exogenous variables were highly non-normal. When the violation of non-normality was not severe (normal; symmetric with excess kurtosis < 1), the LMS approach yielded the most efficient estimates of the latent interaction effect with the highest statistical power. In highly non-normal conditions, the GAPI and UPI approaches with ML estimation yielded unbiased latent interaction effect estimates, with acceptable actual Type-I error rates for both the Wald and likelihood ratio tests of interaction effect at N ≥ 500. An empirical example illustrated the use of the four approaches in testing a latent variable interaction between academic self-efficacy and positive family role models in the prediction of academic performance. PMID:23457417
Rodriguez-Seijas, Craig; Stohl, Malki; Hasin, Deborah S; Eaton, Nicholas R
2015-07-01
Multivariable comorbidity research indicates that many common mental disorders are manifestations of 2 latent transdiagnostic factors, internalizing and externalizing. Environmental stressors are known to increase the risk for experiencing particular mental disorders, but their relationships with transdiagnostic disorder constructs are unknown. The present study investigated one such stressor, perceived racial discrimination, which is robustly associated with a variety of mental disorders. To examine the direct and indirect associations between perceived racial discrimination and common forms of psychopathology. Quantitative analysis of 12 common diagnoses that were previously assessed in a nationally representative sample (N = 5191) of African American and Afro-Caribbean adults in the United States, taken from the National Survey of American Life, and used to test the possibility that transdiagnostic factors mediate the effects of discrimination on disorders. The data were obtained from February 2001 to March 2003. Latent variable measurement models, including factor analysis, and indirect effect models were used in the study. Mental health diagnoses from reliable and valid structured interviews and perceived race-based discrimination. While perceived discrimination was positively associated with all examined forms of psychopathology and substance use disorders, latent variable indirect effects modeling revealed that almost all of these associations were significantly mediated by the transdiagnostic factors. For social anxiety disorder and attention-deficit/hyperactivity disorder, complete mediation was found. The pathways linking perceived discrimination to psychiatric disorders were not direct but indirect (via transdiagnostic factors). Therefore, perceived discrimination may be associated with risk for myriad psychiatric disorders due to its association with transdiagnostic factors.
ERIC Educational Resources Information Center
Samejima, Fumiko
In latent trait theory the latent space, or space of the hypothetical construct, is usually represented by some unidimensional or multi-dimensional continuum of real numbers. Like the latent space, the item response can either be treated as a discrete variable or as a continuous variable. Latent trait theory relates the item response to the latent…
Zhang, Miaomiao; Wells, William M; Golland, Polina
2017-10-01
We present an efficient probabilistic model of anatomical variability in a linear space of initial velocities of diffeomorphic transformations and demonstrate its benefits in clinical studies of brain anatomy. To overcome the computational challenges of the high dimensional deformation-based descriptors, we develop a latent variable model for principal geodesic analysis (PGA) based on a low dimensional shape descriptor that effectively captures the intrinsic variability in a population. We define a novel shape prior that explicitly represents principal modes as a multivariate complex Gaussian distribution on the initial velocities in a bandlimited space. We demonstrate the performance of our model on a set of 3D brain MRI scans from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database. Our model yields a more compact representation of group variation at substantially lower computational cost than the state-of-the-art method such as tangent space PCA (TPCA) and probabilistic principal geodesic analysis (PPGA) that operate in the high dimensional image space. Copyright © 2017 Elsevier B.V. All rights reserved.
Squires, Janet E; Estabrooks, Carole A; Newburn-Cook, Christine V; Gierl, Mark
2011-05-19
There is a lack of acceptable, reliable, and valid survey instruments to measure conceptual research utilization (CRU). In this study, we investigated the psychometric properties of a newly developed scale (the CRU Scale). We used the Standards for Educational and Psychological Testing as a validation framework to assess four sources of validity evidence: content, response processes, internal structure, and relations to other variables. A panel of nine international research utilization experts performed a formal content validity assessment. To determine response process validity, we conducted a series of one-on-one scale administration sessions with 10 healthcare aides. Internal structure and relations to other variables validity was examined using CRU Scale response data from a sample of 707 healthcare aides working in 30 urban Canadian nursing homes. Principal components analysis and confirmatory factor analyses were conducted to determine internal structure. Relations to other variables were examined using: (1) bivariate correlations; (2) change in mean values of CRU with increasing levels of other kinds of research utilization; and (3) multivariate linear regression. Content validity index scores for the five items ranged from 0.55 to 1.00. The principal components analysis predicted a 5-item 1-factor model. This was inconsistent with the findings from the confirmatory factor analysis, which showed best fit for a 4-item 1-factor model. Bivariate associations between CRU and other kinds of research utilization were statistically significant (p < 0.01) for the latent CRU scale score and all five CRU items. The CRU scale score was also shown to be significant predictor of overall research utilization in multivariate linear regression. The CRU scale showed acceptable initial psychometric properties with respect to responses from healthcare aides in nursing homes. Based on our validity, reliability, and acceptability analyses, we recommend using a reduced (four-item) version of the CRU scale to yield sound assessments of CRU by healthcare aides. Refinement to the wording of one item is also needed. Planned future research will include: latent scale scoring, identification of variables that predict and are outcomes to conceptual research use, and longitudinal work to determine CRU Scale sensitivity to change.
Neighborhood environment profiles for physical activity among older adults.
Adams, Marc A; Sallis, James F; Conway, Terry L; Frank, Lawrence D; Saelens, Brian E; Kerr, Jacqueline; Cain, Kelli L; King, Abby C
2012-11-01
To explore among older adults whether multivariate neighborhood profiles were associated with physical activity (PA) and BMI. Adults (66-97 years) were recruited from Baltimore-Washington, DC (n=360), and Seattle-King County, Washington (n=368), regions. Latent profile analyses were conducted using the Neighborhood Environment Walkability Scale. ANCOVA models tested for criterion validity of profiles by examining relationships to PA and BMI. Neighborhood profiles differed significantly by as much as 10 minutes/day for moderate-to-vigorous PA, 1.1 hours/week for walking for errands, and almost 50 minutes/week for leisure PA. Environmental variables resulted in meaningful neighborhood patterns that explained large differences in seniors' health outcomes.
Replicates in high dimensions, with applications to latent variable graphical models.
Tan, Kean Ming; Ning, Yang; Witten, Daniela M; Liu, Han
2016-12-01
In classical statistics, much thought has been put into experimental design and data collection. In the high-dimensional setting, however, experimental design has been less of a focus. In this paper, we stress the importance of collecting multiple replicates for each subject in this setting. We consider learning the structure of a graphical model with latent variables, under the assumption that these variables take a constant value across replicates within each subject. By collecting multiple replicates for each subject, we are able to estimate the conditional dependence relationships among the observed variables given the latent variables. To test the null hypothesis of conditional independence between two observed variables, we propose a pairwise decorrelated score test. Theoretical guarantees are established for parameter estimation and for this test. We show that our proposal is able to estimate latent variable graphical models more accurately than some existing proposals, and apply the proposed method to a brain imaging dataset.
Saunders, Kate; Bilderbeck, Amy; Palmius, Niclas; Goodwin, Guy; De Vos, Maarten
2017-01-01
Background We recently described a new questionnaire to monitor mood called mood zoom (MZ). MZ comprises 6 items assessing mood symptoms on a 7-point Likert scale; we had previously used standard principal component analysis (PCA) to tentatively understand its properties, but the presence of multiple nonzero loadings obstructed the interpretation of its latent variables. Objective The aim of this study was to rigorously investigate the internal properties and latent variables of MZ using an algorithmic approach which may lead to more interpretable results than PCA. Additionally, we explored three other widely used psychiatric questionnaires to investigate latent variable structure similarities with MZ: (1) Altman self-rating mania scale (ASRM), assessing mania; (2) quick inventory of depressive symptomatology (QIDS) self-report, assessing depression; and (3) generalized anxiety disorder (7-item) (GAD-7), assessing anxiety. Methods We elicited responses from 131 participants: 48 bipolar disorder (BD), 32 borderline personality disorder (BPD), and 51 healthy controls (HC), collected longitudinally (median [interquartile range, IQR]: 363 [276] days). Participants were requested to complete ASRM, QIDS, and GAD-7 weekly (all 3 questionnaires were completed on the Web) and MZ daily (using a custom-based smartphone app). We applied sparse PCA (SPCA) to determine the latent variables for the four questionnaires, where a small subset of the original items contributes toward each latent variable. Results We found that MZ had great consistency across the three cohorts studied. Three main principal components were derived using SPCA, which can be tentatively interpreted as (1) anxiety and sadness, (2) positive affect, and (3) irritability. The MZ principal component comprising anxiety and sadness explains most of the variance in BD and BPD, whereas the positive affect of MZ explains most of the variance in HC. The latent variables in ASRM were identical for the patient groups but different for HC; nevertheless, the latent variables shared common items across both the patient group and HC. On the contrary, QIDS had overall very different principal components across groups; sleep was a key element in HC and BD but was absent in BPD. In GAD-7, nervousness was the principal component explaining most of the variance in BD and HC. Conclusions This study has important implications for understanding self-reported mood. MZ has a consistent, intuitively interpretable latent variable structure and hence may be a good instrument for generic mood assessment. Irritability appears to be the key distinguishing latent variable between BD and BPD and might be useful for differential diagnosis. Anxiety and sadness are closely interlinked, a finding that might inform treatment effects to jointly address these covarying symptoms. Anxiety and nervousness appear to be amongst the cardinal latent variable symptoms in BD and merit close attention in clinical practice. PMID:28546141
Time Series Modelling of Syphilis Incidence in China from 2005 to 2012
Zhang, Xingyu; Zhang, Tao; Pei, Jiao; Liu, Yuanyuan; Li, Xiaosong; Medrano-Gracia, Pau
2016-01-01
Background The infection rate of syphilis in China has increased dramatically in recent decades, becoming a serious public health concern. Early prediction of syphilis is therefore of great importance for heath planning and management. Methods In this paper, we analyzed surveillance time series data for primary, secondary, tertiary, congenital and latent syphilis in mainland China from 2005 to 2012. Seasonality and long-term trend were explored with decomposition methods. Autoregressive integrated moving average (ARIMA) was used to fit a univariate time series model of syphilis incidence. A separate multi-variable time series for each syphilis type was also tested using an autoregressive integrated moving average model with exogenous variables (ARIMAX). Results The syphilis incidence rates have increased three-fold from 2005 to 2012. All syphilis time series showed strong seasonality and increasing long-term trend. Both ARIMA and ARIMAX models fitted and estimated syphilis incidence well. All univariate time series showed highest goodness-of-fit results with the ARIMA(0,0,1)×(0,1,1) model. Conclusion Time series analysis was an effective tool for modelling the historical and future incidence of syphilis in China. The ARIMAX model showed superior performance than the ARIMA model for the modelling of syphilis incidence. Time series correlations existed between the models for primary, secondary, tertiary, congenital and latent syphilis. PMID:26901682
Ward, David D; Summers, Mathew J; Saunders, Nichole L; Vickers, James C
2015-04-01
Cognitive reserve (CR) is a protective factor that supports cognition by increasing the resilience of an individual's cognitive function to the deleterious effects of cerebral lesions. A single environmental proxy indicator is often used to estimate CR (e.g. education), possibly resulting in a loss of the accuracy and predictive power of the investigation. Furthermore, while estimates of an individual's prior CR can be made, no operational measure exists to estimate dynamic change in CR resulting from exposure to new life experiences. We aimed to develop two latent measures of CR through factor analysis: prior and current, in a sample of 467 healthy older adults. The prior CR measure combined proxy measures traditionally associated with CR, while the current CR measure combined variables that had the potential to reflect dynamic change in CR due to new life experiences. Our main finding was that the analyses uncovered latent variables in hypothesized prior and current models of CR. The prior CR model supports multivariate estimation of pre-existing CR and may be applied to more accurately estimate CR in the absence of neuropathological data. The current CR model may be applied to evaluate and explore the potential benefits of CR-based interventions prior to dementia onset.
Time Series Modelling of Syphilis Incidence in China from 2005 to 2012.
Zhang, Xingyu; Zhang, Tao; Pei, Jiao; Liu, Yuanyuan; Li, Xiaosong; Medrano-Gracia, Pau
2016-01-01
The infection rate of syphilis in China has increased dramatically in recent decades, becoming a serious public health concern. Early prediction of syphilis is therefore of great importance for heath planning and management. In this paper, we analyzed surveillance time series data for primary, secondary, tertiary, congenital and latent syphilis in mainland China from 2005 to 2012. Seasonality and long-term trend were explored with decomposition methods. Autoregressive integrated moving average (ARIMA) was used to fit a univariate time series model of syphilis incidence. A separate multi-variable time series for each syphilis type was also tested using an autoregressive integrated moving average model with exogenous variables (ARIMAX). The syphilis incidence rates have increased three-fold from 2005 to 2012. All syphilis time series showed strong seasonality and increasing long-term trend. Both ARIMA and ARIMAX models fitted and estimated syphilis incidence well. All univariate time series showed highest goodness-of-fit results with the ARIMA(0,0,1)×(0,1,1) model. Time series analysis was an effective tool for modelling the historical and future incidence of syphilis in China. The ARIMAX model showed superior performance than the ARIMA model for the modelling of syphilis incidence. Time series correlations existed between the models for primary, secondary, tertiary, congenital and latent syphilis.
Bohnert, Amy S B; German, Danielle; Knowlton, Amy R; Latkin, Carl A
2010-03-01
Social support is a multi-dimensional construct that is important to drug use cessation. The present study identified types of supportive friends among the social network members in a community-based sample and examined the relationship of supporter-type classes with supporter, recipient, and supporter-recipient relationship characteristics. We hypothesized that the most supportive network members and their support recipients would be less likely to be current heroin/cocaine users. Participants (n=1453) were recruited from low-income neighborhoods with a high prevalence of drug use. Participants identified their friends via a network inventory, and all nominated friends were included in a latent class analysis and grouped based on their probability of providing seven types of support. These latent classes were included as the dependent variable in a multi-level regression of supporter drug use, recipient drug use, and other characteristics. The best-fitting latent class model identified five support patterns: friends who provided Little/No Support, Low/Moderate Support, High Support, Socialization Support, and Financial Support. In bivariate models, friends in the High, Low/Moderate, and Financial Support were less likely to use heroin or cocaine and had less conflict with and were more trusted by the support recipient than friends in the Low/No Support class. Individuals with supporters in those same support classes compared to the Low/No Support class were less likely to use heroin or cocaine, or to be homeless or female. Multivariable models suggested similar trends. Those with current heroin/cocaine use were less likely to provide or receive comprehensive support from friends. Published by Elsevier Ireland Ltd.
ERIC Educational Resources Information Center
Pek, Jolynn; Losardo, Diane; Bauer, Daniel J.
2011-01-01
Compared to parametric models, nonparametric and semiparametric approaches to modeling nonlinearity between latent variables have the advantage of recovering global relationships of unknown functional form. Bauer (2005) proposed an indirect application of finite mixtures of structural equation models where latent components are estimated in the…
ERIC Educational Resources Information Center
Cham, Heining; West, Stephen G.; Ma, Yue; Aiken, Leona S.
2012-01-01
A Monte Carlo simulation was conducted to investigate the robustness of 4 latent variable interaction modeling approaches (Constrained Product Indicator [CPI], Generalized Appended Product Indicator [GAPI], Unconstrained Product Indicator [UPI], and Latent Moderated Structural Equations [LMS]) under high degrees of nonnormality of the observed…
Dynamic Latent Trait Models with Mixed Hidden Markov Structure for Mixed Longitudinal Outcomes.
Zhang, Yue; Berhane, Kiros
2016-01-01
We propose a general Bayesian joint modeling approach to model mixed longitudinal outcomes from the exponential family for taking into account any differential misclassification that may exist among categorical outcomes. Under this framework, outcomes observed without measurement error are related to latent trait variables through generalized linear mixed effect models. The misclassified outcomes are related to the latent class variables, which represent unobserved real states, using mixed hidden Markov models (MHMM). In addition to enabling the estimation of parameters in prevalence, transition and misclassification probabilities, MHMMs capture cluster level heterogeneity. A transition modeling structure allows the latent trait and latent class variables to depend on observed predictors at the same time period and also on latent trait and latent class variables at previous time periods for each individual. Simulation studies are conducted to make comparisons with traditional models in order to illustrate the gains from the proposed approach. The new approach is applied to data from the Southern California Children Health Study (CHS) to jointly model questionnaire based asthma state and multiple lung function measurements in order to gain better insight about the underlying biological mechanism that governs the inter-relationship between asthma state and lung function development.
NASA Astrophysics Data System (ADS)
Anekawati, Anik; Widjanarko Otok, Bambang; Purhadi; Sutikno
2017-06-01
Research in education often involves a latent variable. Statistical analysis technique that has the ability to analyze the pattern of relationship among latent variables as well as between latent variables and their indicators is Structural Equation Modeling (SEM). SEM partial least square (PLS) was developed as an alternative if these conditions are met: the theory that underlying the design of the model is weak, does not assume a certain scale measurement, the sample size should not be large and the data does not have the multivariate normal distribution. The purpose of this paper is to compare the results of modeling of the educational quality in high school level (SMA/MA) in Sumenep Regency with structural equation modeling approach partial least square with three schemes estimation of score factors. This paper is a result of explanatory research using secondary data from Sumenep Education Department and Badan Pusat Statistik (BPS) Sumenep which was data of Sumenep in the Figures and the District of Sumenep in the Figures for the year 2015. The unit of observation in this study were districts in Sumenep that consists of 18 districts on the mainland and 9 districts in the islands. There were two endogenous variables and one exogenous variable. Endogenous variables are the quality of education level of SMA/MA (Y1) and school infrastructure (Y2), whereas exogenous variable is socio-economic condition (X1). In this study, There is one improved model which represented by model from path scheme because this model is a consistent, all of its indicators are valid and its the value of R-square increased which is: Y1=0.651Y2. In this model, the quality of education influenced only by the school infrastructure (0.651). The socio-economic condition did not affect neither the school infrastructure nor the quality of education. If the school infrastructure increased 1 point, then the quality of education increased 0.651 point. The quality of education had an R2 of 0.418, which indicates that 41.8 percent of variance in the quality of education is explained by the school infrastructure, the remaining 58.2% is explained by the other factors which were not investigated in this work.
Prevalence and associated risk factors of latent tuberculosis infection in a Spanish prison.
López de Goicoechea-Saiz, M E; Sternberg, F; Portilla-Sogorb, J
2018-01-01
To determine the prevalence of latent tuberculosis infection (LTI) in a Spanish prison, analyze the main sociodemographic and clinical variables associated with this condition and estimate the percentage of individuals with LTI who have received chemoprophylactic treatment. Cross-sectional study including inmates hosted in the Madrid VI Prison on 16/07/2016. Exclusion criteria: history of tuberculosis; non-updated tuberculin test according to the Tuberculosis Prevention and Control Program in Prisons protocol. Information of the variables was collected from SANIT and SIP programs, and by checking the clinical records of inmates. Description of the participant population and comparison between the frequency of distribution of the independent variables in LTI present and absent groups were performed, the last calculating the p value with Ji2 and Mann-Whitney U tests. Bivariate and multivariate analysis have been carried out with a logistic regression model. 936 individuals have been included. The prevalence of LTI in prison is 54.6%. This condition has been linked to the sociodemographic variables age, sex and nationality of origin, being age the one that has shown the strongest association. Among the other factors analyzed, only HCV infection behaves as a predictor of LTI. 30.3% of the individuals with LTI have completed or are receiving chemoprophylactic treatment in the moment of the study. LTI prevalence is high in the Spanish current prison population. The results of the study emphasize the relevance of the LTI screening in the prison setting, specially among high risk groups, and point out the need of a greater effort in the indication and completion of the chemoprophylactic treatment.
Application of Local Linear Embedding to Nonlinear Exploratory Latent Structure Analysis
ERIC Educational Resources Information Center
Wang, Haonan; Iyer, Hari
2007-01-01
In this paper we discuss the use of a recent dimension reduction technique called Locally Linear Embedding, introduced by Roweis and Saul, for performing an exploratory latent structure analysis. The coordinate variables from the locally linear embedding describing the manifold on which the data reside serve as the latent variable scores. We…
Introduction to Latent Class Analysis with Applications
ERIC Educational Resources Information Center
Porcu, Mariano; Giambona, Francesca
2017-01-01
Latent class analysis (LCA) is a statistical method used to group individuals (cases, units) into classes (categories) of an unobserved (latent) variable on the basis of the responses made on a set of nominal, ordinal, or continuous observed variables. In this article, we introduce LCA in order to demonstrate its usefulness to early adolescence…
Mixture Distribution Latent State-Trait Analysis: Basic Ideas and Applications
ERIC Educational Resources Information Center
Courvoisier, Delphine S.; Eid, Michael; Nussbeck, Fridtjof W.
2007-01-01
Extensions of latent state-trait models for continuous observed variables to mixture latent state-trait models with and without covariates of change are presented that can separate individuals differing in their occasion-specific variability. An empirical application to the repeated measurement of mood states (N = 501) revealed that a model with 2…
Inácio, Maria Raquel Cavalcanti; de Lima, Kássio Michell Gomes; Lopes, Valquiria Garcia; Pessoa, José Dalton Cruz; de Almeida Teixeira, Gustavo Henrique
2013-02-15
The aim of this study was to evaluate near-infrared reflectance spectroscopy (NIR), and multivariate calibration potential as a rapid method to determinate anthocyanin content in intact fruit (açaí and palmitero-juçara). Several multivariate calibration techniques, including partial least squares (PLS), interval partial least squares, genetic algorithm, successive projections algorithm, and net analyte signal were compared and validated by establishing figures of merit. Suitable results were obtained with the PLS model (four latent variables and 5-point smoothing) with a detection limit of 6.2 g kg(-1), limit of quantification of 20.7 g kg(-1), accuracy estimated as root mean square error of prediction of 4.8 g kg(-1), mean selectivity of 0.79 g kg(-1), sensitivity of 5.04×10(-3) g kg(-1), precision of 27.8 g kg(-1), and signal-to-noise ratio of 1.04×10(-3) g kg(-1). These results suggest NIR spectroscopy and multivariate calibration can be effectively used to determine anthocyanin content in intact açaí and palmitero-juçara fruit. Copyright © 2012 Elsevier Ltd. All rights reserved.
Mannarini, Stefania; Boffo, Marilisa; Rossi, Alessandro; Balottin, Laura
2017-01-01
Background: Although scientific research on the etiology of mental disorders has improved the knowledge of biogenetic and psychosocial aspects related to the onset of mental illness, stigmatizing attitudes and behaviors are still very prevalent and pose a significant social problem. Aim: The aim of this study was to deepen the knowledge of how attitudes toward people with mental illness are affected by specific personal beliefs and characteristics, such as culture and religion of the perceiver. More precisely, the main purpose is the definition of a structure of variables, namely perceived dangerousness, social closeness, and avoidance of the ill person, together with the beliefs about the best treatment to be undertaken and the sick person' gender, capable of describing the complexity of the stigma construct in particular as far as schizophrenia is concerned. Method: The study involved 305 university students, 183 from the University of Padua, Italy, and 122 from the University of Haifa, Israel. For the analyses, a latent class analysis (LCA) approach was chosen to identify a latent categorical structure accounting for the covariance between the observed variables. Such a latent structure was expected to be moderated by cultural background (Italy versus Israel) and religious beliefs, whereas causal beliefs, recommended treatment, dangerousness, social closeness, and public avoidance were the manifest variables, namely the observed indicators of the latent variable. Results: Two sets of results were obtained. First, the relevance of the manifest variables as indicators of the hypothesized latent variable was highlighted. Second, a two-latent-class categorical dimension represented by prejudicial attitudes, causal beliefs, and treatments concerning schizophrenia was found. Specifically, the differential effects of the two cultures and the religious beliefs on the latent structure and their relations highlighted the relevance of the observed variables as indicators of the expected latent variable. Conclusion: The present study contributes to the improvement of the understanding of how attitudes toward people with mental illness are affected by specific personal beliefs and characteristics of the perceiver. The definition of a structure of variables capable of describing the complexity of the stigma construct in particular as far as schizophrenia is concerned was achieved from a cross-cultural perspective.
ERIC Educational Resources Information Center
Wall, Melanie M.; Guo, Jia; Amemiya, Yasuo
2012-01-01
Mixture factor analysis is examined as a means of flexibly estimating nonnormally distributed continuous latent factors in the presence of both continuous and dichotomous observed variables. A simulation study compares mixture factor analysis with normal maximum likelihood (ML) latent factor modeling. Different results emerge for continuous versus…
ERIC Educational Resources Information Center
Holm-Denoma, Jill M.; Richey, J. Anthony; Joiner, Thomas E., Jr.
2010-01-01
Although the latent structure of various eating disorders has been explored in previous studies, no published studies have examined the latent structure of theoretically relevant variables that have been shown to cut across eating disorder diagnoses. The current study examined 3 such variables (dietary restraint, body dissatisfaction, and drive…
HIV-related sexual risk behavior among African American adolescent girls.
Danielson, Carla Kmett; Walsh, Kate; McCauley, Jenna; Ruggiero, Kenneth J; Brown, Jennifer L; Sales, Jessica M; Rose, Eve; Wingood, Gina M; Diclemente, Ralph J
2014-05-01
Latent class analysis (LCA) is a useful statistical tool that can be used to enhance understanding of how various patterns of combined sexual behavior risk factors may confer differential levels of HIV infection risk and to identify subtypes among African American adolescent girls. Data for this analysis is derived from baseline assessments completed prior to randomization in an HIV prevention trial. Participants were African American girls (n=701) aged 14-20 years presenting to sexual health clinics. Girls completed an audio computer-assisted self-interview, which assessed a range of variables regarding sexual history and current and past sexual behavior. Two latent classes were identified with the probability statistics for the two groups in this model being 0.89 and 0.88, respectively. In the final multivariate model, class 1 (the "higher risk" group; n=331) was distinguished by a higher likelihood of >5 lifetime sexual partners, having sex while high on alcohol/drugs, less frequent condom use, and history of sexually transmitted diseases (STDs), when compared with class 2 (the "lower risk" group; n=370). The derived model correctly classified 85.3% of participants into the two groups and accounted for 71% of the variance in the latent HIV-related sexual behavior risk variable. The higher risk class also had worse scores on all hypothesized correlates (e.g., self-esteem, history of sexual assault or physical abuse) relative to the lower risk class. Sexual health clinics represent a unique point of access for HIV-related sexual risk behavior intervention delivery by capitalizing on contact with adolescent girls when they present for services. Four empirically supported risk factors differentiated higher versus lower HIV risk. Replication of these findings is warranted and may offer an empirical basis for parsimonious screening recommendations for girls presenting for sexual healthcare services.
Nock, Nl; Zhang, Lx
2011-11-29
Methods that can evaluate aggregate effects of rare and common variants are limited. Therefore, we applied a two-stage approach to evaluate aggregate gene effects in the 1000 Genomes Project data, which contain 24,487 single-nucleotide polymorphisms (SNPs) in 697 unrelated individuals from 7 populations. In stage 1, we identified potentially interesting genes (PIGs) as those having at least one SNP meeting Bonferroni correction using univariate, multiple regression models. In stage 2, we evaluate aggregate PIG effects on trait, Q1, by modeling each gene as a latent construct, which is defined by multiple common and rare variants, using the multivariate statistical framework of structural equation modeling (SEM). In stage 1, we found that PIGs varied markedly between a randomly selected replicate (replicate 137) and 100 other replicates, with the exception of FLT1. In stage 1, collapsing rare variants decreased false positives but increased false negatives. In stage 2, we developed a good-fitting SEM model that included all nine genes simulated to affect Q1 (FLT1, KDR, ARNT, ELAV4, FLT4, HIF1A, HIF3A, VEGFA, VEGFC) and found that FLT1 had the largest effect on Q1 (βstd = 0.33 ± 0.05). Using replicate 137 estimates as population values, we found that the mean relative bias in the parameters (loadings, paths, residuals) and their standard errors across 100 replicates was on average, less than 5%. Our latent variable SEM approach provides a viable framework for modeling aggregate effects of rare and common variants in multiple genes, but more elegant methods are needed in stage 1 to minimize type I and type II error.
Four not six: Revealing culturally common facial expressions of emotion.
Jack, Rachael E; Sun, Wei; Delis, Ioannis; Garrod, Oliver G B; Schyns, Philippe G
2016-06-01
As a highly social species, humans generate complex facial expressions to communicate a diverse range of emotions. Since Darwin's work, identifying among these complex patterns which are common across cultures and which are culture-specific has remained a central question in psychology, anthropology, philosophy, and more recently machine vision and social robotics. Classic approaches to addressing this question typically tested the cross-cultural recognition of theoretically motivated facial expressions representing 6 emotions, and reported universality. Yet, variable recognition accuracy across cultures suggests a narrower cross-cultural communication supported by sets of simpler expressive patterns embedded in more complex facial expressions. We explore this hypothesis by modeling the facial expressions of over 60 emotions across 2 cultures, and segregating out the latent expressive patterns. Using a multidisciplinary approach, we first map the conceptual organization of a broad spectrum of emotion words by building semantic networks in 2 cultures. For each emotion word in each culture, we then model and validate its corresponding dynamic facial expression, producing over 60 culturally valid facial expression models. We then apply to the pooled models a multivariate data reduction technique, revealing 4 latent and culturally common facial expression patterns that each communicates specific combinations of valence, arousal, and dominance. We then reveal the face movements that accentuate each latent expressive pattern to create complex facial expressions. Our data questions the widely held view that 6 facial expression patterns are universal, instead suggesting 4 latent expressive patterns with direct implications for emotion communication, social psychology, cognitive neuroscience, and social robotics. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
The Houdini Transformation: True, but Illusory.
Bentler, Peter M; Molenaar, Peter C M
2012-01-01
Molenaar (2003, 2011) showed that a common factor model could be transformed into an equivalent model without factors, involving only observed variables and residual errors. He called this invertible transformation the Houdini transformation. His derivation involved concepts from time series and state space theory. This paper verifies the Houdini transformation on a general latent variable model using algebraic methods. The results show that the Houdini transformation is illusory, in the sense that the Houdini transformed model remains a latent variable model. Contrary to common knowledge, a model that is a path model with only observed variables and residual errors may, in fact, be a latent variable model.
The Houdini Transformation: True, but Illusory
Bentler, Peter M.; Molenaar, Peter C. M.
2012-01-01
Molenaar (2003, 2011) showed that a common factor model could be transformed into an equivalent model without factors, involving only observed variables and residual errors. He called this invertible transformation the Houdini transformation. His derivation involved concepts from time series and state space theory. This paper verifies the Houdini transformation on a general latent variable model using algebraic methods. The results show that the Houdini transformation is illusory, in the sense that the Houdini transformed model remains a latent variable model. Contrary to common knowledge, a model that is a path model with only observed variables and residual errors may, in fact, be a latent variable model. PMID:23180888
Rotation in the Dynamic Factor Modeling of Multivariate Stationary Time Series.
ERIC Educational Resources Information Center
Molenaar, Peter C. M.; Nesselroade, John R.
2001-01-01
Proposes a special rotation procedure for the exploratory dynamic factor model for stationary multivariate time series. The rotation procedure applies separately to each univariate component series of a q-variate latent factor series and transforms such a component, initially represented as white noise, into a univariate moving-average.…
IRT-ZIP Modeling for Multivariate Zero-Inflated Count Data
ERIC Educational Resources Information Center
Wang, Lijuan
2010-01-01
This study introduces an item response theory-zero-inflated Poisson (IRT-ZIP) model to investigate psychometric properties of multiple items and predict individuals' latent trait scores for multivariate zero-inflated count data. In the model, two link functions are used to capture two processes of the zero-inflated count data. Item parameters are…
Mikulich-Gilbertson, Susan K; Wagner, Brandie D; Grunwald, Gary K; Riggs, Paula D; Zerbe, Gary O
2018-01-01
Medical research is often designed to investigate changes in a collection of response variables that are measured repeatedly on the same subjects. The multivariate generalized linear mixed model (MGLMM) can be used to evaluate random coefficient associations (e.g. simple correlations, partial regression coefficients) among outcomes that may be non-normal and differently distributed by specifying a multivariate normal distribution for their random effects and then evaluating the latent relationship between them. Empirical Bayes predictors are readily available for each subject from any mixed model and are observable and hence, plotable. Here, we evaluate whether second-stage association analyses of empirical Bayes predictors from a MGLMM, provide a good approximation and visual representation of these latent association analyses using medical examples and simulations. Additionally, we compare these results with association analyses of empirical Bayes predictors generated from separate mixed models for each outcome, a procedure that could circumvent computational problems that arise when the dimension of the joint covariance matrix of random effects is large and prohibits estimation of latent associations. As has been shown in other analytic contexts, the p-values for all second-stage coefficients that were determined by naively assuming normality of empirical Bayes predictors provide a good approximation to p-values determined via permutation analysis. Analyzing outcomes that are interrelated with separate models in the first stage and then associating the resulting empirical Bayes predictors in a second stage results in different mean and covariance parameter estimates from the maximum likelihood estimates generated by a MGLMM. The potential for erroneous inference from using results from these separate models increases as the magnitude of the association among the outcomes increases. Thus if computable, scatterplots of the conditionally independent empirical Bayes predictors from a MGLMM are always preferable to scatterplots of empirical Bayes predictors generated by separate models, unless the true association between outcomes is zero.
Burri, Andrea; Cherkas, Lynn; Spector, Timothy; Rahman, Qazi
2011-01-01
Background Human sexual orientation is influenced by genetic and non-shared environmental factors as are two important psychological correlates – childhood gender typicality (CGT) and adult gender identity (AGI). However, researchers have been unable to resolve the genetic and non-genetic components that contribute to the covariation between these traits, particularly in women. Methodology/Principal Findings Here we performed a multivariate genetic analysis in a large sample of British female twins (N = 4,426) who completed a questionnaire assessing sexual attraction, CGT and AGI. Univariate genetic models indicated modest genetic influences on sexual attraction (25%), AGI (11%) and CGT (31%). For the multivariate analyses, a common pathway model best fitted the data. Conclusions/Significance This indicated that a single latent variable influenced by a genetic component and common non-shared environmental component explained the association between the three traits but there was substantial measurement error. These findings highlight common developmental factors affecting differences in sexual orientation. PMID:21760939
ERIC Educational Resources Information Center
Rupp, Andre A.
2012-01-01
In the focus article of this issue, von Davier, Naemi, and Roberts essentially coupled: (1) a short methodological review of structural similarities of latent variable models with discrete and continuous latent variables; and (2) 2 short empirical case studies that show how these models can be applied to real, rather than simulated, large-scale…
ERIC Educational Resources Information Center
Kelava, Augustin; Werner, Christina S.; Schermelleh-Engel, Karin; Moosbrugger, Helfried; Zapf, Dieter; Ma, Yue; Cham, Heining; Aiken, Leona S.; West, Stephen G.
2011-01-01
Interaction and quadratic effects in latent variable models have to date only rarely been tested in practice. Traditional product indicator approaches need to create product indicators (e.g., x[superscript 2] [subscript 1], x[subscript 1]x[subscript 4]) to serve as indicators of each nonlinear latent construct. These approaches require the use of…
ERIC Educational Resources Information Center
von Davier, Matthias
2014-01-01
Diagnostic models combine multiple binary latent variables in an attempt to produce a latent structure that provides more information about test takers' performance than do unidimensional latent variable models. Recent developments in diagnostic modeling emphasize the possibility that multiple skills may interact in a conjunctive way within the…
Nespolo, Roberto F; Castañeda, Luis E; Roff, Derek A
2005-08-01
Energy metabolism in animals has been largely studied in relation to exogenous sources of variation. However, because they give insight into the relationship between whole metabolism and lower organizational levels such as organs and tissues, examination of endogenous determinants of metabolism other than body mass is itself very important. We studied the multivariate association of body parts and several aspects of energy metabolism in an insect, the nymphs of the sand cricket, Gryllus firmus. By using a variety of both univariate and multivariate techniques, we explored the resultant variance-covariance matrix to build a path diagram with latent variables. After controlling for body mass, we found a significant canonical correlation between metabolism and morphology. According to the factor loadings and path coefficients, the most important contributions of morphology to the correlation were thorax and abdomen size measures, whereas the most important metabolic contribution was resting metabolism. Activity metabolism was mostly explained by body mass rather than body parts, which could be a result of resting rates being chronic consequences of the functioning of the metabolic machinery that the insect must maintain.
Measurement Model Specification Error in LISREL Structural Equation Models.
ERIC Educational Resources Information Center
Baldwin, Beatrice; Lomax, Richard
This LISREL study examines the robustness of the maximum likelihood estimates under varying degrees of measurement model misspecification. A true model containing five latent variables (two endogenous and three exogenous) and two indicator variables per latent variable was used. Measurement model misspecification considered included errors of…
Hoyle, R H
1991-02-01
Indirect measures of psychological constructs are vital to clinical research. On occasion, however, the meaning of indirect measures of psychological constructs is obfuscated by statistical procedures that do not account for the complex relations between items and latent variables and among latent variables. Covariance structure analysis (CSA) is a statistical procedure for testing hypotheses about the relations among items that indirectly measure a psychological construct and relations among psychological constructs. This article introduces clinical researchers to the strengths and limitations of CSA as a statistical procedure for conceiving and testing structural hypotheses that are not tested adequately with other statistical procedures. The article is organized around two empirical examples that illustrate the use of CSA for evaluating measurement models with correlated error terms, higher-order factors, and measured and latent variables.
Unfinished Business in Clarifying Causal Measurement: Commentary on Bainter and Bollen
ERIC Educational Resources Information Center
Markus, Keith A.
2014-01-01
In a series of articles and comments, Kenneth Bollen and his collaborators have incrementally refined an account of structural equation models that (a) model a latent variable as the effect of several observed variables and (b) carry an interpretation of the observed variables as, in some sense, measures of the latent variable that they cause.…
Process connectivity reveals ecohydrologic sensitivity to drought and rainfall pulses
NASA Astrophysics Data System (ADS)
Goodwell, A. E.; Kumar, P.
2017-12-01
Ecohydrologic fluxes within atmosphere, canopy and soil systems exhibit complex and joint variability. This complexity arises from direct and indirect forcing and feedback interactions that can cause fluctuations to propagate between water, energy, and nutrient fluxes at various time scales. When an ecosystem is perturbed in the form of a single storm event, an accumulating drought, or changes in climate and land cover, this aspect of joint variability may dictate responsiveness and resilience of the entire system. A characterization of the time-dependent and multivariate connectivity between processes, fluxes, and states is necessary to identify and understand these aspects of ecohydrologic systems. We construct Temporal Information Partitioning Networks (TIPNets), based on information theory measures, to identify time-dependencies between variables measured at flux towers along elevation and climate gradients in relation to their responses to moisture-related perturbations. Along a flux tower transect in the Reynolds Creek Critical Zone Observatory (CZO) in Idaho, we detect a significant network response to a large 2015 dry season rainfall event that enhances microbial respiration and latent heat fluxes. At a transect in the Southern Sierra CZO in California, we explore network properties in relation to drought responses from 2011 to 2015. We find that both high and low elevation sites exhibit decreased connectivity between atmospheric and soil variables and latent heat fluxes, but the higher elevation site is less sensitive to this altered connectivity in terms of average monthly heat fluxes. Through a novel approach to gage the responsiveness of ecosystem fluxes to shifts in connectivity, this study aids our understanding of ecohydrologic sensitivity to short-term rainfall events and longer term droughts. This study is relevant to ecosystem resilience under a changing climate, and can lead to a greater understanding of shifting behaviors in many types of complex systems.
Interexaminer variation of minutia markup on latent fingerprints.
Ulery, Bradford T; Hicklin, R Austin; Roberts, Maria Antonia; Buscaglia, JoAnn
2016-07-01
Latent print examiners often differ in the number of minutiae they mark during analysis of a latent, and also during comparison of a latent with an exemplar. Differences in minutia counts understate interexaminer variability: examiners' markups may have similar minutia counts but differ greatly in which specific minutiae were marked. We assessed variability in minutia markup among 170 volunteer latent print examiners. Each provided detailed markup documenting their examinations of 22 latent-exemplar pairs of prints randomly assigned from a pool of 320 pairs. An average of 12 examiners marked each latent. The primary factors associated with minutia reproducibility were clarity, which regions of the prints examiners chose to mark, and agreement on value or comparison determinations. In clear areas (where the examiner was "certain of the location, presence, and absence of all minutiae"), median reproducibility was 82%; in unclear areas, median reproducibility was 46%. Differing interpretations regarding which regions should be marked (e.g., when there is ambiguity in the continuity of a print) contributed to variability in minutia markup: especially in unclear areas, marked minutiae were often far from the nearest minutia marked by a majority of examiners. Low reproducibility was also associated with differences in value or comparison determinations. Lack of standardization in minutia markup and unfamiliarity with test procedures presumably contribute to the variability we observed. We have identified factors accounting for interexaminer variability; implementing standards for detailed markup as part of documentation and focusing future training efforts on these factors may help to facilitate transparency and reduce subjectivity in the examination process. Published by Elsevier Ireland Ltd.
Multimethod latent class analysis
Nussbeck, Fridtjof W.; Eid, Michael
2015-01-01
Correct and, hence, valid classifications of individuals are of high importance in the social sciences as these classifications are the basis for diagnoses and/or the assignment to a treatment. The via regia to inspect the validity of psychological ratings is the multitrait-multimethod (MTMM) approach. First, a latent variable model for the analysis of rater agreement (latent rater agreement model) will be presented that allows for the analysis of convergent validity between different measurement approaches (e.g., raters). Models of rater agreement are transferred to the level of latent variables. Second, the latent rater agreement model will be extended to a more informative MTMM latent class model. This model allows for estimating (i) the convergence of ratings, (ii) method biases in terms of differential latent distributions of raters and differential associations of categorizations within raters (specific rater bias), and (iii) the distinguishability of categories indicating if categories are satisfyingly distinct from each other. Finally, an empirical application is presented to exemplify the interpretation of the MTMM latent class model. PMID:26441714
Exploring heterogeneity in clinical trials with latent class analysis
Abarda, Abdallah; Contractor, Ateka A.; Wang, Juan; Dayton, C. Mitchell
2018-01-01
Case-mix is common in clinical trials and treatment effect can vary across different subgroups. Conventionally, a subgroup analysis is performed by dividing the overall study population by one or two grouping variables. It is usually impossible to explore complex high-order intersections among confounding variables. Latent class analysis (LCA) provides a framework to identify latent classes by observed manifest variables. Distal clinical outcomes and treatment effect can be different across these classes. This paper provides a step-by-step tutorial on how to perform LCA with R. A simulated dataset is generated to illustrate the process. In the example, the classify-analyze approach is employed to explore the differential treatment effects on distal outcomes across latent classes. PMID:29955579
2011-01-01
Background There is a lack of acceptable, reliable, and valid survey instruments to measure conceptual research utilization (CRU). In this study, we investigated the psychometric properties of a newly developed scale (the CRU Scale). Methods We used the Standards for Educational and Psychological Testing as a validation framework to assess four sources of validity evidence: content, response processes, internal structure, and relations to other variables. A panel of nine international research utilization experts performed a formal content validity assessment. To determine response process validity, we conducted a series of one-on-one scale administration sessions with 10 healthcare aides. Internal structure and relations to other variables validity was examined using CRU Scale response data from a sample of 707 healthcare aides working in 30 urban Canadian nursing homes. Principal components analysis and confirmatory factor analyses were conducted to determine internal structure. Relations to other variables were examined using: (1) bivariate correlations; (2) change in mean values of CRU with increasing levels of other kinds of research utilization; and (3) multivariate linear regression. Results Content validity index scores for the five items ranged from 0.55 to 1.00. The principal components analysis predicted a 5-item 1-factor model. This was inconsistent with the findings from the confirmatory factor analysis, which showed best fit for a 4-item 1-factor model. Bivariate associations between CRU and other kinds of research utilization were statistically significant (p < 0.01) for the latent CRU scale score and all five CRU items. The CRU scale score was also shown to be significant predictor of overall research utilization in multivariate linear regression. Conclusions The CRU scale showed acceptable initial psychometric properties with respect to responses from healthcare aides in nursing homes. Based on our validity, reliability, and acceptability analyses, we recommend using a reduced (four-item) version of the CRU scale to yield sound assessments of CRU by healthcare aides. Refinement to the wording of one item is also needed. Planned future research will include: latent scale scoring, identification of variables that predict and are outcomes to conceptual research use, and longitudinal work to determine CRU Scale sensitivity to change. PMID:21595888
On the Asymptotic Relative Efficiency of Planned Missingness Designs.
Rhemtulla, Mijke; Savalei, Victoria; Little, Todd D
2016-03-01
In planned missingness (PM) designs, certain data are set a priori to be missing. PM designs can increase validity and reduce cost; however, little is known about the loss of efficiency that accompanies these designs. The present paper compares PM designs to reduced sample (RN) designs that have the same total number of data points concentrated in fewer participants. In 4 studies, we consider models for both observed and latent variables, designs that do or do not include an "X set" of variables with complete data, and a full range of between- and within-set correlation values. All results are obtained using asymptotic relative efficiency formulas, and thus no data are generated; this novel approach allows us to examine whether PM designs have theoretical advantages over RN designs removing the impact of sampling error. Our primary findings are that (a) in manifest variable regression models, estimates of regression coefficients have much lower relative efficiency in PM designs as compared to RN designs, (b) relative efficiency of factor correlation or latent regression coefficient estimates is maximized when the indicators of each latent variable come from different sets, and (c) the addition of an X set improves efficiency in manifest variable regression models only for the parameters that directly involve the X-set variables, but it substantially improves efficiency of most parameters in latent variable models. We conclude that PM designs can be beneficial when the model of interest is a latent variable model; recommendations are made for how to optimize such a design.
Toma, Luiza; Mathijs, Erik
2007-04-01
This paper aims to identify the factors underlying farmers' propensity to participate in organic farming programmes in a Romanian rural region that confronts non-point source pollution. For this, we employ structural equation modelling with latent variables using a specific data set collected through an agri-environmental farm survey in 2001. The model includes one 'behavioural intention' latent variable ('propensity to participate in organic farming programmes') and five 'attitude' and 'socio-economic' latent variables ('socio-demographic characteristics', 'economic characteristics', 'agri-environmental information access', 'environmental risk perception' and 'general environmental concern'). The results indicate that, overall, the model has an adequate fit to the data. All loadings are statistically significant, supporting the theoretical basis for assignment of indicators for each latent variable. The significance tests for the structural model parameters show 'environmental risk perception' as the strongest determinant of farmers' propensity to participate in organic farming programmes.
ERIC Educational Resources Information Center
Kriston, Levente; Melchior, Hanne; Hergert, Anika; Bergelt, Corinna; Watzke, Birgit; Schulz, Holger; von Wolff, Alessa
2011-01-01
The aim of our study was to develop a graphical tool that can be used in addition to standard statistical criteria to support decisions on the number of classes in explorative categorical latent variable modeling for rehabilitation research. Data from two rehabilitation research projects were used. In the first study, a latent profile analysis was…
Data on the interexaminer variation of minutia markup on latent fingerprints.
Ulery, Bradford T; Hicklin, R Austin; Roberts, Maria Antonia; Buscaglia, JoAnn
2016-09-01
The data in this article supports the research paper entitled "Interexaminer variation of minutia markup on latent fingerprints" [1]. The data in this article describes the variability in minutia markup during both analysis of the latents and comparison between latents and exemplars. The data was collected in the "White Box Latent Print Examiner Study," in which each of 170 volunteer latent print examiners provided detailed markup documenting their examinations of latent-exemplar pairs of prints randomly assigned from a pool of 320 pairs. Each examiner examined 22 latent-exemplar pairs; an average of 12 examiners marked each latent.
Xu, Man K.; Gaysina, Darya; Tsonaka, Roula; Morin, Alexandre J. S.; Croudace, Tim J.; Barnett, Jennifer H.; Houwing-Duistermaat, Jeanine; Richards, Marcus; Jones, Peter B.
2017-01-01
Very few molecular genetic studies of personality traits have used longitudinal phenotypic data, therefore molecular basis for developmental change and stability of personality remains to be explored. We examined the role of the monoamine oxidase A gene (MAOA) on extraversion and neuroticism from adolescence to adulthood, using modern latent variable methods. A sample of 1,160 male and 1,180 female participants with complete genotyping data was drawn from a British national birth cohort, the MRC National Survey of Health and Development (NSHD). The predictor variable was based on a latent variable representing genetic variations of the MAOA gene measured by three SNPs (rs3788862, rs5906957, and rs979606). Latent phenotype variables were constructed using psychometric methods to represent cross-sectional and longitudinal phenotypes of extraversion and neuroticism measured at ages 16 and 26. In males, the MAOA genetic latent variable (AAG) was associated with lower extraversion score at age 16 (β = −0.167; CI: −0.289, −0.045; p = 0.007, FDRp = 0.042), as well as greater increase in extraversion score from 16 to 26 years (β = 0.197; CI: 0.067, 0.328; p = 0.003, FDRp = 0.036). No genetic association was found for neuroticism after adjustment for multiple testing. Although, we did not find statistically significant associations after multiple testing correction in females, this result needs to be interpreted with caution due to issues related to x-inactivation in females. The latent variable method is an effective way of modeling phenotype- and genetic-based variances and may therefore improve the methodology of molecular genetic studies of complex psychological traits. PMID:29075213
Xu, Man K; Gaysina, Darya; Tsonaka, Roula; Morin, Alexandre J S; Croudace, Tim J; Barnett, Jennifer H; Houwing-Duistermaat, Jeanine; Richards, Marcus; Jones, Peter B
2017-01-01
Very few molecular genetic studies of personality traits have used longitudinal phenotypic data, therefore molecular basis for developmental change and stability of personality remains to be explored. We examined the role of the monoamine oxidase A gene ( MAOA ) on extraversion and neuroticism from adolescence to adulthood, using modern latent variable methods. A sample of 1,160 male and 1,180 female participants with complete genotyping data was drawn from a British national birth cohort, the MRC National Survey of Health and Development (NSHD). The predictor variable was based on a latent variable representing genetic variations of the MAOA gene measured by three SNPs (rs3788862, rs5906957, and rs979606). Latent phenotype variables were constructed using psychometric methods to represent cross-sectional and longitudinal phenotypes of extraversion and neuroticism measured at ages 16 and 26. In males, the MAOA genetic latent variable (AAG) was associated with lower extraversion score at age 16 (β = -0.167; CI: -0.289, -0.045; p = 0.007, FDRp = 0.042), as well as greater increase in extraversion score from 16 to 26 years (β = 0.197; CI: 0.067, 0.328; p = 0.003, FDRp = 0.036). No genetic association was found for neuroticism after adjustment for multiple testing. Although, we did not find statistically significant associations after multiple testing correction in females, this result needs to be interpreted with caution due to issues related to x-inactivation in females. The latent variable method is an effective way of modeling phenotype- and genetic-based variances and may therefore improve the methodology of molecular genetic studies of complex psychological traits.
Group Comparisons in the Presence of Missing Data Using Latent Variable Modeling Techniques
ERIC Educational Resources Information Center
Raykov, Tenko; Marcoulides, George A.
2010-01-01
A latent variable modeling approach for examining population similarities and differences in observed variable relationship and mean indexes in incomplete data sets is discussed. The method is based on the full information maximum likelihood procedure of model fitting and parameter estimation. The procedure can be employed to test group identities…
Stochastic Approximation Methods for Latent Regression Item Response Models
ERIC Educational Resources Information Center
von Davier, Matthias; Sinharay, Sandip
2010-01-01
This article presents an application of a stochastic approximation expectation maximization (EM) algorithm using a Metropolis-Hastings (MH) sampler to estimate the parameters of an item response latent regression model. Latent regression item response models are extensions of item response theory (IRT) to a latent variable model with covariates…
Estimation and Model Selection for Finite Mixtures of Latent Interaction Models
ERIC Educational Resources Information Center
Hsu, Jui-Chen
2011-01-01
Latent interaction models and mixture models have received considerable attention in social science research recently, but little is known about how to handle if unobserved population heterogeneity exists in the endogenous latent variables of the nonlinear structural equation models. The current study estimates a mixture of latent interaction…
TENSOR DECOMPOSITIONS AND SPARSE LOG-LINEAR MODELS
Johndrow, James E.; Bhattacharya, Anirban; Dunson, David B.
2017-01-01
Contingency table analysis routinely relies on log-linear models, with latent structure analysis providing a common alternative. Latent structure models lead to a reduced rank tensor factorization of the probability mass function for multivariate categorical data, while log-linear models achieve dimensionality reduction through sparsity. Little is known about the relationship between these notions of dimensionality reduction in the two paradigms. We derive several results relating the support of a log-linear model to nonnegative ranks of the associated probability tensor. Motivated by these findings, we propose a new collapsed Tucker class of tensor decompositions, which bridge existing PARAFAC and Tucker decompositions, providing a more flexible framework for parsimoniously characterizing multivariate categorical data. Taking a Bayesian approach to inference, we illustrate empirical advantages of the new decompositions. PMID:29332971
Toward DSM-V: mapping the alcohol use disorder continuum in college students.
Hagman, Brett T; Cohn, Amy M
2011-11-01
The present study examined the dimensionality of DSM-IV Alcohol Use Disorder (AUD) criteria using Item Response Theory (IRT) methods and tested the validity of the proposed DSM-V AUD guidelines in a sample of college students. Participants were 396 college students who reported any alcohol use in the past 90 days and were aged 18 years or older. We conducted factor analyses to determine whether a one- or two-factor model provided a better fit to the AUD criteria. IRT analyses estimated item severity and discrimination parameters for each criterion. Multivariate analyses examined differences among the DSM-V diagnostic cut-off (AUD vs. No AUD) and severity qualifiers (no diagnosis, moderate, severe) across several validating measures of alcohol use. A dominant single-factor model provided the best fit to the AUD criteria. IRT analyses indicated that abuse and dependence criteria were intermixed along the latent continuum. The "legal problems" criterion had the highest severity parameter and the tolerance criterion had the lowest severity parameter. The abuse criterion "social/interpersonal problems" and dependence criterion "activities to obtain alcohol" had the highest discrimination parameter estimates. Multivariate analysis indicated that the DSM-V cut-off point, and severity qualifier groups were distinguishable on several measures of alcohol consumption, drinking consequences, and drinking restraint. Findings suggest that the AUD criteria reflect a latent variable that represents a primary disorder and provide support for the proposed DSM-V AUD criteria in a sample of college students. Continued research in other high-risk samples of college students is needed. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Latent variable method for automatic adaptation to background states in motor imagery BCI
NASA Astrophysics Data System (ADS)
Dagaev, Nikolay; Volkova, Ksenia; Ossadtchi, Alexei
2018-02-01
Objective. Brain-computer interface (BCI) systems are known to be vulnerable to variabilities in background states of a user. Usually, no detailed information on these states is available even during the training stage. Thus there is a need in a method which is capable of taking background states into account in an unsupervised way. Approach. We propose a latent variable method that is based on a probabilistic model with a discrete latent variable. In order to estimate the model’s parameters, we suggest to use the expectation maximization algorithm. The proposed method is aimed at assessing characteristics of background states without any corresponding data labeling. In the context of asynchronous motor imagery paradigm, we applied this method to the real data from twelve able-bodied subjects with open/closed eyes serving as background states. Main results. We found that the latent variable method improved classification of target states compared to the baseline method (in seven of twelve subjects). In addition, we found that our method was also capable of background states recognition (in six of twelve subjects). Significance. Without any supervised information on background states, the latent variable method provides a way to improve classification in BCI by taking background states into account at the training stage and then by making decisions on target states weighted by posterior probabilities of background states at the prediction stage.
Latent class instrumental variables: A clinical and biostatistical perspective
Baker, Stuart G.; Kramer, Barnett S.; Lindeman, Karen S.
2015-01-01
In some two-arm randomized trials, some participants receive the treatment assigned to the other arm as a result of technical problems, refusal of a treatment invitation, or a choice of treatment in an encouragement design. In some before-and-after studies, the availability of a new treatment changes from one time period to this next. Under assumptions that are often reasonable, the latent class instrumental variable (IV) method estimates the effect of treatment received in the aforementioned scenarios involving all-or-none compliance and all-or-none availability. Key aspects are four initial latent classes (sometimes called principal strata) based on treatment received if in each randomization group or time period, the exclusion restriction assumption (in which randomization group or time period is an instrumental variable), the monotonicity assumption (which drops an implausible latent class from the analysis), and the estimated effect of receiving treatment in one latent class (sometimes called efficacy, the local average treatment effect, or the complier average causal effect). Since its independent formulations in the biostatistics and econometrics literatures, the latent class IV method (which has no well-established name) has gained increasing popularity. We review the latent class IV method from a clinical and biostatistical perspective, focusing on underlying assumptions, methodological extensions, and applications in our fields of obstetrics and cancer research. PMID:26239275
A Mulitivariate Statistical Model Describing the Compound Nature of Soil Moisture Drought
NASA Astrophysics Data System (ADS)
Manning, Colin; Widmann, Martin; Bevacqua, Emanuele; Maraun, Douglas; Van Loon, Anne; Vrac, Mathieu
2017-04-01
Soil moisture in Europe acts to partition incoming energy into sensible and latent heat fluxes, thereby exerting a large influence on temperature variability. Soil moisture is predominantly controlled by precipitation and evapotranspiration. When these meteorological variables are accumulated over different timescales, their joint multivariate distribution and dependence structure can be used to provide information of soil moisture. We therefore consider soil moisture drought as a compound event of meteorological drought (deficits of precipitation) and heat waves, or more specifically, periods of high Potential Evapotraspiration (PET). We present here a statistical model of soil moisture based on Pair Copula Constructions (PCC) that can describe the dependence amongst soil moisture and its contributing meteorological variables. The model is designed in such a way that it can account for concurrences of meteorological drought and heat waves and describe the dependence between these conditions at a local level. The model is composed of four variables; daily soil moisture (h); a short term and a long term accumulated precipitation variable (Y1 and Y_2) that account for the propagation of meteorological drought to soil moisture drought; and accumulated PET (Y_3), calculated using the Penman Monteith equation, which can represent the effect of a heat wave on soil conditions. Copula are multivariate distribution functions that allow one to model the dependence structure of given variables separately from their marginal behaviour. PCCs then allow in theory for the formulation of a multivariate distribution of any dimension where the multivariate distribution is decomposed into a product of marginal probability density functions and two-dimensional copula, of which some are conditional. We apply PCC here in such a way that allows us to provide estimates of h and their uncertainty through conditioning on the Y in the form h=h|y_1,y_2,y_3 (1) Applying the model to various Fluxnet sites across Europe, we find the model has good skill and can particularly capture periods of low soil moisture well. We illustrate the relevance of the dependence structure of these Y variables to soil moisture and show how it may be generalised to offer information of soil moisture on a widespread scale where few observations of soil moisture exist. We then present results from a validation study of a selection of EURO CORDEX climate models where we demonstrate the skill of these models in representing these dependencies and so offer insight into the skill seen in the representation of soil moisture in these models.
Yeung, Wing-Fai; Chung, Ka-Fai; Zhang, Nevin Lian-Wen; Zhang, Shi Ping; Yung, Kam-Ping; Chen, Pei-Xian; Ho, Yan-Yee
2016-01-01
Chinese medicine (CM) syndrome (zheng) differentiation is based on the co-occurrence of CM manifestation profiles, such as signs and symptoms, and pulse and tongue features. Insomnia is a symptom that frequently occurs in major depressive disorder despite adequate antidepressant treatment. This study aims to identify co-occurrence patterns in participants with persistent insomnia and major depressive disorder from clinical feature data using latent tree analysis, and to compare the latent variables with relevant CM syndromes. One hundred and forty-two participants with persistent insomnia and a history of major depressive disorder completed a standardized checklist (the Chinese Medicine Insomnia Symptom Checklist) specially developed for CM syndrome classification of insomnia. The checklist covers symptoms and signs, including tongue and pulse features. The clinical features assessed by the checklist were analyzed using Lantern software. CM practitioners with relevant experience compared the clinical feature variables under each latent variable with reference to relevant CM syndromes, based on a previous review of CM syndromes. The symptom data were analyzed to build the latent tree model and the model with the highest Bayes information criterion score was regarded as the best model. This model contained 18 latent variables, each of which divided participants into two clusters. Six clusters represented more than 50 % of the sample. The clinical feature co-occurrence patterns of these six clusters were interpreted as the CM syndromes Liver qi stagnation transforming into fire, Liver fire flaming upward, Stomach disharmony, Hyperactivity of fire due to yin deficiency, Heart-kidney noninteraction, and Qi deficiency of the heart and gallbladder. The clinical feature variables that contributed significant cumulative information coverage (at least 95 %) were identified. Latent tree model analysis on a sample of depressed participants with insomnia revealed 13 clinical feature co-occurrence patterns, four mutual-exclusion patterns, and one pattern with a single clinical feature variable.
Katseanes, Chelsea K; Chappell, Mark A; Hopkins, Bryan G; Durham, Brian D; Price, Cynthia L; Porter, Beth E; Miller, Lesley F
2016-11-01
After nearly a century of use in numerous munition platforms, TNT and RDX contamination has turned up largely in the environment due to ammunition manufacturing or as part of releases from low-order detonations during training activities. Although the basic knowledge governing the environmental fate of TNT and RDX are known, accurate predictions of TNT and RDX persistence in soil remain elusive, particularly given the universal heterogeneity of pedomorphic soil types. In this work, we proposed a new solution for modeling the sorption and persistence of these munition constituents as multivariate mathematical functions correlating soil attribute data over a variety of taxonomically distinct soil types to contaminant behavior, instead of a single constant or parameter of a specific absolute value. To test this idea, we conducted experiments measuring the sorption of TNT and RDX on taxonomically different soil types that were extensively physical and chemically characterized. Statistical decomposition of the log-transformed, and auto-scaled soil characterization data using the dimension-reduction technique PCA (principal component analysis) revealed a strong latent structure based in the multiple pairwise correlations among the soil properties. TNT and RDX sorption partitioning coefficients (KD-TNT and KD-RDX) were regressed against this latent structure using partial least squares regression (PLSR), generating a 3-factor, multivariate linear functions. Here, PLSR models predicted KD-TNT and KD-RDX values based on attributes contributing to endogenous alkaline/calcareous and soil fertility criteria, respectively, exhibited among the different soil types: We hypothesized that the latent structure arising from the strong covariance of full multivariate geochemical matrix describing taxonomically distinguished soil types may provide the means for potentially predicting complex phenomena in soils. The development of predictive multivariate models tuned to a local soil's taxonomic designation would have direct benefit to military range managers seeking to anticipate the environmental risks of training activities on impact sites. Published by Elsevier Ltd.
Development of lifetime comorbidity in the WHO World Mental Health (WMH) Surveys
Kessler, Ronald C.; Ormel, Johan; Petukhova, Maria; McLaughlin, Katie A.; Green, Jennifer Greif; Russo, Leo J.; Stein, Dan J.; Zaslavsky, Alan M; Aguilar-Gaxiola, Sergio; Alonso, Jordi; Andrade, Laura; Benjet, Corina; de Girolamo, Giovanni; de Graaf, Ron; Demyttenaere, Koen; Fayyad, John; Haro, Josep Maria; Hu, Chi yi; Karam, Aimee; Lee, Sing; Lepine, Jean-Pierre; Matchsinger, Herbert; Mihaescu-Pintia, Constanta; Posada-Villa, Jose; Sagar, Rajesh; Üstün, T. Bedirhan
2010-01-01
CONTEXT Although numerous studies have examined the role of latent variables in the structure of comorbidity among mental disorders, none has examined their role in the development of comorbidity. OBJECTIVE To study the role of latent variables in the development of comorbidity among 18 lifetime DSM-IV disorders in the WHO World Mental Health (WMH) surveys. SETTING/PARTICIPANTS Nationally or regionally representative community surveys in 14 countries with a total of 21,229 respondents. MAIN OUTCOME MEASURES First onset of 18 lifetime DSM-IV anxiety, mood, behavior, and substance disorders assessed retrospectively in the WHO Composite International Diagnostic Interview (CIDI). RESULTS Separate internalizing (anxiety and mood disorders) and externalizing (behavior and substance disorders) factors were found in exploratory factor analysis of lifetime disorders. Consistently significant positive time-lagged associations were found in survival analyses for virtually all temporally primary lifetime disorders predicting subsequent onset of other disorders. Within-domain (i.e., internalizing or externalizing) associations were generally stronger than between-domain associations. The vast majority of time-lagged associations were explained by a model that assumed the existence of mediating latent internalizing and externalizing variables. Specific phobia and obsessive-compulsive disorder (internalizing) and hyperactivity disorder and oppositional-defiant disorder (externalizing) were the most important predictors. A small number of residual associations remained significant after controlling the latent variables. CONCLUSIONS The good fit of the latent variable model suggests that common causal pathways account for most of the comorbidity among the disorders considered here. These common pathways should be the focus of future research on the development of comorbidity, although several important pair-wise associations that cannot be accounted for by latent variables also exist that warrant further focused study. PMID:21199968
Lamont, Andrea E.; Vermunt, Jeroen K.; Van Horn, M. Lee
2016-01-01
Regression mixture models are increasingly used as an exploratory approach to identify heterogeneity in the effects of a predictor on an outcome. In this simulation study, we test the effects of violating an implicit assumption often made in these models – i.e., independent variables in the model are not directly related to latent classes. Results indicated that the major risk of failing to model the relationship between predictor and latent class was an increase in the probability of selecting additional latent classes and biased class proportions. Additionally, this study tests whether regression mixture models can detect a piecewise relationship between a predictor and outcome. Results suggest that these models are able to detect piecewise relations, but only when the relationship between the latent class and the predictor is included in model estimation. We illustrate the implications of making this assumption through a re-analysis of applied data examining heterogeneity in the effects of family resources on academic achievement. We compare previous results (which assumed no relation between independent variables and latent class) to the model where this assumption is lifted. Implications and analytic suggestions for conducting regression mixture based on these findings are noted. PMID:26881956
A descriptivist approach to trait conceptualization and inference.
Jonas, Katherine G; Markon, Kristian E
2016-01-01
In their recent article, How Functionalist and Process Approaches to Behavior Can Explain Trait Covariation, Wood, Gardner, and Harms (2015) underscore the need for more process-based understandings of individual differences. At the same time, the article illustrates a common error in the use and interpretation of latent variable models: namely, the misuse of models to arbitrate issues of causation and the nature of latent variables. Here, we explain how latent variables can be understood simply as parsimonious summaries of data, and how statistical inference can be based on choosing those summaries that minimize information required to represent the data using the model. Although Wood, Gardner, and Harms acknowledge this perspective, they underestimate its significance, including its importance to modeling and the conceptualization of psychological measurement. We believe this perspective has important implications for understanding individual differences in a number of domains, including current debates surrounding the role of formative versus reflective latent variables. (c) 2015 APA, all rights reserved).
ERIC Educational Resources Information Center
von Davier, Matthias; Sinharay, Sandip
2009-01-01
This paper presents an application of a stochastic approximation EM-algorithm using a Metropolis-Hastings sampler to estimate the parameters of an item response latent regression model. Latent regression models are extensions of item response theory (IRT) to a 2-level latent variable model in which covariates serve as predictors of the…
A General Approach to Defining Latent Growth Components
ERIC Educational Resources Information Center
Mayer, Axel; Steyer, Rolf; Mueller, Horst
2012-01-01
We present a 3-step approach to defining latent growth components. In the first step, a measurement model with at least 2 indicators for each time point is formulated to identify measurement error variances and obtain latent variables that are purged from measurement error. In the second step, we use contrast matrices to define the latent growth…
Latent class instrumental variables: a clinical and biostatistical perspective.
Baker, Stuart G; Kramer, Barnett S; Lindeman, Karen S
2016-01-15
In some two-arm randomized trials, some participants receive the treatment assigned to the other arm as a result of technical problems, refusal of a treatment invitation, or a choice of treatment in an encouragement design. In some before-and-after studies, the availability of a new treatment changes from one time period to this next. Under assumptions that are often reasonable, the latent class instrumental variable (IV) method estimates the effect of treatment received in the aforementioned scenarios involving all-or-none compliance and all-or-none availability. Key aspects are four initial latent classes (sometimes called principal strata) based on treatment received if in each randomization group or time period, the exclusion restriction assumption (in which randomization group or time period is an instrumental variable), the monotonicity assumption (which drops an implausible latent class from the analysis), and the estimated effect of receiving treatment in one latent class (sometimes called efficacy, the local average treatment effect, or the complier average causal effect). Since its independent formulations in the biostatistics and econometrics literatures, the latent class IV method (which has no well-established name) has gained increasing popularity. We review the latent class IV method from a clinical and biostatistical perspective, focusing on underlying assumptions, methodological extensions, and applications in our fields of obstetrics and cancer research. Copyright © 2015 John Wiley & Sons, Ltd.
NASA Astrophysics Data System (ADS)
Bressan, Lucas P.; do Nascimento, Paulo Cícero; Schmidt, Marcella E. P.; Faccin, Henrique; de Machado, Leandro Carvalho; Bohrer, Denise
2017-02-01
A novel method was developed to determine low molecular weight polycyclic aromatic hydrocarbons in aqueous leachates from soils and sediments using a salting-out assisted liquid-liquid extraction, synchronous fluorescence spectrometry and a multivariate calibration technique. Several experimental parameters were controlled and the optimum conditions were: sodium carbonate as the salting-out agent at concentration of 2 mol L- 1, 3 mL of acetonitrile as extraction solvent, 6 mL of aqueous leachate, vortexing for 5 min and centrifuging at 4000 rpm for 5 min. The partial least squares calibration was optimized to the lowest values of root mean squared error and five latent variables were chosen for each of the targeted compounds. The regression coefficients for the true versus predicted concentrations were higher than 0.99. Figures of merit for the multivariate method were calculated, namely sensitivity, multivariate detection limit and multivariate quantification limit. The selectivity was also evaluated and other polycyclic aromatic hydrocarbons did not interfere in the analysis. Likewise, high performance liquid chromatography was used as a comparative methodology, and the regression analysis between the methods showed no statistical difference (t-test). The proposed methodology was applied to soils and sediments of a Brazilian river and the recoveries ranged from 74.3% to 105.8%. Overall, the proposed methodology was suitable for the targeted compounds, showing that the extraction method can be applied to spectrofluorometric analysis and that the multivariate calibration is also suitable for these compounds in leachates from real samples.
Westman, Eric; Aguilar, Carlos; Muehlboeck, J-Sebastian; Simmons, Andrew
2013-01-01
Automated structural magnetic resonance imaging (MRI) processing pipelines are gaining popularity for Alzheimer's disease (AD) research. They generate regional volumes, cortical thickness measures and other measures, which can be used as input for multivariate analysis. It is not clear which combination of measures and normalization approach are most useful for AD classification and to predict mild cognitive impairment (MCI) conversion. The current study includes MRI scans from 699 subjects [AD, MCI and controls (CTL)] from the Alzheimer's disease Neuroimaging Initiative (ADNI). The Freesurfer pipeline was used to generate regional volume, cortical thickness, gray matter volume, surface area, mean curvature, gaussian curvature, folding index and curvature index measures. 259 variables were used for orthogonal partial least square to latent structures (OPLS) multivariate analysis. Normalisation approaches were explored and the optimal combination of measures determined. Results indicate that cortical thickness measures should not be normalized, while volumes should probably be normalized by intracranial volume (ICV). Combining regional cortical thickness measures (not normalized) with cortical and subcortical volumes (normalized with ICV) using OPLS gave a prediction accuracy of 91.5 % when distinguishing AD versus CTL. This model prospectively predicted future decline from MCI to AD with 75.9 % of converters correctly classified. Normalization strategy did not have a significant effect on the accuracies of multivariate models containing multiple MRI measures for this large dataset. The appropriate choice of input for multivariate analysis in AD and MCI is of great importance. The results support the use of un-normalised cortical thickness measures and volumes normalised by ICV.
Whiteway, Matthew R; Butts, Daniel A
2017-03-01
The activity of sensory cortical neurons is not only driven by external stimuli but also shaped by other sources of input to the cortex. Unlike external stimuli, these other sources of input are challenging to experimentally control, or even observe, and as a result contribute to variability of neural responses to sensory stimuli. However, such sources of input are likely not "noise" and may play an integral role in sensory cortex function. Here we introduce the rectified latent variable model (RLVM) in order to identify these sources of input using simultaneously recorded cortical neuron populations. The RLVM is novel in that it employs nonnegative (rectified) latent variables and is much less restrictive in the mathematical constraints on solutions because of the use of an autoencoder neural network to initialize model parameters. We show that the RLVM outperforms principal component analysis, factor analysis, and independent component analysis, using simulated data across a range of conditions. We then apply this model to two-photon imaging of hundreds of simultaneously recorded neurons in mouse primary somatosensory cortex during a tactile discrimination task. Across many experiments, the RLVM identifies latent variables related to both the tactile stimulation as well as nonstimulus aspects of the behavioral task, with a majority of activity explained by the latter. These results suggest that properly identifying such latent variables is necessary for a full understanding of sensory cortical function and demonstrate novel methods for leveraging large population recordings to this end. NEW & NOTEWORTHY The rapid development of neural recording technologies presents new opportunities for understanding patterns of activity across neural populations. Here we show how a latent variable model with appropriate nonlinear form can be used to identify sources of input to a neural population and infer their time courses. Furthermore, we demonstrate how these sources are related to behavioral contexts outside of direct experimental control. Copyright © 2017 the American Physiological Society.
Grosse Frie, Kirstin; Janssen, Christian
2009-01-01
Based on the theoretical and empirical approach of Pierre Bourdieu, a multivariate non-linear method is introduced as an alternative way to analyse the complex relationships between social determinants and health. The analysis is based on face-to-face interviews with 695 randomly selected respondents aged 30 to 59. Variables regarding socio-economic status, life circumstances, lifestyles, health-related behaviour and health were chosen for the analysis. In order to determine whether the respondents can be differentiated and described based on these variables, a non-linear canonical correlation analysis (OVERALS) was performed. The results can be described on three dimensions; Eigenvalues add up to the fit of 1.444, which can be interpreted as approximately 50 % of explained variance. The three-dimensional space illustrates correspondences between variables and provides a framework for interpretation based on latent dimensions, which can be described by age, education, income and gender. Using non-linear canonical correlation analysis, health characteristics can be analysed in conjunction with socio-economic conditions and lifestyles. Based on Bourdieus theoretical approach, the complex correlations between these variables can be more substantially interpreted and presented.
ERIC Educational Resources Information Center
Park, Jungkyu; Yu, Hsiu-Ting
2016-01-01
The multilevel latent class model (MLCM) is a multilevel extension of a latent class model (LCM) that is used to analyze nested structure data structure. The nonparametric version of an MLCM assumes a discrete latent variable at a higher-level nesting structure to account for the dependency among observations nested within a higher-level unit. In…
Measuring individual differences in responses to date-rape vignettes using latent variable models.
Tuliao, Antover P; Hoffman, Lesa; McChargue, Dennis E
2017-01-01
Vignette methodology can be a flexible and powerful way to examine individual differences in response to dangerous real-life scenarios. However, most studies underutilize the usefulness of such methodology by analyzing only one outcome, which limits the ability to track event-related changes (e.g., vacillation in risk perception). The current study was designed to illustrate the dynamic influence of risk perception on exit point from a date-rape vignette. Our primary goal was to provide an illustrative example of how to use latent variable models for vignette methodology, including latent growth curve modeling with piecewise slopes, as well as latent variable measurement models. Through the combination of a step-by-step exposition in this text and corresponding model syntax available electronically, we detail an alternative statistical "blueprint" to enhance future violence research efforts using vignette methodology. Aggr. Behav. 43:60-73, 2017. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
TPSLVM: a dimensionality reduction algorithm based on thin plate splines.
Jiang, Xinwei; Gao, Junbin; Wang, Tianjiang; Shi, Daming
2014-10-01
Dimensionality reduction (DR) has been considered as one of the most significant tools for data analysis. One type of DR algorithms is based on latent variable models (LVM). LVM-based models can handle the preimage problem easily. In this paper we propose a new LVM-based DR model, named thin plate spline latent variable model (TPSLVM). Compared to the well-known Gaussian process latent variable model (GPLVM), our proposed TPSLVM is more powerful especially when the dimensionality of the latent space is low. Also, TPSLVM is robust to shift and rotation. This paper investigates two extensions of TPSLVM, i.e., the back-constrained TPSLVM (BC-TPSLVM) and TPSLVM with dynamics (TPSLVM-DM) as well as their combination BC-TPSLVM-DM. Experimental results show that TPSLVM and its extensions provide better data visualization and more efficient dimensionality reduction compared to PCA, GPLVM, ISOMAP, etc.
Eastwood, John Graeme; Jalaludin, Bin Badrudin; Kemp, Lynn Ann; Phung, Hai Ngoc
2014-01-01
We have previously reported in this journal on an ecological study of perinatal depressive symptoms in South Western Sydney. In that article, we briefly reported on a factor analysis that was utilized to identify empirical indicators for analysis. In this article, we report on the mixed method approach that was used to identify those latent variables. Social epidemiology has been slow to embrace a latent variable approach to the study of social, political, economic, and cultural structures and mechanisms, partly for philosophical reasons. Critical realist ontology and epistemology have been advocated as an appropriate methodological approach to both theory building and theory testing in the health sciences. We describe here an emergent mixed method approach that uses qualitative methods to identify latent constructs followed by factor analysis using empirical indicators chosen to measure identified qualitative codes. Comparative analysis of the findings is reported together with a limited description of realist approaches to abstract reasoning.
Using SAS PROC CALIS to fit Level-1 error covariance structures of latent growth models.
Ding, Cherng G; Jane, Ten-Der
2012-09-01
In the present article, we demonstrates the use of SAS PROC CALIS to fit various types of Level-1 error covariance structures of latent growth models (LGM). Advantages of the SEM approach, on which PROC CALIS is based, include the capabilities of modeling the change over time for latent constructs, measured by multiple indicators; embedding LGM into a larger latent variable model; incorporating measurement models for latent predictors; and better assessing model fit and the flexibility in specifying error covariance structures. The strength of PROC CALIS is always accompanied with technical coding work, which needs to be specifically addressed. We provide a tutorial on the SAS syntax for modeling the growth of a manifest variable and the growth of a latent construct, focusing the documentation on the specification of Level-1 error covariance structures. Illustrations are conducted with the data generated from two given latent growth models. The coding provided is helpful when the growth model has been well determined and the Level-1 error covariance structure is to be identified.
Heteroscedastic Latent Trait Models for Dichotomous Data.
Molenaar, Dylan
2015-09-01
Effort has been devoted to account for heteroscedasticity with respect to observed or latent moderator variables in item or test scores. For instance, in the multi-group generalized linear latent trait model, it could be tested whether the observed (polychoric) covariance matrix differs across the levels of an observed moderator variable. In the case that heteroscedasticity arises across the latent trait itself, existing models commonly distinguish between heteroscedastic residuals and a skewed trait distribution. These models have valuable applications in intelligence, personality and psychopathology research. However, existing approaches are only limited to continuous and polytomous data, while dichotomous data are common in intelligence and psychopathology research. Therefore, in present paper, a heteroscedastic latent trait model is presented for dichotomous data. The model is studied in a simulation study, and applied to data pertaining alcohol use and cognitive ability.
Social activity, cognitive decline and dementia risk: a 20-year prospective cohort study.
Marioni, Riccardo E; Proust-Lima, Cecile; Amieva, Helene; Brayne, Carol; Matthews, Fiona E; Dartigues, Jean-Francois; Jacqmin-Gadda, Helene
2015-10-24
Identifying modifiable lifestyle correlates of cognitive decline and risk of dementia is complex, particularly as few population-based longitudinal studies jointly model these interlinked processes. Recent methodological developments allow us to examine statistically defined sub-populations with separate cognitive trajectories and dementia risks. Engagement in social, physical, or intellectual pursuits, social network size, self-perception of feeling well understood, and degree of satisfaction with social relationships were assessed in 2854 participants from the Paquid cohort (mean baseline age 77 years) and related to incident dementia and cognitive change over 20-years of follow-up. Multivariate repeated cognitive information was exploited by defining the global cognitive functioning as the latent common factor underlying the tests. In addition, three latent homogeneous sub-populations of cognitive change and dementia were identified and contrasted according to social environment variables. In the whole population, we found associations between increased engagement in social, physical, or intellectual pursuits and increased cognitive ability (but not decline) and decreased risk of incident dementia, and between feeling understood and slower cognitive decline. There was evidence for three sub-populations of cognitive aging: fast, medium, and no cognitive decline. The social-environment measures at baseline did not help explain the heterogeneity of cognitive decline and incident dementia diagnosis between these sub-populations. We observed a complex series of relationships between social-environment variables and cognitive decline and dementia. In the whole population, factors such as increased engagement in social, physical, or intellectual pursuits were related to a decreased risk of dementia. However, in a sub-population analysis, the social-environment variables were not linked to the heterogeneous patterns of cognitive decline and dementia risk that defined the sub-groups.
Maximum Likelihood Estimation of Nonlinear Structural Equation Models with Ignorable Missing Data
ERIC Educational Resources Information Center
Lee, Sik-Yum; Song, Xin-Yuan; Lee, John C. K.
2003-01-01
The existing maximum likelihood theory and its computer software in structural equation modeling are established on the basis of linear relationships among latent variables with fully observed data. However, in social and behavioral sciences, nonlinear relationships among the latent variables are important for establishing more meaningful models…
Estimating and Visualizing Nonlinear Relations among Latent Variables: A Semiparametric Approach
ERIC Educational Resources Information Center
Pek, Jolynn; Sterba, Sonya K.; Kok, Bethany E.; Bauer, Daniel J.
2009-01-01
The graphical presentation of any scientific finding enhances its description, interpretation, and evaluation. Research involving latent variables is no exception, especially when potential nonlinear effects are suspect. This article has multiple aims. First, it provides a nontechnical overview of a semiparametric approach to modeling nonlinear…
Generalized Structured Component Analysis with Latent Interactions
ERIC Educational Resources Information Center
Hwang, Heungsun; Ho, Moon-Ho Ringo; Lee, Jonathan
2010-01-01
Generalized structured component analysis (GSCA) is a component-based approach to structural equation modeling. In practice, researchers may often be interested in examining the interaction effects of latent variables. However, GSCA has been geared only for the specification and testing of the main effects of variables. Thus, an extension of GSCA…
Behavioral Scale Reliability and Measurement Invariance Evaluation Using Latent Variable Modeling
ERIC Educational Resources Information Center
Raykov, Tenko
2004-01-01
A latent variable modeling approach to reliability and measurement invariance evaluation for multiple-component measuring instruments is outlined. An initial discussion deals with the limitations of coefficient alpha, a frequently used index of composite reliability. A widely and readily applicable structural modeling framework is next described…
Multilevel and Latent Variable Modeling with Composite Links and Exploded Likelihoods
ERIC Educational Resources Information Center
Rabe-Hesketh, Sophia; Skrondal, Anders
2007-01-01
Composite links and exploded likelihoods are powerful yet simple tools for specifying a wide range of latent variable models. Applications considered include survival or duration models, models for rankings, small area estimation with census information, models for ordinal responses, item response models with guessing, randomized response models,…
Evaluation of Validity and Reliability for Hierarchical Scales Using Latent Variable Modeling
ERIC Educational Resources Information Center
Raykov, Tenko; Marcoulides, George A.
2012-01-01
A latent variable modeling method is outlined, which accomplishes estimation of criterion validity and reliability for a multicomponent measuring instrument with hierarchical structure. The approach provides point and interval estimates for the scale criterion validity and reliability coefficients, and can also be used for testing composite or…
Meta-Analysis of Scale Reliability Using Latent Variable Modeling
ERIC Educational Resources Information Center
Raykov, Tenko; Marcoulides, George A.
2013-01-01
A latent variable modeling approach is outlined that can be used for meta-analysis of reliability coefficients of multicomponent measuring instruments. Important limitations of efforts to combine composite reliability findings across multiple studies are initially pointed out. A reliability synthesis procedure is discussed that is based on…
Diagnostic Procedures for Detecting Nonlinear Relationships between Latent Variables
ERIC Educational Resources Information Center
Bauer, Daniel J.; Baldasaro, Ruth E.; Gottfredson, Nisha C.
2012-01-01
Structural equation models are commonly used to estimate relationships between latent variables. Almost universally, the fitted models specify that these relationships are linear in form. This assumption is rarely checked empirically, largely for lack of appropriate diagnostic techniques. This article presents and evaluates two procedures that can…
Chen, Jinsong; Zhang, Dake; Choi, Jaehwa
2015-12-01
It is common to encounter latent variables with ordinal data in social or behavioral research. Although a mediated effect of latent variables (latent mediated effect, or LME) with ordinal data may appear to be a straightforward combination of LME with continuous data and latent variables with ordinal data, the methodological challenges to combine the two are not trivial. This research covers model structures as complex as LME and formulates both point and interval estimates of LME for ordinal data using the Bayesian full-information approach. We also combine weighted least squares (WLS) estimation with the bias-corrected bootstrapping (BCB; Efron Journal of the American Statistical Association, 82, 171-185, 1987) method or the traditional delta method as the limited-information approach. We evaluated the viability of these different approaches across various conditions through simulation studies, and provide an empirical example to illustrate the approaches. We found that the Bayesian approach with reasonably informative priors is preferred when both point and interval estimates are of interest and the sample size is 200 or above.
Williams, L. Keoki; Buu, Anne
2017-01-01
We propose a multivariate genome-wide association test for mixed continuous, binary, and ordinal phenotypes. A latent response model is used to estimate the correlation between phenotypes with different measurement scales so that the empirical distribution of the Fisher’s combination statistic under the null hypothesis is estimated efficiently. The simulation study shows that our proposed correlation estimation methods have high levels of accuracy. More importantly, our approach conservatively estimates the variance of the test statistic so that the type I error rate is controlled. The simulation also shows that the proposed test maintains the power at the level very close to that of the ideal analysis based on known latent phenotypes while controlling the type I error. In contrast, conventional approaches–dichotomizing all observed phenotypes or treating them as continuous variables–could either reduce the power or employ a linear regression model unfit for the data. Furthermore, the statistical analysis on the database of the Study of Addiction: Genetics and Environment (SAGE) demonstrates that conducting a multivariate test on multiple phenotypes can increase the power of identifying markers that may not be, otherwise, chosen using marginal tests. The proposed method also offers a new approach to analyzing the Fagerström Test for Nicotine Dependence as multivariate phenotypes in genome-wide association studies. PMID:28081206
Pousset, M; Tremblay, R E; Falissard, B
2011-06-01
The aim of this study was to contribute to clarification of the relations between antisocial personality disorder (APD) and its potential risk factors in a population of 560 French male prisoners. Adverse childhood was assessed as a latent variable determined by several traumatic events. APD (MINI), character and temperament (Cloninger's model), WAIS®-III similarities subtest and psychosocial characteristics were assessed by two clinicians. The WAIS®-III subtest accounts for verbal and cognitive performance. We used a structural model to determine the weight of the different pathways between adverse childhood and APD. Study confirmed the major and direct role of adverse childhood (standardized coefficient=0.48). An intermediate effect mediated by character (considered as a global variable) and novelty-seeking was also shown, confirming previous results from the literature. This study emphasizes the role of adverse childhood in APD, suggesting the potential benefit of early intervention in the prevention of antisocial behaviours. Copyright © 2011. Published by Elsevier Masson SAS.
TØ, Bechshøft; Sonne, C; Dietz, R; Born, EW; Muir, DCG; Letcher, RJ; Novak, MA; Henchey, E; Meyer, JS; Jenssen, BM; Villanger, GD
2012-01-01
The multivariate relationship between hair cortisol, whole blood thyroid hormones, and the complex mixtures of organohalogen contaminant (OHC) levels measured in subcutaneous adipose of 23 East Greenland polar bears (eight males and 15 females, all sampled between the years 1999 and 2001) was analyzed using projection to latent structure (PLS) regression modeling. In the resulting PLS model, most important variables with a negative influence on cortisol levels were particularly BDE-99, but also CB-180, -201, BDE-153, and CB-170/190. The most important variables with a positive influence on cortisol were CB-66/95, α-HCH, TT3, as well as heptachlor epoxide, dieldrin, BDE-47, p,p′-DDD. Although statistical modeling does not necessarily fully explain biological cause-effect relationships, relationships indicate that (1) the hypothalamic-pituitary-adrenal (HPA) axis in East Greenland polar bears is likely to be affected by OHC-contaminants and (2) the association between OHCs and cortisol may be linked with the hypothalamus-pituitary-thyroid (HPT) axis. PMID:22575327
Park, Ji Sook; Gangopadhyay, Ishanti; Davidson, Meghan M.; Weismer, Susan Ellis
2017-01-01
Purpose We aimed to outline the latent variables approach for measuring nonverbal executive function (EF) skills in school-age children, and to examine the relationship between nonverbal EF skills and language performance in this age group. Method Seventy-one typically developing children, ages 8 through 11, participated in the study. Three EF components, inhibition, updating, and task-shifting, were each indexed using 2 nonverbal tasks. A latent variables approach was used to extract latent scores that represented each EF construct. Children were also administered common standardized language measures. Multiple regression analyses were conducted to examine the relationship between EF and language skills. Results Nonverbal updating was associated with the Receptive Language Index on the Clinical Evaluation of Language Fundamentals–Fourth Edition (CELF-4). When composites denoting lexical–semantic and syntactic abilities were derived, nonverbal inhibition (but not shifting or updating) was found to predict children's syntactic abilities. These relationships held when the effects of age, IQ, and socioeconomic status were controlled. Conclusions The study makes a methodological contribution by explicating a method by which researchers can use the latent variables approach when measuring EF performance in school-age children. The study makes a theoretical and a clinical contribution by suggesting that language performance may be related to domain-general EFs. PMID:28306755
Kaushanskaya, Margarita; Park, Ji Sook; Gangopadhyay, Ishanti; Davidson, Meghan M; Weismer, Susan Ellis
2017-04-14
We aimed to outline the latent variables approach for measuring nonverbal executive function (EF) skills in school-age children, and to examine the relationship between nonverbal EF skills and language performance in this age group. Seventy-one typically developing children, ages 8 through 11, participated in the study. Three EF components, inhibition, updating, and task-shifting, were each indexed using 2 nonverbal tasks. A latent variables approach was used to extract latent scores that represented each EF construct. Children were also administered common standardized language measures. Multiple regression analyses were conducted to examine the relationship between EF and language skills. Nonverbal updating was associated with the Receptive Language Index on the Clinical Evaluation of Language Fundamentals-Fourth Edition (CELF-4). When composites denoting lexical-semantic and syntactic abilities were derived, nonverbal inhibition (but not shifting or updating) was found to predict children's syntactic abilities. These relationships held when the effects of age, IQ, and socioeconomic status were controlled. The study makes a methodological contribution by explicating a method by which researchers can use the latent variables approach when measuring EF performance in school-age children. The study makes a theoretical and a clinical contribution by suggesting that language performance may be related to domain-general EFs.
Generating Multivariate Ordinal Data via Entropy Principles.
Lee, Yen; Kaplan, David
2018-03-01
When conducting robustness research where the focus of attention is on the impact of non-normality, the marginal skewness and kurtosis are often used to set the degree of non-normality. Monte Carlo methods are commonly applied to conduct this type of research by simulating data from distributions with skewness and kurtosis constrained to pre-specified values. Although several procedures have been proposed to simulate data from distributions with these constraints, no corresponding procedures have been applied for discrete distributions. In this paper, we present two procedures based on the principles of maximum entropy and minimum cross-entropy to estimate the multivariate observed ordinal distributions with constraints on skewness and kurtosis. For these procedures, the correlation matrix of the observed variables is not specified but depends on the relationships between the latent response variables. With the estimated distributions, researchers can study robustness not only focusing on the levels of non-normality but also on the variations in the distribution shapes. A simulation study demonstrates that these procedures yield excellent agreement between specified parameters and those of estimated distributions. A robustness study concerning the effect of distribution shape in the context of confirmatory factor analysis shows that shape can affect the robust [Formula: see text] and robust fit indices, especially when the sample size is small, the data are severely non-normal, and the fitted model is complex.
Kirby, James B.; Bollen, Kenneth A.
2009-01-01
Structural Equation Modeling with latent variables (SEM) is a powerful tool for social and behavioral scientists, combining many of the strengths of psychometrics and econometrics into a single framework. The most common estimator for SEM is the full-information maximum likelihood estimator (ML), but there is continuing interest in limited information estimators because of their distributional robustness and their greater resistance to structural specification errors. However, the literature discussing model fit for limited information estimators for latent variable models is sparse compared to that for full information estimators. We address this shortcoming by providing several specification tests based on the 2SLS estimator for latent variable structural equation models developed by Bollen (1996). We explain how these tests can be used to not only identify a misspecified model, but to help diagnose the source of misspecification within a model. We present and discuss results from a Monte Carlo experiment designed to evaluate the finite sample properties of these tests. Our findings suggest that the 2SLS tests successfully identify most misspecified models, even those with modest misspecification, and that they provide researchers with information that can help diagnose the source of misspecification. PMID:20419054
Matrix completion by deep matrix factorization.
Fan, Jicong; Cheng, Jieyu
2018-02-01
Conventional methods of matrix completion are linear methods that are not effective in handling data of nonlinear structures. Recently a few researchers attempted to incorporate nonlinear techniques into matrix completion but there still exists considerable limitations. In this paper, a novel method called deep matrix factorization (DMF) is proposed for nonlinear matrix completion. Different from conventional matrix completion methods that are based on linear latent variable models, DMF is on the basis of a nonlinear latent variable model. DMF is formulated as a deep-structure neural network, in which the inputs are the low-dimensional unknown latent variables and the outputs are the partially observed variables. In DMF, the inputs and the parameters of the multilayer neural network are simultaneously optimized to minimize the reconstruction errors for the observed entries. Then the missing entries can be readily recovered by propagating the latent variables to the output layer. DMF is compared with state-of-the-art methods of linear and nonlinear matrix completion in the tasks of toy matrix completion, image inpainting and collaborative filtering. The experimental results verify that DMF is able to provide higher matrix completion accuracy than existing methods do and DMF is applicable to large matrices. Copyright © 2017 Elsevier Ltd. All rights reserved.
Stein, Judith A; Nyamathi, Adeline; Ullman, Jodie B; Bentler, Peter M
2007-01-01
Studies among normative samples generally demonstrate a positive impact of marriage on health behaviors and other related attitudes. In this study, we examine the impact of marriage on HIV/AIDS risk behaviors and attitudes among impoverished, highly stressed, homeless couples, many with severe substance abuse problems. A multilevel analysis of 368 high-risk sexually intimate married and unmarried heterosexual couples assessed individual and couple-level effects on social support, substance use problems, HIV/AIDS knowledge, perceived HIV/AIDS risk, needle-sharing, condom use, multiple sex partners, and HIV/AIDS testing. More variance was explained in the protective and risk variables by couple-level latent variable predictors than by individual latent variable predictors, although some gender effects were found (e.g., more alcohol problems among men). The couple-level variable of marriage predicted lower perceived risk, less deviant social support, and fewer sex partners but predicted more needle-sharing.
Guest, Rebecca; Craig, Ashley; Nicholson Perry, Kathryn; Tran, Yvonne; Ephraums, Catherine; Hales, Alison; Dezarnaulds, Annalisa; Crino, Rocco; Middleton, James
2015-11-01
To examine change in resilience in people with spinal cord injury (SCI) when group cognitive behavior therapy (GCBT) was added to routine psychosocial rehabilitation (RPR). A prospective repeated-measures cohort design was used to determine the efficacy of the addition of GCBT (n = 50). The control group consisted of individuals receiving RPR, which included access to individual CBT (ICBT) when required (n = 38). Groups were assessed on 3 occasions: soon after admission, within 2 weeks of discharge, and 6-months postdischarge. Measures included sociodemographic, injury, and psychosocial factors. The outcome variable was resilience, considered an important outcome measure for recovery. To adjust for baseline differences in self-efficacy, depressive mood and anxiety between the 2 groups, these factors were entered into a repeated measures multivariate analysis of covariance (MANCOVA) as covariates. Latent class analysis was used to determine the best-fitting model of resilience trajectories for both groups. The MANCOVA indicated that the addition of GCBT to psychosocial rehabilitation did not result in improved resilience compared with the ICBT group. Trajectory data indicated over 60% were demonstrating acceptable resilience irrespective of group. Changes in resilience mean scores suggest the addition of GCBT adds little to resilience outcomes. Latent class modeling indicated both groups experienced similar trajectories of improvement and deterioration. Results highlight the importance of conducting multivariate modeling analysis that isolates subgroups of related cases over time to understand complex trajectories. Further research is needed to clarify individual differences in CBT intervention preference as well as other factors which impact on resilience. (c) 2015 APA, all rights reserved).
Plis, Sergey M; Sui, Jing; Lane, Terran; Roy, Sushmita; Clark, Vincent P; Potluru, Vamsi K; Huster, Rene J; Michael, Andrew; Sponheim, Scott R; Weisend, Michael P; Calhoun, Vince D
2013-01-01
Identifying the complex activity relationships present in rich, modern neuroimaging data sets remains a key challenge for neuroscience. The problem is hard because (a) the underlying spatial and temporal networks may be nonlinear and multivariate and (b) the observed data may be driven by numerous latent factors. Further, modern experiments often produce data sets containing multiple stimulus contexts or tasks processed by the same subjects. Fusing such multi-session data sets may reveal additional structure, but raises further statistical challenges. We present a novel analysis method for extracting complex activity networks from such multifaceted imaging data sets. Compared to previous methods, we choose a new point in the trade-off space, sacrificing detailed generative probability models and explicit latent variable inference in order to achieve robust estimation of multivariate, nonlinear group factors (“network clusters”). We apply our method to identify relationships of task-specific intrinsic networks in schizophrenia patients and control subjects from a large fMRI study. After identifying network-clusters characterized by within- and between-task interactions, we find significant differences between patient and control groups in interaction strength among networks. Our results are consistent with known findings of brain regions exhibiting deviations in schizophrenic patients. However, we also find high-order, nonlinear interactions that discriminate groups but that are not detected by linear, pair-wise methods. We additionally identify high-order relationships that provide new insights into schizophrenia but that have not been found by traditional univariate or second-order methods. Overall, our approach can identify key relationships that are missed by existing analysis methods, without losing the ability to find relationships that are known to be important. PMID:23876245
Abstract: Inference and Interval Estimation for Indirect Effects With Latent Variable Models.
Falk, Carl F; Biesanz, Jeremy C
2011-11-30
Models specifying indirect effects (or mediation) and structural equation modeling are both popular in the social sciences. Yet relatively little research has compared methods that test for indirect effects among latent variables and provided precise estimates of the effectiveness of different methods. This simulation study provides an extensive comparison of methods for constructing confidence intervals and for making inferences about indirect effects with latent variables. We compared the percentile (PC) bootstrap, bias-corrected (BC) bootstrap, bias-corrected accelerated (BC a ) bootstrap, likelihood-based confidence intervals (Neale & Miller, 1997), partial posterior predictive (Biesanz, Falk, and Savalei, 2010), and joint significance tests based on Wald tests or likelihood ratio tests. All models included three reflective latent variables representing the independent, dependent, and mediating variables. The design included the following fully crossed conditions: (a) sample size: 100, 200, and 500; (b) number of indicators per latent variable: 3 versus 5; (c) reliability per set of indicators: .7 versus .9; (d) and 16 different path combinations for the indirect effect (α = 0, .14, .39, or .59; and β = 0, .14, .39, or .59). Simulations were performed using a WestGrid cluster of 1680 3.06GHz Intel Xeon processors running R and OpenMx. Results based on 1,000 replications per cell and 2,000 resamples per bootstrap method indicated that the BC and BC a bootstrap methods have inflated Type I error rates. Likelihood-based confidence intervals and the PC bootstrap emerged as methods that adequately control Type I error and have good coverage rates.
Psychometrics in Psychological Research: Role Model or Partner in Science?
ERIC Educational Resources Information Center
Sijtsma, Klaas
2006-01-01
This is a reaction to Borsboom's (2006) discussion paper on the issue that psychology takes so little notice of the modern developments in psychometrics, in particular, latent variable methods. Contrary to Borsboom, it is argued that latent variables are summaries of interesting data properties, that construct validation should involve studying…
An Alternative Approach for Nonlinear Latent Variable Models
ERIC Educational Resources Information Center
Mooijaart, Ab; Bentler, Peter M.
2010-01-01
In the last decades there has been an increasing interest in nonlinear latent variable models. Since the seminal paper of Kenny and Judd, several methods have been proposed for dealing with these kinds of models. This article introduces an alternative approach. The methodology involves fitting some third-order moments in addition to the means and…
Using Structural Equation Models with Latent Variables to Study Student Growth and Development.
ERIC Educational Resources Information Center
Pike, Gary R.
1991-01-01
Analysis of data on freshman-to-senior developmental gains in 722 University of Tennessee-Knoxville students provides evidence of the advantages of structural equation modeling with latent variables and suggests that the group differences identified by traditional analysis of variance and covariance techniques may be an artifact of measurement…
Bayesian Analysis of Structural Equation Models with Nonlinear Covariates and Latent Variables
ERIC Educational Resources Information Center
Song, Xin-Yuan; Lee, Sik-Yum
2006-01-01
In this article, we formulate a nonlinear structural equation model (SEM) that can accommodate covariates in the measurement equation and nonlinear terms of covariates and exogenous latent variables in the structural equation. The covariates can come from continuous or discrete distributions. A Bayesian approach is developed to analyze the…
Aptitude, Achievement and Competence in Medicine: A Latent Variable Path Model
ERIC Educational Resources Information Center
Collin, V. Terri; Violato, Claudio; Hecker, Kent
2009-01-01
To develop and test a latent variable path model of general achievement, aptitude for medicine and competence in medicine employing data from the Medical College Admission Test (MCAT), pre-medical undergraduate grade point average (UGPA) and demographic characteristics for competence in pre-clinical and measures of competence (United States…
Evaluation of Reliability Coefficients for Two-Level Models via Latent Variable Analysis
ERIC Educational Resources Information Center
Raykov, Tenko; Penev, Spiridon
2010-01-01
A latent variable analysis procedure for evaluation of reliability coefficients for 2-level models is outlined. The method provides point and interval estimates of group means' reliability, overall reliability of means, and conditional reliability. In addition, the approach can be used to test simple hypotheses about these parameters. The…
Evaluation of Scale Reliability with Binary Measures Using Latent Variable Modeling
ERIC Educational Resources Information Center
Raykov, Tenko; Dimitrov, Dimiter M.; Asparouhov, Tihomir
2010-01-01
A method for interval estimation of scale reliability with discrete data is outlined. The approach is applicable with multi-item instruments consisting of binary measures, and is developed within the latent variable modeling methodology. The procedure is useful for evaluation of consistency of single measures and of sum scores from item sets…
ERIC Educational Resources Information Center
Weissman, Alexander
2013-01-01
Convergence of the expectation-maximization (EM) algorithm to a global optimum of the marginal log likelihood function for unconstrained latent variable models with categorical indicators is presented. The sufficient conditions under which global convergence of the EM algorithm is attainable are provided in an information-theoretic context by…
A Comparison of Methods for Estimating Quadratic Effects in Nonlinear Structural Equation Models
ERIC Educational Resources Information Center
Harring, Jeffrey R.; Weiss, Brandi A.; Hsu, Jui-Chen
2012-01-01
Two Monte Carlo simulations were performed to compare methods for estimating and testing hypotheses of quadratic effects in latent variable regression models. The methods considered in the current study were (a) a 2-stage moderated regression approach using latent variable scores, (b) an unconstrained product indicator approach, (c) a latent…
ERIC Educational Resources Information Center
Raykov, Tenko
2011-01-01
Interval estimation of intraclass correlation coefficients in hierarchical designs is discussed within a latent variable modeling framework. A method accomplishing this aim is outlined, which is applicable in two-level studies where participants (or generally lower-order units) are clustered within higher-order units. The procedure can also be…
Evaluation of Weighted Scale Reliability and Criterion Validity: A Latent Variable Modeling Approach
ERIC Educational Resources Information Center
Raykov, Tenko
2007-01-01
A method is outlined for evaluating the reliability and criterion validity of weighted scales based on sets of unidimensional measures. The approach is developed within the framework of latent variable modeling methodology and is useful for point and interval estimation of these measurement quality coefficients in counseling and education…
Multilevel Latent Class Analysis: Parametric and Nonparametric Models
ERIC Educational Resources Information Center
Finch, W. Holmes; French, Brian F.
2014-01-01
Latent class analysis is an analytic technique often used in educational and psychological research to identify meaningful groups of individuals within a larger heterogeneous population based on a set of variables. This technique is flexible, encompassing not only a static set of variables but also longitudinal data in the form of growth mixture…
Discriminant Validity Assessment: Use of Fornell & Larcker criterion versus HTMT Criterion
NASA Astrophysics Data System (ADS)
Hamid, M. R. Ab; Sami, W.; Mohmad Sidek, M. H.
2017-09-01
Assessment of discriminant validity is a must in any research that involves latent variables for the prevention of multicollinearity issues. Fornell and Larcker criterion is the most widely used method for this purpose. However, a new method has emerged for establishing the discriminant validity assessment through heterotrait-monotrait (HTMT) ratio of correlations method. Therefore, this article presents the results of discriminant validity assessment using these methods. Data from previous study was used that involved 429 respondents for empirical validation of value-based excellence model in higher education institutions (HEI) in Malaysia. From the analysis, the convergent, divergent and discriminant validity were established and admissible using Fornell and Larcker criterion. However, the discriminant validity is an issue when employing the HTMT criterion. This shows that the latent variables under study faced the issue of multicollinearity and should be looked into for further details. This also implied that the HTMT criterion is a stringent measure that could detect the possible indiscriminant among the latent variables. In conclusion, the instrument which consisted of six latent variables was still lacking in terms of discriminant validity and should be explored further.
Choi, Eunhee; Tang, Fengyan; Kim, Sung-Geun; Turk, Phillip
2016-10-01
This study examined the longitudinal relationships between functional health in later years and three types of productive activities: volunteering, full-time, and part-time work. Using the data from five waves (2000-2008) of the Health and Retirement Study, we applied multivariate latent growth curve modeling to examine the longitudinal relationships among individuals 50 or over. Functional health was measured by limitations in activities of daily living. Individuals who volunteered, worked either full time or part time exhibited a slower decline in functional health than nonparticipants. Significant associations were also found between initial functional health and longitudinal changes in productive activity participation. This study provides additional support for the benefits of productive activities later in life; engagement in volunteering and employment are indeed associated with better functional health in middle and old age. © The Author(s) 2016.
ERIC Educational Resources Information Center
Dolan, Conor V.; Molenaar, Peter C. M.
1994-01-01
In multigroup covariance structure analysis with structured means, the traditional latent selection model is formulated as a special case of phenotypic selection. Illustrations with real and simulated data demonstrate how one can test specific hypotheses concerning selection on latent variables. (SLD)
Spurious Latent Classes in the Mixture Rasch Model
ERIC Educational Resources Information Center
Alexeev, Natalia; Templin, Jonathan; Cohen, Allan S.
2011-01-01
Mixture Rasch models have been used to study a number of psychometric issues such as goodness of fit, response strategy differences, strategy shifts, and multidimensionality. Although these models offer the potential for improving understanding of the latent variables being measured, under some conditions overextraction of latent classes may…
Piecewise Linear-Linear Latent Growth Mixture Models with Unknown Knots
ERIC Educational Resources Information Center
Kohli, Nidhi; Harring, Jeffrey R.; Hancock, Gregory R.
2013-01-01
Latent growth curve models with piecewise functions are flexible and useful analytic models for investigating individual behaviors that exhibit distinct phases of development in observed variables. As an extension of this framework, this study considers a piecewise linear-linear latent growth mixture model (LGMM) for describing segmented change of…
Brunwasser, Steven M; Gebretsadik, Tebeb; Gold, Diane R; Turi, Kedir N; Stone, Cosby A; Datta, Soma; Gern, James E; Hartert, Tina V
2018-01-01
The International Study of Asthma and Allergies in Children (ISAAC) Wheezing Module is commonly used to characterize pediatric asthma in epidemiological studies, including nearly all airway cohorts participating in the Environmental Influences on Child Health Outcomes (ECHO) consortium. However, there is no consensus model for operationalizing wheezing severity with this instrument in explanatory research studies. Severity is typically measured using coarsely-defined categorical variables, reducing power and potentially underestimating etiological associations. More precise measurement approaches could improve testing of etiological theories of wheezing illness. We evaluated a continuous latent variable model of pediatric wheezing severity based on four ISAAC Wheezing Module items. Analyses included subgroups of children from three independent cohorts whose parents reported past wheezing: infants ages 0-2 in the INSPIRE birth cohort study (Cohort 1; n = 657), 6-7-year-old North American children from Phase One of the ISAAC study (Cohort 2; n = 2,765), and 5-6-year-old children in the EHAAS birth cohort study (Cohort 3; n = 102). Models were estimated using structural equation modeling. In all cohorts, covariance patterns implied by the latent variable model were consistent with the observed data, as indicated by non-significant χ2 goodness of fit tests (no evidence of model misspecification). Cohort 1 analyses showed that the latent factor structure was stable across time points and child sexes. In both cohorts 1 and 3, the latent wheezing severity variable was prospectively associated with wheeze-related clinical outcomes, including physician asthma diagnosis, acute corticosteroid use, and wheeze-related outpatient medical visits when adjusting for confounders. We developed an easily applicable continuous latent variable model of pediatric wheezing severity based on items from the well-validated ISAAC Wheezing Module. This model prospectively associates with asthma morbidity, as demonstrated in two ECHO birth cohort studies, and provides a more statistically powerful method of testing etiologic hypotheses of childhood wheezing illness and asthma.
Young, Bonnie N; Rendón, Adrian; Rosas-Taraco, Adrian; Baker, Jack; Healy, Meghan; Gross, Jessica M; Long, Jeffrey; Burgos, Marcos; Hunley, Keith L
2014-01-01
Diverse socioeconomic and clinical factors influence susceptibility to tuberculosis (TB) disease in Mexico. The role of genetic factors, particularly those that differ between the parental groups that admixed in Mexico, is unclear. The objectives of this study are to identify the socioeconomic and clinical predictors of the transition from latent TB infection (LTBI) to pulmonary TB disease in an urban population in northeastern Mexico, and to examine whether genetic ancestry plays an independent role in this transition. We recruited 97 pulmonary TB disease patients and 97 LTBI individuals from a public hospital in Monterrey, Nuevo León. Socioeconomic and clinical variables were collected from interviews and medical records, and genetic ancestry was estimated for a subset of 142 study participants from 291,917 single nucleotide polymorphisms (SNPs). We examined crude associations between the variables and TB disease status. Significant predictors from crude association tests were analyzed using multivariable logistic regression. We also compared genetic ancestry between LTBI individuals and TB disease patients at 1,314 SNPs in 273 genes from the TB biosystem in the NCBI BioSystems database. In crude association tests, 12 socioeconomic and clinical variables were associated with TB disease. Multivariable logistic regression analyses indicated that marital status, diabetes, and smoking were independently associated with TB status. Genetic ancestry was not associated with TB disease in either crude or multivariable analyses. Separate analyses showed that LTBI individuals recruited from hospital staff had significantly higher European genetic ancestry than LTBI individuals recruited from the clinics and waiting rooms. Genetic ancestry differed between individuals with LTBI and TB disease at SNPs located in two genes in the TB biosystem. These results indicate that Monterrey may be structured with respect to genetic ancestry, and that genetic differences in TB susceptibility in parental populations may contribute to variation in disease susceptibility in the region.
Young, Bonnie N.; Rendón, Adrian; Rosas-Taraco, Adrian; Baker, Jack; Healy, Meghan; Gross, Jessica M.; Long, Jeffrey; Burgos, Marcos; Hunley, Keith L.
2014-01-01
Diverse socioeconomic and clinical factors influence susceptibility to tuberculosis (TB) disease in Mexico. The role of genetic factors, particularly those that differ between the parental groups that admixed in Mexico, is unclear. The objectives of this study are to identify the socioeconomic and clinical predictors of the transition from latent TB infection (LTBI) to pulmonary TB disease in an urban population in northeastern Mexico, and to examine whether genetic ancestry plays an independent role in this transition. We recruited 97 pulmonary TB disease patients and 97 LTBI individuals from a public hospital in Monterrey, Nuevo León. Socioeconomic and clinical variables were collected from interviews and medical records, and genetic ancestry was estimated for a subset of 142 study participants from 291,917 single nucleotide polymorphisms (SNPs). We examined crude associations between the variables and TB disease status. Significant predictors from crude association tests were analyzed using multivariable logistic regression. We also compared genetic ancestry between LTBI individuals and TB disease patients at 1,314 SNPs in 273 genes from the TB biosystem in the NCBI BioSystems database. In crude association tests, 12 socioeconomic and clinical variables were associated with TB disease. Multivariable logistic regression analyses indicated that marital status, diabetes, and smoking were independently associated with TB status. Genetic ancestry was not associated with TB disease in either crude or multivariable analyses. Separate analyses showed that LTBI individuals recruited from hospital staff had significantly higher European genetic ancestry than LTBI individuals recruited from the clinics and waiting rooms. Genetic ancestry differed between individuals with LTBI and TB disease at SNPs located in two genes in the TB biosystem. These results indicate that Monterrey may be structured with respect to genetic ancestry, and that genetic differences in TB susceptibility in parental populations may contribute to variation in disease susceptibility in the region. PMID:24728409
Three Cs in measurement models: causal indicators, composite indicators, and covariates.
Bollen, Kenneth A; Bauldry, Shawn
2011-09-01
In the last 2 decades attention to causal (and formative) indicators has grown. Accompanying this growth has been the belief that one can classify indicators into 2 categories: effect (reflective) indicators and causal (formative) indicators. We argue that the dichotomous view is too simple. Instead, there are effect indicators and 3 types of variables on which a latent variable depends: causal indicators, composite (formative) indicators, and covariates (the "Three Cs"). Causal indicators have conceptual unity, and their effects on latent variables are structural. Covariates are not concept measures, but are variables to control to avoid bias in estimating the relations between measures and latent variables. Composite (formative) indicators form exact linear combinations of variables that need not share a concept. Their coefficients are weights rather than structural effects, and composites are a matter of convenience. The failure to distinguish the Three Cs has led to confusion and questions, such as, Are causal and formative indicators different names for the same indicator type? Should an equation with causal or formative indicators have an error term? Are the coefficients of causal indicators less stable than effect indicators? Distinguishing between causal and composite indicators and covariates goes a long way toward eliminating this confusion. We emphasize the key role that subject matter expertise plays in making these distinctions. We provide new guidelines for working with these variable types, including identification of models, scaling latent variables, parameter estimation, and validity assessment. A running empirical example on self-perceived health illustrates our major points.
A Second-Order Conditionally Linear Mixed Effects Model with Observed and Latent Variable Covariates
ERIC Educational Resources Information Center
Harring, Jeffrey R.; Kohli, Nidhi; Silverman, Rebecca D.; Speece, Deborah L.
2012-01-01
A conditionally linear mixed effects model is an appropriate framework for investigating nonlinear change in a continuous latent variable that is repeatedly measured over time. The efficacy of the model is that it allows parameters that enter the specified nonlinear time-response function to be stochastic, whereas those parameters that enter in a…
ERIC Educational Resources Information Center
Kaushanskaya, Margarita; Park, Ji Sook; Gangopadhyay, Ishanti; Davidson, Meghan M.; Weismer, Susan Ellis
2017-01-01
Purpose: We aimed to outline the latent variables approach for measuring nonverbal executive function (EF) skills in school-age children, and to examine the relationship between nonverbal EF skills and language performance in this age group. Method: Seventy-one typically developing children, ages 8 through 11, participated in the study. Three EF…
ERIC Educational Resources Information Center
Seo, Hyojeong; Shaw, Leslie A.; Shogren, Karrie A.; Lang, Kyle M.; Little, Todd D.
2017-01-01
This article demonstrates the use of structural equation modeling to develop norms for a translated version of a standardized scale, the Supports Intensity Scale-Children's Version (SIS-C). The latent variable norming method proposed is useful when the standardization sample for a translated version is relatively small to derive norms…
Interrater Agreement Evaluation: A Latent Variable Modeling Approach
ERIC Educational Resources Information Center
Raykov, Tenko; Dimitrov, Dimiter M.; von Eye, Alexander; Marcoulides, George A.
2013-01-01
A latent variable modeling method for evaluation of interrater agreement is outlined. The procedure is useful for point and interval estimation of the degree of agreement among a given set of judges evaluating a group of targets. In addition, the approach allows one to test for identity in underlying thresholds across raters as well as to identify…
ERIC Educational Resources Information Center
Choi, Kilchan
2011-01-01
This report explores a new latent variable regression 4-level hierarchical model for monitoring school performance over time using multisite multiple-cohorts longitudinal data. This kind of data set has a 4-level hierarchical structure: time-series observation nested within students who are nested within different cohorts of students. These…
Standard Errors of Estimated Latent Variable Scores with Estimated Structural Parameters
ERIC Educational Resources Information Center
Hoshino, Takahiro; Shigemasu, Kazuo
2008-01-01
The authors propose a concise formula to evaluate the standard error of the estimated latent variable score when the true values of the structural parameters are not known and must be estimated. The formula can be applied to factor scores in factor analysis or ability parameters in item response theory, without bootstrap or Markov chain Monte…
ERIC Educational Resources Information Center
Preßler, Anna-Lena; Könen, Tanja; Hasselhorn, Marcus; Krajewski, Kristin
2014-01-01
The aim of the present study was to empirically disentangle the interdependencies of the impact of nonverbal intelligence, working memory capacities, and phonological processing skills on early reading decoding and spelling within a latent variable approach. In a sample of 127 children, these cognitive preconditions were assessed before the onset…
An Alternative Two Stage Least Squares (2SLS) Estimator for Latent Variable Equations.
ERIC Educational Resources Information Center
Bollen, Kenneth A.
1996-01-01
An alternative two-stage least squares (2SLS) estimator of the parameters in LISREL type models is proposed and contrasted with existing estimators. The new 2SLS estimator allows observed and latent variables to originate from nonnormal distributions, is consistent, has a known asymptotic covariance matrix, and can be estimated with standard…
Classical Item Analysis Using Latent Variable Modeling: A Note on a Direct Evaluation Procedure
ERIC Educational Resources Information Center
Raykov, Tenko; Marcoulides, George A.
2011-01-01
A directly applicable latent variable modeling procedure for classical item analysis is outlined. The method allows one to point and interval estimate item difficulty, item correlations, and item-total correlations for composites consisting of categorical items. The approach is readily employed in empirical research and as a by-product permits…
ERIC Educational Resources Information Center
Raykov, Tenko; Marcoulides, George A.
2015-01-01
A direct approach to point and interval estimation of Cronbach's coefficient alpha for multiple component measuring instruments is outlined. The procedure is based on a latent variable modeling application with widely circulated software. As a by-product, using sample data the method permits ascertaining whether the population discrepancy…
Introduction to the special section on mixture modeling in personality assessment.
Wright, Aidan G C; Hallquist, Michael N
2014-01-01
Latent variable models offer a conceptual and statistical framework for evaluating the underlying structure of psychological constructs, including personality and psychopathology. Complex structures that combine or compare categorical and dimensional latent variables can be accommodated using mixture modeling approaches, which provide a powerful framework for testing nuanced theories about psychological structure. This special series includes introductory primers on cross-sectional and longitudinal mixture modeling, in addition to empirical examples applying these techniques to real-world data collected in clinical settings. This group of articles is designed to introduce personality assessment scientists and practitioners to a general latent variable framework that we hope will stimulate new research and application of mixture models to the assessment of personality and its pathology.
Sartipi, Majid; Nedjat, Saharnaz; Mansournia, Mohammad Ali; Baigi, Vali; Fotouhi, Akbar
2016-11-01
Some variables like Socioeconomic Status (SES) cannot be directly measured, instead, so-called 'latent variables' are measured indirectly through calculating tangible items. There are different methods for measuring latent variables such as data reduction methods e.g. Principal Components Analysis (PCA) and Latent Class Analysis (LCA). The purpose of our study was to measure assets index- as a representative of SES- through two methods of Non-Linear PCA (NLPCA) and LCA, and to compare them for choosing the most appropriate model. This was a cross sectional study in which 1995 respondents filled the questionnaires about their assets in Tehran. The data were analyzed by SPSS 19 (CATPCA command) and SAS 9.2 (PROC LCA command) to estimate their socioeconomic status. The results were compared based on the Intra-class Correlation Coefficient (ICC). The 6 derived classes from LCA based on BIC, were highly consistent with the 6 classes from CATPCA (Categorical PCA) (ICC = 0.87, 95%CI: 0.86 - 0.88). There is no gold standard to measure SES. Therefore, it is not possible to definitely say that a specific method is better than another one. LCA is a complicated method that presents detailed information about latent variables and required one assumption (local independency), while NLPCA is a simple method, which requires more assumptions. Generally, NLPCA seems to be an acceptable method of analysis because of its simplicity and high agreement with LCA.
Variable-Length Computerized Adaptive Testing Using the Higher Order DINA Model
ERIC Educational Resources Information Center
Hsu, Chia-Ling; Wang, Wen-Chung
2015-01-01
Cognitive diagnosis models provide profile information about a set of latent binary attributes, whereas item response models yield a summary report on a latent continuous trait. To utilize the advantages of both models, higher order cognitive diagnosis models were developed in which information about both latent binary attributes and latent…
Testing Manifest Monotonicity Using Order-Constrained Statistical Inference
ERIC Educational Resources Information Center
Tijmstra, Jesper; Hessen, David J.; van der Heijden, Peter G. M.; Sijtsma, Klaas
2013-01-01
Most dichotomous item response models share the assumption of latent monotonicity, which states that the probability of a positive response to an item is a nondecreasing function of a latent variable intended to be measured. Latent monotonicity cannot be evaluated directly, but it implies manifest monotonicity across a variety of observed scores,…
ERIC Educational Resources Information Center
Pek, Jolynn; Chalmers, R. Philip; Kok, Bethany E.; Losardo, Diane
2015-01-01
Structural equation mixture models (SEMMs), when applied as a semiparametric model (SPM), can adequately recover potentially nonlinear latent relationships without their specification. This SPM is useful for exploratory analysis when the form of the latent regression is unknown. The purpose of this article is to help users familiar with structural…
Szekér, Szabolcs; Vathy-Fogarassy, Ágnes
2018-01-01
Logistic regression based propensity score matching is a widely used method in case-control studies to select the individuals of the control group. This method creates a suitable control group if all factors affecting the output variable are known. However, if relevant latent variables exist as well, which are not taken into account during the calculations, the quality of the control group is uncertain. In this paper, we present a statistics-based research in which we try to determine the relationship between the accuracy of the logistic regression model and the uncertainty of the dependent variable of the control group defined by propensity score matching. Our analyses show that there is a linear correlation between the fit of the logistic regression model and the uncertainty of the output variable. In certain cases, a latent binary explanatory variable can result in a relative error of up to 70% in the prediction of the outcome variable. The observed phenomenon calls the attention of analysts to an important point, which must be taken into account when deducting conclusions.
Three Cs in Measurement Models: Causal Indicators, Composite Indicators, and Covariates
Bollen, Kenneth A.; Bauldry, Shawn
2013-01-01
In the last two decades attention to causal (and formative) indicators has grown. Accompanying this growth has been the belief that we can classify indicators into two categories, effect (reflective) indicators and causal (formative) indicators. This paper argues that the dichotomous view is too simple. Instead, there are effect indicators and three types of variables on which a latent variable depends: causal indicators, composite (formative) indicators, and covariates (the “three Cs”). Causal indicators have conceptual unity and their effects on latent variables are structural. Covariates are not concept measures, but are variables to control to avoid bias in estimating the relations between measures and latent variable(s). Composite (formative) indicators form exact linear combinations of variables that need not share a concept. Their coefficients are weights rather than structural effects and composites are a matter of convenience. The failure to distinguish the “three Cs” has led to confusion and questions such as: are causal and formative indicators different names for the same indicator type? Should an equation with causal or formative indicators have an error term? Are the coefficients of causal indicators less stable than effect indicators? Distinguishing between causal and composite indicators and covariates goes a long way toward eliminating this confusion. We emphasize the key role that subject matter expertise plays in making these distinctions. We provide new guidelines for working with these variable types, including identification of models, scaling latent variables, parameter estimation, and validity assessment. A running empirical example on self-perceived health illustrates our major points. PMID:21767021
Hertzog, Christopher; Dixon, Roger A; Hultsch, David F; MacDonald, Stuart W S
2003-12-01
The authors used 6-year longitudinal data from the Victoria Longitudinal Study (VLS) to investigate individual differences in amount of episodic memory change. Latent change models revealed reliable individual differences in cognitive change. Changes in episodic memory were significantly correlated with changes in other cognitive variables, including speed and working memory. A structural equation model for the latent change scores showed that changes in speed and working memory predicted changes in episodic memory, as expected by processing resource theory. However, these effects were best modeled as being mediated by changes in induction and fact retrieval. Dissociations were detected between cross-sectional ability correlations and longitudinal changes. Shuffling the tasks used to define the Working Memory latent variable altered patterns of change correlations.
Growth Modeling with Non-Ignorable Dropout: Alternative Analyses of the STAR*D Antidepressant Trial
Muthén, Bengt; Asparouhov, Tihomir; Hunter, Aimee; Leuchter, Andrew
2011-01-01
This paper uses a general latent variable framework to study a series of models for non-ignorable missingness due to dropout. Non-ignorable missing data modeling acknowledges that missingness may depend on not only covariates and observed outcomes at previous time points as with the standard missing at random (MAR) assumption, but also on latent variables such as values that would have been observed (missing outcomes), developmental trends (growth factors), and qualitatively different types of development (latent trajectory classes). These alternative predictors of missing data can be explored in a general latent variable framework using the Mplus program. A flexible new model uses an extended pattern-mixture approach where missingness is a function of latent dropout classes in combination with growth mixture modeling using latent trajectory classes. A new selection model allows not only an influence of the outcomes on missingness, but allows this influence to vary across latent trajectory classes. Recommendations are given for choosing models. The missing data models are applied to longitudinal data from STAR*D, the largest antidepressant clinical trial in the U.S. to date. Despite the importance of this trial, STAR*D growth model analyses using non-ignorable missing data techniques have not been explored until now. The STAR*D data are shown to feature distinct trajectory classes, including a low class corresponding to substantial improvement in depression, a minority class with a U-shaped curve corresponding to transient improvement, and a high class corresponding to no improvement. The analyses provide a new way to assess drug efficiency in the presence of dropout. PMID:21381817
Persistence and amplitude of cigarette demand in relation to quit intentions and attempts
O’Connor, Richard J.; Heckman, Bryan W.; Adkison, Sarah E.; Rees, Vaughan W.; Hatsukami, Dorothy K.; Bickel, Warren K.; Cummings, K. Michael
2016-01-01
INTRODUCTION The cigarette purchase task (CPT) is a method that can be used to assess relative value of cigarettes. Based on cigarettes purchased across a price range, five derived metrics (Omax, Pmax, breakpoint, intensity, elasticity) can assess cigarette demand. A study with adolescent smokers found that these could be reduced to two latent factors: Persistence (price insensitivity) and Amplitude (volumetric consumption). We sought to replicate this structure with adult smokers, and examine how these variables relate to cessation efforts. METHOD Web-based survey conducted in 2014 among adult (18+) current daily cigarette smokers (N=1194). Participants completed the CPT, Fagerstrom Test for Nicotine Dependence (FTND), reported past-year quit attempts, and future quit intentions. We included published scales assessing perceived prevalence of smoking, social reactivity, smoker identity, and risk perception. RESULTS Our analysis supported two latent variables, Persistence and Amplitude, which correlated positively with FTND. Persistence correlated with several psychosocial factors, and was higher among those intending to quit very soon, but did not vary by number of past-year quit attempts. Amplitude differed across quit attempts and intention (p’s <.001), and in multivariable models was significantly associated with lower 30-day quit intention [OR=0.76, p=.001]. CONCLUSIONS Persistence and Amplitude factors characterized CPT data in adults, discriminated known groups (e.g., smokers by intentions to quit), and were positively associated with nicotine dependence. Factor scores also appear to relate to certain psychosocial factors, such as smoker identity and perceptions of risk. Future research should examine the predictive validity of these constructs. PMID:27048156
Persistence and amplitude of cigarette demand in relation to quit intentions and attempts.
O'Connor, Richard J; Heckman, Bryan W; Adkison, Sarah E; Rees, Vaughan W; Hatsukami, Dorothy K; Bickel, Warren K; Cummings, K Michael
2016-06-01
The cigarette purchase task (CPT) is a method that can be used to assess the relative value of cigarettes. Based on cigarettes purchased across a price range, five derived metrics (Omax, Pmax, breakpoint, intensity, and elasticity) can assess cigarette demand. A study with adolescent smokers found that these could be reduced to two latent factors: persistence (price insensitivity) and amplitude (volumetric consumption). We sought to replicate this structure with adult smokers and examine how these variables relate to cessation efforts. Web-based survey conducted in 2014 among adult (18 years and above) current daily cigarette smokers (N = 1194). Participants completed the CPT, Fagerstrom Test for Nicotine Dependence (FTND), reported past-year quit attempts, and future quit intentions. We included published scales assessing perceived prevalence of smoking, social reactivity, smoker identity, and risk perception. Our analysis supported two latent variables, persistence and amplitude, which correlated positively with FTND. Persistence was correlated with several psychosocial factors and was higher among those intending to quit very soon, but did not vary by number of past-year quit attempts. Amplitude differed across quit attempts and intention (p < 0.001) and, in multivariable models, was significantly associated with lower 30-day quit intention (OR = 0.76, p = 0.001). Persistence and amplitude factors characterized CPT data in adults, discriminated known groups (e.g., smokers by intentions to quit), and were positively associated with nicotine dependence. Factor scores also appear to relate to certain psychosocial factors, such as smoker identity and perceptions of risk. Future research should examine the predictive validity of these constructs.
Tao, Yebin; Sánchez, Brisa N; Mukherjee, Bhramar
2015-03-30
Many existing cohort studies designed to investigate health effects of environmental exposures also collect data on genetic markers. The Early Life Exposures in Mexico to Environmental Toxicants project, for instance, has been genotyping single nucleotide polymorphisms on candidate genes involved in mental and nutrient metabolism and also in potentially shared metabolic pathways with the environmental exposures. Given the longitudinal nature of these cohort studies, rich exposure and outcome data are available to address novel questions regarding gene-environment interaction (G × E). Latent variable (LV) models have been effectively used for dimension reduction, helping with multiple testing and multicollinearity issues in the presence of correlated multivariate exposures and outcomes. In this paper, we first propose a modeling strategy, based on LV models, to examine the association between repeated outcome measures (e.g., child weight) and a set of correlated exposure biomarkers (e.g., prenatal lead exposure). We then construct novel tests for G × E effects within the LV framework to examine effect modification of outcome-exposure association by genetic factors (e.g., the hemochromatosis gene). We consider two scenarios: one allowing dependence of the LV models on genes and the other assuming independence between the LV models and genes. We combine the two sets of estimates by shrinkage estimation to trade off bias and efficiency in a data-adaptive way. Using simulations, we evaluate the properties of the shrinkage estimates, and in particular, we demonstrate the need for this data-adaptive shrinkage given repeated outcome measures, exposure measures possibly repeated and time-varying gene-environment association. Copyright © 2014 John Wiley & Sons, Ltd.
Empirical Bayes Approaches to Multivariate Fuzzy Partitions.
ERIC Educational Resources Information Center
Woodbury, Max A.; Manton, Kenneth G.
1991-01-01
An empirical Bayes-maximum likelihood estimation procedure is presented for the application of fuzzy partition models in describing high dimensional discrete response data. The model describes individuals in terms of partial membership in multiple latent categories that represent bounded discrete spaces. (SLD)
High-Dimensional Sparse Factor Modeling: Applications in Gene Expression Genomics
Carvalho, Carlos M.; Chang, Jeffrey; Lucas, Joseph E.; Nevins, Joseph R.; Wang, Quanli; West, Mike
2010-01-01
We describe studies in molecular profiling and biological pathway analysis that use sparse latent factor and regression models for microarray gene expression data. We discuss breast cancer applications and key aspects of the modeling and computational methodology. Our case studies aim to investigate and characterize heterogeneity of structure related to specific oncogenic pathways, as well as links between aggregate patterns in gene expression profiles and clinical biomarkers. Based on the metaphor of statistically derived “factors” as representing biological “subpathway” structure, we explore the decomposition of fitted sparse factor models into pathway subcomponents and investigate how these components overlay multiple aspects of known biological activity. Our methodology is based on sparsity modeling of multivariate regression, ANOVA, and latent factor models, as well as a class of models that combines all components. Hierarchical sparsity priors address questions of dimension reduction and multiple comparisons, as well as scalability of the methodology. The models include practically relevant non-Gaussian/nonparametric components for latent structure, underlying often quite complex non-Gaussianity in multivariate expression patterns. Model search and fitting are addressed through stochastic simulation and evolutionary stochastic search methods that are exemplified in the oncogenic pathway studies. Supplementary supporting material provides more details of the applications, as well as examples of the use of freely available software tools for implementing the methodology. PMID:21218139
ERIC Educational Resources Information Center
Schweizer, Karl
2006-01-01
A model with fixed relations between manifest and latent variables is presented for investigating choice reaction time data. The numbers for fixation originate from the polynomial function. Two options are considered: the component-based (1 latent variable for each component of the polynomial function) and composite-based options (1 latent…
ERIC Educational Resources Information Center
Yang, Ji Seung; Cai, Li
2014-01-01
The main purpose of this study is to improve estimation efficiency in obtaining maximum marginal likelihood estimates of contextual effects in the framework of nonlinear multilevel latent variable model by adopting the Metropolis-Hastings Robbins-Monro algorithm (MH-RM). Results indicate that the MH-RM algorithm can produce estimates and standard…
ERIC Educational Resources Information Center
Seo, Hyojeong; Little, Todd D.; Shogren, Karrie A.; Lang, Kyle M.
2016-01-01
Structural equation modeling (SEM) is a powerful and flexible analytic tool to model latent constructs and their relations with observed variables and other constructs. SEM applications offer advantages over classical models in dealing with statistical assumptions and in adjusting for measurement error. So far, however, SEM has not been fully used…
ERIC Educational Resources Information Center
Raykov, Tenko; Marcoulides, George A.; Tong, Bing
2016-01-01
A latent variable modeling procedure is discussed that can be used to test if two or more homogeneous multicomponent instruments with distinct components are measuring the same underlying construct. The method is widely applicable in scale construction and development research and can also be of special interest in construct validation studies.…
Hu, Chuanpu; Randazzo, Bruce; Sharma, Amarnath; Zhou, Honghui
2017-10-01
Exposure-response modeling plays an important role in optimizing dose and dosing regimens during clinical drug development. The modeling of multiple endpoints is made possible in part by recent progress in latent variable indirect response (IDR) modeling for ordered categorical endpoints. This manuscript aims to investigate the level of improvement achievable by jointly modeling two such endpoints in the latent variable IDR modeling framework through the sharing of model parameters. This is illustrated with an application to the exposure-response of guselkumab, a human IgG1 monoclonal antibody in clinical development that blocks IL-23. A Phase 2b study was conducted in 238 patients with psoriasis for which disease severity was assessed using Psoriasis Area and Severity Index (PASI) and Physician's Global Assessment (PGA) scores. A latent variable Type I IDR model was developed to evaluate the therapeutic effect of guselkumab dosing on 75, 90 and 100% improvement of PASI scores from baseline and PGA scores, with placebo effect empirically modeled. The results showed that the joint model is able to describe the observed data better with fewer parameters compared with the common approach of separately modeling the endpoints.
Ciampi, Antonio; Dyachenko, Alina; Cole, Martin; McCusker, Jane
2011-12-01
The study of mental disorders in the elderly presents substantial challenges due to population heterogeneity, coexistence of different mental disorders, and diagnostic uncertainty. While reliable tools have been developed to collect relevant data, new approaches to study design and analysis are needed. We focus on a new analytic approach. Our framework is based on latent class analysis and hidden Markov chains. From repeated measurements of a multivariate disease index, we extract the notion of underlying state of a patient at a time point. The course of the disorder is then a sequence of transitions among states. States and transitions are not observable; however, the probability of being in a state at a time point, and the transition probabilities from one state to another over time can be estimated. Data from 444 patients with and without diagnosis of delirium and dementia were available from a previous study. The Delirium Index was measured at diagnosis, and at 2 and 6 months from diagnosis. Four latent classes were identified: fairly healthy, moderately ill, clearly sick, and very sick. Dementia and delirium could not be separated on the basis of these data alone. Indeed, as the probability of delirium increased, so did the probability of decline of mental functions. Eight most probable courses were identified, including good and poor stable courses, and courses exhibiting various patterns of improvement. Latent class analysis and hidden Markov chains offer a promising tool for studying mental disorders in the elderly. Its use may show its full potential as new data become available.
Levant, Ronald F; Hall, Rosalie J; Weigold, Ingrid K; McCurdy, Eric R
2016-10-01
The construct validity of the Male Role Norms Inventory-Short Form (MRNI-SF) was assessed using a latent variable approach implemented with structural equation modeling (SEM). The MRNI-SF was specified as having a bifactor structure, and validation scales were also specified as latent variables. The latent variable approach had the advantages of separating effects of general and specific factors and controlling for some sources of measurement error. Data (N = 484) were from a diverse sample (38.8% men of color, 22.3% men of diverse sexualities) of community-dwelling and college men who responded to an online survey. The construct validity of the MRNI-SF General Traditional Masculinity Ideology factor was supported for all 4 of the proposed latent correlations with: (a) Male Role Attitudes Scale; (b) general factor of Conformity to Masculine Norms Inventory-46; (c) higher-order factor of Gender Role Conflict Scale; and (d) Personal Attributes Questionnaire-Masculinity Scale. Significant correlations with relevant other latent factors provided concurrent validity evidence for the MRNI-SF specific factors of Negativity toward Sexual Minorities, Importance of Sex, Restrictive Emotionality, and Toughness, with all 8 of the hypothesized relationships supported. However, 3 relationships concerning Dominance were not supported. (The construct validity of the remaining 2 MRNI-SF specific factors-Avoidance of Femininity and Self-Reliance through Mechanical Skills was not assessed.) Comparisons were made, and meaningful differences noted, between the latent correlations emphasized in this study and their raw variable counterparts. Results are discussed in terms of the advantages of an SEM approach and the unique characteristics of the bifactor model. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Using Design-Based Latent Growth Curve Modeling with Cluster-Level Predictor to Address Dependency
ERIC Educational Resources Information Center
Wu, Jiun-Yu; Kwok, Oi-Man; Willson, Victor L.
2014-01-01
The authors compared the effects of using the true Multilevel Latent Growth Curve Model (MLGCM) with single-level regular and design-based Latent Growth Curve Models (LGCM) with or without the higher-level predictor on various criterion variables for multilevel longitudinal data. They found that random effect estimates were biased when the…
A Vernacular for Linear Latent Growth Models
ERIC Educational Resources Information Center
Hancock, Gregory R.; Choi, Jaehwa
2006-01-01
In its most basic form, latent growth modeling (latent curve analysis) allows an assessment of individuals' change in a measured variable X over time. For simple linear models, as with other growth models, parameter estimates associated with the a construct (amount of X at a chosen temporal reference point) and b construct (growth in X per unit…
A Latent Transition Analysis Model for Assessing Change in Cognitive Skills
ERIC Educational Resources Information Center
Li, Feiming; Cohen, Allan; Bottge, Brian; Templin, Jonathan
2016-01-01
Latent transition analysis (LTA) was initially developed to provide a means of measuring change in dynamic latent variables. In this article, we illustrate the use of a cognitive diagnostic model, the DINA model, as the measurement model in a LTA, thereby demonstrating a means of analyzing change in cognitive skills over time. An example is…
ERIC Educational Resources Information Center
Raykov, Tenko; Dimitrov, Dimiter M.; Marcoulides, George A.; Li, Tatyana; Menold, Natalja
2018-01-01
A latent variable modeling method for studying measurement invariance when evaluating latent constructs with multiple binary or binary scored items with no guessing is outlined. The approach extends the continuous indicator procedure described by Raykov and colleagues, utilizes similarly the false discovery rate approach to multiple testing, and…
Mannarini, Stefania; Balottin, Laura; Toldo, Irene; Gatta, Michela
2016-10-01
The study, conducted on Italian preadolscents aged 11 to 13 belonging to the general population, aims to investigate the relationship between the emotional functioning, namely, alexithymia, and the risk of developing behavioral and emotional problems measured using the Strength and Difficulty Questionnaire. The latent class analysis approach allowed to identify two latent variables, accounting for the internalizing (emotional symptoms and difficulties in emotional awareness) and for the externalizing problems (conduct problems and hyperactivity, problematic relationships with peers, poor prosocial behaviors and externally oriented thinking). The two latent variables featured two latent classes: the difficulty in dealing with problems and the strength to face problems that was representative of most of the healthy participants with specific gender differences. Along with the analysis of psychopathological behaviors, the study of resilience and strengths can prove to be a key step in order to develop valuable preventive approaches to tackle psychiatric disorders. © 2016 Scandinavian Psychological Associations and John Wiley & Sons Ltd.
Grace, J.B.; Bollen, K.A.
2008-01-01
Structural equation modeling (SEM) holds the promise of providing natural scientists the capacity to evaluate complex multivariate hypotheses about ecological systems. Building on its predecessors, path analysis and factor analysis, SEM allows for the incorporation of both observed and unobserved (latent) variables into theoretically-based probabilistic models. In this paper we discuss the interface between theory and data in SEM and the use of an additional variable type, the composite. In simple terms, composite variables specify the influences of collections of other variables and can be helpful in modeling heterogeneous concepts of the sort commonly of interest to ecologists. While long recognized as a potentially important element of SEM, composite variables have received very limited use, in part because of a lack of theoretical consideration, but also because of difficulties that arise in parameter estimation when using conventional solution procedures. In this paper we present a framework for discussing composites and demonstrate how the use of partially-reduced-form models can help to overcome some of the parameter estimation and evaluation problems associated with models containing composites. Diagnostic procedures for evaluating the most appropriate and effective use of composites are illustrated with an example from the ecological literature. It is argued that an ability to incorporate composite variables into structural equation models may be particularly valuable in the study of natural systems, where concepts are frequently multifaceted and the influence of suites of variables are often of interest. ?? Springer Science+Business Media, LLC 2007.
Error propagation of partial least squares for parameters optimization in NIR modeling.
Du, Chenzhao; Dai, Shengyun; Qiao, Yanjiang; Wu, Zhisheng
2018-03-05
A novel methodology is proposed to determine the error propagation of partial least-square (PLS) for parameters optimization in near-infrared (NIR) modeling. The parameters include spectral pretreatment, latent variables and variable selection. In this paper, an open source dataset (corn) and a complicated dataset (Gardenia) were used to establish PLS models under different modeling parameters. And error propagation of modeling parameters for water quantity in corn and geniposide quantity in Gardenia were presented by both type І and type II error. For example, when variable importance in the projection (VIP), interval partial least square (iPLS) and backward interval partial least square (BiPLS) variable selection algorithms were used for geniposide in Gardenia, compared with synergy interval partial least squares (SiPLS), the error weight varied from 5% to 65%, 55% and 15%. The results demonstrated how and what extent the different modeling parameters affect error propagation of PLS for parameters optimization in NIR modeling. The larger the error weight, the worse the model. Finally, our trials finished a powerful process in developing robust PLS models for corn and Gardenia under the optimal modeling parameters. Furthermore, it could provide a significant guidance for the selection of modeling parameters of other multivariate calibration models. Copyright © 2017. Published by Elsevier B.V.
Error propagation of partial least squares for parameters optimization in NIR modeling
NASA Astrophysics Data System (ADS)
Du, Chenzhao; Dai, Shengyun; Qiao, Yanjiang; Wu, Zhisheng
2018-03-01
A novel methodology is proposed to determine the error propagation of partial least-square (PLS) for parameters optimization in near-infrared (NIR) modeling. The parameters include spectral pretreatment, latent variables and variable selection. In this paper, an open source dataset (corn) and a complicated dataset (Gardenia) were used to establish PLS models under different modeling parameters. And error propagation of modeling parameters for water quantity in corn and geniposide quantity in Gardenia were presented by both type І and type II error. For example, when variable importance in the projection (VIP), interval partial least square (iPLS) and backward interval partial least square (BiPLS) variable selection algorithms were used for geniposide in Gardenia, compared with synergy interval partial least squares (SiPLS), the error weight varied from 5% to 65%, 55% and 15%. The results demonstrated how and what extent the different modeling parameters affect error propagation of PLS for parameters optimization in NIR modeling. The larger the error weight, the worse the model. Finally, our trials finished a powerful process in developing robust PLS models for corn and Gardenia under the optimal modeling parameters. Furthermore, it could provide a significant guidance for the selection of modeling parameters of other multivariate calibration models.
Eastwood, John Graeme; Kemp, Lynn Ann; Jalaludin, Bin Badrudin; Phung, Hai Ngoc
2013-01-01
The aim of the study reported here is to explore ecological covariate and latent variable associations with perinatal depressive symptoms in South Western Sydney for the purpose of informing subsequent theory generation of perinatal context, depression, and the developmental origins of health and disease. Mothers (n = 15,389) delivering in 2002 and 2003 were assessed at two to three weeks after delivery for risk factors for depressive symptoms. The binary outcome variables were Edinburgh Postnatal Depression Scale (EPDS)> 9 and > 12. Aggregated EPDS > 9 was analyzed for 101 suburbs. Suburb-level variables were drawn from the 2001 Australian Census, New South Wales Crime Statistics, and aggregated individual-level risk factors. Analysis included exploratory factor analysis, univariate and multivariate likelihood, and Bayesian linear regression with conditional autoregressive components. The exploratory factor analysis identified six factors: neighborhood adversity, social cohesion, health behaviors, housing quality, social services, and support networks. Variables associated with neighborhood adversity, social cohesion, social networks, and ethnic diversity were consistently associated with aggregated depressive symptoms. The findings support the theoretical proposition that neighborhood adversity causes maternal psychological distress and depression within the context of social buffers including social networks, social cohesion, and social services.
NASA Astrophysics Data System (ADS)
Ahmed, S.; Abdul-Aziz, O. I.
2015-12-01
We used a systematic data-analytics approach to analyze and quantify relative linkages of four stream water quality indicators (total nitrogen, TN; total phosphorus, TP; chlorophyll-a, Chla; and dissolved oxygen, DO) with six land use and four hydrologic variables, along with the potential external (upstream in-land and downstream coastal) controls in highly complex coastal urban watersheds of southeast Florida, U.S.A. Multivariate pattern recognition techniques of principle component and factor analyses, in concert with Pearson correlation analysis, were applied to map interrelations and identify latent patterns of the participatory variables. Relative linkages of the in-stream water quality variables with their associated drivers were then quantified by developing dimensionless partial least squares (PLS) regression model based on standardized data. Model fitting efficiency (R2=0.71-0.87) and accuracy (ratio of root-mean-square error to the standard deviation of the observations, RSR=0.35-0.53) suggested good predictions of the water quality variables in both wet and dry seasons. Agricultural land and groundwater exhibited substantial controls on surface water quality. In-stream TN concentration appeared to be mostly contributed by the upstream water entering from Everglades in both wet and dry seasons. In contrast, watershed land uses had stronger linkages with TP and Chla than that of the watershed hydrologic and upstream (Everglades) components for both seasons. Both land use and hydrologic components showed strong linkages with DO in wet season; however, the land use linkage appeared to be less in dry season. The data-analytics method provided a comprehensive empirical framework to achieve crucial mechanistic insights into the urban stream water quality processes. Our study quantitatively identified dominant drivers of water quality, indicating key management targets to maintain healthy stream ecosystems in complex urban-natural environments near the coast.
Hounkpatin, Hilda Osafo; Boyce, Christopher J; Dunn, Graham; Wood, Alex M
2017-09-18
A number of structural equation models have been developed to examine change in 1 variable or the longitudinal association between 2 variables. The most common of these are the latent growth model, the autoregressive cross-lagged model, the autoregressive latent trajectory model, and the latent change score model. The authors first overview each of these models through evaluating their different assumptions surrounding the nature of change and how these assumptions may result in different data interpretations. They then, to elucidate these issues in an empirical example, examine the longitudinal association between personality traits and life satisfaction. In a representative Dutch sample (N = 8,320), with participants providing data on both personality and life satisfaction measures every 2 years over an 8-year period, the authors reproduce findings from previous research. However, some of the structural equation models overviewed have not previously been applied to the personality-life satisfaction relation. The extended empirical examination suggests intraindividual changes in life satisfaction predict subsequent intraindividual changes in personality traits. The availability of data sets with 3 or more assessment waves allows the application of more advanced structural equation models such as the autoregressive latent trajectory or the extended latent change score model, which accounts for the complex dynamic nature of change processes and allows stronger inferences on the nature of the association between variables. However, the choice of model should be determined by theories of change processes in the variables being studied. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
ERIC Educational Resources Information Center
von Davier, Matthias
2016-01-01
This report presents results on a parallel implementation of the expectation-maximization (EM) algorithm for multidimensional latent variable models. The developments presented here are based on code that parallelizes both the E step and the M step of the parallel-E parallel-M algorithm. Examples presented in this report include item response…
ERIC Educational Resources Information Center
van der Maas, Han L. J.; Molenaar, Dylan; Maris, Gunter; Kievit, Rogier A.; Borsboom, Denny
2011-01-01
This article analyzes latent variable models from a cognitive psychology perspective. We start by discussing work by Tuerlinckx and De Boeck (2005), who proved that a diffusion model for 2-choice response processes entails a 2-parameter logistic item response theory (IRT) model for individual differences in the response data. Following this line…
ERIC Educational Resources Information Center
Seo, Hyojeong; Little, Todd D.; Shogren, Karrie A.; Lang, Kyle M.
2016-01-01
Structural equation modeling (SEM) is a powerful and flexible analytic tool to model latent constructs and their relations with observed variables and other constructs. SEM applications offer advantages over classical models in dealing with statistical assumptions and in adjusting for measurement error. So far, however, SEM has not been fully used…
ERIC Educational Resources Information Center
Raykov, Tenko; Marcoulides, George A.; Akaeze, Hope O.
2017-01-01
This note is concerned with examining the relationship between within-group and between-group variances in two-level nested designs. A latent variable modeling approach is outlined that permits point and interval estimation of their ratio and allows their comparison in a multilevel study. The procedure can also be used to test various hypotheses…
ERIC Educational Resources Information Center
Frisby, Craig L.; Wang, Ze
2016-01-01
Data from the standardization sample of the Woodcock-Johnson Psychoeducational Battery--Third Edition (WJ III) Cognitive standard battery and Test Session Observation Checklist items were analyzed to understand the relationship between g (general mental ability) and test session behavior (TSB; n = 5,769). Latent variable modeling methods were used…
ERIC Educational Resources Information Center
Olatunji, Bunmi O.; Cole, David A.
2009-01-01
In an 8-wave, 4-year longitudinal study, 787 children (Grades 3-6) completed the Revised Children's Manifest Anxiety Scale (C. R. Reynolds & B. O. Richmond, 1985), a measure of the Physiological Reactivity, Worry-Oversensitivity, and Social Alienation dimensions of anxiety. A latent variable (trait-state-occasion) model and a latent growth curve…
Causal Indicators Can Help to Interpret Factors
ERIC Educational Resources Information Center
Bentler, Peter M.
2016-01-01
The latent factor in a causal indicator model is no more than the latent factor of the factor part of the model. However, if the causal indicator variables are well-understood and help to improve the prediction of individuals' factor scores, they can help to interpret the meaning of the latent factor. Aguirre-Urreta, Rönkkö, and Marakas (2016)…
Dadousis, C; Cipolat-Gotet, C; Bittante, G; Cecchinato, A
2018-02-01
We studied the genetics of cheese-related latent variables (factors; Fs) for application in dairy cattle breeding. In total, 26 traits, recorded in 1264 Brown Swiss cows, were analyzed through multivariate factor analysis (MFA). Traits analyzed were descriptors of milk quality and yield (including protein fractions) and measures of coagulation, curd firmness (CF), cheese yields (%CY) and nutrient recoveries in the curd (REC). A total of 10 Fs (mutual orthogonal with a varimax rotation) were obtained. To assess the practical use of the Fs into breeding, we inferred their genetic parameters using single and bivariate animal models under a Bayesian framework. Heritability estimates (intra-herd) varied between 0.11 and 0.72 (F3: Yield and F7: κ-β-CN, respectively). The Fs underlined basic characteristics of the cheese-making process, milk components and udder health, while retaining 74% of the original variability. The first two Fs were indicators of the CY percentage (F1: %CY) and the CF process (F2: CF t ), and presented similar heritability estimates: 0.268 and 0.295, respectively. The third factor was associated with the yield of milk and solids (F3: Yield) characterized by a low heritability (0.108) and the fourth with the cheese nitrogen (N) (F4: Cheese N) that conversely appeared to be characterized by a high heritability (0.618). Three Fs were associated with the proportion of the basic milk caseins on total milk protein (F5: as1-β-CN, F7: κ-β-CN, F8: as2-CN), also highly heritable (0.565, 0.723 and 0.397, respectively) and 1 factor with the phosphorylated form of the as1-CN (F9: as1-CN-Ph; 0.318). Moreover, 1 factor was linked to the whey protein α-LA (F10: α-LA; 0.147). An indicator factor of a cow's udder health (F6: Udder health) was also obtained and showed a moderate heritability (0.204). Although the Fs were phenotypically uncorrelated, considerable additive genetic correlations existed among them, with highest values observed between F10: α-LA and F6: Udder health (-0.67) as well as between F9: as1-CN-Ph and F3: Yield (-0.60). Our results show the usefulness of MFA in dairy cattle breeding. The ability to replace a large number of variables with a few latent indicators of the same biological meaning marks MFA as a valuable tool for developing breeding strategies to improve cow's cheese-related traits.
The Interface Between Theory and Data in Structural Equation Models
Grace, James B.; Bollen, Kenneth A.
2006-01-01
Structural equation modeling (SEM) holds the promise of providing natural scientists the capacity to evaluate complex multivariate hypotheses about ecological systems. Building on its predecessors, path analysis and factor analysis, SEM allows for the incorporation of both observed and unobserved (latent) variables into theoretically based probabilistic models. In this paper we discuss the interface between theory and data in SEM and the use of an additional variable type, the composite, for representing general concepts. In simple terms, composite variables specify the influences of collections of other variables and can be helpful in modeling general relationships of the sort commonly of interest to ecologists. While long recognized as a potentially important element of SEM, composite variables have received very limited use, in part because of a lack of theoretical consideration, but also because of difficulties that arise in parameter estimation when using conventional solution procedures. In this paper we present a framework for discussing composites and demonstrate how the use of partially reduced form models can help to overcome some of the parameter estimation and evaluation problems associated with models containing composites. Diagnostic procedures for evaluating the most appropriate and effective use of composites are illustrated with an example from the ecological literature. It is argued that an ability to incorporate composite variables into structural equation models may be particularly valuable in the study of natural systems, where concepts are frequently multifaceted and the influences of suites of variables are often of interest.
Microcomputer-based classification of environmental data in municipal areas
NASA Astrophysics Data System (ADS)
Thiergärtner, H.
1995-10-01
Multivariate data-processing methods used in mineral resource identification can be used to classify urban regions. Using elements of expert systems, geographical information systems, as well as known classification and prognosis systems, it is possible to outline a single model that consists of resistant and of temporary parts of a knowledge base including graphical input and output treatment and of resistant and temporary elements of a bank of methods and algorithms. Whereas decision rules created by experts will be stored in expert systems directly, powerful classification rules in form of resistant but latent (implicit) decision algorithms may be implemented in the suggested model. The latent functions will be transformed into temporary explicit decision rules by learning processes depending on the actual task(s), parameter set(s), pixels selection(s), and expert control(s). This takes place both at supervised and nonsupervised classification of multivariately described pixel sets representing municipal subareas. The model is outlined briefly and illustrated by results obtained in a target area covering a part of the city of Berlin (Germany).
Bechshøft, T Ø; Sonne, C; Dietz, R; Born, E W; Muir, D C G; Letcher, R J; Novak, M A; Henchey, E; Meyer, J S; Jenssen, B M; Villanger, G D
2012-07-01
The multivariate relationship between hair cortisol, whole blood thyroid hormones, and the complex mixtures of organohalogen contaminant (OHC) levels measured in subcutaneous adipose of 23 East Greenland polar bears (eight males and 15 females, all sampled between the years 1999 and 2001) was analyzed using projection to latent structure (PLS) regression modeling. In the resulting PLS model, most important variables with a negative influence on cortisol levels were particularly BDE-99, but also CB-180, -201, BDE-153, and CB-170/190. The most important variables with a positive influence on cortisol were CB-66/95, α-HCH, TT3, as well as heptachlor epoxide, dieldrin, BDE-47, p,p'-DDD. Although statistical modeling does not necessarily fully explain biological cause-effect relationships, relationships indicate that (1) the hypothalamic-pituitary-adrenal (HPA) axis in East Greenland polar bears is likely to be affected by OHC-contaminants and (2) the association between OHCs and cortisol may be linked with the hypothalamus-pituitary-thyroid (HPT) axis. Copyright © 2012 Elsevier Inc. All rights reserved.
Konold, Timothy R; Cornell, Dewey
2015-12-01
This study tested a conceptual model of school climate in which two key elements of an authoritative school, structure and support variables, are associated with student engagement in school and lower levels of peer aggression. Multilevel multivariate structural modeling was conducted in a statewide sample of 48,027 students in 323 public high schools who completed the Authoritative School Climate Survey. As hypothesized, two measures of structure (Disciplinary Structure and Academic Expectations) and two measures of support (Respect for Students and Willingness to Seek Help) were associated with higher student engagement (Affective Engagement and Cognitive Engagement) and lower peer aggression (Prevalence of Teasing and Bullying) on both student and school levels of analysis, controlling for the effects of school demographics (school size, percentage of minority students, and percentage of low income students). These results support the extension of authoritative school climate model to high school and guide further research on the conditions for a positive school climate. Copyright © 2015 Society for the Study of School Psychology. Published by Elsevier Ltd. All rights reserved.
Effects of additional data on Bayesian clustering.
Yamazaki, Keisuke
2017-10-01
Hierarchical probabilistic models, such as mixture models, are used for cluster analysis. These models have two types of variables: observable and latent. In cluster analysis, the latent variable is estimated, and it is expected that additional information will improve the accuracy of the estimation of the latent variable. Many proposed learning methods are able to use additional data; these include semi-supervised learning and transfer learning. However, from a statistical point of view, a complex probabilistic model that encompasses both the initial and additional data might be less accurate due to having a higher-dimensional parameter. The present paper presents a theoretical analysis of the accuracy of such a model and clarifies which factor has the greatest effect on its accuracy, the advantages of obtaining additional data, and the disadvantages of increasing the complexity. Copyright © 2017 Elsevier Ltd. All rights reserved.
Thomas, Philipp; Rammsayer, Thomas; Schweizer, Karl; Troche, Stefan
2015-01-01
Numerous studies reported a strong link between working memory capacity (WMC) and fluid intelligence (Gf), although views differ in respect to how close these two constructs are related to each other. In the present study, we used a WMC task with five levels of task demands to assess the relationship between WMC and Gf by means of a new methodological approach referred to as fixed-links modeling. Fixed-links models belong to the family of confirmatory factor analysis (CFA) and are of particular interest for experimental, repeated-measures designs. With this technique, processes systematically varying across task conditions can be disentangled from processes unaffected by the experimental manipulation. Proceeding from the assumption that experimental manipulation in a WMC task leads to increasing demands on WMC, the processes systematically varying across task conditions can be assumed to be WMC-specific. Processes not varying across task conditions, on the other hand, are probably independent of WMC. Fixed-links models allow for representing these two kinds of processes by two independent latent variables. In contrast to traditional CFA where a common latent variable is derived from the different task conditions, fixed-links models facilitate a more precise or purified representation of the WMC-related processes of interest. By using fixed-links modeling to analyze data of 200 participants, we identified a non-experimental latent variable, representing processes that remained constant irrespective of the WMC task conditions, and an experimental latent variable which reflected processes that varied as a function of experimental manipulation. This latter variable represents the increasing demands on WMC and, hence, was considered a purified measure of WMC controlled for the constant processes. Fixed-links modeling showed that both the purified measure of WMC (β = .48) as well as the constant processes involved in the task (β = .45) were related to Gf. Taken together, these two latent variables explained the same portion of variance of Gf as a single latent variable obtained by traditional CFA (β = .65) indicating that traditional CFA causes an overestimation of the effective relationship between WMC and Gf. Thus, fixed-links modeling provides a feasible method for a more valid investigation of the functional relationship between specific constructs.
Childhood malnutrition in Egypt using geoadditive Gaussian and latent variable models.
Khatab, Khaled
2010-04-01
Major progress has been made over the last 30 years in reducing the prevalence of malnutrition amongst children less than 5 years of age in developing countries. However, approximately 27% of children under the age of 5 in these countries are still malnourished. This work focuses on the childhood malnutrition in one of the biggest developing countries, Egypt. This study examined the association between bio-demographic and socioeconomic determinants and the malnutrition problem in children less than 5 years of age using the 2003 Demographic and Health survey data for Egypt. In the first step, we use separate geoadditive Gaussian models with the continuous response variables stunting (height-for-age), underweight (weight-for-age), and wasting (weight-for-height) as indicators of nutritional status in our case study. In a second step, based on the results of the first step, we apply the geoadditive Gaussian latent variable model for continuous indicators in which the 3 measurements of the malnutrition status of children are assumed as indicators for the latent variable "nutritional status".
ERIC Educational Resources Information Center
Lippke, Sonia; Nigg, Claudio R.; Maddock, Jay E.
2007-01-01
This is the first study to test whether the stages of change of the transtheoretical model are qualitatively different through exploring discontinuity patterns in theory of planned behavior (TPB) variables using latent multigroup structural equation modeling (MSEM) with AMOS. Discontinuity patterns in terms of latent means and prediction patterns…
Dynamic Factor Analysis of Nonstationary Multivariate Time Series.
ERIC Educational Resources Information Center
Molenaar, Peter C. M.; And Others
1992-01-01
The dynamic factor model proposed by P. C. Molenaar (1985) is exhibited, and a dynamic nonstationary factor model (DNFM) is constructed with latent factor series that have time-varying mean functions. The use of a DNFM is illustrated using data from a television viewing habits study. (SLD)
On Latent Growth Models for Composites and Their Constituents.
Hancock, Gregory R; Mao, Xiulin; Kher, Hemant
2013-09-01
Over the last decade and a half, latent growth modeling has become an extremely popular and versatile technique for evaluating longitudinal change and its determinants. Most common among the models applied are those for a single measured variable over time. This model has been extended in a variety of ways, most relevant for the current work being the multidomain and the second-order latent growth models. Whereas the former allows for growth function characteristics to be modeled for multiple outcomes simultaneously, with the degree of growth characteristics' relations assessed within the model (e.g., cross-domain slope factor correlations), the latter models growth in latent outcomes, each of which has effect indicators repeated over time. But what if one has an outcome that is believed to be formative relative to its indicator variables rather than latent? In this case, where the outcome is a composite of multiple constituents, modeling change over time is less straightforward. This article provides analytical and applied details for simultaneously modeling growth in composites and their constituent elements, including a real data example using a general computer self-efficacy questionnaire.
Cowley, Benjamin R.; Kaufman, Matthew T.; Butler, Zachary S.; Churchland, Mark M.; Ryu, Stephen I.; Shenoy, Krishna V.; Yu, Byron M.
2014-01-01
Objective Analyzing and interpreting the activity of a heterogeneous population of neurons can be challenging, especially as the number of neurons, experimental trials, and experimental conditions increases. One approach is to extract a set of latent variables that succinctly captures the prominent co-fluctuation patterns across the neural population. A key problem is that the number of latent variables needed to adequately describe the population activity is often greater than three, thereby preventing direct visualization of the latent space. By visualizing a small number of 2-d projections of the latent space or each latent variable individually, it is easy to miss salient features of the population activity. Approach To address this limitation, we developed a Matlab graphical user interface (called DataHigh) that allows the user to quickly and smoothly navigate through a continuum of different 2-d projections of the latent space. We also implemented a suite of additional visualization tools (including playing out population activity timecourses as a movie and displaying summary statistics, such as covariance ellipses and average timecourses) and an optional tool for performing dimensionality reduction. Main results To demonstrate the utility and versatility of DataHigh, we used it to analyze single-trial spike count and single-trial timecourse population activity recorded using a multi-electrode array, as well as trial-averaged population activity recorded using single electrodes. Significance DataHigh was developed to fulfill a need for visualization in exploratory neural data analysis, which can provide intuition that is critical for building scientific hypotheses and models of population activity. PMID:24216250
NASA Astrophysics Data System (ADS)
Cowley, Benjamin R.; Kaufman, Matthew T.; Butler, Zachary S.; Churchland, Mark M.; Ryu, Stephen I.; Shenoy, Krishna V.; Yu, Byron M.
2013-12-01
Objective. Analyzing and interpreting the activity of a heterogeneous population of neurons can be challenging, especially as the number of neurons, experimental trials, and experimental conditions increases. One approach is to extract a set of latent variables that succinctly captures the prominent co-fluctuation patterns across the neural population. A key problem is that the number of latent variables needed to adequately describe the population activity is often greater than 3, thereby preventing direct visualization of the latent space. By visualizing a small number of 2-d projections of the latent space or each latent variable individually, it is easy to miss salient features of the population activity. Approach. To address this limitation, we developed a Matlab graphical user interface (called DataHigh) that allows the user to quickly and smoothly navigate through a continuum of different 2-d projections of the latent space. We also implemented a suite of additional visualization tools (including playing out population activity timecourses as a movie and displaying summary statistics, such as covariance ellipses and average timecourses) and an optional tool for performing dimensionality reduction. Main results. To demonstrate the utility and versatility of DataHigh, we used it to analyze single-trial spike count and single-trial timecourse population activity recorded using a multi-electrode array, as well as trial-averaged population activity recorded using single electrodes. Significance. DataHigh was developed to fulfil a need for visualization in exploratory neural data analysis, which can provide intuition that is critical for building scientific hypotheses and models of population activity.
Cowley, Benjamin R; Kaufman, Matthew T; Butler, Zachary S; Churchland, Mark M; Ryu, Stephen I; Shenoy, Krishna V; Yu, Byron M
2013-12-01
Analyzing and interpreting the activity of a heterogeneous population of neurons can be challenging, especially as the number of neurons, experimental trials, and experimental conditions increases. One approach is to extract a set of latent variables that succinctly captures the prominent co-fluctuation patterns across the neural population. A key problem is that the number of latent variables needed to adequately describe the population activity is often greater than 3, thereby preventing direct visualization of the latent space. By visualizing a small number of 2-d projections of the latent space or each latent variable individually, it is easy to miss salient features of the population activity. To address this limitation, we developed a Matlab graphical user interface (called DataHigh) that allows the user to quickly and smoothly navigate through a continuum of different 2-d projections of the latent space. We also implemented a suite of additional visualization tools (including playing out population activity timecourses as a movie and displaying summary statistics, such as covariance ellipses and average timecourses) and an optional tool for performing dimensionality reduction. To demonstrate the utility and versatility of DataHigh, we used it to analyze single-trial spike count and single-trial timecourse population activity recorded using a multi-electrode array, as well as trial-averaged population activity recorded using single electrodes. DataHigh was developed to fulfil a need for visualization in exploratory neural data analysis, which can provide intuition that is critical for building scientific hypotheses and models of population activity.
Inverse Ising problem in continuous time: A latent variable approach
NASA Astrophysics Data System (ADS)
Donner, Christian; Opper, Manfred
2017-12-01
We consider the inverse Ising problem: the inference of network couplings from observed spin trajectories for a model with continuous time Glauber dynamics. By introducing two sets of auxiliary latent random variables we render the likelihood into a form which allows for simple iterative inference algorithms with analytical updates. The variables are (1) Poisson variables to linearize an exponential term which is typical for point process likelihoods and (2) Pólya-Gamma variables, which make the likelihood quadratic in the coupling parameters. Using the augmented likelihood, we derive an expectation-maximization (EM) algorithm to obtain the maximum likelihood estimate of network parameters. Using a third set of latent variables we extend the EM algorithm to sparse couplings via L1 regularization. Finally, we develop an efficient approximate Bayesian inference algorithm using a variational approach. We demonstrate the performance of our algorithms on data simulated from an Ising model. For data which are simulated from a more biologically plausible network with spiking neurons, we show that the Ising model captures well the low order statistics of the data and how the Ising couplings are related to the underlying synaptic structure of the simulated network.
Oberg, Tomas
2004-01-01
Halogenated aliphatic compounds have many technical uses, but substances within this group are also ubiquitous environmental pollutants that can affect the ozone layer and contribute to global warming. The establishment of quantitative structure-property relationships is of interest not only to fill in gaps in the available database but also to validate experimental data already acquired. The three-dimensional structures of 240 compounds were modeled with molecular mechanics prior to the generation of empirical descriptors. Two bilinear projection methods, principal component analysis (PCA) and partial-least-squares regression (PLSR), were used to identify outliers. PLSR was subsequently used to build a multivariate calibration model by extracting the latent variables that describe most of the covariation between the molecular structure and the boiling point. Boiling points were also estimated with an extension of the group contribution method of Stein and Brown.
Assessment Practices of Child Clinicians.
Cook, Jonathan R; Hausman, Estee M; Jensen-Doss, Amanda; Hawley, Kristin M
2017-03-01
Assessment is an integral component of treatment. However, prior surveys indicate clinicians may not use standardized assessment strategies. We surveyed 1,510 clinicians and used multivariate analysis of variance to explore group differences in specific measure use. Clinicians used unstandardized measures more frequently than standardized measures, although psychologists used standardized measures more frequently than nonpsychologists. We also used latent profile analysis to classify clinicians based on their overall approach to assessment and examined associations between clinician-level variables and assessment class or profile membership. A four-profile model best fit the data. The largest profile consisted of clinicians who primarily used unstandardized assessments (76.7%), followed by broad-spectrum assessors who regularly use both standardized and unstandardized assessment (11.9%), and two smaller profiles of minimal (6.0%) and selective assessors (5.5%). Compared with broad-spectrum assessors, unstandardized and minimal assessors were less likely to report having adequate standardized measures training. Implications for clinical practice and training are discussed.
Measuring Disparities: Bias in the SF-36v2 among Spanish-speaking Medical Patients
Sudano, Joseph J.; Perzynski, Adam; Love, Thomas E.; Lewis, Steven A.; Murray, Patrick M.; Huber, Gail; Ruo, Bernice; Baker, David W.
2011-01-01
Background Many national surveys have found substantial differences in self-reported overall health (SROH) between Spanish-speaking Hispanics and other racial/ethnic groups. However, because cultural and language differences may create measurement bias, it is unclear whether observed differences in SROH reflect true differences in health. Objectives This study uses a cross-sectional survey to investigate psychometric properties of the SF-36v2 for subjects across four racial/ethnic and language groups. Multi-group latent variable modeling was used to test increasingly stringent criteria for measurement equivalence. Subjects Our sample (N = 1281) included 383 non-Hispanic whites, 368 non-Hispanic blacks, 206 Hispanics interviewed in English and 324 Hispanics interviewed in Spanish recruited from outpatient medical clinics in two large urban areas. Results We found weak factorial invariance across the four groups. However, there was no strong factorial invariance. The overall fit of the model was substantially worse (change in CFI > .02, RMSEA change > .003) after requiring equal intercepts across all groups. Further comparisons established that the equality constraints on the intercepts for Spanish-speaking Hispanics were responsible for the decrement to model fit. Conclusions Observed differences between SF-36v2 scores for Spanish speaking Hispanics are systematically biased relative to the other three groups. The lack of strong invariance suggests the need for caution when comparing SF-36v2 mean scores of Spanish-speaking Hispanics with those of other groups. However, measurement equivalence testing for this study supports correlational or multivariate latent variable analyses of SF-36v2 responses across all four subgroups, since these analyses require only weak factorial invariance. PMID:21430580
Verbal Neuropsychological Functions in Aphasia: An Integrative Model
ERIC Educational Resources Information Center
Vigliecca, Nora Silvana; Báez, Sandra
2015-01-01
A theoretical framework which considers the verbal functions of the brain under a multivariate and comprehensive cognitive model was statistically analyzed. A confirmatory factor analysis was performed to verify whether some recognized aphasia constructs can be hierarchically integrated as latent factors from a homogenously verbal test. The Brief…
Racial Variation in Vocational Rehabilitation Outcomes: A Structural Equation Modeling Approach
ERIC Educational Resources Information Center
Martin, Frank H.
2010-01-01
Numerous studies have indicated racial and ethnic disparities in the vocational rehabilitation (VR) system, including differences in acceptance, services provided, closure types, and employment outcomes. Few of these studies, however, have used advanced multivariate techniques or latent constructs to measure quality of employment outcomes (QEO) or…
ERIC Educational Resources Information Center
McDonald, Roderick P.
2011-01-01
A distinction is proposed between measures and predictors of latent variables. The discussion addresses the consequences of the distinction for the true-score model, the linear factor model, Structural Equation Models, longitudinal and multilevel models, and item-response models. A distribution-free treatment of calibration and…
Massof, Robert W
2014-10-01
A simple theoretical framework explains patient responses to items in rating scale questionnaires. Fixed latent variables position each patient and each item on the same linear scale. Item responses are governed by a set of fixed category thresholds, one for each ordinal response category. A patient's item responses are magnitude estimates of the difference between the patient variable and the patient's estimate of the item variable, relative to his/her personally defined response category thresholds. Differences between patients in their personal estimates of the item variable and in their personal choices of category thresholds are represented by random variables added to the corresponding fixed variables. Effects of intervention correspond to changes in the patient variable, the patient's response bias, and/or latent item variables for a subset of items. Intervention effects on patients' item responses were simulated by assuming the random variables are normally distributed with a constant scalar covariance matrix. Rasch analysis was used to estimate latent variables from the simulated responses. The simulations demonstrate that changes in the patient variable and changes in response bias produce indistinguishable effects on item responses and manifest as changes only in the estimated patient variable. Changes in a subset of item variables manifest as intervention-specific differential item functioning and as changes in the estimated person variable that equals the average of changes in the item variables. Simulations demonstrate that intervention-specific differential item functioning produces inefficiencies and inaccuracies in computer adaptive testing. © The Author(s) 2013 Reprints and permissions: sagepub.co.uk/journalsPermissions.nav.
Stockbridge, Erica L; Miller, Thaddeus L; Carlson, Erin K; Ho, Christine
2018-01-01
To determine whether latent tuberculosis infection risk factors are associated with an increased likelihood of latent tuberculosis infection testing in the US private healthcare sector. A national sample of medical and pharmacy claims representing services rendered January 2011 through December 2013 for 3,997,986 commercially insured individuals in the US who were 0 to 64 years of age. We used multivariable logistic regression models to determine whether TB/LTBI risk factors were associated with an increased likelihood of Interferon-Gamma Release Assay (IGRA) or Tuberculin Skin Test (TST) testing in the private sector. 4.31% (4.27-4.34%) received at least one TST/IGRA test between 2011 and 2013 while 1.69% (1.67-1.72%) received a TST/IGRA test in 2013. Clinical risk factors associated with a significantly increased likelihood of testing included HIV, immunosuppressive therapy, exposure to tuberculosis, a history of tuberculosis, diabetes, tobacco use, end stage renal disease, and alcohol use disorder. Other significant variables included gender, age, asthma, the state tuberculosis rate, population density, and percent of foreign-born persons in a county. Private sector TST/IGRA testing is not uncommon and testing varies with clinical risk indicators. Thus, the private sector can be a powerful resource in the fight against tuberculosis. Analyses of administrative data can inform how best to leverage private sector healthcare toward tuberculosis prevention activities.
NASA Astrophysics Data System (ADS)
Chen, Yi-Ying; Chu, Chia-Ren; Li, Ming-Hsu
2012-10-01
SummaryIn this paper we present a semi-parametric multivariate gap-filling model for tower-based measurement of latent heat flux (LE). Two statistical techniques, the principal component analysis (PCA) and a nonlinear interpolation approach were integrated into this LE gap-filling model. The PCA was first used to resolve the multicollinearity relationships among various environmental variables, including radiation, soil moisture deficit, leaf area index, wind speed, etc. Two nonlinear interpolation methods, multiple regressions (MRS) and the K-nearest neighbors (KNNs) were examined with random selected flux gaps for both clear sky and nighttime/cloudy data to incorporate into this LE gap-filling model. Experimental results indicated that the KNN interpolation approach is able to provide consistent LE estimations while MRS presents over estimations during nighttime/cloudy. Rather than using empirical regression parameters, the KNN approach resolves the nonlinear relationship between the gap-filled LE flux and principal components with adaptive K values under different atmospheric states. The developed LE gap-filling model (PCA with KNN) works with a RMSE of 2.4 W m-2 (˜0.09 mm day-1) at a weekly time scale by adding 40% artificial flux gaps into original dataset. Annual evapotranspiration at this study site were estimated at 736 mm (1803 MJ) and 728 mm (1785 MJ) for year 2008 and 2009, respectively.
Coly, A; Morisky, D
2004-06-01
Two health clinics in Los Angeles County, California. To identify factors associated with completion of care among foreign-born adolescents treated for latent tuberculosis infection (LTBI). A total of 766 low-income adolescents (79% participation rate), including 610 foreign-born, were recruited. In prospective face-to-face interviews, data were obtained on socio-demographic and lifestyle characteristics, psychosocial factors and clinic-related variables. Medical chart data were abstracted regarding clinic appointment keeping and completion of treatment. Univariate and multivariate logistic regression analyses were performed to identify factors associated with completion of care. Foreign-born adolescents were more likely to complete care than US-born adolescents, with 82% completion of care rate. In logistic regression analyses after controlling for age, medication taking behavior (OR 1.26, 95%CI 1.15-1.39), living with both parents (OR 1.74, 95%CI 1.02-2.97), sexual intercourse (OR 0.66, 95%CI 0.36-1.19) and speaking mostly or only English with parents (OR 0.39, 95%CI 0.15-1.03) were independently associated with completion of care. These findings contribute to our understanding of the factors that may explain why some adolescents complete care whereas others do not. They provide supportive evidence that tailored intervention programs should be developed to support the screening and completion of treatment of foreign-born adolescents.
Latent lifestyle preferences and household location decisions
NASA Astrophysics Data System (ADS)
Walker, Joan L.; Li, Jieping
2007-04-01
Lifestyle, indicating preferences towards a particular way of living, is a key driver of the decision of where to live. We employ latent class choice models to represent this behavior, where the latent classes are the lifestyles and the choice model is the choice of residential location. Thus, we simultaneously estimate lifestyle groups and how lifestyle impacts location decisions. Empirical results indicate three latent lifestyle segments: suburban dwellers, urban dwellers, and transit-riders. The suggested lifestyle segments have intriguing policy implications. Lifecycle characteristics are used to predict lifestyle preferences, although there remain significant aspects that cannot be explained by observable variables.
NASA Astrophysics Data System (ADS)
Mathew, Sneha Susan; Kumar, Karanam Kishore
2018-05-01
The latent heat released in the clouds over the tropics plays a vital role in driving the Hadley circulation (HC). The present study discusses the influence of latent heating (LH) on the HC parameters viz., centre, strength and total width by using precipitation LH profiles derived from the space-borne observations of the Precipitation Radar (PR) onboard Tropical Rain Measuring Mission (TRMM) and meridional stream function (MSF) derived from ECMWF-Interim reanalysis. The latitude of peak latent heating, width of the latent heating distribution and the total LH released within the ascending limb of the HC are estimated and their influence on the HC centre, strength and width is quantified, for the first time. The present results show that the latitude of peak LH significantly influences the position of the HC centre with correlation coefficient of 0.90. This high correlation between these two quantities seems to be due to their co-variability with the apparent motion of the Sun across the latitudes. The intensity of the HC in the NH as well as SH shows high correlation with the latitude of peak LH with coefficients - 0.85 and - 0.78, respectively. These results indicate that farther the latitude of peak LH from the equator in the summer hemisphere, stronger is the HC intensity in the winter hemisphere. The present analysis also reveals that the total LH released within the ascending limb of HC substantially influence the total width of the HC, with correlation coefficient 0.52, as compared to the other two LH parameters. This observation can be attributed to the fact that the HC is sensitive to the latent heat release in the mid-tropospheric levels in the tropics. An attempt is also made to investigate the degree of variability of these parameters after deseasonalization and results are discussed in the light of present understanding. The significance of the present study lies in providing the observational evidence for the influence of latent heating on the HC strength/width variability, quantitatively, for the first time using TRMM observations of precipitation latent heating.
Dalvand, Sahar; Koohpayehzadeh, Jalil; Karimlou, Masoud; Asgari, Fereshteh; Rafei, Ali; Seifi, Behjat; Niksima, Seyed Hassan; Bakhshi, Enayatollah
2015-01-01
Because the use of BMI (Body Mass Index) alone as a measure of adiposity has been criticized, in the present study our aim was to fit a latent variable model to simultaneously examine the factors that affect waist circumference (continuous outcome) and obesity (binary outcome) among Iranian adults. Data included 18,990 Iranian individuals aged 20-65 years that are derived from the third National Survey of Noncommunicable Diseases Risk Factors in Iran. Using latent variable model, we estimated the relation of two correlated responses (waist circumference and obesity) with independent variables including age, gender, PR (Place of Residence), PA (physical activity), smoking status, SBP (Systolic Blood Pressure), DBP (Diastolic Blood Pressure), CHOL (cholesterol), FBG (Fasting Blood Glucose), diabetes, and FHD (family history of diabetes). All variables were related to both obesity and waist circumference (WC). Older age, female sex, being an urban resident, physical inactivity, nonsmoking, hypertension, hypercholesterolemia, hyperglycemia, diabetes, and having family history of diabetes were significant risk factors that increased WC and obesity. Findings from this study of Iranian adult settings offer more insights into factors associated with high WC and high prevalence of obesity in this population.
Geiser, Christian; Keller, Brian T.; Lockhart, Ginger; Eid, Michael; Cole, David A.; Koch, Tobias
2014-01-01
Researchers analyzing longitudinal data often want to find out whether the process they study is characterized by (1) short-term state variability, (2) long-term trait change, or (3) a combination of state variability and trait change. Classical latent state-trait (LST) models are designed to measure reversible state variability around a fixed set-point or trait, whereas latent growth curve (LGC) models focus on long-lasting and often irreversible trait changes. In the present paper, we contrast LST and LGC models from the perspective of measurement invariance (MI) testing. We show that establishing a pure state-variability process requires (a) the inclusion of a mean structure and (b) establishing strong factorial invariance in LST analyses. Analytical derivations and simulations demonstrate that LST models with non-invariant parameters can mask the fact that a trait-change or hybrid process has generated the data. Furthermore, the inappropriate application of LST models to trait change or hybrid data can lead to bias in the estimates of consistency and occasion-specificity, which are typically of key interest in LST analyses. Four tips for the proper application of LST models are provided. PMID:24652650
Sun, Fei; Xu, Bing; Zhang, Yi; Dai, Shengyun; Shi, Xinyuan; Qiao, Yanjiang
2017-01-01
ABSTRACT The dissolution is one of the critical quality attributes (CQAs) of oral solid dosage forms because it relates to the absorption of drug. In this paper, the influence of raw materials, granules and process parameters on the dissolution of paracetamol tablet was analyzed using latent variable modeling methods. The variability in raw materials and granules was understood based on the principle component analysis (PCA), respectively. A multi-block partial least squares (MBPLS) model was used to determine the critical factors affecting the dissolution. The results showed that the binder amount, the post granulation time, the API content in granule, the fill depth and the punch tip separation distance were the critical factors with variable importance in the projection (VIP) values larger than 1. The importance of each unit of the whole process was also ranked using the block importance in the projection (BIP) index. It was concluded that latent variable models (LVMs) were very useful tools to extract information from the available data and improve the understanding on dissolution behavior of paracetamol tablet. The obtained LVMs were also helpful to propose the process design space and to design control strategies in the further research. PMID:27689242
Multiple indicators, multiple causes measurement error models
Tekwe, Carmen D.; Carter, Randy L.; Cullings, Harry M.; ...
2014-06-25
Multiple indicators, multiple causes (MIMIC) models are often employed by researchers studying the effects of an unobservable latent variable on a set of outcomes, when causes of the latent variable are observed. There are times, however, when the causes of the latent variable are not observed because measurements of the causal variable are contaminated by measurement error. The objectives of this study are as follows: (i) to develop a novel model by extending the classical linear MIMIC model to allow both Berkson and classical measurement errors, defining the MIMIC measurement error (MIMIC ME) model; (ii) to develop likelihood-based estimation methodsmore » for the MIMIC ME model; and (iii) to apply the newly defined MIMIC ME model to atomic bomb survivor data to study the impact of dyslipidemia and radiation dose on the physical manifestations of dyslipidemia. Finally, as a by-product of our work, we also obtain a data-driven estimate of the variance of the classical measurement error associated with an estimate of the amount of radiation dose received by atomic bomb survivors at the time of their exposure.« less
Multiple Indicators, Multiple Causes Measurement Error Models
Tekwe, Carmen D.; Carter, Randy L.; Cullings, Harry M.; Carroll, Raymond J.
2014-01-01
Multiple Indicators, Multiple Causes Models (MIMIC) are often employed by researchers studying the effects of an unobservable latent variable on a set of outcomes, when causes of the latent variable are observed. There are times however when the causes of the latent variable are not observed because measurements of the causal variable are contaminated by measurement error. The objectives of this paper are: (1) to develop a novel model by extending the classical linear MIMIC model to allow both Berkson and classical measurement errors, defining the MIMIC measurement error (MIMIC ME) model, (2) to develop likelihood based estimation methods for the MIMIC ME model, (3) to apply the newly defined MIMIC ME model to atomic bomb survivor data to study the impact of dyslipidemia and radiation dose on the physical manifestations of dyslipidemia. As a by-product of our work, we also obtain a data-driven estimate of the variance of the classical measurement error associated with an estimate of the amount of radiation dose received by atomic bomb survivors at the time of their exposure. PMID:24962535
Multiple indicators, multiple causes measurement error models
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tekwe, Carmen D.; Carter, Randy L.; Cullings, Harry M.
Multiple indicators, multiple causes (MIMIC) models are often employed by researchers studying the effects of an unobservable latent variable on a set of outcomes, when causes of the latent variable are observed. There are times, however, when the causes of the latent variable are not observed because measurements of the causal variable are contaminated by measurement error. The objectives of this study are as follows: (i) to develop a novel model by extending the classical linear MIMIC model to allow both Berkson and classical measurement errors, defining the MIMIC measurement error (MIMIC ME) model; (ii) to develop likelihood-based estimation methodsmore » for the MIMIC ME model; and (iii) to apply the newly defined MIMIC ME model to atomic bomb survivor data to study the impact of dyslipidemia and radiation dose on the physical manifestations of dyslipidemia. Finally, as a by-product of our work, we also obtain a data-driven estimate of the variance of the classical measurement error associated with an estimate of the amount of radiation dose received by atomic bomb survivors at the time of their exposure.« less
Wang, Peng-Wei; Yen, Cheng-Fang
2017-12-08
Adolescent suicidal behavior may consist of different symptoms, including suicidal ideation, suicidal planning and suicidal attempts. Adolescent substance use behavior may contribute to adolescent suicidal behavior. However, research on the relationships between specific substance use and individual suicidal behavior is insufficient, as adolescents may not use only one substance or develop only one facet of suicidal behavior. Latent variables permit us to describe the relationships between clusters of related behaviors more accurately than studying the relationships between specific behaviors. Thus, the aim of this study was to explore how adolescent substance use behavior contributes to suicidal behavior using latent variables representing adolescent suicidal and substance use behaviors. A total of 13,985 adolescents were recruited using a stratified random sampling strategy. The participants indicated whether they had experienced suicidal ideation, planning and attempts and reported their cigarette, alcohol, ketamine and MDMA use during the past year. Latent analysis was used to examine the relationship between substance use and suicidal behavior. Adolescents who used any one of the above substances exhibited more suicidal behavior. The results of latent variables analysis revealed that adolescent substance use contributed to suicidal behavior and that boys exhibited more severe substance use behavior than girls. However, there was no gender difference in the association between substance use and suicidal behavior. Substance use behavior in adolescents is related to more suicidal behavior. In addition, the contribution of substance use to suicidal behavior does not differ between genders.
Chen, Jiabo; Li, Fayun; Fan, Zhiping; Wang, Yanjie
2016-01-01
Source apportionment of river water pollution is critical in water resource management and aquatic conservation. Comprehensive application of various GIS-based multivariate statistical methods was performed to analyze datasets (2009–2011) on water quality in the Liao River system (China). Cluster analysis (CA) classified the 12 months of the year into three groups (May–October, February–April and November–January) and the 66 sampling sites into three groups (groups A, B and C) based on similarities in water quality characteristics. Discriminant analysis (DA) determined that temperature, dissolved oxygen (DO), pH, chemical oxygen demand (CODMn), 5-day biochemical oxygen demand (BOD5), NH4+–N, total phosphorus (TP) and volatile phenols were significant variables affecting temporal variations, with 81.2% correct assignments. Principal component analysis (PCA) and positive matrix factorization (PMF) identified eight potential pollution factors for each part of the data structure, explaining more than 61% of the total variance. Oxygen-consuming organics from cropland and woodland runoff were the main latent pollution factor for group A. For group B, the main pollutants were oxygen-consuming organics, oil, nutrients and fecal matter. For group C, the evaluated pollutants primarily included oxygen-consuming organics, oil and toxic organics. PMID:27775679
Punzo, Antonio; Ingrassia, Salvatore; Maruotti, Antonello
2018-04-22
A time-varying latent variable model is proposed to jointly analyze multivariate mixed-support longitudinal data. The proposal can be viewed as an extension of hidden Markov regression models with fixed covariates (HMRMFCs), which is the state of the art for modelling longitudinal data, with a special focus on the underlying clustering structure. HMRMFCs are inadequate for applications in which a clustering structure can be identified in the distribution of the covariates, as the clustering is independent from the covariates distribution. Here, hidden Markov regression models with random covariates are introduced by explicitly specifying state-specific distributions for the covariates, with the aim of improving the recovering of the clusters in the data with respect to a fixed covariates paradigm. The hidden Markov regression models with random covariates class is defined focusing on the exponential family, in a generalized linear model framework. Model identifiability conditions are sketched, an expectation-maximization algorithm is outlined for parameter estimation, and various implementation and operational issues are discussed. Properties of the estimators of the regression coefficients, as well as of the hidden path parameters, are evaluated through simulation experiments and compared with those of HMRMFCs. The method is applied to physical activity data. Copyright © 2018 John Wiley & Sons, Ltd.
Kim, Hyungsuk; Park, Young-Jae; Park, Young-Bae
2013-01-01
Individuals may perceive the concepts in Korean medicine pattern classification differently because it is performed according to the integration of a variety of information. Therefore, analysis about individual perspective is very important for examining the cross-sectional perspective state of Korean medicine concepts and developing both the clinical guideline including diagnosis and the curriculum of Korean medicine colleges. Moreover, because this conceptual difference is thought to begin with college education, it is worthwhile to observe students' viewpoints. So, we suggested multivariate analysis to explore the dimensional structure of Korean medicine students' conceptual perceptions regarding phlegm pattern. We surveyed 326 students divided into 5 groups based on their year of study. Data were analyzed using multidimensional scaling and factor analysis. Within-group difference was the smallest for third-year students, who have received Korean medicine education in full for the first time. With the exception of first-year students, the conceptual map revealed that each group's mean perceptions of phlegm pattern were distributed in almost linear fashion. To determine the effect of education, we investigated the preference rankings and scores of each symptom. We also extracted factors to identify latent variables and to compare the between-group conceptual characteristics regarding phlegm pattern. PMID:24062789
Benson, Nicholas F; Kranzler, John H; Floyd, Randy G
2016-10-01
Prior research examining cognitive ability and academic achievement relations have been based on different theoretical models, have employed both latent variables as well as observed variables, and have used a variety of analytic methods. Not surprisingly, results have been inconsistent across studies. The aims of this study were to (a) examine how relations between psychometric g, Cattell-Horn-Carroll (CHC) broad abilities, and academic achievement differ across higher-order and bifactor models; (b) examine how well various types of observed scores corresponded with latent variables; and (c) compare two types of observed scores (i.e., refined and non-refined factor scores) as predictors of academic achievement. Results suggest that cognitive-achievement relations vary across theoretical models and that both types of factor scores tend to correspond well with the models on which they are based. However, orthogonal refined factor scores (derived from a bifactor model) have the advantage of controlling for multicollinearity arising from the measurement of psychometric g across all measures of cognitive abilities. Results indicate that the refined factor scores provide more precise representations of their targeted constructs than non-refined factor scores and maintain close correspondence with the cognitive-achievement relations observed for latent variables. Thus, we argue that orthogonal refined factor scores provide more accurate representations of the relations between CHC broad abilities and achievement outcomes than non-refined scores do. Further, the use of refined factor scores addresses calls for the application of scores based on latent variable models. Copyright © 2016 Society for the Study of School Psychology. Published by Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
Gottfried, Adele Eskeles; Marcoulides, George A.; Gottfried, Allen W.; Oliver, Pamella H.; Guerin, Diana Wright
2007-01-01
Research has established that academic intrinsic motivation, enjoyment of school learning without receipt of external rewards, significantly declines across childhood through adolescence. Math intrinsic motivation evidences the most severe decline compared with other subject areas. This study addresses this developmental decline in math intrinsic…
ERIC Educational Resources Information Center
Rupp, Andre A.; Templin, Jonathan L.
2008-01-01
"Diagnostic classification models" (DCM) are frequently promoted by psychometricians as important modelling alternatives for analyzing response data in situations where multivariate classifications of respondents are made on the basis of multiple postulated latent skills. In this review paper, a definitional boundary of the space of DCM…
Seeto, Mark
2017-01-01
Recent epidemiological data suggest the relation between hearing difficulty and depression is more evident in younger and middle-aged populations than in older adults. There are also suggestions that the relation may be more evident in specific subgroups; that is, other factors may influence a relationship between hearing and depression in different subgroups. Using cross-sectional data from the UK Biobank on 134,357 community-dwelling people and structural equation modelling, this study examined the potential mediating influence of social isolation and unemployment and the confounding influence of physical illness and cardiovascular conditions on the relation between a latent hearing variable and both a latent depressive episodes variable and a latent depressive symptoms variable. The models were stratified by age (40s, 50s, and 60s) and gender and further controlled for physical illness and professional support in associations involving social isolation and unemployment. The latent hearing variable was primarily defined by reported hearing difficulty in noise. For all subgroups, poor hearing was significantly related to both more depressive episodes and more depressive symptoms. In all models, the direct and generally small association exceeded the indirect associations via physical health and social interaction. Significant (depressive episodes) and near significant (depressive symptoms) higher direct associations were estimated for males in their 40s and 50s than for males in their 60s. There was at each age-group no significant difference in estimated associations across gender. Irrespective of the temporal order of variables, findings suggest that audiological services should facilitate psychosocial counselling. PMID:28752806
Keidser, Gitte; Seeto, Mark
2017-01-01
Recent epidemiological data suggest the relation between hearing difficulty and depression is more evident in younger and middle-aged populations than in older adults. There are also suggestions that the relation may be more evident in specific subgroups; that is, other factors may influence a relationship between hearing and depression in different subgroups. Using cross-sectional data from the UK Biobank on 134,357 community-dwelling people and structural equation modelling, this study examined the potential mediating influence of social isolation and unemployment and the confounding influence of physical illness and cardiovascular conditions on the relation between a latent hearing variable and both a latent depressive episodes variable and a latent depressive symptoms variable. The models were stratified by age (40s, 50s, and 60s) and gender and further controlled for physical illness and professional support in associations involving social isolation and unemployment. The latent hearing variable was primarily defined by reported hearing difficulty in noise. For all subgroups, poor hearing was significantly related to both more depressive episodes and more depressive symptoms. In all models, the direct and generally small association exceeded the indirect associations via physical health and social interaction. Significant (depressive episodes) and near significant (depressive symptoms) higher direct associations were estimated for males in their 40s and 50s than for males in their 60s. There was at each age-group no significant difference in estimated associations across gender. Irrespective of the temporal order of variables, findings suggest that audiological services should facilitate psychosocial counselling.
Piper, Megan E.; Bolt, Daniel M.; Kim, Su-Young; Japuntich, Sandra J.; Smith, Stevens S.; Niederdeppe, Jeff; Cannon, Dale S.; Baker, Timothy B.
2008-01-01
The construct of tobacco dependence is important from both scientific and public health perspectives, but it is poorly understood. The current research integrates person-centered analyses (e.g., latent profile analysis) and variable-centered analyses (e.g., exploratory factor analysis) to understand better the latent structure of dependence and to guide distillation of the phenotype. Using data from four samples of smokers (including treatment and non-treatment samples), latent profiles were derived using the Wisconsin Inventory of Smoking Dependence Motives (WISDM) subscale scores. Across all four samples, results revealed a unique latent profile that had relative elevations on four dependence motive subscales (Automaticity, Craving, Loss of Control, and Tolerance). Variable-centered analyses supported the uniqueness of these four subscales both as measures of a common factor distinct from that underlying the other nine subscales, and as the strongest predictors of relapse, withdrawal and other dependence criteria. Conversely, the remaining nine motives carried little unique predictive validity regarding dependence. Applications of a factor mixture model further support the presence of a unique class of smokers in relation to a common factor underlying the four subscales. The results illustrate how person-centered analyses may be useful as a supplement to variable-centered analyses for uncovering variables that are necessary and/or sufficient predictors of disorder criteria, as they may uncover small segments of a population in which the variables are uniquely distributed. The results also suggest that severe dependence is associated with a pattern of smoking that is heavy, pervasive, automatic and relatively unresponsive to instrumental contingencies. PMID:19025223
Sotgiu, Giovanni; Altet-Gómez, Neus; Tsolia, Maria; Ruga, Ezia; Velizarova, Svetlana; Kampmann, Beate
2012-01-01
Rationale: Interferon-γ (IFN-γ) release assays are widely used to diagnose latent infection with Mycobacterium tuberculosis in adults, but their performance in children remains incompletely evaluated to date. Objectives: To investigate factors influencing results of IFN-γ release assays in children using a large European data set. Methods: The Pediatric Tuberculosis Network European Trials group pooled and analyzed data from five sites across Europe comprising 1,128 children who were all investigated for latent tuberculosis infection by tuberculin skin test and at least one IFN-γ release assay. Multivariate analyses examined age, bacillus Calmette-Guérin (BCG) vaccination status, and sex as predictor variables of results. Subgroup analyses included children who were household contacts. Measurements and Main Results: A total of 1,093 children had a QuantiFERON-TB Gold In-Tube assay and 382 had a T-SPOT.TB IFN-γ release assay. Age was positively correlated with a positive blood result (QuantiFERON-TB Gold In-Tube: odds ratio [OR], 1.08 per year increasing age [P < 0.0001]; T-SPOT.TB: OR, 1.14 per year increasing age [P < 0.001]). A positive QuantiFERON-TB Gold In-Tube result was shown by 5.5% of children with a tuberculin skin test result less than 5 mm, by 14.8% if less than 10 mm, and by 20.2% if less than 15 mm. Prior BCG vaccination was associated with a negative IFN-γ release assay result (QuantiFERON-TB Gold In-Tube: OR, 0.41 [P < 0.001]; T-SPOT.TB: OR, 0.41 [P < 0.001]). Young age was a predictor of indeterminate IFN-γ release assay results, but indeterminate rates were low (3.6% in children < 5 yr, 1% in children > 5 yr). Conclusions: Our data show that BCG vaccination may be effective in protecting children against Mycobacterium tuberculosis infection. To restrict use of IFN-γ release assays to children with positive skin tests risks underestimating latent infection. PMID:22700862
Scale Reliability Evaluation with Heterogeneous Populations
ERIC Educational Resources Information Center
Raykov, Tenko; Marcoulides, George A.
2015-01-01
A latent variable modeling approach for scale reliability evaluation in heterogeneous populations is discussed. The method can be used for point and interval estimation of reliability of multicomponent measuring instruments in populations representing mixtures of an unknown number of latent classes or subpopulations. The procedure is helpful also…
Measurement of Psychological Disorders Using Cognitive Diagnosis Models
ERIC Educational Resources Information Center
Templin, Jonathan L.; Henson, Robert A.
2006-01-01
Cognitive diagnosis models are constrained (multiple classification) latent class models that characterize the relationship of questionnaire responses to a set of dichotomous latent variables. Having emanated from educational measurement, several aspects of such models seem well suited to use in psychological assessment and diagnosis. This article…
Clayton, Francina J; Sears, Claire; Davis, Alice; Hulme, Charles
2018-07-01
Paired-associate learning (PAL) tasks measure the ability to form a novel association between a stimulus and a response. Performance on such tasks is strongly associated with reading ability, and there is increasing evidence that verbal task demands may be critical in explaining this relationship. The current study investigated the relationships between different forms of PAL and reading ability. A total of 97 children aged 8-10 years completed a battery of reading assessments and six different PAL tasks (phoneme-phoneme, visual-phoneme, nonverbal-nonverbal, visual-nonverbal, nonword-nonword, and visual-nonword) involving both familiar phonemes and unfamiliar nonwords. A latent variable path model showed that PAL ability is captured by two correlated latent variables: auditory-articulatory and visual-articulatory. The auditory-articulatory latent variable was the stronger predictor of reading ability, providing support for a verbal account of the PAL-reading relationship. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
A multilevel model for comorbid outcomes: obesity and diabetes in the US.
Congdon, Peter
2010-02-01
Multilevel models are overwhelmingly applied to single health outcomes, but when two or more health conditions are closely related, it is important that contextual variation in their joint prevalence (e.g., variations over different geographic settings) is considered. A multinomial multilevel logit regression approach for analysing joint prevalence is proposed here that includes subject level risk factors (e.g., age, race, education) while also taking account of geographic context. Data from a US population health survey (the 2007 Behavioral Risk Factor Surveillance System or BRFSS) are used to illustrate the method, with a six category multinomial outcome defined by diabetic status and weight category (obese, overweight, normal). The influence of geographic context is partly represented by known geographic variables (e.g., county poverty), and partly by a model for latent area influences. In particular, a shared latent variable (common factor) approach is proposed to measure the impact of unobserved area influences on joint weight and diabetes status, with the latent variable being spatially structured to reflect geographic clustering in risk.
Latent Variable Modeling of Brain Gray Matter Volume and Psychopathy in Incarcerated Offenders
Baskin-Sommers, Arielle R.; Neumann, Craig S.; Cope, Lora M.; Kiehl, Kent A.
2016-01-01
Advanced statistical modeling has become a prominent feature in psychological science and can be a useful approach for representing the neural architecture linked to psychopathology. Psychopathy, a disorder characterized by dysfunction in interpersonal-affective and impulsive-antisocial domains, is associated with widespread neural abnormalities. Several imaging studies suggest that underlying structural deficits in paralimbic regions are associated with psychopathy. While these studies are useful, they make assumptions about the organization of the brain and its relevance to individuals displaying psychopathic features. Capitalizing on statistical modeling, the present study (N=254) used latent variable methods to examine the structure of gray matter volume in male offenders, and assessed the latent relations between psychopathy and gray matter factors reflecting paralimbic and non-paralimbic regions. Results revealed good fit for a four-factor gray matter paralimbic model and these first-order factors were accounted for by a super-ordinate paralimbic ‘system’ factor. Moreover, a super-ordinate psychopathy factor significantly predicted the paralimbic, but not the non-paralimbic factor. The latent variable paralimbic model, specifically linked with psychopathy, goes beyond understanding of single brain regions within the system and provides evidence for psychopathy-related gray matter volume reductions in the paralimbic system as a whole. PMID:27269123
Analyzing Longitudinal Item Response Data via the Pairwise Fitting Method
ERIC Educational Resources Information Center
Fu, Zhi-Hui; Tao, Jian; Shi, Ning-Zhong; Zhang, Ming; Lin, Nan
2011-01-01
Multidimensional item response theory (MIRT) models can be applied to longitudinal educational surveys where a group of individuals are administered different tests over time with some common items. However, computational problems typically arise as the dimension of the latent variables increases. This is especially true when the latent variable…
Cognitive Activities During Adulthood Are More Important than Education in Building Reserve
Reed, Bruce R.; Dowling, Maritza; Farias, Sarah Tomaszewski; Sonnen, Joshua; Strauss, Milton; Schneider, Julie A.; Bennett, David A.; Mungas, Dan
2012-01-01
Cognitive reserve is thought to reflect life experiences. Which experiences contribute to reserve and their relative importance is not understood. Subjects were 652 autopsied cases from the Rush Memory and Aging Project and the Religious Orders Study. Reserve was defined as the residual variance of the regressions of cognitive factors on brain pathology and was captured in a latent variable that was regressed on potential determinants of reserve. Neuropathology variables included Alzheimer’s disease markers, Lewy bodies, infarcts, microinfarcts, and brain weight. Cognition was measured with six cognitive domain scores. Determinants of reserve were socioeconomic status (SES), education, leisure cognitive activities at age 40 (CA40) and at study enrollment (CAbaseline) in late life. The four exogenous predictors of reserve were weakly to moderately inter-correlated. In a multivariate model, all except SES had statistically significant effects on Reserve, the strongest of which were CA40 (β= .31) and CAbaseline (β= .28). The Education effect was negative in the full model (β= −.25). Results suggest that leisure cognitive activities throughout adulthood are more important than education in determining reserve. Discrepancies between cognitive activity and education may be informative in estimating late life reserve. PMID:23131600
Cognitive activities during adulthood are more important than education in building reserve.
Reed, Bruce R; Dowling, Maritza; Tomaszewski Farias, Sarah; Sonnen, Joshua; Strauss, Milton; Schneider, Julie A; Bennett, David A; Mungas, Dan
2011-07-01
Cognitive reserve is thought to reflect life experiences. Which experiences contribute to reserve and their relative importance is not understood. Subjects were 652 autopsied cases from the Rush Memory and Aging Project and the Religious Orders Study. Reserve was defined as the residual variance of the regressions of cognitive factors on brain pathology and was captured in a latent variable that was regressed on potential determinants of reserve. Neuropathology variables included Alzheimer's disease markers, Lewy bodies, infarcts, microinfarcts, and brain weight. Cognition was measured with six cognitive domain scores. Determinants of reserve were socioeconomic status (SES), education, leisure cognitive activities at age 40 (CA40) and at study enrollment (CAbaseline) in late life. The four exogenous predictors of reserve were weakly to moderately inter-correlated. In a multivariate model, all except SES had statistically significant effects on Reserve, the strongest of which were CA40 (β = .31) and CAbaseline (β = .28). The Education effect was negative in the full model (β = -.25). Results suggest that leisure cognitive activities throughout adulthood are more important than education in determining reserve. Discrepancies between cognitive activity and education may be informative in estimating late life reserve.
2011-01-01
Background Thousands of children experience cardiac arrest events every year in pediatric intensive care units. Most of these children die. Cardiac arrest prediction tools are used as part of medical emergency team evaluations to identify patients in standard hospital beds that are at high risk for cardiac arrest. There are no models to predict cardiac arrest in pediatric intensive care units though, where the risk of an arrest is 10 times higher than for standard hospital beds. Current tools are based on a multivariable approach that does not characterize deterioration, which often precedes cardiac arrests. Characterizing deterioration requires a time series approach. The purpose of this study is to propose a method that will allow for time series data to be used in clinical prediction models. Successful implementation of these methods has the potential to bring arrest prediction to the pediatric intensive care environment, possibly allowing for interventions that can save lives and prevent disabilities. Methods We reviewed prediction models from nonclinical domains that employ time series data, and identified the steps that are necessary for building predictive models using time series clinical data. We illustrate the method by applying it to the specific case of building a predictive model for cardiac arrest in a pediatric intensive care unit. Results Time course analysis studies from genomic analysis provided a modeling template that was compatible with the steps required to develop a model from clinical time series data. The steps include: 1) selecting candidate variables; 2) specifying measurement parameters; 3) defining data format; 4) defining time window duration and resolution; 5) calculating latent variables for candidate variables not directly measured; 6) calculating time series features as latent variables; 7) creating data subsets to measure model performance effects attributable to various classes of candidate variables; 8) reducing the number of candidate features; 9) training models for various data subsets; and 10) measuring model performance characteristics in unseen data to estimate their external validity. Conclusions We have proposed a ten step process that results in data sets that contain time series features and are suitable for predictive modeling by a number of methods. We illustrated the process through an example of cardiac arrest prediction in a pediatric intensive care setting. PMID:22023778
Kennedy, Curtis E; Turley, James P
2011-10-24
Thousands of children experience cardiac arrest events every year in pediatric intensive care units. Most of these children die. Cardiac arrest prediction tools are used as part of medical emergency team evaluations to identify patients in standard hospital beds that are at high risk for cardiac arrest. There are no models to predict cardiac arrest in pediatric intensive care units though, where the risk of an arrest is 10 times higher than for standard hospital beds. Current tools are based on a multivariable approach that does not characterize deterioration, which often precedes cardiac arrests. Characterizing deterioration requires a time series approach. The purpose of this study is to propose a method that will allow for time series data to be used in clinical prediction models. Successful implementation of these methods has the potential to bring arrest prediction to the pediatric intensive care environment, possibly allowing for interventions that can save lives and prevent disabilities. We reviewed prediction models from nonclinical domains that employ time series data, and identified the steps that are necessary for building predictive models using time series clinical data. We illustrate the method by applying it to the specific case of building a predictive model for cardiac arrest in a pediatric intensive care unit. Time course analysis studies from genomic analysis provided a modeling template that was compatible with the steps required to develop a model from clinical time series data. The steps include: 1) selecting candidate variables; 2) specifying measurement parameters; 3) defining data format; 4) defining time window duration and resolution; 5) calculating latent variables for candidate variables not directly measured; 6) calculating time series features as latent variables; 7) creating data subsets to measure model performance effects attributable to various classes of candidate variables; 8) reducing the number of candidate features; 9) training models for various data subsets; and 10) measuring model performance characteristics in unseen data to estimate their external validity. We have proposed a ten step process that results in data sets that contain time series features and are suitable for predictive modeling by a number of methods. We illustrated the process through an example of cardiac arrest prediction in a pediatric intensive care setting.
Repeatability and Reproducibility of Decisions by Latent Fingerprint Examiners
Ulery, Bradford T.; Hicklin, R. Austin; Buscaglia, JoAnn; Roberts, Maria Antonia
2012-01-01
The interpretation of forensic fingerprint evidence relies on the expertise of latent print examiners. We tested latent print examiners on the extent to which they reached consistent decisions. This study assessed intra-examiner repeatability by retesting 72 examiners on comparisons of latent and exemplar fingerprints, after an interval of approximately seven months; each examiner was reassigned 25 image pairs for comparison, out of total pool of 744 image pairs. We compare these repeatability results with reproducibility (inter-examiner) results derived from our previous study. Examiners repeated 89.1% of their individualization decisions, and 90.1% of their exclusion decisions; most of the changed decisions resulted in inconclusive decisions. Repeatability of comparison decisions (individualization, exclusion, inconclusive) was 90.0% for mated pairs, and 85.9% for nonmated pairs. Repeatability and reproducibility were notably lower for comparisons assessed by the examiners as “difficult” than for “easy” or “moderate” comparisons, indicating that examiners' assessments of difficulty may be useful for quality assurance. No false positive errors were repeated (n = 4); 30% of false negative errors were repeated. One percent of latent value decisions were completely reversed (no value even for exclusion vs. of value for individualization). Most of the inter- and intra-examiner variability concerned whether the examiners considered the information available to be sufficient to reach a conclusion; this variability was concentrated on specific image pairs such that repeatability and reproducibility were very high on some comparisons and very low on others. Much of the variability appears to be due to making categorical decisions in borderline cases. PMID:22427888
Toward a Model-Based Approach to the Clinical Assessment of Personality Psychopathology
Eaton, Nicholas R.; Krueger, Robert F.; Docherty, Anna R.; Sponheim, Scott R.
2015-01-01
Recent years have witnessed tremendous growth in the scope and sophistication of statistical methods available to explore the latent structure of psychopathology, involving continuous, discrete, and hybrid latent variables. The availability of such methods has fostered optimism that they can facilitate movement from classification primarily crafted through expert consensus to classification derived from empirically-based models of psychopathological variation. The explication of diagnostic constructs with empirically supported structures can then facilitate the development of assessment tools that appropriately characterize these constructs. Our goal in this paper is to illustrate how new statistical methods can inform conceptualization of personality psychopathology and therefore its assessment. We use magical thinking as example, because both theory and earlier empirical work suggested the possibility of discrete aspects to the latent structure of personality psychopathology, particularly forms of psychopathology involving distortions of reality testing, yet other data suggest that personality psychopathology is generally continuous in nature. We directly compared the fit of a variety of latent variable models to magical thinking data from a sample enriched with clinically significant variation in psychotic symptomatology for explanatory purposes. Findings generally suggested a continuous latent variable model best represented magical thinking, but results varied somewhat depending on different indices of model fit. We discuss the implications of the findings for classification and applied personality assessment. We also highlight some limitations of this type of approach that are illustrated by these data, including the importance of substantive interpretation, in addition to use of model fit indices, when evaluating competing structural models. PMID:24007309
The job content questionnaire in various occupational contexts: applying a latent class model
Santos, Kionna Oliveira Bernardes; de Araújo, Tânia Maria; Karasek, Robert
2017-01-01
Objective To evaluate Job Content Questionnaire(JCQ) performance using the latent class model. Methods We analysed cross-sectional studies conducted in Brazil and examined three occupational categories: petroleum industry workers (n=489), teachers (n=4392) and primary healthcare workers (3078)and 1552 urban workers from a representative sample of the city of Feira de Santana in Bahia, Brazil. An appropriate number of latent classes was extracted and described each occupational category using latent class analysis, a multivariate method that evaluates constructs and takes into account the latent characteristics underlying the structure of measurement scales. The conditional probabilities of workers belonging to each class were then analysed graphically. Results Initially, the latent class analysis extracted four classes corresponding to the four job types (active, passive, low strain and high strain) proposed by the Job-Strain model (JSM) and operationalised by the JCQ. However, after taking into consideration the adequacy criteria to evaluate the number of extracted classes, three classes (active, low strain and high strain) were extracted from the studies of urban workers and teachers and four classes (active, passive, low strain and high strain) from the study of primary healthcare and petroleum industry workers. Conclusion The four job types proposed by the JSM were identified among primary healthcare and petroleum industry workers—groups with relatively high levels of skill discretion and decision authority. Three job types were identified for teachers and urban workers; however, passive job situations were not found within these groups. The latent class analysis enabled us to describe the conditional standard responses of the job types proposed by the model, particularly in relation to active jobs and high and low strain situations. PMID:28515185
Viewpoints: Interactive Exploration of Large Multivariate Earth and Space Science Data Sets
NASA Astrophysics Data System (ADS)
Levit, C.; Gazis, P. R.
2006-05-01
Analysis and visualization of extremely large and complex data sets may be one of the most significant challenges facing earth and space science investigators in the forthcoming decades. While advances in hardware speed and storage technology have roughly kept up with (indeed, have driven) increases in database size, the same is not of our abilities to manage the complexity of these data. Current missions, instruments, and simulations produce so much data of such high dimensionality that they outstrip the capabilities of traditional visualization and analysis software. This problem can only be expected to get worse as data volumes increase by orders of magnitude in future missions and in ever-larger supercomputer simulations. For large multivariate data (more than 105 samples or records with more than 5 variables per sample) the interactive graphics response of most existing statistical analysis, machine learning, exploratory data analysis, and/or visualization tools such as Torch, MLC++, Matlab, S++/R, and IDL stutters, stalls, or stops working altogether. Fortunately, the graphics processing units (GPUs) built in to all professional desktop and laptop computers currently on the market are capable of transforming, filtering, and rendering hundreds of millions of points per second. We present a prototype open-source cross-platform application which leverages much of the power latent in the GPU to enable smooth interactive exploration and analysis of large high- dimensional data using a variety of classical and recent techniques. The targeted application is the interactive analysis of large, complex, multivariate data sets, with dimensionalities that may surpass 100 and sample sizes that may exceed 106-108.
Assessment of water quality parameters using multivariate analysis for Klang River basin, Malaysia.
Mohamed, Ibrahim; Othman, Faridah; Ibrahim, Adriana I N; Alaa-Eldin, M E; Yunus, Rossita M
2015-01-01
This case study uses several univariate and multivariate statistical techniques to evaluate and interpret a water quality data set obtained from the Klang River basin located within the state of Selangor and the Federal Territory of Kuala Lumpur, Malaysia. The river drains an area of 1,288 km(2), from the steep mountain rainforests of the main Central Range along Peninsular Malaysia to the river mouth in Port Klang, into the Straits of Malacca. Water quality was monitored at 20 stations, nine of which are situated along the main river and 11 along six tributaries. Data was collected from 1997 to 2007 for seven parameters used to evaluate the status of the water quality, namely dissolved oxygen, biochemical oxygen demand, chemical oxygen demand, suspended solids, ammoniacal nitrogen, pH, and temperature. The data were first investigated using descriptive statistical tools, followed by two practical multivariate analyses that reduced the data dimensions for better interpretation. The analyses employed were factor analysis and principal component analysis, which explain 60 and 81.6% of the total variation in the data, respectively. We found that the resulting latent variables from the factor analysis are interpretable and beneficial for describing the water quality in the Klang River. This study presents the usefulness of several statistical methods in evaluating and interpreting water quality data for the purpose of monitoring the effectiveness of water resource management. The results should provide more straightforward data interpretation as well as valuable insight for managers to conceive optimum action plans for controlling pollution in river water.
Lindberg, Ann-Sofie; Oksa, Juha; Antti, Henrik; Malm, Christer
2015-01-01
Physical capacity has previously been deemed important for firefighters physical work capacity, and aerobic fitness, muscular strength, and muscular endurance are the most frequently investigated parameters of importance. Traditionally, bivariate and multivariate linear regression statistics have been used to study relationships between physical capacities and work capacities among firefighters. An alternative way to handle datasets consisting of numerous correlated variables is to use multivariate projection analyses, such as Orthogonal Projection to Latent Structures. The first aim of the present study was to evaluate the prediction and predictive power of field and laboratory tests, respectively, on firefighters' physical work capacity on selected work tasks. Also, to study if valid predictions could be achieved without anthropometric data. The second aim was to externally validate selected models. The third aim was to validate selected models on firefighters' and on civilians'. A total of 38 (26 men and 12 women) + 90 (38 men and 52 women) subjects were included in the models and the external validation, respectively. The best prediction (R2) and predictive power (Q2) of Stairs, Pulling, Demolition, Terrain, and Rescue work capacities included field tests (R2 = 0.73 to 0.84, Q2 = 0.68 to 0.82). The best external validation was for Stairs work capacity (R2 = 0.80) and worst for Demolition work capacity (R2 = 0.40). In conclusion, field and laboratory tests could equally well predict physical work capacities for firefighting work tasks, and models excluding anthropometric data were valid. The predictive power was satisfactory for all included work tasks except Demolition.
Embodiment Feels Better: Girls' Body Objectification and Well-Being across Adolescence
ERIC Educational Resources Information Center
Impett, Emily A.; Henson, James M.; Breines, Juliana G.; Schooler, Deborah; Tolman, Deborah L.
2011-01-01
In a five-year longitudinal study, we investigated the role of body objectification in shaping girls' self-esteem and depressive symptoms over the course of adolescence. Multivariate Latent Growth Curve Modeling (MLGM) was used to test the association between body objectification and both self-esteem and depressive symptoms with data from 587…
ERIC Educational Resources Information Center
Guglielmi, R. Sergio
2012-01-01
The effectiveness of various strategies for educating the growing U.S. population of English language learners (ELLs) has attracted a great deal of controversy. Bilingual education theory posits that retention and continued development of native language (L1) skills facilitate academic achievement through two mediating mechanisms. First, L1…
ERIC Educational Resources Information Center
Kim, Sooyeon; Murry, Velma McBride; Brody, Gene H.
The functional relationships between developmental change in children's self-control and academic achievement were examined using longitudinal family data. Multivariate latent growth models (LGM) were specified to determine whether the rate of growth in academic achievement changes as a function of developmental change in self-control. Data came…
Hudson, Jennifer L; Kendall, Philip C; Chu, Brian C; Gosch, Elizabeth; Martin, Erin; Taylor, Alan; Knight, Ashleigh
2014-01-01
This study examined the relations between treatment process variables and child anxiety outcomes. Independent raters watched/listened to taped therapy sessions of 151 anxiety-disordered (6-14 yr-old; M = 10.71) children (43% boys) and assessed process variables (child alliance, therapist alliance, child involvement, therapist flexibility and therapist functionality) within a manual-based cognitive-behavioural treatment. Latent growth modelling examined three latent variables (intercept, slope, and quadratic) for each process variable. Child age, gender, family income and ethnicity were examined as potential antecedents. Outcome was analyzed using factorially derived clinician, mother, father, child and teacher scores from questionnaire and structured diagnostic interviews at pretreatment, posttreatment and 12-month follow-up. Latent growth models demonstrated a concave quadratic curve for child involvement and therapist flexibility over time. A predominantly linear, downward slope was observed for alliance, and functional flexibility remained consistent over time. Increased alliance, child involvement and therapist flexibility showed some albeit inconsistent, associations with positive treatment outcome. Findings support the notion that maintaining the initial high level of alliance or involvement is important for clinical improvement. There is some support that progressively increasing alliance/involvement also positively impacts on treatment outcome. These findings were not consistent across outcome measurement points or reporters. Copyright © 2013 Elsevier Ltd. All rights reserved.
Genetic and Environmental Influences of General Cognitive Ability: Is g a valid latent construct?
Panizzon, Matthew S.; Vuoksimaa, Eero; Spoon, Kelly M.; Jacobson, Kristen C.; Lyons, Michael J.; Franz, Carol E.; Xian, Hong; Vasilopoulos, Terrie; Kremen, William S.
2014-01-01
Despite an extensive literature, the “g” construct remains a point of debate. Different models explaining the observed relationships among cognitive tests make distinct assumptions about the role of g in relation to those tests and specific cognitive domains. Surprisingly, these different models and their corresponding assumptions are rarely tested against one another. In addition to the comparison of distinct models, a multivariate application of the twin design offers a unique opportunity to test whether there is support for g as a latent construct with its own genetic and environmental influences, or whether the relationships among cognitive tests are instead driven by independent genetic and environmental factors. Here we tested multiple distinct models of the relationships among cognitive tests utilizing data from the Vietnam Era Twin Study of Aging (VETSA), a study of middle-aged male twins. Results indicated that a hierarchical (higher-order) model with a latent g phenotype, as well as specific cognitive domains, was best supported by the data. The latent g factor was highly heritable (86%), and accounted for most, but not all, of the genetic effects in specific cognitive domains and elementary cognitive tests. By directly testing multiple competing models of the relationships among cognitive tests in a genetically-informative design, we are able to provide stronger support than in prior studies for g being a valid latent construct. PMID:24791031
Ordinal probability effect measures for group comparisons in multinomial cumulative link models.
Agresti, Alan; Kateri, Maria
2017-03-01
We consider simple ordinal model-based probability effect measures for comparing distributions of two groups, adjusted for explanatory variables. An "ordinal superiority" measure summarizes the probability that an observation from one distribution falls above an independent observation from the other distribution, adjusted for explanatory variables in a model. The measure applies directly to normal linear models and to a normal latent variable model for ordinal response variables. It equals Φ(β/2) for the corresponding ordinal model that applies a probit link function to cumulative multinomial probabilities, for standard normal cdf Φ and effect β that is the coefficient of the group indicator variable. For the more general latent variable model for ordinal responses that corresponds to a linear model with other possible error distributions and corresponding link functions for cumulative multinomial probabilities, the ordinal superiority measure equals exp(β)/[1+exp(β)] with the log-log link and equals approximately exp(β/2)/[1+exp(β/2)] with the logit link, where β is the group effect. Another ordinal superiority measure generalizes the difference of proportions from binary to ordinal responses. We also present related measures directly for ordinal models for the observed response that need not assume corresponding latent response models. We present confidence intervals for the measures and illustrate with an example. © 2016, The International Biometric Society.
2014-01-01
Background The Disaster Emergency Medical Personnel System (DEMPS) program provides a system of volunteers whereby active or retired Department of Veterans Affairs (VA) personnel can register to be deployed to support other VA facilities or the nation during national emergencies or disasters. Both early and ongoing volunteer training is required to participate. Methods This study aims to identify factors that impact willingness to deploy in the event of an emergency. This analysis was based on responses from 2,385 survey respondents (response rate, 29%). Latent variable path models were developed and tested using the EQS structural equations modeling program. Background demographic variables of education, age, minority ethnicity, and female gender were used as predictors of intervening latent variables of DEMPS Volunteer Experience, Positive Attitude about Training, and Stress. The model had acceptable fit statistics, and all three intermediate latent variables significantly predicted the outcome latent variable Readiness to Deploy. Results DEMPS Volunteer Experience and a Positive Attitude about Training were associated with Readiness to Deploy. Stress was associated with decreased Readiness to Deploy. Female gender was negatively correlated with Readiness to Deploy; however, there was an indirect relationship between female gender and Readiness to Deploy through Positive Attitude about Training. Conclusions These findings suggest that volunteer emergency management response programs such as DEMPS should consider how best to address the factors that may make women less ready to deploy than men in order to ensure adequate gender representation among emergency responders. The findings underscore the importance of training opportunities to ensure that gender-sensitive support is a strong component of emergency response, and may apply to other emergency response programs such as the Medical Reserve Corps and the American Red Cross. PMID:25038628
Zagelbaum, Nicole K; Heslin, Kevin C; Stein, Judith A; Ruzek, Josef; Smith, Robert E; Nyugen, Tam; Dobalian, Aram
2014-07-19
The Disaster Emergency Medical Personnel System (DEMPS) program provides a system of volunteers whereby active or retired Department of Veterans Affairs (VA) personnel can register to be deployed to support other VA facilities or the nation during national emergencies or disasters. Both early and ongoing volunteer training is required to participate. This study aims to identify factors that impact willingness to deploy in the event of an emergency. This analysis was based on responses from 2,385 survey respondents (response rate, 29%). Latent variable path models were developed and tested using the EQS structural equations modeling program. Background demographic variables of education, age, minority ethnicity, and female gender were used as predictors of intervening latent variables of DEMPS Volunteer Experience, Positive Attitude about Training, and Stress. The model had acceptable fit statistics, and all three intermediate latent variables significantly predicted the outcome latent variable Readiness to Deploy. DEMPS Volunteer Experience and a Positive Attitude about Training were associated with Readiness to Deploy. Stress was associated with decreased Readiness to Deploy. Female gender was negatively correlated with Readiness to Deploy; however, there was an indirect relationship between female gender and Readiness to Deploy through Positive Attitude about Training. These findings suggest that volunteer emergency management response programs such as DEMPS should consider how best to address the factors that may make women less ready to deploy than men in order to ensure adequate gender representation among emergency responders. The findings underscore the importance of training opportunities to ensure that gender-sensitive support is a strong component of emergency response, and may apply to other emergency response programs such as the Medical Reserve Corps and the American Red Cross.
Widaman, Keith F.; Grimm, Kevin J.; Early, Dawnté R.; Robins, Richard W.; Conger, Rand D.
2013-01-01
Difficulties arise in multiple-group evaluations of factorial invariance if particular manifest variables are missing completely in certain groups. Ad hoc analytic alternatives can be used in such situations (e.g., deleting manifest variables), but some common approaches, such as multiple imputation, are not viable. At least 3 solutions to this problem are viable: analyzing differing sets of variables across groups, using pattern mixture approaches, and a new method using random number generation. The latter solution, proposed in this article, is to generate pseudo-random normal deviates for all observations for manifest variables that are missing completely in a given sample and then to specify multiple-group models in a way that respects the random nature of these values. An empirical example is presented in detail comparing the 3 approaches. The proposed solution can enable quantitative comparisons at the latent variable level between groups using programs that require the same number of manifest variables in each group. PMID:24019738
van Wieringen, Wessel N; van de Wiel, Mark A
2011-05-01
Realizing that genes often operate together, studies into the molecular biology of cancer shift focus from individual genes to pathways. In order to understand the regulatory mechanisms of a pathway, one must study its genes at all molecular levels. To facilitate such study at the genomic level, we developed exploratory factor analysis for the characterization of the variability of a pathway's copy number data. A latent variable model that describes the call probability data of a pathway is introduced and fitted with an EM algorithm. In two breast cancer data sets, it is shown that the first two latent variables of GO nodes, which inherit a clear interpretation from the call probabilities, are often related to the proportion of aberrations and a contrast of the probabilities of a loss and of a gain. Linking the latent variables to the node's gene expression data suggests that they capture the "global" effect of genomic aberrations on these transcript levels. In all, the proposed method provides an possibly insightful characterization of pathway copy number data, which may be fruitfully exploited to study the interaction between the pathway's DNA copy number aberrations and data from other molecular levels like gene expression.
Senn, Theresa E.; Scott-Sheldon, Lori A. J.; Vanable, Peter A.; Carey, Michael P.
2011-01-01
Background The Information-Motivation-Behavioral Skills (IMB) model often guides sexual risk reduction programs even though no studies have examined covariation in the theory’s constructs in a dynamic fashion with longitudinal data. Purpose Using new developments in latent growth modeling, we explore how changes in information, motivation, and behavioral skills over 9 months relate to changes in condom use among STD clinic patients. Methods Participants (N = 1281, 50% female, 66% African American) completed measures of IMB constructs at three time points. We used parallel process latent growth modeling to examine associations among intercepts and slopes of IMB constructs. Results Initial levels of motivation, behavioral skills, and condom use were all positively associated, with behavioral skills partially mediating associations between motivation and condom use. Changes over time in behavioral skills positively related to changes in condom use. Conclusions Results support the key role of behavioral skills in sexual risk reduction, suggesting these skills should be targeted in HIV prevention interventions. PMID:21638196
Correcting Measurement Error in Latent Regression Covariates via the MC-SIMEX Method
ERIC Educational Resources Information Center
Rutkowski, Leslie; Zhou, Yan
2015-01-01
Given the importance of large-scale assessments to educational policy conversations, it is critical that subpopulation achievement is estimated reliably and with sufficient precision. Despite this importance, biased subpopulation estimates have been found to occur when variables in the conditioning model side of a latent regression model contain…
ERIC Educational Resources Information Center
Kaya, Yasemin; Leite, Walter L.
2017-01-01
Cognitive diagnosis models are diagnostic models used to classify respondents into homogenous groups based on multiple categorical latent variables representing the measured cognitive attributes. This study aims to present longitudinal models for cognitive diagnosis modeling, which can be applied to repeated measurements in order to monitor…
A Latent Variable Approach to Executive Control in Healthy Ageing
ERIC Educational Resources Information Center
Adrover-Roig, Daniel; Sese, Albert; Barcelo, Francisco; Palmer, Alfonso
2012-01-01
It is a well-established finding that the central executive is fractionated in at least three separable component processes: Updating, Shifting, and Inhibition of information (Miyake et al., 2000). However, the fractionation of the central executive among the elderly has been less well explored, and Miyake's et al. latent structure has not yet…
On the Relation between the Linear Factor Model and the Latent Profile Model
ERIC Educational Resources Information Center
Halpin, Peter F.; Dolan, Conor V.; Grasman, Raoul P. P. P.; De Boeck, Paul
2011-01-01
The relationship between linear factor models and latent profile models is addressed within the context of maximum likelihood estimation based on the joint distribution of the manifest variables. Although the two models are well known to imply equivalent covariance decompositions, in general they do not yield equivalent estimates of the…
ERIC Educational Resources Information Center
Fryer, Luke K.
2017-01-01
During the past decade, quantitative researchers have examined the first-year university experience from both variable-centred and person-centred perspectives. These studies have, however, generally been cross-sectional and therefore often failed to address how student learning changes during this transition. Furthermore, research has been…
A Comparison of Four Approaches to Account for Method Effects in Latent State-Trait Analyses
ERIC Educational Resources Information Center
Geiser, Christian; Lockhart, Ginger
2012-01-01
Latent state-trait (LST) analysis is frequently applied in psychological research to determine the degree to which observed scores reflect stable person-specific effects, effects of situations and/or person-situation interactions, and random measurement error. Most LST applications use multiple repeatedly measured observed variables as indicators…
A comparison of latent class, K-means, and K-median methods for clustering dichotomous data.
Brusco, Michael J; Shireman, Emilie; Steinley, Douglas
2017-09-01
The problem of partitioning a collection of objects based on their measurements on a set of dichotomous variables is a well-established problem in psychological research, with applications including clinical diagnosis, educational testing, cognitive categorization, and choice analysis. Latent class analysis and K-means clustering are popular methods for partitioning objects based on dichotomous measures in the psychological literature. The K-median clustering method has recently been touted as a potentially useful tool for psychological data and might be preferable to its close neighbor, K-means, when the variable measures are dichotomous. We conducted simulation-based comparisons of the latent class, K-means, and K-median approaches for partitioning dichotomous data. Although all 3 methods proved capable of recovering cluster structure, K-median clustering yielded the best average performance, followed closely by latent class analysis. We also report results for the 3 methods within the context of an application to transitive reasoning data, in which it was found that the 3 approaches can exhibit profound differences when applied to real data. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Unsworth, Nash
2009-09-01
A latent variable analysis was conducted to examine the nature of individual differences in the dynamics of free recall and cognitive abilities. Participants performed multiple measures of free recall, working memory capacity (WMC), and fluid intelligence (gF). For each free recall task, recall accuracy, recall latency, and number of intrusion errors were determined, and latent factors were derived for each. It was found that recall accuracy was negatively related to both recall latency and number of intrusions, and recall latency and number of intrusions were positively related. Furthermore, latent WMC and gF factors were positively related to recall accuracy, but negatively related to recall latency and number of intrusions. Finally, a cluster analysis revealed that subgroups of participants with deficits in focusing the search had deficits in recovering degraded representations or deficits in monitoring the products of retrieval. The results are consistent with the idea that variation in the dynamics of free recall, WMC, and gF are primarily due to differences in search set size, but differences in recovery and monitoring are also important.
From loss to loneliness: The relationship between bereavement and depressive symptoms.
Fried, Eiko I; Bockting, Claudi; Arjadi, Retha; Borsboom, Denny; Amshoff, Maximilian; Cramer, Angélique O J; Epskamp, Sacha; Tuerlinckx, Francis; Carr, Deborah; Stroebe, Margaret
2015-05-01
Spousal bereavement can cause a rise in depressive symptoms. This study empirically evaluates 2 competing explanations concerning how this causal effect is brought about: (a) a traditional latent variable explanation, in which loss triggers depression which then leads to symptoms; and (b) a novel network explanation, in which bereavement directly affects particular depression symptoms which then activate other symptoms. We used data from the Changing Lives of Older Couples (CLOC) study and compared depressive symptomatology, assessed via the 11-item Center for Epidemiologic Studies Depression Scale (CES-D), among those who lost their partner (N = 241) with a still-married control group (N = 274). We modeled the effect of partner loss on depressive symptoms either as an indirect effect through a latent variable, or as a direct effect in a network constructed through a causal search algorithm. Compared to the control group, widow(er)s' scores were significantly higher for symptoms of loneliness, sadness, depressed mood, and appetite loss, and significantly lower for happiness and enjoyed life. The effect of partner loss on these symptoms was not mediated by a latent variable. The network model indicated that bereavement mainly affected loneliness, which in turn activated other depressive symptoms. The direct effects of spousal loss on particular symptoms are inconsistent with the predictions of latent variable models, but can be explained from a network perspective. The findings support a growing body of literature showing that specific adverse life events differentially affect depressive symptomatology, and suggest that future studies should examine interventions that directly target such symptoms. (c) 2015 APA, all rights reserved).
Chung, Ill-Min; Kim, Jae-Kwang; Lee, Kyoung-Jin; Park, Sung-Kyu; Lee, Ji-Hee; Son, Na-Young; Jin, Yong-Ik; Kim, Seung-Hyun
2018-02-01
Rice (Oryza sativa L.) is the world's third largest food crop after wheat and corn. Geographic authentication of rice has recently emerged asan important issue for enhancing human health via food safety and quality assurance. Here, we aimed to discriminate rice of six Asian countries through geographic authentication using combinations of elemental/isotopic composition analysis and chemometric techniques. Principal components analysis could distinguish samples cultivated from most countries, except for those cultivated in the Philippines and Japan. Furthermore, orthogonal projection to latent structure-discriminant analysis provided clear discrimination between rice cultivated in Korea and other countries. The major common variables responsible for differentiation in these models were δ 34 S, Mn, and Mg. Our findings contribute to understanding the variations of elemental and isotopic compositions in rice depending on geographic origins, and offer valuable insight into the control of fraudulent labeling regarding the geographic origins of rice traded among Asian countries. Copyright © 2017 Elsevier Ltd. All rights reserved.
Rosenström, Tom; Ystrom, Eivind; Torvik, Fartein Ask; Czajkowski, Nikolai Olavi; Gillespie, Nathan A.; Aggen, Steven H.; Krueger, Robert F.; Kendler, Kenneth S; Reichborn-Kjennerud, Ted
2017-01-01
Results from previous studies on DSM-IV and DSM-5 Antisocial Personality Disorder (ASPD) have suggested that the construct is etiologically multidimensional. To our knowledge, however, the structure of genetic and environmental influences in ASPD has not been examined using an appropriate range of biometric models and diagnostic interviews. The 7 ASPD criteria (section A) were assessed in a population-based sample of 2794 Norwegian twins by a structured interview for DSM-IV personality disorders. Exploratory analyses were conducted at the phenotypic level. Multivariate biometric models, including both independent and common pathways, were compared. A single phenotypic factor was found, and the best-fitting biometric model was a single-factor common pathway model, with common-factor heritability of 51% (95% CI = 40–67%). In other words, both genetic and environmental correlations between the ASPD criteria could be accounted for by a single common latent variable. The findings support the validity of ASPD as a unidimensional diagnostic construct. PMID:28108863
Rosenström, Tom; Ystrom, Eivind; Torvik, Fartein Ask; Czajkowski, Nikolai Olavi; Gillespie, Nathan A; Aggen, Steven H; Krueger, Robert F; Kendler, Kenneth S; Reichborn-Kjennerud, Ted
2017-05-01
Results from previous studies on DSM-IV and DSM-5 Antisocial Personality Disorder (ASPD) have suggested that the construct is etiologically multidimensional. To our knowledge, however, the structure of genetic and environmental influences in ASPD has not been examined using an appropriate range of biometric models and diagnostic interviews. The 7 ASPD criteria (section A) were assessed in a population-based sample of 2794 Norwegian twins by a structured interview for DSM-IV personality disorders. Exploratory analyses were conducted at the phenotypic level. Multivariate biometric models, including both independent and common pathways, were compared. A single phenotypic factor was found, and the best-fitting biometric model was a single-factor common pathway model, with common-factor heritability of 51% (95% CI 40-67%). In other words, both genetic and environmental correlations between the ASPD criteria could be accounted for by a single common latent variable. The findings support the validity of ASPD as a unidimensional diagnostic construct.
Moayyeri, Alireza; Hart, Deborah J; Snieder, Harold; Hammond, Christopher J; Spector, Timothy D; Steves, Claire J
2016-02-01
Little is known about the extent to which aging trajectories of different body systems share common sources of variance. We here present a large twin study investigating the trajectories of change in five systems: cardiovascular, respiratory, skeletal, morphometric, and metabolic. Longitudinal clinical data were collected on 3,508 female twins in the TwinsUK registry (complete pairs:740 monozygotic (MZ), 986 dizygotic (DZ), mean age at entry 48.9 ± 10.4, range 18-75 years; mean follow-up 10.2 ± 2.8 years, range 4-17.8 years). Panel data on multiple age-related variables were used to estimate biological ages for each individual at each time point, in linear mixed effects models. A weighted average approach was used to combine variables within predefined body system groups. Aging trajectories for each system in each individual were then constructed using linear modeling. Multivariate structural equation modeling of these aging trajectories showed low genetic effects (heritability), ranging from 2% in metabolic aging to 22% in cardiovascular aging. However, we found a significant effect of shared environmental factors on the variations in aging trajectories in cardiovascular (54%), skeletal (34%), morphometric (53%), and metabolic systems (53%). The remainder was due to environmental factors unique to each individual plus error. Multivariate Cholesky decomposition showed that among aging trajectories for various body systems there were significant and substantial correlations between the unique environmental latent factors as well as shared environmental factors. However, there was no evidence for a single common factor for aging. This study, the first of its kind in aging, suggests that diverse organ systems share non-genetic sources of variance for aging trajectories. Confirmatory studies are needed using population-based twin cohorts and alternative methods of handling missing data.
Sáez, Carlos; Robles, Montserrat; García-Gómez, Juan M
2017-02-01
Biomedical data may be composed of individuals generated from distinct, meaningful sources. Due to possible contextual biases in the processes that generate data, there may exist an undesirable and unexpected variability among the probability distribution functions (PDFs) of the source subsamples, which, when uncontrolled, may lead to inaccurate or unreproducible research results. Classical statistical methods may have difficulties to undercover such variabilities when dealing with multi-modal, multi-type, multi-variate data. This work proposes two metrics for the analysis of stability among multiple data sources, robust to the aforementioned conditions, and defined in the context of data quality assessment. Specifically, a global probabilistic deviation and a source probabilistic outlyingness metrics are proposed. The first provides a bounded degree of the global multi-source variability, designed as an estimator equivalent to the notion of normalized standard deviation of PDFs. The second provides a bounded degree of the dissimilarity of each source to a latent central distribution. The metrics are based on the projection of a simplex geometrical structure constructed from the Jensen-Shannon distances among the sources PDFs. The metrics have been evaluated and demonstrated their correct behaviour on a simulated benchmark and with real multi-source biomedical data using the UCI Heart Disease data set. The biomedical data quality assessment based on the proposed stability metrics may improve the efficiency and effectiveness of biomedical data exploitation and research.
Sun, Fei; Xu, Bing; Zhang, Yi; Dai, Shengyun; Yang, Chan; Cui, Xianglong; Shi, Xinyuan; Qiao, Yanjiang
2016-01-01
The quality of Chinese herbal medicine tablets suffers from batch-to-batch variability due to a lack of manufacturing process understanding. In this paper, the Panax notoginseng saponins (PNS) immediate release tablet was taken as the research subject. By defining the dissolution of five active pharmaceutical ingredients and the tablet tensile strength as critical quality attributes (CQAs), influences of both the manipulated process parameters introduced by an orthogonal experiment design and the intermediate granules' properties on the CQAs were fully investigated by different chemometric methods, such as the partial least squares, the orthogonal projection to latent structures, and the multiblock partial least squares (MBPLS). By analyzing the loadings plots and variable importance in the projection indexes, the granule particle sizes and the minimal punch tip separation distance in tableting were identified as critical process parameters. Additionally, the MBPLS model suggested that the lubrication time in the final blending was also important in predicting tablet quality attributes. From the calculated block importance in the projection indexes, the tableting unit was confirmed to be the critical process unit of the manufacturing line. The results demonstrated that the combinatorial use of different multivariate modeling methods could help in understanding the complex process relationships as a whole. The output of this study can then be used to define a control strategy to improve the quality of the PNS immediate release tablet.
Multivariate analysis in thoracic research.
Mengual-Macenlle, Noemí; Marcos, Pedro J; Golpe, Rafael; González-Rivas, Diego
2015-03-01
Multivariate analysis is based in observation and analysis of more than one statistical outcome variable at a time. In design and analysis, the technique is used to perform trade studies across multiple dimensions while taking into account the effects of all variables on the responses of interest. The development of multivariate methods emerged to analyze large databases and increasingly complex data. Since the best way to represent the knowledge of reality is the modeling, we should use multivariate statistical methods. Multivariate methods are designed to simultaneously analyze data sets, i.e., the analysis of different variables for each person or object studied. Keep in mind at all times that all variables must be treated accurately reflect the reality of the problem addressed. There are different types of multivariate analysis and each one should be employed according to the type of variables to analyze: dependent, interdependence and structural methods. In conclusion, multivariate methods are ideal for the analysis of large data sets and to find the cause and effect relationships between variables; there is a wide range of analysis types that we can use.
A longitudinal study of mortality and air pollution for São Paulo, Brazil.
Botter, Denise A; Jørgensen, Bent; Peres, Antonieta A Q
2002-09-01
We study the effects of various air-pollution variables on the daily death counts for people over 65 years in São Paulo, Brazil, from 1991 to 1993, controlling for meteorological variables. We use a state space model where the air-pollution variables enter via the latent process, and the meteorological variables via the observation equation. The latent process represents the potential mortality due to air pollution, and is estimated by Kalman filter techniques. The effect of air pollution on mortality is found to be a function of the variation in the sulphur dioxide level for the previous 3 days, whereas the other air-pollution variables (total suspended particulates, nitrogen dioxide, carbon monoxide, ozone) are not significant when sulphur dioxide is in the equation. There are significant effects of humidity and up to lag 3 of temperature, and a significant seasonal variation.
Bentein, Kathleen; Vandenberghe, Christian; Vandenberg, Robert; Stinglhamber, Florence
2005-05-01
Through the use of affective, normative, and continuance commitment in a multivariate 2nd-order factor latent growth modeling approach, the authors observed linear negative trajectories that characterized the changes in individuals across time in both affective and normative commitment. In turn, an individual's intention to quit the organization was characterized by a positive trajectory. A significant association was also found between the change trajectories such that the steeper the decline in an individual's affective and normative commitments across time, the greater the rate of increase in that individual's intention to quit, and, further, the greater the likelihood that the person actually left the organization over the next 9 months. Findings regarding continuance commitment and its components were mixed.
Neufeld, Sharon; Jones, Peter B.; Fonagy, Peter; Bullmore, Edward T.; Dolan, Raymond J.; Moutoussis, Michael; Toseeb, Umar; Goodyer, Ian M.
2017-01-01
Little is known about the underlying relationships between self-reported mental health items measuring both positive and negative emotional and behavioural symptoms at the population level in young people. Improved measurement of the full range of mental well-being and mental illness may aid in understanding the aetiological substrates underlying the development of both mental wellness as well as specific psychiatric diagnoses. A general population sample aged 14 to 24 years completed self-report questionnaires on anxiety, depression, psychotic-like symptoms, obsessionality and well-being. Exploratory and confirmatory factor models for categorical data and latent profile analyses were used to evaluate the structure of both mental wellness and illness items. First order, second order and bifactor structures were evaluated on 118 self-reported items obtained from 2228 participants. A bifactor solution was the best fitting latent variable model with one general latent factor termed ‘distress’ and five ‘distress independent’ specific factors defined as self-confidence, antisocial behaviour, worry, aberrant thinking, and mood. Next, six distinct subgroups were derived from a person-centred latent profile analysis of the factor scores. Finally, concurrent validity was assessed using information on hazardous behaviours (alcohol use, substance misuse, self-harm) and treatment for mental ill health: both discriminated between the latent traits and latent profile subgroups. The findings suggest a complex, multidimensional mental health structure in the youth population rather than the previously assumed first or second order factor structure. Additionally, the analysis revealed a low hazardous behaviour/low mental illness risk subgroup not previously described. Population sub-groups show greater validity over single variable factors in revealing mental illness risks. In conclusion, our findings indicate that the structure of self reported mental health is multidimensional in nature and uniquely finds improved prediction to mental illness risk within person-centred subgroups derived from the multidimensional latent traits. PMID:28403164
St Clair, Michelle C; Neufeld, Sharon; Jones, Peter B; Fonagy, Peter; Bullmore, Edward T; Dolan, Raymond J; Moutoussis, Michael; Toseeb, Umar; Goodyer, Ian M
2017-01-01
Little is known about the underlying relationships between self-reported mental health items measuring both positive and negative emotional and behavioural symptoms at the population level in young people. Improved measurement of the full range of mental well-being and mental illness may aid in understanding the aetiological substrates underlying the development of both mental wellness as well as specific psychiatric diagnoses. A general population sample aged 14 to 24 years completed self-report questionnaires on anxiety, depression, psychotic-like symptoms, obsessionality and well-being. Exploratory and confirmatory factor models for categorical data and latent profile analyses were used to evaluate the structure of both mental wellness and illness items. First order, second order and bifactor structures were evaluated on 118 self-reported items obtained from 2228 participants. A bifactor solution was the best fitting latent variable model with one general latent factor termed 'distress' and five 'distress independent' specific factors defined as self-confidence, antisocial behaviour, worry, aberrant thinking, and mood. Next, six distinct subgroups were derived from a person-centred latent profile analysis of the factor scores. Finally, concurrent validity was assessed using information on hazardous behaviours (alcohol use, substance misuse, self-harm) and treatment for mental ill health: both discriminated between the latent traits and latent profile subgroups. The findings suggest a complex, multidimensional mental health structure in the youth population rather than the previously assumed first or second order factor structure. Additionally, the analysis revealed a low hazardous behaviour/low mental illness risk subgroup not previously described. Population sub-groups show greater validity over single variable factors in revealing mental illness risks. In conclusion, our findings indicate that the structure of self reported mental health is multidimensional in nature and uniquely finds improved prediction to mental illness risk within person-centred subgroups derived from the multidimensional latent traits.
The Effects of Model Misspecification and Sample Size on LISREL Maximum Likelihood Estimates.
ERIC Educational Resources Information Center
Baldwin, Beatrice
The robustness of LISREL computer program maximum likelihood estimates under specific conditions of model misspecification and sample size was examined. The population model used in this study contains one exogenous variable; three endogenous variables; and eight indicator variables, two for each latent variable. Conditions of model…
A Multilevel Model for Comorbid Outcomes: Obesity and Diabetes in the US
Congdon, Peter
2010-01-01
Multilevel models are overwhelmingly applied to single health outcomes, but when two or more health conditions are closely related, it is important that contextual variation in their joint prevalence (e.g., variations over different geographic settings) is considered. A multinomial multilevel logit regression approach for analysing joint prevalence is proposed here that includes subject level risk factors (e.g., age, race, education) while also taking account of geographic context. Data from a US population health survey (the 2007 Behavioral Risk Factor Surveillance System or BRFSS) are used to illustrate the method, with a six category multinomial outcome defined by diabetic status and weight category (obese, overweight, normal). The influence of geographic context is partly represented by known geographic variables (e.g., county poverty), and partly by a model for latent area influences. In particular, a shared latent variable (common factor) approach is proposed to measure the impact of unobserved area influences on joint weight and diabetes status, with the latent variable being spatially structured to reflect geographic clustering in risk. PMID:20616977
ERIC Educational Resources Information Center
Mimeau, Catherine; Dionne, Ginette; Feng, Bei; Brendgen, Mara; Vitaro, Frank; Tremblay, Richard E.; Boivin, Michel
2018-01-01
This twin study examined the genetic and environmental etiology of vocabulary, syntax, and their association in first graders. French-speaking same-sex twins (n = 555) completed two vocabulary tests, and two scores of syntax were calculated from their spontaneous speech at 7 years of age. Multivariate latent factor genetic analyses showed that…
ERIC Educational Resources Information Center
Leite, Walter L.; Zuo, Youzhen
2011-01-01
Among the many methods currently available for estimating latent variable interactions, the unconstrained approach is attractive to applied researchers because of its relatively easy implementation with any structural equation modeling (SEM) software. Using a Monte Carlo simulation study, we extended and evaluated the unconstrained approach to…
ERIC Educational Resources Information Center
Henry, Kimberly L.; Muthen, Bengt
2010-01-01
Latent class analysis (LCA) is a statistical method used to identify subtypes of related cases using a set of categorical or continuous observed variables. Traditional LCA assumes that observations are independent. However, multilevel data structures are common in social and behavioral research and alternative strategies are needed. In this…
Using the Graded Response Model to Control Spurious Interactions in Moderated Multiple Regression
ERIC Educational Resources Information Center
Morse, Brendan J.; Johanson, George A.; Griffeth, Rodger W.
2012-01-01
Recent simulation research has demonstrated that using simple raw score to operationalize a latent construct can result in inflated Type I error rates for the interaction term of a moderated statistical model when the interaction (or lack thereof) is proposed at the latent variable level. Rescaling the scores using an appropriate item response…
ERIC Educational Resources Information Center
Li, Tiandong
2012-01-01
In large-scale assessments, such as the National Assessment of Educational Progress (NAEP), plausible values based on Multiple Imputations (MI) have been used to estimate population characteristics for latent constructs under complex sample designs. Mislevy (1991) derived a closed-form analytic solution for a fixed-effect model in creating…
Fischer, H Felix; Rose, Matthias
2016-10-19
Recently, a growing number of Item-Response Theory (IRT) models has been published, which allow estimation of a common latent variable from data derived by different Patient Reported Outcomes (PROs). When using data from different PROs, direct estimation of the latent variable has some advantages over the use of sum score conversion tables. It requires substantial proficiency in the field of psychometrics to fit such models using contemporary IRT software. We developed a web application ( http://www.common-metrics.org ), which allows estimation of latent variable scores more easily using IRT models calibrating different measures on instrument independent scales. Currently, the application allows estimation using six different IRT models for Depression, Anxiety, and Physical Function. Based on published item parameters, users of the application can directly estimate latent trait estimates using expected a posteriori (EAP) for sum scores as well as for specific response patterns, Bayes modal (MAP), Weighted likelihood estimation (WLE) and Maximum likelihood (ML) methods and under three different prior distributions. The obtained estimates can be downloaded and analyzed using standard statistical software. This application enhances the usability of IRT modeling for researchers by allowing comparison of the latent trait estimates over different PROs, such as the Patient Health Questionnaire Depression (PHQ-9) and Anxiety (GAD-7) scales, the Center of Epidemiologic Studies Depression Scale (CES-D), the Beck Depression Inventory (BDI), PROMIS Anxiety and Depression Short Forms and others. Advantages of this approach include comparability of data derived with different measures and tolerance against missing values. The validity of the underlying models needs to be investigated in the future.
Incorporating imperfect detection into joint models of communites: A response to Warton et al.
Beissinger, Steven R.; Iknayan, Kelly J.; Guillera-Arroita, Gurutzeta; Zipkin, Elise; Dorazio, Robert; Royle, Andy; Kery, Marc
2016-01-01
Warton et al. [1] advance community ecology by describing a statistical framework that can jointly model abundances (or distributions) across many taxa to quantify how community properties respond to environmental variables. This framework specifies the effects of both measured and unmeasured (latent) variables on the abundance (or occurrence) of each species. Latent variables are random effects that capture the effects of both missing environmental predictors and correlations in parameter values among different species. As presented in Warton et al., however, the joint modeling framework fails to account for the common problem of detection or measurement errors that always accompany field sampling of abundance or occupancy, and are well known to obscure species- and community-level inferences.
What is significant about a single nursing session? An exploratory study.
Miller, Elizabeth M
2017-09-10
Researchers and clinicians specializing in breastfeeding often rely on measuring one nursing session to characterize the breastfeeding relationship. However, less is known about the descriptive or statistically predictive characteristics of one nursing session. The purposes of this study are twofold: (1) to explore the relationships between variables in a single nursing session; and (2) to study the association between variables in a single nursing session and infant length-for-age (LAZ) and weight-for-age (WAZ). In 63 nursing mother-infant pairs in the United States, anthropometric measurement and observation of a single nursing session revealed six nursing session variables: fore milk fat percent, hind milk fat percent, infant milk intake, duration of session, time since last session, and time of day of session. A principle factor analysis, undertaken to explore latent variables underlying the six session variables, revealed two factors: (1) loaded highly on fore and hind milk fat percentage, reflecting the overall fat percent in a feed; and (2) loaded highly on milk intake and hind milk fat percentage, indicating the process of breast emptying. In multivariate analyses of all session variables on infant LAZ and WAZ, only hind milk fat percentage was significantly negatively associated with LAZ (β = -0.14, P = .01 (two-tailed), R 2 = 0.070), confirmed by a significant negative association between LAZ and factor one (β = -0.32, P = .05 (two-tailed), R 2 = 0.090). This research describes the dynamics of a single nursing session, and has the potential to help explain variation in infant growth and nutrition. © 2017 Wiley Periodicals, Inc.
Geiser, Christian; Bishop, Jacob; Lockhart, Ginger; Shiffman, Saul; Grenard, Jerry L.
2013-01-01
Latent state-trait (LST) and latent growth curve (LGC) models are frequently used in the analysis of longitudinal data. Although it is well-known that standard single-indicator LGC models can be analyzed within either the structural equation modeling (SEM) or multilevel (ML; hierarchical linear modeling) frameworks, few researchers realize that LST and multivariate LGC models, which use multiple indicators at each time point, can also be specified as ML models. In the present paper, we demonstrate that using the ML-SEM rather than the SL-SEM framework to estimate the parameters of these models can be practical when the study involves (1) a large number of time points, (2) individually-varying times of observation, (3) unequally spaced time intervals, and/or (4) incomplete data. Despite the practical advantages of the ML-SEM approach under these circumstances, there are also some limitations that researchers should consider. We present an application to an ecological momentary assessment study (N = 158 youths with an average of 23.49 observations of positive mood per person) using the software Mplus (Muthén and Muthén, 1998–2012) and discuss advantages and disadvantages of using the ML-SEM approach to estimate the parameters of LST and multiple-indicator LGC models. PMID:24416023
Using structural equation modeling for network meta-analysis.
Tu, Yu-Kang; Wu, Yun-Chun
2017-07-14
Network meta-analysis overcomes the limitations of traditional pair-wise meta-analysis by incorporating all available evidence into a general statistical framework for simultaneous comparisons of several treatments. Currently, network meta-analyses are undertaken either within the Bayesian hierarchical linear models or frequentist generalized linear mixed models. Structural equation modeling (SEM) is a statistical method originally developed for modeling causal relations among observed and latent variables. As random effect is explicitly modeled as a latent variable in SEM, it is very flexible for analysts to specify complex random effect structure and to make linear and nonlinear constraints on parameters. The aim of this article is to show how to undertake a network meta-analysis within the statistical framework of SEM. We used an example dataset to demonstrate the standard fixed and random effect network meta-analysis models can be easily implemented in SEM. It contains results of 26 studies that directly compared three treatment groups A, B and C for prevention of first bleeding in patients with liver cirrhosis. We also showed that a new approach to network meta-analysis based on the technique of unrestricted weighted least squares (UWLS) method can also be undertaken using SEM. For both the fixed and random effect network meta-analysis, SEM yielded similar coefficients and confidence intervals to those reported in the previous literature. The point estimates of two UWLS models were identical to those in the fixed effect model but the confidence intervals were greater. This is consistent with results from the traditional pairwise meta-analyses. Comparing to UWLS model with common variance adjusted factor, UWLS model with unique variance adjusted factor has greater confidence intervals when the heterogeneity was larger in the pairwise comparison. The UWLS model with unique variance adjusted factor reflects the difference in heterogeneity within each comparison. SEM provides a very flexible framework for univariate and multivariate meta-analysis, and its potential as a powerful tool for advanced meta-analysis is still to be explored.
Multivariate Analysis of Ladle Vibration
NASA Astrophysics Data System (ADS)
Yenus, Jaefer; Brooks, Geoffrey; Dunn, Michelle
2016-08-01
The homogeneity of composition and uniformity of temperature of the steel melt before it is transferred to the tundish are crucial in making high-quality steel product. The homogenization process is performed by stirring the melt using inert gas in ladles. Continuous monitoring of this process is important to make sure the action of stirring is constant throughout the ladle. Currently, the stirring process is monitored by process operators who largely rely on visual and acoustic phenomena from the ladle. However, due to lack of measurable signals, the accuracy and suitability of this manual monitoring are problematic. The actual flow of argon gas to the ladle may not be same as the flow gage reading due to leakage along the gas line components. As a result, the actual degree of stirring may not be correctly known. Various researchers have used one-dimensional vibration, and sound and image signals measured from the ladle to predict the degree of stirring inside. They developed online sensors which are indeed to monitor the online stirring phenomena. In this investigation, triaxial vibration signals have been measured from a cold water model which is a model of an industrial ladle. Three flow rate ranges and varying bath heights were used to collect vibration signals. The Fast Fourier Transform was applied to the dataset before it has been analyzed using principal component analysis (PCA) and partial least squares (PLS). PCA was used to unveil the structure in the experimental data. PLS was mainly applied to predict the stirring from the vibration response. It was found that for each flow rate range considered in this study, the informative signals reside in different frequency ranges. The first latent variables in these frequency ranges explain more than 95 pct of the variation in the stirring process for the entire single layer and the double layer data collected from the cold model. PLS analysis in these identified frequency ranges demonstrated that the latent variables of the response and predictor variables are highly correlated. The predicted variable has shown linear relationship with the stirring energy and bath recirculation speed. This outcome can improve the predictability of the mixing status in ladle metallurgy and make the online control of the process easier. Industrial testing of this input will follow.
ERIC Educational Resources Information Center
Fagginger Auer, Marije F.; Hickendorff, Marian; Van Putten, Cornelis M.; Béguin, Anton A.; Heiser, Willem J.
2016-01-01
A first application of multilevel latent class analysis (MLCA) to educational large-scale assessment data is demonstrated. This statistical technique addresses several of the challenges that assessment data offers. Importantly, MLCA allows modeling of the often ignored teacher effects and of the joint influence of teacher and student variables.…
Consequences of Ignoring Guessing when Estimating the Latent Density in Item Response Theory
ERIC Educational Resources Information Center
Woods, Carol M.
2008-01-01
In Ramsay-curve item response theory (RC-IRT), the latent variable distribution is estimated simultaneously with the item parameters. In extant Monte Carlo evaluations of RC-IRT, the item response function (IRF) used to fit the data is the same one used to generate the data. The present simulation study examines RC-IRT when the IRF is imperfectly…
Students' Views on Mathematics in Single-Sex and Coed Classrooms in Ghana
ERIC Educational Resources Information Center
Bofah, Emmanuel Adu-tutu; Hannula, Markku S.
2016-01-01
In this study, we investigated students' views on themselves as learners of mathematics as a function of school-by-sex (N = 2034, MAge = 18.49, SDAge = 1.25; 12th-grade; 58.2% girls). Using latent variable Structural Equation Modeling (SEM), the measurement and structural equivalence as well as the equality of latent means of scores across…
The Information a Test Provides on an Ability Parameter. Research Report. ETS RR-07-18
ERIC Educational Resources Information Center
Haberman, Shelby J.
2007-01-01
In item-response theory, if a latent-structure model has an ability variable, then elementary information theory may be employed to provide a criterion for evaluation of the information the test provides concerning ability. This criterion may be considered even in cases in which the latent-structure model is not valid, although interpretation of…
An introduction to mixture item response theory models.
De Ayala, R J; Santiago, S Y
2017-02-01
Mixture item response theory (IRT) allows one to address situations that involve a mixture of latent subpopulations that are qualitatively different but within which a measurement model based on a continuous latent variable holds. In this modeling framework, one can characterize students by both their location on a continuous latent variable as well as by their latent class membership. For example, in a study of risky youth behavior this approach would make it possible to estimate an individual's propensity to engage in risky youth behavior (i.e., on a continuous scale) and to use these estimates to identify youth who might be at the greatest risk given their class membership. Mixture IRT can be used with binary response data (e.g., true/false, agree/disagree, endorsement/not endorsement, correct/incorrect, presence/absence of a behavior), Likert response scales, partial correct scoring, nominal scales, or rating scales. In the following, we present mixture IRT modeling and two examples of its use. Data needed to reproduce analyses in this article are available as supplemental online materials at http://dx.doi.org/10.1016/j.jsp.2016.01.002. Copyright © 2016 Society for the Study of School Psychology. Published by Elsevier Ltd. All rights reserved.
Latent Heating Structures Derived from TRMM
NASA Technical Reports Server (NTRS)
Tao, W.-K.; Smith, E. A.; Adler, R.; Hou, A.; Kakar, R.; Krishnamurti, T.; Kummerow, C.; Lang, S.; Olson, W.; Satoh, S.
2004-01-01
Rainfall is the fundamental variable within the Earth's hydrological cycle because it is both the main forcing term leading to variations in continental and oceanic surface water budgets. The vertical distribution of latent heat release, which is accompanied with rain, modulates large-scale meridional and zonal circulations within the tropics as well as modifying the energetic efficiency of mid-latitude weather systems. Latent heat release itself is a consequence of phase changes between the vapor, liquid, and frozen states of water.This paper focuses on the retrieval of latent heat release from satellite measurements generated by the Tropical Rainfall Measuring Mission 0. The TRMM observatory, whose development was a joint US-Japan space endeavor, was launched in November 1997. TRMM measurements provide an accurate account of rainfall over the global tropics, information which can be .used to estimate the four-dimensional structure of latent heating across the entire tropical and sub-tropical regions. Various algorithm methodologies for estimating latent heating based on rain rate measurements from TRMM observations are described. The strengths and weaknesses of these algorithms, as well as the latent heating products generated by these algorithms, are also discussed along with validation analyses of the products. The investigation paper provides an overview of how TRMM-derived latent heating information is currently being used in conjunction with global weather and climate models, and concludes with remarks designed to stimulate further research on latent heating retrieval
Estimation of diagnostic test accuracy without full verification: a review of latent class methods
Collins, John; Huynh, Minh
2014-01-01
The performance of a diagnostic test is best evaluated against a reference test that is without error. For many diseases, this is not possible, and an imperfect reference test must be used. However, diagnostic accuracy estimates may be biased if inaccurately verified status is used as the truth. Statistical models have been developed to handle this situation by treating disease as a latent variable. In this paper, we conduct a systematized review of statistical methods using latent class models for estimating test accuracy and disease prevalence in the absence of complete verification. PMID:24910172
Fall Risk, Supports and Services, and Falls Following a Nursing Home Discharge.
Noureldin, Marwa; Hass, Zachary; Abrahamson, Kathleen; Arling, Greg
2017-09-04
Falls are a major source of morbidity and mortality among older adults; however, little is known regarding fall occurrence during a nursing home (NH) to community transition. This study sought to examine whether the presence of supports and services impacts the relationship between fall-related risk factors and fall occurrence post NH discharge. Participants in the Minnesota Return to Community Initiative who were assisted in achieving a community discharge (N = 1459) comprised the study sample. The main outcome was fall occurrence within 30 days of discharge. Factor analyses were used to estimate latent models from variables of interest. A structural equation model (SEM) was estimated to determine the relationship between the emerging latent variables and falls. Fifteen percent of participants fell within 30 days of NH discharge. Factor analysis of fall-related risk factors produced three latent variables: fall concerns/history; activities of daily living impairments; and use of high-risk medications. A supports/services latent variable also emerged that included caregiver support frequency, medication management assistance, durable medical equipment use, discharge location, and receipt of home health or skilled nursing services. In the SEM model, high-risk medications use and fall concerns/history had direct positive effects on falling. Receiving supports/services did not affect falling directly; however, it reduced the effect of high-risk medication use on falling (p < .05). Within the context of a state-implemented transition program, findings highlight the importance of supports/services in mitigating against medication-related risk of falling post NH discharge. © The Author 2017. Published by Oxford University Press on behalf of The Gerontological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
ERIC Educational Resources Information Center
Hayiou-Thomas, Marianna E.; Dale, Philip S.; Plomin, Robert
2012-01-01
The present study is the first long-term longitudinal examination of the etiology of individual differences in language from early childhood through to adolescence. We applied a multivariate latent factor genetic model to longitudinal data from the Twins Early Development Study in order to (a) compare the magnitude of genetic and environmental…
Correlative and multivariate analysis of increased radon concentration in underground laboratory.
Maletić, Dimitrije M; Udovičić, Vladimir I; Banjanac, Radomir M; Joković, Dejan R; Dragić, Aleksandar L; Veselinović, Nikola B; Filipović, Jelena
2014-11-01
The results of analysis using correlative and multivariate methods, as developed for data analysis in high-energy physics and implemented in the Toolkit for Multivariate Analysis software package, of the relations of the variation of increased radon concentration with climate variables in shallow underground laboratory is presented. Multivariate regression analysis identified a number of multivariate methods which can give a good evaluation of increased radon concentrations based on climate variables. The use of the multivariate regression methods will enable the investigation of the relations of specific climate variable with increased radon concentrations by analysis of regression methods resulting in 'mapped' underlying functional behaviour of radon concentrations depending on a wide spectrum of climate variables. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
A general diagnostic model applied to language testing data.
von Davier, Matthias
2008-11-01
Probabilistic models with one or more latent variables are designed to report on a corresponding number of skills or cognitive attributes. Multidimensional skill profiles offer additional information beyond what a single test score can provide, if the reported skills can be identified and distinguished reliably. Many recent approaches to skill profile models are limited to dichotomous data and have made use of computationally intensive estimation methods such as Markov chain Monte Carlo, since standard maximum likelihood (ML) estimation techniques were deemed infeasible. This paper presents a general diagnostic model (GDM) that can be estimated with standard ML techniques and applies to polytomous response variables as well as to skills with two or more proficiency levels. The paper uses one member of a larger class of diagnostic models, a compensatory diagnostic model for dichotomous and partial credit data. Many well-known models, such as univariate and multivariate versions of the Rasch model and the two-parameter logistic item response theory model, the generalized partial credit model, as well as a variety of skill profile models, are special cases of this GDM. In addition to an introduction to this model, the paper presents a parameter recovery study using simulated data and an application to real data from the field test for TOEFL Internet-based testing.
Iliceto, Paolo; Pompili, Maurizio; Spencer-Thomas, Sally; Ferracuti, Stefano; Erbuto, Denise; Lester, David; Candilera, Gabriella; Girardi, Paolo
2013-03-01
Occupational stress is a multivariate process involving sources of pressure, psycho-physiological distress, locus of control, work dissatisfaction, depression, anxiety, mental health disorders, hopelessness, and suicide ideation. Healthcare professionals are known for higher rates of occupational-related distress (burnout and compassion fatigue) and higher rates of suicide. The purpose of this study was to explain the relationships between occupational stress and some psychopathological dimensions in a sample of health professionals. We investigated 156 nurses and physicians, 62 males and 94 females, who were administered self-report questionnaires to assess occupational stress [occupational stress inventory (OSI)], temperament (temperament evaluation of Memphis, Pisa, Paris, and San Diego autoquestionnaire), and hopelessness (Beck hopelessness scale). The best Multiple Indicators Multiple Causes model with five OSI predictors yielded the following results: χ2(9) = 14.47 (p = 0.11); χ2/df = 1.60; comparative fit index = 0.99; root mean square error of approximation = 0.05. This model provided a good fit to the empirical data, showing a strong direct influence of casual variables such as work dissatisfaction, absence of type A behavior, and especially external locus of control, psychological and physiological distress on latent variable psychopathology. Occupational stress is in a complex relationship with temperament and hopelessness and also common among healthcare professionals.
The job content questionnaire in various occupational contexts: applying a latent class model.
Santos, Kionna Oliveira Bernardes; Araújo, Tânia Maria de; Carvalho, Fernando Martins; Karasek, Robert
2017-05-17
To evaluate Job Content Questionnaire(JCQ) performance using the latent class model. We analysed cross-sectional studies conducted in Brazil and examined three occupational categories: petroleum industry workers (n=489), teachers (n=4392) and primary healthcare workers (3078)and 1552 urban workers from a representative sample of the city of Feira de Santana in Bahia, Brazil. An appropriate number of latent classes was extracted and described each occupational category using latent class analysis, a multivariate method that evaluates constructs and takes into accountthe latent characteristics underlying the structure of measurement scales. The conditional probabilities of workers belonging to each class were then analysed graphically. Initially, the latent class analysis extracted four classes corresponding to the four job types (active, passive, low strain and high strain) proposed by the Job-Strain model (JSM) and operationalised by the JCQ. However, after taking into consideration the adequacy criteria to evaluate the number of extracted classes, three classes (active, low strain and high strain) were extracted from the studies of urban workers and teachers and four classes (active, passive, low strain and high strain) from the study of primary healthcare and petroleum industry workers. The four job types proposed by the JSM were identified among primary healthcare and petroleum industry workers-groups with relatively high levels of skill discretion and decision authority. Three job types were identified for teachers and urban workers; however, passive job situations were not found within these groups. The latent class analysis enabled us to describe the conditional standard responses of the job types proposed by the model, particularly in relation to active jobs and high and low strain situations. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Efficacy of Doxycycline in the Treatment of Syphilis.
Dai, Ting; Qu, Rui; Liu, Jinfen; Zhou, Pingyu; Wang, Qianqiu
2017-01-01
Doxycycline is an alternative antibiotic drug for the treatment of syphilis, but data on its efficacy, especially data on its efficacy against late latent syphilis, are limited. A retrospective study was conducted to evaluate the effectiveness of doxycycline for the treatment of patients with different stages of syphilis. Patients who received doxycycline treatment between June 2011 and June 2014 were involved. The serological response to doxycycline was defined as either a negative toluidine red unheated serum test (TRUST) result or a ≥4-fold decrease in titer at 12 months following the treatment. Univariate and multivariate logistic regression analyses were performed to identify factors associated with the serological response. During the study period, a total of 163 syphilis patients were treated with doxycycline, and 118 patients completed doxycycline treatment and the 12-month follow-up. Among the 118 patients, the serological response rate at 12 months was 100.0% (7/7) in patients with primary syphilis, 96.9% (62/64) in patients with secondary syphilis, 91.3% (21/23) in patients with early latent syphilis, and 79.2% (19/24) in patients with late latent syphilis. The total serological response rates were 92.4% (109/118) for preprotocol (PP) patients and 66.9% (109/163) for all intention-to-treat (ITT) patients. In multivariate analysis, patients who serologically responded at 12 months following treatment were positively associated with a higher baseline TRUST titer and an earlier syphilis stage than nonresponders. Our study showed excellent treatment outcomes in patients with different stages of syphilis. Our data, along with those from other reports, support the usage of doxycycline as a good alternative therapeutic option in the treatment of syphilis. Copyright © 2016 American Society for Microbiology.
NASA Astrophysics Data System (ADS)
Thelen, Brian T.; Xique, Ismael J.; Burns, Joseph W.; Goley, G. Steven; Nolan, Adam R.; Benson, Jonathan W.
2017-04-01
With all of the new remote sensing modalities available, and with ever increasing capabilities and frequency of collection, there is a desire to fundamentally understand/quantify the information content in the collected image data relative to various exploitation goals, such as detection/classification. A fundamental approach for this is the framework of Bayesian decision theory, but a daunting challenge is to have significantly flexible and accurate multivariate models for the features and/or pixels that capture a wide assortment of distributions and dependen- cies. In addition, data can come in the form of both continuous and discrete representations, where the latter is often generated based on considerations of robustness to imaging conditions and occlusions/degradations. In this paper we propose a novel suite of "latent" models fundamentally based on multivariate Gaussian copula models that can be used for quantized data from SAR imagery. For this Latent Gaussian Copula (LGC) model, we derive an approximate, maximum-likelihood estimation algorithm and demonstrate very reasonable estimation performance even for the larger images with many pixels. However applying these LGC models to large dimen- sions/images within a Bayesian decision/classification theory is infeasible due to the computational/numerical issues in evaluating the true full likelihood, and we propose an alternative class of novel pseudo-likelihoood detection statistics that are computationally feasible. We show in a few simple examples that these statistics have the potential to provide very good and robust detection/classification performance. All of this framework is demonstrated on a simulated SLICY data set, and the results show the importance of modeling the dependencies, and of utilizing the pseudo-likelihood methods.
Lo, Po-Han; Tsou, Mei-Yung; Chang, Kuang-Yi
2015-09-01
Patient-controlled epidural analgesia (PCEA) is commonly used for pain relief after total knee arthroplasty (TKA). This study aimed to model the trajectory of analgesic demand over time after TKA and explore its influential factors using latent curve analysis. Data were retrospectively collected from 916 patients receiving unilateral or bilateral TKA and postoperative PCEA. PCEA demands during 12-hour intervals for 48 hours were directly retrieved from infusion pumps. Potentially influential factors of PCEA demand, including age, height, weight, body mass index, sex, and infusion pump settings, were also collected. A latent curve analysis with 2 latent variables, the intercept (baseline) and slope (trend), was applied to model the changes in PCEA demand over time. The effects of influential factors on these 2 latent variables were estimated to examine how these factors interacted with time to alter the trajectory of PCEA demand over time. On average, the difference in analgesic demand between the first and second 12-hour intervals was only 15% of that between the first and third 12-hour intervals. No significant difference in PCEA demand was noted between the third and fourth 12-hour intervals. Aging tended to decrease the baseline PCEA demand but body mass index and infusion rate were positively correlated with the baseline. Only sex significantly affected the trend parameter and male individuals tended to have a smoother decreasing trend of analgesic demands over time. Patients receiving bilateral procedures did not consume more analgesics than their unilateral counterparts. Goodness of fit analysis indicated acceptable model fit to the observed data. Latent curve analysis provided valuable information about how analgesic demand after TKA changed over time and how patient characteristics affected its trajectory.
Rössler, Wulf; Hengartner, Michael P; Ajdacic-Gross, Vladeta; Haker, Helene; Angst, Jules
2013-10-01
Our aim was to deconstruct the variance underlying the expression of sub-clinical psychosis symptoms into portions associated with latent time-dependent states and time-invariant traits. We analyzed data of 335 subjects from the general population of Zurich, Switzerland, who had been repeatedly measured between 1979 (age 20/21) and 2008 (age 49/50). We applied two measures of sub-clinical psychosis derived from the SCL-90-R, namely schizotypal signs (STS) and schizophrenia nuclear symptoms (SNS). Variance was decomposed with latent state-trait analysis and associations with covariates were examined with generalized linear models. At ages 19/20 and 49/50, the latent states underlying STS accounted for 48% and 51% of variance, whereas for SNS those estimates were 62% and 50%. Between those age classes, however, expression of sub-clinical psychosis was strongly associated with stable traits (75% and 89% of total variance in STS and SNS, respectively, at age 27/28). Latent states underlying variance in STS and SNS were particularly related to partnership problems over almost the entire observation period. STS was additionally related to employment problems, whereas drug-use was a strong predictor of states underlying both syndromes at age 19/20. The latent trait underlying expression of STS and SNS was particularly related to low sense of mastery and self-esteem and to high depressiveness. Although most psychosis symptoms are transient and episodic in nature, the variability in their expression is predominantly caused by stable traits. Those time-invariant and rather consistent effects are particularly influential around age 30, whereas the occasion-specific states appear to be particularly influential at ages 20 and 50. © 2013.
NASA Astrophysics Data System (ADS)
Levit, Creon; Gazis, P.
2006-06-01
The graphics processing units (GPUs) built in to all professional desktop and laptop computers currently on the market are capable of transforming, filtering, and rendering hundreds of millions of points per second. We present a prototype open-source cross-platform (windows, linux, Apple OSX) application which leverages some of the power latent in the GPU to enable smooth interactive exploration and analysis of large high-dimensional data using a variety of classical and recent techniques. The targeted application area is the interactive analysis of complex, multivariate space science and astrophysics data sets, with dimensionalities that may surpass 100 and sample sizes that may exceed 10^6-10^8.
ERIC Educational Resources Information Center
Aryadoust, Vahid
2015-01-01
The present study uses a mixture Rasch model to examine latent differential item functioning in English as a foreign language listening tests. Participants (n = 250) took a listening and lexico-grammatical test and completed the metacognitive awareness listening questionnaire comprising problem solving (PS), planning and evaluation (PE), mental…
An All-Fragments Grammar for Simple and Accurate Parsing
2012-03-21
Tsujii. Probabilistic CFG with latent annotations. In Proceedings of ACL, 2005. Slav Petrov and Dan Klein. Improved Inference for Unlexicalized Parsing. In...Proceedings of NAACL-HLT, 2007. Slav Petrov and Dan Klein. Sparse Multi-Scale Grammars for Discriminative Latent Variable Parsing. In Proceedings of...EMNLP, 2008. Slav Petrov, Leon Barrett, Romain Thibaux, and Dan Klein. Learning Accurate, Compact, and Interpretable Tree Annotation. In Proceedings
Stability of Language in Childhood: A Multi-Age, -Domain, -Measure, and -Source Study
Bornstein, Marc H.; Putnick, Diane L.
2011-01-01
The stability of language across childhood is traditionally assessed by exploring longitudinal relations between individual language measures. However, language encompasses many domains and varies with different sources (child speech, parental report, experimenter assessment). This study evaluated individual variation in multiple age-appropriate measures of child language derived from multiple sources and stability between their latent variables in 192 young children across more than 2 years. Structural equation modeling demonstrated the loading of multiple measures of child language from different sources on single latent variables of language at ages 20 and 48 months. A large stability coefficient (r = .84) obtained between the 2 language latent variables. This stability obtained even when accounting for family socioeconomic status, maternal verbal intelligence, education, speech, and tendency to respond in a socially desirable fashion, and child social competence. Stability was also equivalent for children in diverse childcare situations and for girls and boys. Across age, from the beginning of language acquisition to just before school entry, aggregating multiple age-appropriate methods and measures at each age and multiple reporters, children show strong stability of individual differences in general language development. PMID:22004343
Data-driven subtypes of major depressive disorder: a systematic review
2012-01-01
Background According to current classification systems, patients with major depressive disorder (MDD) may have very different combinations of symptoms. This symptomatic diversity hinders the progress of research into the causal mechanisms and treatment allocation. Theoretically founded subtypes of depression such as atypical, psychotic, and melancholic depression have limited clinical applicability. Data-driven analyses of symptom dimensions or subtypes of depression are scarce. In this systematic review, we examine the evidence for the existence of data-driven symptomatic subtypes of depression. Methods We undertook a systematic literature search of MEDLINE, PsycINFO and Embase in May 2012. We included studies analyzing the depression criteria of the Diagnostic and Statistical Manual of Mental Disorders, fourth edition (DSM-IV) of adults with MDD in latent variable analyses. Results In total, 1176 articles were retrieved, of which 20 satisfied the inclusion criteria. These reports described a total of 34 latent variable analyses: 6 confirmatory factor analyses, 6 exploratory factor analyses, 12 principal component analyses, and 10 latent class analyses. The latent class techniques distinguished 2 to 5 classes, which mainly reflected subgroups with different overall severity: 62 of 71 significant differences on symptom level were congruent with a latent class solution reflecting severity. The latent class techniques did not consistently identify specific symptom clusters. Latent factor techniques mostly found a factor explaining the variance in the symptoms depressed mood and interest loss (11 of 13 analyses), often complemented by psychomotor retardation or fatigue (8 of 11 analyses). However, differences in found factors and classes were substantial. Conclusions The studies performed to date do not provide conclusive evidence for the existence of depressive symptom dimensions or symptomatic subtypes. The wide diversity of identified factors and classes might result either from the absence of patterns to be found, or from the theoretical and modeling choices preceding analysis. PMID:23210727
Renzette, Nicholas; Somasundaran, Mohan; Brewster, Frank; Coderre, James; Weiss, Eric R.; McManus, Margaret; Greenough, Thomas; Tabak, Barbara; Garber, Manuel; Kowalik, Timothy F.
2014-01-01
ABSTRACT We report the diversity of latent membrane protein 1 (LMP1) gene founder sequences and the level of Epstein-Barr virus (EBV) genome variability over time and across anatomic compartments by using virus genomes amplified directly from oropharyngeal wash specimens and peripheral blood B cells during acute infection and convalescence. The intrahost nucleotide variability of the founder virus was 0.02% across the region sequences, and diversity increased significantly over time in the oropharyngeal compartment (P = 0.004). The LMP1 region showing the greatest level of variability in both compartments, and over time, was concentrated within the functional carboxyl-terminal activating regions 2 and 3 (CTAR2 and CTAR3). Interestingly, a deletion in a proline-rich repeat region (amino acids 274 to 289) of EBV commonly reported in EBV sequenced from cancer specimens was not observed in acute infectious mononucleosis (AIM) patients. Taken together, these data highlight the diversity in circulating EBV genomes and its potential importance in disease pathogenesis and vaccine design. IMPORTANCE This study is among the first to leverage an improved high-throughput deep-sequencing methodology to investigate directly from patient samples the degree of diversity in Epstein-Barr virus (EBV) populations and the extent to which viral genome diversity develops over time in the infected host. Significant variability of circulating EBV latent membrane protein 1 (LMP1) gene sequences was observed between cellular and oral wash samples, and this variability increased over time in oral wash samples. The significance of EBV genetic diversity in transmission and disease pathogenesis are discussed. PMID:24429365
Latent variable model for suicide risk in relation to social capital and socio-economic status.
Congdon, Peter
2012-08-01
There is little evidence on the association between suicide outcomes (ideation, attempts, self-harm) and social capital. This paper investigates such associations using a structural equation model based on health survey data, and allowing for both individual and contextual risk factors. Social capital and other major risk factors for suicide, namely socioeconomic status and social isolation, are modelled as latent variables that are proxied (or measured) by observed indicators or question responses for survey subjects. These latent scales predict suicide risk in the structural component of the model. Also relevant to explaining suicide risk are contextual variables, such as area deprivation and region of residence, as well as the subject's demographic status. The analysis is based on the 2007 Adult Psychiatric Morbidity Survey and includes 7,403 English subjects. A Bayesian modelling strategy is used. Models with and without social capital as a predictor of suicide risk are applied. A benefit to statistical fit is demonstrated when social capital is added as a predictor. Social capital varies significantly by geographic context variables (neighbourhood deprivation, region), and this impacts on the direct effects of these contextual variables on suicide risk. In particular, area deprivation is not confirmed as a distinct significant influence. The model develops a suicidality risk score incorporating social capital, and the success of this risk score in predicting actual suicide events is demonstrated. Social capital as reflected in neighbourhood perceptions is a significant factor affecting risks of different types of self-harm and may mediate the effects of other contextual variables such as area deprivation.
Renzette, Nicholas; Somasundaran, Mohan; Brewster, Frank; Coderre, James; Weiss, Eric R; McManus, Margaret; Greenough, Thomas; Tabak, Barbara; Garber, Manuel; Kowalik, Timothy F; Luzuriaga, Katherine
2014-04-01
We report the diversity of latent membrane protein 1 (LMP1) gene founder sequences and the level of Epstein-Barr virus (EBV) genome variability over time and across anatomic compartments by using virus genomes amplified directly from oropharyngeal wash specimens and peripheral blood B cells during acute infection and convalescence. The intrahost nucleotide variability of the founder virus was 0.02% across the region sequences, and diversity increased significantly over time in the oropharyngeal compartment (P = 0.004). The LMP1 region showing the greatest level of variability in both compartments, and over time, was concentrated within the functional carboxyl-terminal activating regions 2 and 3 (CTAR2 and CTAR3). Interestingly, a deletion in a proline-rich repeat region (amino acids 274 to 289) of EBV commonly reported in EBV sequenced from cancer specimens was not observed in acute infectious mononucleosis (AIM) patients. Taken together, these data highlight the diversity in circulating EBV genomes and its potential importance in disease pathogenesis and vaccine design. This study is among the first to leverage an improved high-throughput deep-sequencing methodology to investigate directly from patient samples the degree of diversity in Epstein-Barr virus (EBV) populations and the extent to which viral genome diversity develops over time in the infected host. Significant variability of circulating EBV latent membrane protein 1 (LMP1) gene sequences was observed between cellular and oral wash samples, and this variability increased over time in oral wash samples. The significance of EBV genetic diversity in transmission and disease pathogenesis are discussed.
Geiser, Christian; Griffin, Daniel; Shiffman, Saul
2016-01-01
Sometimes, researchers are interested in whether an intervention, experimental manipulation, or other treatment causes changes in intra-individual state variability. The authors show how multigroup-multiphase latent state-trait (MG-MP-LST) models can be used to examine treatment effects with regard to both mean differences and differences in state variability. The approach is illustrated based on a randomized controlled trial in which N = 338 smokers were randomly assigned to nicotine replacement therapy (NRT) vs. placebo prior to quitting smoking. We found that post quitting, smokers in both the NRT and placebo group had significantly reduced intra-individual affect state variability with respect to the affect items calm and content relative to the pre-quitting phase. This reduction in state variability did not differ between the NRT and placebo groups, indicating that quitting smoking may lead to a stabilization of individuals' affect states regardless of whether or not individuals receive NRT.
Geiser, Christian; Griffin, Daniel; Shiffman, Saul
2016-01-01
Sometimes, researchers are interested in whether an intervention, experimental manipulation, or other treatment causes changes in intra-individual state variability. The authors show how multigroup-multiphase latent state-trait (MG-MP-LST) models can be used to examine treatment effects with regard to both mean differences and differences in state variability. The approach is illustrated based on a randomized controlled trial in which N = 338 smokers were randomly assigned to nicotine replacement therapy (NRT) vs. placebo prior to quitting smoking. We found that post quitting, smokers in both the NRT and placebo group had significantly reduced intra-individual affect state variability with respect to the affect items calm and content relative to the pre-quitting phase. This reduction in state variability did not differ between the NRT and placebo groups, indicating that quitting smoking may lead to a stabilization of individuals' affect states regardless of whether or not individuals receive NRT. PMID:27499744
Vera, José Fernando; de Rooij, Mark; Heiser, Willem J
2014-11-01
In this paper we propose a latent class distance association model for clustering in the predictor space of large contingency tables with a categorical response variable. The rows of such a table are characterized as profiles of a set of explanatory variables, while the columns represent a single outcome variable. In many cases such tables are sparse, with many zero entries, which makes traditional models problematic. By clustering the row profiles into a few specific classes and representing these together with the categories of the response variable in a low-dimensional Euclidean space using a distance association model, a parsimonious prediction model can be obtained. A generalized EM algorithm is proposed to estimate the model parameters and the adjusted Bayesian information criterion statistic is employed to test the number of mixture components and the dimensionality of the representation. An empirical example highlighting the advantages of the new approach and comparing it with traditional approaches is presented. © 2014 The British Psychological Society.
Three subgroups of pain profiles identified in 227 women with arthritis: a latent class analysis.
de Luca, Katie; Parkinson, Lynne; Downie, Aron; Blyth, Fiona; Byles, Julie
2017-03-01
The objectives were to identify subgroups of women with arthritis based upon the multi-dimensional nature of their pain experience and to compare health and socio-demographic variables between subgroups. A latent class analysis of 227 women with self-reported arthritis was used to identify clusters of women based upon the sensory, affective, and cognitive dimensions of the pain experience. Multivariate multinomial logistic regression analysis was used to determine the relationship between cluster membership and health and sociodemographic characteristics. A three-class cluster model was most parsimonious. 39.5 % of women had a unidimensional pain profile; 38.6 % of women had moderate multidimensional pain profile that included additional pain symptomatology such as sensory qualities and pain catastrophizing; and 21.9 % of women had severe multidimensional pain profile that included prominent pain symptomatology such as sensory and affective qualities of pain, pain catastrophizing, and neuropathic pain. Women with severe multidimensional pain profile have a 30.5 % higher risk of poorer quality of life and a 7.3 % higher risk of suffering depression, and women with moderate multidimensional pain profile have a 6.4 % higher risk of poorer quality of life when compared to women with unidimensional pain. This study identified three distinct subgroups of pain profiles in older women with arthritis. Women had very different experiences of pain, and cluster membership impacted significantly on health-related quality of life. These preliminary findings provide a stronger understanding of profiles of pain and may contribute to the development of tailored treatment options in arthritis.
Latent Heating from TRMM Satellite Measurements
NASA Technical Reports Server (NTRS)
Tao, Wei-Kuo; Smith, E. A.; Adler, R.; Haddad, Z.; Hou, A.; Iguchi, T.; Kakar, R.; Krishnamurti, T.; Kummerow, C.; Lang, S.
2004-01-01
Rainfall production is the fundamental variable within the Earth's hydrological cycle because it is both the principal forcing term in surface water budgets and its energetics corollary, latent heating, is the principal source of atmospheric diabatic heating. Latent heat release itself is a consequence of phase changes between the vapor, liquid, and frozen states of water. The properties of the vertical distribution of latent heat release modulate large-scale meridional and zonal circulations within the tropics - as well as modifying the energetic efficiencies of midlatitude weather systems. This paper focuses on the retrieval of latent heat release from satellite measurements generated by the Tropical Rainfall Measuring Mission (TRMM) satellite observatory, which was launched in November 1997 as a joint American-Japanese space endeavor. Since then, TRMM measurements have been providing an accurate four-dimensional account of rainfall over the global tropics and sub-tropics, information which can be used to estimate the space-time structure of latent heating across the Earth's low latitudes. The paper examines how the observed TRMM distribution of rainfall has advanced an understanding of the global water and energy cycle and its consequent relationship to the atmospheric general circulation and climate via latent heat release. A set of algorithm methodologies that are being used to estimate latent heating based on rain rate retrievals from the TRMM observations are described. The characteristics of these algorithms and the latent heating products that can be generated from them are also described, along with validation analyses of the heating products themselves. Finally, the investigation provides an overview of how TRMM-derived latent heating information is currently being used in conjunction with global weather and climate models, concluding with remarks intended to stimulate further research on latent heating retrieval from satellites.
Spatial path models with multiple indicators and multiple causes: mental health in US counties.
Congdon, Peter
2011-06-01
This paper considers a structural model for the impact on area mental health outcomes (poor mental health, suicide) of spatially structured latent constructs: deprivation, social capital, social fragmentation and rurality. These constructs are measured by multiple observed effect indicators, with the constructs allowed to be correlated both between and within areas. However, in the scheme developed here, particular latent constructs may also be influenced by known variables, or, via path sequences, by other constructs, possibly nonlinearly. For example, area social capital may be measured by effect indicators (e.g. associational density, charitable activity), but influenced as causes by other constructs (e.g. area deprivation), and by observed features of the socio-ethnic structure of areas. A model incorporating these features is applied to suicide mortality and the prevalence of poor mental health in 3141 US counties, which are related to the latent spatial constructs and to observed variables (e.g. county ethnic mix). Copyright © 2011 Elsevier Ltd. All rights reserved.
A Bayesian Approach to More Stable Estimates of Group-Level Effects in Contextual Studies.
Zitzmann, Steffen; Lüdtke, Oliver; Robitzsch, Alexander
2015-01-01
Multilevel analyses are often used to estimate the effects of group-level constructs. However, when using aggregated individual data (e.g., student ratings) to assess a group-level construct (e.g., classroom climate), the observed group mean might not provide a reliable measure of the unobserved latent group mean. In the present article, we propose a Bayesian approach that can be used to estimate a multilevel latent covariate model, which corrects for the unreliable assessment of the latent group mean when estimating the group-level effect. A simulation study was conducted to evaluate the choice of different priors for the group-level variance of the predictor variable and to compare the Bayesian approach with the maximum likelihood approach implemented in the software Mplus. Results showed that, under problematic conditions (i.e., small number of groups, predictor variable with a small ICC), the Bayesian approach produced more accurate estimates of the group-level effect than the maximum likelihood approach did.
NASA Technical Reports Server (NTRS)
Shepherd, J. Marshall; Einaudi, Franco (Technical Monitor)
2000-01-01
The Tropical Rainfall Measuring Mission (TRMM) as a part of NASA's Earth System Enterprise is the first mission dedicated to measuring tropical rainfall through microwave and visible sensors, and includes the first spaceborne rain radar. Tropical rainfall comprises two-thirds of global rainfall. It is also the primary distributor of heat through the atmosphere's circulation. It is this circulation that defines Earth's weather and climate. Understanding rainfall and its variability is crucial to understanding and predicting global climate change. Weather and climate models need an accurate assessment of the latent heating released as tropical rainfall occurs. Currently, cloud model-based algorithms are used to derive latent heating based on rainfall structure. Ultimately, these algorithms can be applied to actual data from TRMM. This study investigates key underlying assumptions used in developing the latent heating algorithms. For example, the standard algorithm is highly dependent on a system's rainfall amount and structure. It also depends on an a priori database of model-derived latent heating profiles based on the aforementioned rainfall characteristics. Unanswered questions remain concerning the sensitivity of latent heating profiles to environmental conditions (both thermodynamic and kinematic), regionality, and seasonality. This study investigates and quantifies such sensitivities and seeks to determine the optimal latent heating profile database based on the results. Ultimately, the study seeks to produce an optimized latent heating algorithm based not only on rainfall structure but also hydrometeor profiles.
Jeon, Jihyoun; Hsu, Li; Gorfine, Malka
2012-07-01
Frailty models are useful for measuring unobserved heterogeneity in risk of failures across clusters, providing cluster-specific risk prediction. In a frailty model, the latent frailties shared by members within a cluster are assumed to act multiplicatively on the hazard function. In order to obtain parameter and frailty variate estimates, we consider the hierarchical likelihood (H-likelihood) approach (Ha, Lee and Song, 2001. Hierarchical-likelihood approach for frailty models. Biometrika 88, 233-243) in which the latent frailties are treated as "parameters" and estimated jointly with other parameters of interest. We find that the H-likelihood estimators perform well when the censoring rate is low, however, they are substantially biased when the censoring rate is moderate to high. In this paper, we propose a simple and easy-to-implement bias correction method for the H-likelihood estimators under a shared frailty model. We also extend the method to a multivariate frailty model, which incorporates complex dependence structure within clusters. We conduct an extensive simulation study and show that the proposed approach performs very well for censoring rates as high as 80%. We also illustrate the method with a breast cancer data set. Since the H-likelihood is the same as the penalized likelihood function, the proposed bias correction method is also applicable to the penalized likelihood estimators.
Mannarini, Stefania; Boffo, Marilisa
2015-01-01
Mental illness stigma is a serious societal problem and a critical impediment to treatment seeking for mentally ill people. To improve the understanding of mental illness stigma, this study focuses on the simultaneous analysis of people's aetiological beliefs, attitudes (i.e. perceived dangerousness and social distance), and recommended treatments related to several mental disorders by devising an over-arching latent structure that could explain the relations among these variables. Three hundred and sixty university students randomly received an unlabelled vignette depicting one of six mental disorders to be evaluated on the four variables on a Likert-type scale. A one-factor Latent Class Analysis (LCA) model was hypothesized, which comprised the four manifest variables as indicators and the mental disorder as external variable. The main findings were the following: (a) a one-factor LCA model was retrieved; (b) alcohol and drug addictions are the most strongly stigmatized; (c) a realistic opinion about the causes and treatment of schizophrenia, anxiety, bulimia, and depression was associated to lower prejudicial attitudes and social rejection. Beyond the general appraisal of mental illness an individual might have, the results generally point to the acknowledgement of the specific features of different diagnostic categories. The implications of the present results are discussed in the framework of a better understanding of mental illness stigma.
Archambeau, Cédric; Verleysen, Michel
2007-01-01
A new variational Bayesian learning algorithm for Student-t mixture models is introduced. This algorithm leads to (i) robust density estimation, (ii) robust clustering and (iii) robust automatic model selection. Gaussian mixture models are learning machines which are based on a divide-and-conquer approach. They are commonly used for density estimation and clustering tasks, but are sensitive to outliers. The Student-t distribution has heavier tails than the Gaussian distribution and is therefore less sensitive to any departure of the empirical distribution from Gaussianity. As a consequence, the Student-t distribution is suitable for constructing robust mixture models. In this work, we formalize the Bayesian Student-t mixture model as a latent variable model in a different way from Svensén and Bishop [Svensén, M., & Bishop, C. M. (2005). Robust Bayesian mixture modelling. Neurocomputing, 64, 235-252]. The main difference resides in the fact that it is not necessary to assume a factorized approximation of the posterior distribution on the latent indicator variables and the latent scale variables in order to obtain a tractable solution. Not neglecting the correlations between these unobserved random variables leads to a Bayesian model having an increased robustness. Furthermore, it is expected that the lower bound on the log-evidence is tighter. Based on this bound, the model complexity, i.e. the number of components in the mixture, can be inferred with a higher confidence.
Do gamblers eat more salt? Testing a latent trait model of covariance in consumption
Goodwin, Belinda C.; Browne, Matthew; Rockloff, Matthew; Donaldson, Phillip
2015-01-01
A diverse class of stimuli, including certain foods, substances, media, and economic behaviours, may be described as ‘reward-oriented’ in that they provide immediate reinforcement with little initial investment. Neurophysiological and personality concepts, including dopaminergic dysfunction, reward sensitivity and rash impulsivity, each predict the existence of a latent behavioural trait that leads to increased consumption of all stimuli in this class. Whilst bivariate relationships (co-morbidities) are often reported in the literature, to our knowledge, a multivariate investigation of this possible trait has not been done. We surveyed 1,194 participants (550 male) on their typical weekly consumption of 11 types of reward-oriented stimuli, including fast food, salt, caffeine, television, gambling products, and illicit drugs. Confirmatory factor analysis was used to compare models in a 3×3 structure, based on the definition of a single latent factor (none, fixed loadings, or estimated loadings), and assumed residual covariance structure (none, a-priori / literature based, or post-hoc / data-driven). The inclusion of a single latent behavioural ‘consumption’ factor significantly improved model fit in all cases. Also confirming theoretical predictions, estimated factor loadings on reward-oriented indicators were uniformly positive, regardless of assumptions regarding residual covariances. Additionally, the latent trait was found to be negatively correlated with the non-reward-oriented indicators of fruit and vegetable consumption. The findings support the notion of a single behavioural trait leading to increased consumption of reward-oriented stimuli across multiple modalities. We discuss implications regarding the concentration of negative lifestyle-related health behaviours. PMID:26551907
Do gamblers eat more salt? Testing a latent trait model of covariance in consumption.
Goodwin, Belinda C; Browne, Matthew; Rockloff, Matthew; Donaldson, Phillip
2015-09-01
A diverse class of stimuli, including certain foods, substances, media, and economic behaviours, may be described as 'reward-oriented' in that they provide immediate reinforcement with little initial investment. Neurophysiological and personality concepts, including dopaminergic dysfunction, reward sensitivity and rash impulsivity, each predict the existence of a latent behavioural trait that leads to increased consumption of all stimuli in this class. Whilst bivariate relationships (co-morbidities) are often reported in the literature, to our knowledge, a multivariate investigation of this possible trait has not been done. We surveyed 1,194 participants (550 male) on their typical weekly consumption of 11 types of reward-oriented stimuli, including fast food, salt, caffeine, television, gambling products, and illicit drugs. Confirmatory factor analysis was used to compare models in a 3×3 structure, based on the definition of a single latent factor (none, fixed loadings, or estimated loadings), and assumed residual covariance structure (none, a-priori / literature based, or post-hoc / data-driven). The inclusion of a single latent behavioural 'consumption' factor significantly improved model fit in all cases. Also confirming theoretical predictions, estimated factor loadings on reward-oriented indicators were uniformly positive, regardless of assumptions regarding residual covariances. Additionally, the latent trait was found to be negatively correlated with the non-reward-oriented indicators of fruit and vegetable consumption. The findings support the notion of a single behavioural trait leading to increased consumption of reward-oriented stimuli across multiple modalities. We discuss implications regarding the concentration of negative lifestyle-related health behaviours.
Katseanes, Chelsea K; Chappell, Mark A; Hopkins, Bryan G; Durham, Brian D; Price, Cynthia L; Porter, Beth E; Miller, Lesley F
2017-12-01
After nearly a century of use in numerous munition platforms, TNT and RDX contamination has turned up largely in the environment due to ammunition manufacturing or as part of releases from low-order detonations during training activities. Although the basic knowledge governing the environmental fate of TNT and RDX are known, accurate predictions of TNT and RDX persistence in soil remain elusive, particularly given the universal heterogeneity of pedomorphic soil types. In this work, we proposed overcoming this problem by considering the environmental persistence of these munition constituents (MC) as multivariate mathematical functions over a variety of taxonomically distinct soil types, instead of a single constant or parameter of a specific absolute value. To test this idea, we conducted experiments where the disappearance kinetics of TNT and RDX were measured over a >300 h period in taxonomically distinct soils. Classical fertility-based soil measurements were log-transformed, statistically decomposed, and correlated to TNT and RDX disappearance rates (k -TNT and k -RDX ) using multivariate dimension-reduction and correlation techniques. From these efforts, we generated multivariate linear functions for k parameters across different soil types based on a statistically reduced set of their chemical and physical properties: Calculations showed that the soil properties exhibited strong covariance, with a prominent latent structure emerging as the basis for relative comparisons of the samples in reduced space. Loadings describing TNT degradation were largely driven by properties associated with alkaline/calcareous soil characteristics, while the degradation of RDX was attributed to the soil organic matter content - reflective of an important soil fertility characteristic. In spite of the differing responses to the munitions, batch data suggested that the overall nutrient dynamics were consistent for each soil type, as well as readily distinguishable from the other soil types used in this study. Thus, we hypothesized that the latent structure arising from the strong covariance of full multivariate geochemical matrix describing taxonomically distinguished "soil types" may provide the means for potentially predicting complex phenomena in soils. Published by Elsevier Ltd.
An Analytic Approach to Modeling Land-Atmosphere Interaction: 1. Construct and Equilibrium Behavior
NASA Astrophysics Data System (ADS)
Brubaker, Kaye L.; Entekhabi, Dara
1995-03-01
A four-variable land-atmosphere model is developed to investigate the coupled exchanges of water and energy between the land surface and atmosphere and the role of these exchanges in the statistical behavior of continental climates. The land-atmosphere system is substantially simplified and formulated as a set of ordinary differential equations that, with the addition of random noise, are suitable for analysis in the form of the multivariate Îto equation. The model treats the soil layer and the near-surface atmosphere as reservoirs with storage capacities for heat and water. The transfers between these reservoirs are regulated by four states: soil saturation, soil temperature, air specific humidity, and air potential temperature. The atmospheric reservoir is treated as a turbulently mixed boundary layer of fixed depth. Heat and moisture advection, precipitation, and layer-top air entrainment are parameterized. The system is forced externally by solar radiation and the lateral advection of air and water mass. The remaining energy and water mass exchanges are expressed in terms of the state variables. The model development and equilibrium solutions are presented. Although comparisons between observed data and steady state model results re inexact, the model appears to do a reasonable job of partitioning net radiation into sensible and latent heat flux in appropriate proportions for bare-soil midlatitude summer conditions. Subsequent work will introduce randomness into the forcing terms to investigate the effect of water-energy coupling and land-atmosphere interaction on variability and persistence in the climatic system.
Sun, Fei; Xu, Bing; Zhang, Yi; Dai, Shengyun; Yang, Chan; Cui, Xianglong; Shi, Xinyuan; Qiao, Yanjiang
2016-01-01
The quality of Chinese herbal medicine tablets suffers from batch-to-batch variability due to a lack of manufacturing process understanding. In this paper, the Panax notoginseng saponins (PNS) immediate release tablet was taken as the research subject. By defining the dissolution of five active pharmaceutical ingredients and the tablet tensile strength as critical quality attributes (CQAs), influences of both the manipulated process parameters introduced by an orthogonal experiment design and the intermediate granules’ properties on the CQAs were fully investigated by different chemometric methods, such as the partial least squares, the orthogonal projection to latent structures, and the multiblock partial least squares (MBPLS). By analyzing the loadings plots and variable importance in the projection indexes, the granule particle sizes and the minimal punch tip separation distance in tableting were identified as critical process parameters. Additionally, the MBPLS model suggested that the lubrication time in the final blending was also important in predicting tablet quality attributes. From the calculated block importance in the projection indexes, the tableting unit was confirmed to be the critical process unit of the manufacturing line. The results demonstrated that the combinatorial use of different multivariate modeling methods could help in understanding the complex process relationships as a whole. The output of this study can then be used to define a control strategy to improve the quality of the PNS immediate release tablet. PMID:27932865
ERIC Educational Resources Information Center
Raykov, Tenko; Marcoulides, George A.
2014-01-01
This research note contributes to the discussion of methods that can be used to identify useful auxiliary variables for analyses of incomplete data sets. A latent variable approach is discussed, which is helpful in finding auxiliary variables with the property that if included in subsequent maximum likelihood analyses they may enhance considerably…
Maximum Likelihood Analysis of Nonlinear Structural Equation Models with Dichotomous Variables
ERIC Educational Resources Information Center
Song, Xin-Yuan; Lee, Sik-Yum
2005-01-01
In this article, a maximum likelihood approach is developed to analyze structural equation models with dichotomous variables that are common in behavioral, psychological and social research. To assess nonlinear causal effects among the latent variables, the structural equation in the model is defined by a nonlinear function. The basic idea of the…
Causal Models with Unmeasured Variables: An Introduction to LISREL.
ERIC Educational Resources Information Center
Wolfle, Lee M.
Whenever one uses ordinary least squares regression, one is making an implicit assumption that all of the independent variables have been measured without error. Such an assumption is obviously unrealistic for most social data. One approach for estimating such regression models is to measure implied coefficients between latent variables for which…
Least Principal Components Analysis (LPCA): An Alternative to Regression Analysis.
ERIC Educational Resources Information Center
Olson, Jeffery E.
Often, all of the variables in a model are latent, random, or subject to measurement error, or there is not an obvious dependent variable. When any of these conditions exist, an appropriate method for estimating the linear relationships among the variables is Least Principal Components Analysis. Least Principal Components are robust, consistent,…
Fiske, Ian J.; Royle, J. Andrew; Gross, Kevin
2014-01-01
Ecologists and wildlife biologists increasingly use latent variable models to study patterns of species occurrence when detection is imperfect. These models have recently been generalized to accommodate both a more expansive description of state than simple presence or absence, and Markovian dynamics in the latent state over successive sampling seasons. In this paper, we write these multi-season, multi-state models as hidden Markov models to find both maximum likelihood estimates of model parameters and finite-sample estimators of the trajectory of the latent state over time. These estimators are especially useful for characterizing population trends in species of conservation concern. We also develop parametric bootstrap procedures that allow formal inference about latent trend. We examine model behavior through simulation, and we apply the model to data from the North American Amphibian Monitoring Program.
ERIC Educational Resources Information Center
Choi, Kilchan; Seltzer, Michael
2005-01-01
In studies of change in education and numerous other fields, interest often centers on how differences in the status of individuals at the start of a time period of substantive interest relate to differences in subsequent change. This report presents a fully Bayesian approach to estimating three-level hierarchical models in which latent variable…
Epilepsy and the Wnt Signaling Pathway
2015-06-01
status epilepticus (SE), head injury, infection or stroke). This is followed by a variable (months to years in humans) “latent period” followed by the...TERMS Status Epilepticus , Wnt Signaling, Epileptogenesis 16. SECURITY CLASSIFICATION OF: U 17. LIMITATION OF ABSTRACTU U 18. NUMBER OF PAGES 4...disease sub-type. In this grant, we will investigate the mechanisms of Status Epilepticus (SE) and the ensuing latent period in animal models of
Individual heterogeneity in reproductive rates and cost of reproduction in a long-lived vertebrate
Chambert, Thierry; Rotella, Jay J; Higgs, Megan D; Garrott, Robert A
2013-01-01
Individual variation in reproductive success is a key feature of evolution, but also has important implications for predicting population responses to variable environments. Although such individual variation in reproductive outcomes has been reported in numerous studies, most analyses to date have not considered whether these realized differences were due to latent individual heterogeneity in reproduction or merely random chance causing different outcomes among like individuals. Furthermore, latent heterogeneity in fitness components might be expressed differently in contrasted environmental conditions, an issue that has only rarely been investigated. Here, we assessed (i) the potential existence of latent individual heterogeneity and (ii) the nature of its expression (fixed vs. variable) in a population of female Weddell seals (Leptonychotes weddellii), using a hierarchical modeling approach on a 30-year mark–recapture data set consisting of 954 individual encounter histories. We found strong support for the existence of latent individual heterogeneity in the population, with “robust” individuals expected to produce twice as many pups as “frail” individuals. Moreover, the expression of individual heterogeneity appeared consistent, with only mild evidence that it might be amplified when environmental conditions are severe. Finally, the explicit modeling of individual heterogeneity allowed us to detect a substantial cost of reproduction that was not evidenced when the heterogeneity was ignored. PMID:23919151
Blanchin, Myriam; Hardouin, Jean-Benoit; Le Neel, Tanguy; Kubis, Gildas; Blanchard, Claire; Mirallié, Eric; Sébille, Véronique
2011-04-15
Health sciences frequently deal with Patient Reported Outcomes (PRO) data for the evaluation of concepts, in particular health-related quality of life, which cannot be directly measured and are often called latent variables. Two approaches are commonly used for the analysis of such data: Classical Test Theory (CTT) and Item Response Theory (IRT). Longitudinal data are often collected to analyze the evolution of an outcome over time. The most adequate strategy to analyze longitudinal latent variables, which can be either based on CTT or IRT models, remains to be identified. This strategy must take into account the latent characteristic of what PROs are intended to measure as well as the specificity of longitudinal designs. A simple and widely used IRT model is the Rasch model. The purpose of our study was to compare CTT and Rasch-based approaches to analyze longitudinal PRO data regarding type I error, power, and time effect estimation bias. Four methods were compared: the Score and Mixed models (SM) method based on the CTT approach, the Rasch and Mixed models (RM), the Plausible Values (PV), and the Longitudinal Rasch model (LRM) methods all based on the Rasch model. All methods have shown comparable results in terms of type I error, all close to 5 per cent. LRM and SM methods presented comparable power and unbiased time effect estimations, whereas RM and PV methods showed low power and biased time effect estimations. This suggests that RM and PV methods should be avoided to analyze longitudinal latent variables. Copyright © 2010 John Wiley & Sons, Ltd.
Stamovlasis, Dimitrios; Papageorgiou, George; Tsitsipis, Georgios; Tsikalas, Themistoklis; Vaiopoulou, Julie
2018-01-01
This paper illustrates two psychometric methods, latent class analysis (LCA) and taxometric analysis (TA) using empirical data from research probing children's mental representation in science learning. LCA is used to obtain a typology based on observed variables and to further investigate how the encountered classes might be related to external variables, where the effectiveness of classification process and the unbiased estimations of parameters become the main concern. In the step-wise LCA, the class membership is assigned and subsequently its relationship with covariates is established. This leading-edge modeling approach suffers from severe downward-biased estimations. The illustration of LCA is focused on alternative bias correction approaches and demonstrates the effect of modal and proportional class-membership assignment along with BCH and ML correction procedures. The illustration of LCA is presented with three covariates, which are psychometric variables operationalizing formal reasoning, divergent thinking and field dependence-independence, respectively. Moreover, taxometric analysis, a method designed to detect the type of the latent structural model, categorical or dimensional, is introduced, along with the relevant basic concepts and tools. TA was applied complementarily in the same data sets to answer the fundamental hypothesis about children's naïve knowledge on the matters under study and it comprises an additional asset in building theory which is fundamental for educational practices. Taxometric analysis provided results that were ambiguous as far as the type of the latent structure. This finding initiates further discussion and sets a problematization within this framework rethinking fundamental assumptions and epistemological issues. PMID:29713300
Riba Ruiz, Jordi-Roger; Canals, Trini; Cantero, Rosa
2017-01-01
Ethylene propylene diene monomer (EPDM) rubber is widely used in a diverse type of applications, such as the automotive, industrial and construction sectors among others. Due to its appealing features, the consumption of vulcanized EPDM rubber is growing significantly. However, environmental issues are forcing the application of devulcanization processes to facilitate recovery, which has led rubber manufacturers to implement strict quality controls. Consequently, it is important to develop methods for supervising the vulcanizing and recovery processes of such products. This paper deals with the supervision process of EPDM compounds by means of Fourier transform mid-infrared (FT-IR) spectroscopy and suitable multivariate statistical methods. An expedited and nondestructive classification approach was applied to a sufficient number of EPDM samples with different applied processes, that is, with and without application of vulcanizing agents, vulcanized samples, and microwave treated samples. First the FT-IR spectra of the samples is acquired and next it is processed by applying suitable feature extraction methods, i.e., principal component analysis and canonical variate analysis to obtain the latent variables to be used for classifying test EPDM samples. Finally, the k nearest neighbor algorithm was used in the classification stage. Experimental results prove the accuracy of the proposed method and the potential of FT-IR spectroscopy in this area, since the classification accuracy can be as high as 100%.
DOT National Transportation Integrated Search
2015-12-01
We develop an econometric framework for incorporating spatial dependence in integrated model systems of latent variables and multidimensional mixed data outcomes. The framework combines Bhats Generalized Heterogeneous Data Model (GHDM) with a spat...
Should "Multiple Imputations" Be Treated as "Multiple Indicators"?
ERIC Educational Resources Information Center
Mislevy, Robert J.
1993-01-01
Multiple imputations for latent variables are constructed so that analyses treating them as true variables have the correct expectations for population characteristics. Analyzing multiple imputations in accordance with their construction yields correct estimates of population characteristics, whereas analyzing them as multiple indicators generally…
Measurement Models for Reasoned Action Theory.
Hennessy, Michael; Bleakley, Amy; Fishbein, Martin
2012-03-01
Quantitative researchers distinguish between causal and effect indicators. What are the analytic problems when both types of measures are present in a quantitative reasoned action analysis? To answer this question, we use data from a longitudinal study to estimate the association between two constructs central to reasoned action theory: behavioral beliefs and attitudes toward the behavior. The belief items are causal indicators that define a latent variable index while the attitude items are effect indicators that reflect the operation of a latent variable scale. We identify the issues when effect and causal indicators are present in a single analysis and conclude that both types of indicators can be incorporated in the analysis of data based on the reasoned action approach.
Application of latent variable model in Rosenberg self-esteem scale.
Leung, Shing-On; Wu, Hui-Ping
2013-01-01
Latent Variable Models (LVM) are applied to Rosenberg Self-Esteem Scale (RSES). Parameter estimations automatically give negative signs hence no recoding is necessary for negatively scored items. Bad items can be located through parameter estimate, item characteristic curves and other measures. Two factors are extracted with one on self-esteem and the other on the degree to take moderate views, with the later not often being covered in previous studies. A goodness-of-fit measure based on two-way margins is used but more works are needed. Results show that scaling provided by models with more formal statistical ground correlated highly with conventional method, which may provide justification for usual practice.
A Study of Effects of MultiCollinearity in the Multivariable Analysis
Yoo, Wonsuk; Mayberry, Robert; Bae, Sejong; Singh, Karan; (Peter) He, Qinghua; Lillard, James W.
2015-01-01
A multivariable analysis is the most popular approach when investigating associations between risk factors and disease. However, efficiency of multivariable analysis highly depends on correlation structure among predictive variables. When the covariates in the model are not independent one another, collinearity/multicollinearity problems arise in the analysis, which leads to biased estimation. This work aims to perform a simulation study with various scenarios of different collinearity structures to investigate the effects of collinearity under various correlation structures amongst predictive and explanatory variables and to compare these results with existing guidelines to decide harmful collinearity. Three correlation scenarios among predictor variables are considered: (1) bivariate collinear structure as the most simple collinearity case, (2) multivariate collinear structure where an explanatory variable is correlated with two other covariates, (3) a more realistic scenario when an independent variable can be expressed by various functions including the other variables. PMID:25664257
A Study of Effects of MultiCollinearity in the Multivariable Analysis.
Yoo, Wonsuk; Mayberry, Robert; Bae, Sejong; Singh, Karan; Peter He, Qinghua; Lillard, James W
2014-10-01
A multivariable analysis is the most popular approach when investigating associations between risk factors and disease. However, efficiency of multivariable analysis highly depends on correlation structure among predictive variables. When the covariates in the model are not independent one another, collinearity/multicollinearity problems arise in the analysis, which leads to biased estimation. This work aims to perform a simulation study with various scenarios of different collinearity structures to investigate the effects of collinearity under various correlation structures amongst predictive and explanatory variables and to compare these results with existing guidelines to decide harmful collinearity. Three correlation scenarios among predictor variables are considered: (1) bivariate collinear structure as the most simple collinearity case, (2) multivariate collinear structure where an explanatory variable is correlated with two other covariates, (3) a more realistic scenario when an independent variable can be expressed by various functions including the other variables.
Bringas, Maria L.; Zaldivar, Marilyn; Rojas, Pedro A.; Martinez-Montes, Karelia; Chongo, Dora M.; Ortega, Maria A.; Galvizu, Reynaldo; Perez, Alba E.; Morales, Lilia M.; Maragoto, Carlos; Vera, Hector; Galan, Lidice; Besson, Mireille; Valdes-Sosa, Pedro A.
2015-01-01
This study was a two-armed parallel group design aimed at testing real world effectiveness of a music therapy (MT) intervention for children with severe neurological disorders. The control group received only the standard neurorestoration program and the experimental group received an additional MT “Auditory Attention plus Communication protocol” just before the usual occupational and speech therapy. Multivariate Item Response Theory (MIRT) identified a neuropsychological status-latent variable manifested in all children and which exhibited highly significant changes only in the experimental group. Changes in brain plasticity also occurred in the experimental group, as evidenced using a Mismatch Event Related paradigm which revealed significant post intervention positive responses in the latency range between 308 and 400 ms in frontal regions. LORETA EEG source analysis identified prefrontal and midcingulate regions as differentially activated by the MT in the experimental group. Taken together, our results showing improved attention and communication as well as changes in brain plasticity in children with severe neurological impairments, confirm the importance of MT for the rehabilitation of patients across a wide range of dysfunctions. PMID:26582974
Text mining factor analysis (TFA) in green tea patent data
NASA Astrophysics Data System (ADS)
Rahmawati, Sela; Suprijadi, Jadi; Zulhanif
2017-03-01
Factor analysis has become one of the most widely used multivariate statistical procedures in applied research endeavors across a multitude of domains. There are two main types of analyses based on factor analysis: Exploratory Factor Analysis (EFA) and Confirmatory Factor Analysis (CFA). Both EFA and CFA aim to observed relationships among a group of indicators with a latent variable, but they differ fundamentally, a priori and restrictions made to the factor model. This method will be applied to patent data technology sector green tea to determine the development technology of green tea in the world. Patent analysis is useful in identifying the future technological trends in a specific field of technology. Database patent are obtained from agency European Patent Organization (EPO). In this paper, CFA model will be applied to the nominal data, which obtain from the presence absence matrix. While doing processing, analysis CFA for nominal data analysis was based on Tetrachoric matrix. Meanwhile, EFA model will be applied on a title from sector technology dominant. Title will be pre-processing first using text mining analysis.
NASA Astrophysics Data System (ADS)
Shahlan, M. Z.; Sidek, A. A.; Suffian, S. A.; Hazza, M. H. F. A.; Daud, M. R. C.
2018-01-01
In this paper, climate change and global warming are the biggest current issues in the industrial sectors. The green supply chain managements (GSCM) is one of the crucial input to these issues. Effective GSCM can potentially secure the organization’s competitive advantage and improve the environmental performance of the network activities. In this study, the aim is to investigate and examine how a small and medium enterprises (SMEs) stakeholder pressure and top management influence green supply chain management practices. The study is further advance green supply chain management research in Malaysia focusing on SMEs manufacturing sector using structural equation modelling. Structural equation modelling is a multivariate statistical analysis technique used to examine structural relationship. It is the combination of factor analysis and multi regression analysis and used to analyse structural relationship between measure variable and latent factor. This research found that top management support and stakeholder pressure is the major influence for SMEs to adopt green supply chain management. The research also found that top management is fully mediate with the relationship between stakeholder pressure and monitoring supplier environmental performance.
Domain-Invariant Partial-Least-Squares Regression.
Nikzad-Langerodi, Ramin; Zellinger, Werner; Lughofer, Edwin; Saminger-Platz, Susanne
2018-05-11
Multivariate calibration models often fail to extrapolate beyond the calibration samples because of changes associated with the instrumental response, environmental condition, or sample matrix. Most of the current methods used to adapt a source calibration model to a target domain exclusively apply to calibration transfer between similar analytical devices, while generic methods for calibration-model adaptation are largely missing. To fill this gap, we here introduce domain-invariant partial-least-squares (di-PLS) regression, which extends ordinary PLS by a domain regularizer in order to align the source and target distributions in the latent-variable space. We show that a domain-invariant weight vector can be derived in closed form, which allows the integration of (partially) labeled data from the source and target domains as well as entirely unlabeled data from the latter. We test our approach on a simulated data set where the aim is to desensitize a source calibration model to an unknown interfering agent in the target domain (i.e., unsupervised model adaptation). In addition, we demonstrate unsupervised, semisupervised, and supervised model adaptation by di-PLS on two real-world near-infrared (NIR) spectroscopic data sets.
Bringas, Maria L; Zaldivar, Marilyn; Rojas, Pedro A; Martinez-Montes, Karelia; Chongo, Dora M; Ortega, Maria A; Galvizu, Reynaldo; Perez, Alba E; Morales, Lilia M; Maragoto, Carlos; Vera, Hector; Galan, Lidice; Besson, Mireille; Valdes-Sosa, Pedro A
2015-01-01
This study was a two-armed parallel group design aimed at testing real world effectiveness of a music therapy (MT) intervention for children with severe neurological disorders. The control group received only the standard neurorestoration program and the experimental group received an additional MT "Auditory Attention plus Communication protocol" just before the usual occupational and speech therapy. Multivariate Item Response Theory (MIRT) identified a neuropsychological status-latent variable manifested in all children and which exhibited highly significant changes only in the experimental group. Changes in brain plasticity also occurred in the experimental group, as evidenced using a Mismatch Event Related paradigm which revealed significant post intervention positive responses in the latency range between 308 and 400 ms in frontal regions. LORETA EEG source analysis identified prefrontal and midcingulate regions as differentially activated by the MT in the experimental group. Taken together, our results showing improved attention and communication as well as changes in brain plasticity in children with severe neurological impairments, confirm the importance of MT for the rehabilitation of patients across a wide range of dysfunctions.
ERIC Educational Resources Information Center
Ockey, Gary
2011-01-01
Drawing on current theories in personality, second-language (L2) oral ability, and psychometrics, this study investigates the extent to which self-consciousness and assertiveness are explanatory variables of L2 oral ability. Three hundred sixty first-year Japanese university students who were studying English as a foreign language participated in…
ERIC Educational Resources Information Center
Bae, Sung Man
2015-01-01
We examined how perceived parenting style, friendship satisfaction, and academic motivation influence the addictive use of smartphones longitudinally. We utilized the panel data (from 2010-2012) of Korean children and youth panel survey of the National Youth Policy Institute. Data were collected from 2,376 individuals in the first year (boys:…
NASA Astrophysics Data System (ADS)
Cannon, Alex J.
2018-01-01
Most bias correction algorithms used in climatology, for example quantile mapping, are applied to univariate time series. They neglect the dependence between different variables. Those that are multivariate often correct only limited measures of joint dependence, such as Pearson or Spearman rank correlation. Here, an image processing technique designed to transfer colour information from one image to another—the N-dimensional probability density function transform—is adapted for use as a multivariate bias correction algorithm (MBCn) for climate model projections/predictions of multiple climate variables. MBCn is a multivariate generalization of quantile mapping that transfers all aspects of an observed continuous multivariate distribution to the corresponding multivariate distribution of variables from a climate model. When applied to climate model projections, changes in quantiles of each variable between the historical and projection period are also preserved. The MBCn algorithm is demonstrated on three case studies. First, the method is applied to an image processing example with characteristics that mimic a climate projection problem. Second, MBCn is used to correct a suite of 3-hourly surface meteorological variables from the Canadian Centre for Climate Modelling and Analysis Regional Climate Model (CanRCM4) across a North American domain. Components of the Canadian Forest Fire Weather Index (FWI) System, a complicated set of multivariate indices that characterizes the risk of wildfire, are then calculated and verified against observed values. Third, MBCn is used to correct biases in the spatial dependence structure of CanRCM4 precipitation fields. Results are compared against a univariate quantile mapping algorithm, which neglects the dependence between variables, and two multivariate bias correction algorithms, each of which corrects a different form of inter-variable correlation structure. MBCn outperforms these alternatives, often by a large margin, particularly for annual maxima of the FWI distribution and spatiotemporal autocorrelation of precipitation fields.
Weikert, Madeline; Motl, Robert W; Suh, Yoojin; McAuley, Edward; Wynn, Daniel
2010-03-15
Motion sensors such as accelerometers have been recognized as an ideal measure of physical activity in persons with MS. This study examined the hypothesis that accelerometer movement counts represent a measure of both physical activity and walking mobility in individuals with MS. The sample included 269 individuals with a definite diagnosis of relapsing-remitting MS who completed the Godin Leisure-Time Exercise Questionnaire (GLTEQ), International Physical Activity Questionnaire (IPAQ), Multiple Sclerosis Walking Scale-12 (MSWS-12), Patient Determined Disease Steps (PDDS), and then wore an ActiGraph accelerometer for 7days. The data were analyzed using bivariate correlation and confirmatory factor analysis. The results indicated that (a) the GLTEQ and IPAQ scores were strongly correlated and loaded significantly on a physical activity latent variable, (b) the MSWS-12 and PDDS scores strongly correlated and loaded significantly on a walking mobility latent variable, and (c) the accelerometer movement counts correlated similarly with the scores from the four self-report questionnaires and cross-loaded on both physical activity and walking mobility latent variables. Our data suggest that accelerometers are measuring both physical activity and walking mobility in persons with MS, whereas self-report instruments are measuring either physical activity or walking mobility in this population.
Hardin, Andrew
2017-09-01
In this issue, Bollen and Diamantopoulos (2017) defend causal-formative indicators against several common criticisms leveled by scholars who oppose their use. In doing so, the authors make several convincing assertions: Constructs exist independently from their measures; theory determines whether indicators cause or measure latent variables; and reflective and causal-formative indicators are both subject to interpretational confounding. However, despite being a well-reasoned, comprehensive defense of causal-formative indicators, no single article can address all of the issues associated with this debate. Thus, Bollen and Diamantopoulos leave a few fundamental issues unresolved. For example, how can researchers establish the reliability of indicators that may include measurement error? Moreover, how should researchers interpret disturbance terms that capture sources of influence related to both the empirical definition of the latent variable and to the theoretical definition of the construct? Relatedly, how should researchers reconcile the requirement for a census of causal-formative indicators with the knowledge that indicators are likely missing from the empirically estimated latent variable? This commentary develops 6 related research questions to draw attention to these fundamental issues, and to call for future research that can lead to the development of theory to guide the use of causal-formative indicators. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Comparing hierarchical models via the marginalized deviance information criterion.
Quintero, Adrian; Lesaffre, Emmanuel
2018-07-20
Hierarchical models are extensively used in pharmacokinetics and longitudinal studies. When the estimation is performed from a Bayesian approach, model comparison is often based on the deviance information criterion (DIC). In hierarchical models with latent variables, there are several versions of this statistic: the conditional DIC (cDIC) that incorporates the latent variables in the focus of the analysis and the marginalized DIC (mDIC) that integrates them out. Regardless of the asymptotic and coherency difficulties of cDIC, this alternative is usually used in Markov chain Monte Carlo (MCMC) methods for hierarchical models because of practical convenience. The mDIC criterion is more appropriate in most cases but requires integration of the likelihood, which is computationally demanding and not implemented in Bayesian software. Therefore, we consider a method to compute mDIC by generating replicate samples of the latent variables that need to be integrated out. This alternative can be easily conducted from the MCMC output of Bayesian packages and is widely applicable to hierarchical models in general. Additionally, we propose some approximations in order to reduce the computational complexity for large-sample situations. The method is illustrated with simulated data sets and 2 medical studies, evidencing that cDIC may be misleading whilst mDIC appears pertinent. Copyright © 2018 John Wiley & Sons, Ltd.
NASA Astrophysics Data System (ADS)
Solimun, Fernandes, Adji Achmad Rinaldo; Arisoesilaningsih, Endang
2017-12-01
Research in various fields generally investigates systems and involves latent variables. One method to analyze the model representing the system is path analysis. The data of latent variables measured using questionnaires by applying attitude scale model yields data in the form of score, before analyzed should be transformation so that it becomes data of scale. Path coefficient, is parameter estimator, calculated from scale data using method of successive interval (MSI) and summated rating scale (SRS). In this research will be identifying which data transformation method is better. Path coefficients have smaller varieties are said to be more efficient. The transformation method that produces scaled data and used in path analysis capable of producing path coefficients (parameter estimators) with smaller varieties is said to be better. The result of analysis using real data shows that on the influence of Attitude variable to Intention Entrepreneurship, has relative efficiency (ER) = 1, where it shows that the result of analysis using data transformation of MSI and SRS as efficient. On the other hand, for simulation data, at high correlation between items (0.7-0.9), MSI method is more efficient 1.3 times better than SRS method.
Liggett, Jacqueline; Sellbom, Martin; Carmichael, Kieran L C
2017-12-01
The current study examined the extent to which the trait-based operationalization of obsessive-compulsive personality disorder (OCPD) in Section III of the DSM-5 describes the same construct as the one described in Section II. A community sample of 313 adults completed a series of personality inventories indexing the DSM-5 Sections II and III diagnostic criteria for OCPD, in addition to a measure of functional impairment modelled after the criteria in Section III. Results indicated that latent constructs representing Section II and Section III OCPD overlapped substantially (r = .75, p < .001). Hierarchical latent regression models revealed that at least three of the four DSM-5 Section III facets (Rigid Perfectionism, Perseveration, and Intimacy Avoidance) uniquely accounted for a large proportion of variance (53%) in a latent Section II OCPD variable. Further, Anxiousness and (low) Impulsivity, as well as self and interpersonal impairment, augmented the prediction of latent OCPD scores.
Estimating Interaction Effects With Incomplete Predictor Variables
Enders, Craig K.; Baraldi, Amanda N.; Cham, Heining
2014-01-01
The existing missing data literature does not provide a clear prescription for estimating interaction effects with missing data, particularly when the interaction involves a pair of continuous variables. In this article, we describe maximum likelihood and multiple imputation procedures for this common analysis problem. We outline 3 latent variable model specifications for interaction analyses with missing data. These models apply procedures from the latent variable interaction literature to analyses with a single indicator per construct (e.g., a regression analysis with scale scores). We also discuss multiple imputation for interaction effects, emphasizing an approach that applies standard imputation procedures to the product of 2 raw score predictors. We thoroughly describe the process of probing interaction effects with maximum likelihood and multiple imputation. For both missing data handling techniques, we outline centering and transformation strategies that researchers can implement in popular software packages, and we use a series of real data analyses to illustrate these methods. Finally, we use computer simulations to evaluate the performance of the proposed techniques. PMID:24707955
Structural identifiability of cyclic graphical models of biological networks with latent variables.
Wang, Yulin; Lu, Na; Miao, Hongyu
2016-06-13
Graphical models have long been used to describe biological networks for a variety of important tasks such as the determination of key biological parameters, and the structure of graphical model ultimately determines whether such unknown parameters can be unambiguously obtained from experimental observations (i.e., the identifiability problem). Limited by resources or technical capacities, complex biological networks are usually partially observed in experiment, which thus introduces latent variables into the corresponding graphical models. A number of previous studies have tackled the parameter identifiability problem for graphical models such as linear structural equation models (SEMs) with or without latent variables. However, the limited resolution and efficiency of existing approaches necessarily calls for further development of novel structural identifiability analysis algorithms. An efficient structural identifiability analysis algorithm is developed in this study for a broad range of network structures. The proposed method adopts the Wright's path coefficient method to generate identifiability equations in forms of symbolic polynomials, and then converts these symbolic equations to binary matrices (called identifiability matrix). Several matrix operations are introduced for identifiability matrix reduction with system equivalency maintained. Based on the reduced identifiability matrices, the structural identifiability of each parameter is determined. A number of benchmark models are used to verify the validity of the proposed approach. Finally, the network module for influenza A virus replication is employed as a real example to illustrate the application of the proposed approach in practice. The proposed approach can deal with cyclic networks with latent variables. The key advantage is that it intentionally avoids symbolic computation and is thus highly efficient. Also, this method is capable of determining the identifiability of each single parameter and is thus of higher resolution in comparison with many existing approaches. Overall, this study provides a basis for systematic examination and refinement of graphical models of biological networks from the identifiability point of view, and it has a significant potential to be extended to more complex network structures or high-dimensional systems.
Multivariate stochastic simulation with subjective multivariate normal distributions
P. J. Ince; J. Buongiorno
1991-01-01
In many applications of Monte Carlo simulation in forestry or forest products, it may be known that some variables are correlated. However, for simplicity, in most simulations it has been assumed that random variables are independently distributed. This report describes an alternative Monte Carlo simulation technique for subjectively assesed multivariate normal...
"L"-Bivariate and "L"-Multivariate Association Coefficients. Research Report. ETS RR-08-40
ERIC Educational Resources Information Center
Kong, Nan; Lewis, Charles
2008-01-01
Given a system of multiple random variables, a new measure called the "L"-multivariate association coefficient is defined using (conditional) entropy. Unlike traditional correlation measures, the L-multivariate association coefficient measures the multiassociations or multirelations among the multiple variables in the given system; that…
NASA Astrophysics Data System (ADS)
Leauthaud, Crystele; Cappelaere, Bernard; Demarty, Jérôme; Guichard, Françoise; Velluet, Cécile; Kergoat, Laurent; Vischel, Théo; Grippa, Manuela; Mouhaimouni, Mohammed; Bouzou Moussa, Ibrahim; Mainassara, Ibrahim; Sultan, Benjamin
2017-04-01
The Sahel has experienced strong climate variability in the past decades. Understanding its implications for natural and cultivated ecosystems is pivotal in a context of high population growth and mainly agriculture-based livelihoods. However, efforts to model processes at the land-atmosphere interface are hindered, particularly when the multi-decadal timescale is targeted, as climatic data are scarce, largely incomplete and often unreliable. This study presents the generation of a long-term, high-temporal resolution, multivariate local climatic data set for Niamey, Central Sahel. The continuous series spans the period 1950-2009 at a 30-min timescale and includes ground station-based meteorological variables (precipitation, air temperature, relative and specific humidity, air pressure, wind speed, downwelling long- and short-wave radiation) as well as process-modelled surface fluxes (upwelling long- and short-wave radiation,latent, sensible and soil heat fluxes and surface temperature). A combination of complementary techniques (linear/spline regressions, a multivariate analogue method, artificial neural networks and recursive gap filling) was used to reconstruct missing meteorological data. The complete surface energy budget was then obtained for two dominant land cover types, fallow bush and millet, by applying the meteorological forcing data set to a finely field-calibrated land surface model. Uncertainty in reconstructed data was expressed by means of a stochastic ensemble of plausible historical time series. Climatological statistics were computed at sub-daily to decadal timescales and compared with local, regional and global data sets such as CRU and ERA-Interim. The reconstructed precipitation statistics, ˜1°C increase in mean annual temperature from 1950 to 2009, and mean diurnal and annual cycles for all variables were in good agreement with previous studies. The new data set, denoted NAD (Niamey Airport-derived set) and publicly available, can be used to investigate the water and energy cycles in Central Sahel, while the methodology can be applied to reconstruct series at other stations. The study has been published in Int. J. Climatol. (2016), DOI: 10.1002/joc.4874
Zhu, Yanzhong; Song, Yonghui; Yu, Huibin; Liu, Ruixia; Liu, Lusan; Lv, Chunjian
2017-08-08
UV-visible absorption spectroscopy coupled with principal component analysis (PCA) and hierarchical cluster analysis (HCA) was applied to characterize spectroscopic components, detect latent factors, and investigate spatial variations of dissolved organic matter (DOM) in a large-scale lake. Twelve surface water samples were collected from Dongjianghu Lake in China. DOM contained lignin and quinine moieties, carboxylic acid, microbial products, and aromatic and alkyl groups, which in the northern part of the lake was largely different from the southern part. Fifteen spectroscopic indices were deduced from the absorption spectra to indicate molecular weight or humification degree of DOM. The northern part of the lake presented the smaller molecular weight or the lower humification degree of DOM than the southern part. E 2/4 , E 3/4 , E 2/3 , and S 2 were latent factors of characterizing the molecular weight of DOM, while E 2/5 , E 3/5 , E 2/6 , E 4/5 , E 3/6 , and A 2/1 were latent factors of evaluating the humification degree of DOM. The UV-visible absorption spectroscopy combined with PCA and HCA may not only characterize DOM fractions of lakes, but may be transferred to other types of waterscape.
LATENT SPACE MODELS FOR MULTIVIEW NETWORK DATA
Salter-Townshend, Michael; McCormick, Tyler H.
2018-01-01
Social relationships consist of interactions along multiple dimensions. In social networks, this means that individuals form multiple types of relationships with the same person (e.g., an individual will not trust all of his/her acquaintances). Statistical models for these data require understanding two related types of dependence structure: (i) structure within each relationship type, or network view, and (ii) the association between views. In this paper, we propose a statistical framework that parsimoniously represents dependence between relationship types while also maintaining enough flexibility to allow individuals to serve different roles in different relationship types. Our approach builds on work on latent space models for networks [see, e.g., J. Amer. Statist. Assoc. 97 (2002) 1090–1098]. These models represent the propensity for two individuals to form edges as conditionally independent given the distance between the individuals in an unobserved social space. Our work departs from previous work in this area by representing dependence structure between network views through a multivariate Bernoulli likelihood, providing a representation of between-view association. This approach infers correlations between views not explained by the latent space model. Using our method, we explore 6 multiview network structures across 75 villages in rural southern Karnataka, India [Banerjee et al. (2013)]. PMID:29721127
LATENT SPACE MODELS FOR MULTIVIEW NETWORK DATA.
Salter-Townshend, Michael; McCormick, Tyler H
2017-09-01
Social relationships consist of interactions along multiple dimensions. In social networks, this means that individuals form multiple types of relationships with the same person (e.g., an individual will not trust all of his/her acquaintances). Statistical models for these data require understanding two related types of dependence structure: (i) structure within each relationship type, or network view, and (ii) the association between views. In this paper, we propose a statistical framework that parsimoniously represents dependence between relationship types while also maintaining enough flexibility to allow individuals to serve different roles in different relationship types. Our approach builds on work on latent space models for networks [see, e.g., J. Amer. Statist. Assoc. 97 (2002) 1090-1098]. These models represent the propensity for two individuals to form edges as conditionally independent given the distance between the individuals in an unobserved social space. Our work departs from previous work in this area by representing dependence structure between network views through a multivariate Bernoulli likelihood, providing a representation of between-view association. This approach infers correlations between views not explained by the latent space model. Using our method, we explore 6 multiview network structures across 75 villages in rural southern Karnataka, India [Banerjee et al. (2013)].
Helle, Samuli
2018-03-01
Revealing causal effects from correlative data is very challenging and a contemporary problem in human life history research owing to the lack of experimental approach. Problems with causal inference arising from measurement error in independent variables, whether related either to inaccurate measurement technique or validity of measurements, seem not well-known in this field. The aim of this study is to show how structural equation modeling (SEM) with latent variables can be applied to account for measurement error in independent variables when the researcher has recorded several indicators of a hypothesized latent construct. As a simple example of this approach, measurement error in lifetime allocation of resources to reproduction in Finnish preindustrial women is modelled in the context of the survival cost of reproduction. In humans, lifetime energetic resources allocated in reproduction are almost impossible to quantify with precision and, thus, typically used measures of lifetime reproductive effort (e.g., lifetime reproductive success and parity) are likely to be plagued by measurement error. These results are contrasted with those obtained from a traditional regression approach where the single best proxy of lifetime reproductive effort available in the data is used for inference. As expected, the inability to account for measurement error in women's lifetime reproductive effort resulted in the underestimation of its underlying effect size on post-reproductive survival. This article emphasizes the advantages that the SEM framework can provide in handling measurement error via multiple-indicator latent variables in human life history studies. © 2017 Wiley Periodicals, Inc.
Physician communication in the operating room.
Kirschbaum, Kristin A; Rask, John P; Fortner, Sally A; Kulesher, Robert; Nelson, Michael T; Yen, Tony; Brennan, Matthew
2015-01-01
In this study, communication research was conducted with multidisciplinary groups of operating-room physicians. Theoretical frameworks from intercultural communication and rhetoric were used to (a) measure latent cultural communication variables and (b) conduct communication training with the physicians. A six-step protocol guided the research with teams of physicians from different surgical specialties: anesthesiologists, general surgeons, and obstetrician-gynecologists (n = 85). Latent cultural communication variables were measured by surveys administered to physicians before and after completion of the protocol. The centerpiece of the 2-hour research protocol was an instructional session that informed the surgical physicians about rhetorical choices that support participatory communication. Post-training results demonstrated scores increased on communication variables that contribute to collaborative communication and teamwork among the physicians. This study expands health communication research through application of combined intercultural and rhetorical frameworks, and establishes new ways communication theory can contribute to medical education.
Jung, Kwanghee; Takane, Yoshio; Hwang, Heungsun; Woodward, Todd S
2016-06-01
We extend dynamic generalized structured component analysis (GSCA) to enhance its data-analytic capability in structural equation modeling of multi-subject time series data. Time series data of multiple subjects are typically hierarchically structured, where time points are nested within subjects who are in turn nested within a group. The proposed approach, named multilevel dynamic GSCA, accommodates the nested structure in time series data. Explicitly taking the nested structure into account, the proposed method allows investigating subject-wise variability of the loadings and path coefficients by looking at the variance estimates of the corresponding random effects, as well as fixed loadings between observed and latent variables and fixed path coefficients between latent variables. We demonstrate the effectiveness of the proposed approach by applying the method to the multi-subject functional neuroimaging data for brain connectivity analysis, where time series data-level measurements are nested within subjects.
Molgaard Nielsen, Anne; Hestbaek, Lise; Vach, Werner; Kent, Peter; Kongsted, Alice
2017-08-09
Heterogeneity in patients with low back pain is well recognised and different approaches to subgrouping have been proposed. One statistical technique that is increasingly being used is Latent Class Analysis as it performs subgrouping based on pattern recognition with high accuracy. Previously, we developed two novel suggestions for subgrouping patients with low back pain based on Latent Class Analysis of patient baseline characteristics (patient history and physical examination), which resulted in 7 subgroups when using a single-stage analysis, and 9 subgroups when using a two-stage approach. However, their prognostic capacity was unexplored. This study (i) determined whether the subgrouping approaches were associated with the future outcomes of pain intensity, pain frequency and disability, (ii) assessed whether one of these two approaches was more strongly or more consistently associated with these outcomes, and (iii) assessed the performance of the novel subgroupings as compared to the following variables: two existing subgrouping tools (STarT Back Tool and Quebec Task Force classification), four baseline characteristics and a group of previously identified domain-specific patient categorisations (collectively, the 'comparator variables'). This was a longitudinal cohort study of 928 patients consulting for low back pain in primary care. The associations between each subgroup approach and outcomes at 2 weeks, 3 and 12 months, and with weekly SMS responses were tested in linear regression models, and their prognostic capacity (variance explained) was compared to that of the comparator variables listed above. The two previously identified subgroupings were similarly associated with all outcomes. The prognostic capacity of both subgroupings was better than that of the comparator variables, except for participants' recovery beliefs and the domain-specific categorisations, but was still limited. The explained variance ranged from 4.3%-6.9% for pain intensity and from 6.8%-20.3% for disability, and highest at the 2 weeks follow-up. Latent Class-derived subgroups provided additional prognostic information when compared to a range of variables, but the improvements were not substantial enough to warrant further development into a new prognostic tool. Further research could investigate if these novel subgrouping approaches may help to improve existing tools that subgroup low back pain patients.
Robust Measurement via A Fused Latent and Graphical Item Response Theory Model.
Chen, Yunxiao; Li, Xiaoou; Liu, Jingchen; Ying, Zhiliang
2018-03-12
Item response theory (IRT) plays an important role in psychological and educational measurement. Unlike the classical testing theory, IRT models aggregate the item level information, yielding more accurate measurements. Most IRT models assume local independence, an assumption not likely to be satisfied in practice, especially when the number of items is large. Results in the literature and simulation studies in this paper reveal that misspecifying the local independence assumption may result in inaccurate measurements and differential item functioning. To provide more robust measurements, we propose an integrated approach by adding a graphical component to a multidimensional IRT model that can offset the effect of unknown local dependence. The new model contains a confirmatory latent variable component, which measures the targeted latent traits, and a graphical component, which captures the local dependence. An efficient proximal algorithm is proposed for the parameter estimation and structure learning of the local dependence. This approach can substantially improve the measurement, given no prior information on the local dependence structure. The model can be applied to measure both a unidimensional latent trait and multidimensional latent traits.
Cook, Thomas B; Brenner, Lisa A; Cloninger, C Robert; Langenberg, Patricia; Igbide, Ajirioghene; Giegling, Ina; Hartmann, Annette M; Konte, Bettina; Friedl, Marion; Brundin, Lena; Groer, Maureen W; Can, Adem; Rujescu, Dan; Postolache, Teodor T
2015-01-01
Latent chronic infection with Toxoplasma gondii (T. gondii), a common neurotropic pathogen, has been previously linked with suicidal self-directed violence (SSDV). We sought to determine if latent infection with T. gondii is associated with trait aggression and impulsivity, intermediate phenotypes for suicidal behavior, in psychiatrically healthy adults. Traits of aggression and impulsivity were analyzed in relationship to IgG antibody seropositivity for T. gondii and two other latent neurotropic infections, herpes simplex virus 1 (HSV1) and cytomegalovirus (CMV). One thousand community-residing adults residing in the Munich metropolitan area with no Axis I or II conditions by SCID for DSM-IV (510 men, 490 women, mean age 53.6 ± 15.8, range 20-74). Plasma samples were tested for IgG antibodies to T. gondii, HSV-1 and CMV by ELISA. Self-reported ratings of trait aggression scores (Questionnaire for Measuring Factors of Aggression [FAF]) and trait impulsivity (Sensation-Seeking Scale-V [SSS-V]) were analyzed using linear multivariate methods. T. gondii IgG seropositivity was significantly associated with higher trait reactive aggression scores among women (p < .01), but not among men. T. gondii-positivity was also associated with higher impulsive sensation-seeking (SSS-V Disinhibition) among younger men (p < .01) aged 20-59 years old (median age = 60). All associations with HSV-1 and CMV were not significant. Aggression and impulsivity, personality traits considered as endophenotypes for SSDV, are associated with latent T. gondii infection in a gender and age-specific manner, and could be further investigated as prognostic and treatment targets in T. gondii-positive individuals at risk for SSDV. Published by Elsevier Ltd.
Parametric Cost Models for Space Telescopes
NASA Technical Reports Server (NTRS)
Stahl, H. Philip
2010-01-01
A study is in-process to develop a multivariable parametric cost model for space telescopes. Cost and engineering parametric data has been collected on 30 different space telescopes. Statistical correlations have been developed between 19 variables of 59 variables sampled. Single Variable and Multi-Variable Cost Estimating Relationships have been developed. Results are being published.
Preliminary Multi-Variable Parametric Cost Model for Space Telescopes
NASA Technical Reports Server (NTRS)
Stahl, H. Philip; Hendrichs, Todd
2010-01-01
This slide presentation reviews creating a preliminary multi-variable cost model for the contract costs of making a space telescope. There is discussion of the methodology for collecting the data, definition of the statistical analysis methodology, single variable model results, testing of historical models and an introduction of the multi variable models.
The choice of product indicators in latent variable interaction models: post hoc analyses.
Foldnes, Njål; Hagtvet, Knut Arne
2014-09-01
The unconstrained product indicator (PI) approach is a simple and popular approach for modeling nonlinear effects among latent variables. This approach leaves the practitioner to choose the PIs to be included in the model, introducing arbitrariness into the modeling. In contrast to previous Monte Carlo studies, we evaluated the PI approach by 3 post hoc analyses applied to a real-world case adopted from a research effort in social psychology. The measurement design applied 3 and 4 indicators for the 2 latent 1st-order variables, leaving the researcher with a choice among more than 4,000 possible PI configurations. Sixty so-called matched-pair configurations that have been recommended in previous literature are of special interest. In the 1st post hoc analysis we estimated the interaction effect for all PI configurations, keeping the real-world sample fixed. The estimated interaction effect was substantially affected by the choice of PIs, also across matched-pair configurations. Subsequently, a post hoc Monte Carlo study was conducted, with varying sample sizes and data distributions. Convergence, bias, Type I error and power of the interaction test were investigated for each matched-pair configuration and the all-pairs configuration. Variation in estimates across matched-pair configurations for a typical sample was substantial. The choice of specific configuration significantly affected convergence and the interaction test's outcome. The all-pairs configuration performed overall better than the matched-pair configurations. A further advantage of the all-pairs over the matched-pairs approach is its unambiguity. The final study evaluates the all-pairs configuration for small sample sizes and compares it to the non-PI approach of latent moderated structural equations. PsycINFO Database Record (c) 2014 APA, all rights reserved.
Bowden, Stephen C; Lissner, Dianne; McCarthy, Kerri A L; Weiss, Lawrence G; Holdnack, James A
2007-10-01
Equivalence of the psychological model underlying Wechsler Adult Intelligence Scale-Third Edition (WAIS-III) scores obtained in the United States and Australia was examined in this study. Examination of metric invariance involves testing the hypothesis that all components of the measurement model relating observed scores to latent variables are numerically equal in different samples. The assumption of metric invariance is necessary for interpretation of scores derived from research studies that seek to generalize patterns of convergent and divergent validity and patterns of deficit or disability. An Australian community volunteer sample was compared to the US standardization data. A pattern of strict metric invariance was observed across samples. In addition, when the effects of different demographic characteristics of the US and Australian samples were included, structural parameters reflecting values of the latent cognitive variables were found not to differ. These results provide important evidence for the equivalence of measurement of core cognitive abilities with the WAIS-III and suggest that latent cognitive abilities in the US and Australia do not differ.
Discriminative latent models for recognizing contextual group activities.
Lan, Tian; Wang, Yang; Yang, Weilong; Robinovitch, Stephen N; Mori, Greg
2012-08-01
In this paper, we go beyond recognizing the actions of individuals and focus on group activities. This is motivated from the observation that human actions are rarely performed in isolation; the contextual information of what other people in the scene are doing provides a useful cue for understanding high-level activities. We propose a novel framework for recognizing group activities which jointly captures the group activity, the individual person actions, and the interactions among them. Two types of contextual information, group-person interaction and person-person interaction, are explored in a latent variable framework. In particular, we propose three different approaches to model the person-person interaction. One approach is to explore the structures of person-person interaction. Differently from most of the previous latent structured models, which assume a predefined structure for the hidden layer, e.g., a tree structure, we treat the structure of the hidden layer as a latent variable and implicitly infer it during learning and inference. The second approach explores person-person interaction in the feature level. We introduce a new feature representation called the action context (AC) descriptor. The AC descriptor encodes information about not only the action of an individual person in the video, but also the behavior of other people nearby. The third approach combines the above two. Our experimental results demonstrate the benefit of using contextual information for disambiguating group activities.
Discriminative Latent Models for Recognizing Contextual Group Activities
Lan, Tian; Wang, Yang; Yang, Weilong; Robinovitch, Stephen N.; Mori, Greg
2012-01-01
In this paper, we go beyond recognizing the actions of individuals and focus on group activities. This is motivated from the observation that human actions are rarely performed in isolation; the contextual information of what other people in the scene are doing provides a useful cue for understanding high-level activities. We propose a novel framework for recognizing group activities which jointly captures the group activity, the individual person actions, and the interactions among them. Two types of contextual information, group-person interaction and person-person interaction, are explored in a latent variable framework. In particular, we propose three different approaches to model the person-person interaction. One approach is to explore the structures of person-person interaction. Differently from most of the previous latent structured models, which assume a predefined structure for the hidden layer, e.g., a tree structure, we treat the structure of the hidden layer as a latent variable and implicitly infer it during learning and inference. The second approach explores person-person interaction in the feature level. We introduce a new feature representation called the action context (AC) descriptor. The AC descriptor encodes information about not only the action of an individual person in the video, but also the behavior of other people nearby. The third approach combines the above two. Our experimental results demonstrate the benefit of using contextual information for disambiguating group activities. PMID:22144516
Olatunji, Bunmi O; Ebesutani, Chad; Kim, Jingu; Riemann, Bradley C; Jacobi, David M
2017-04-15
Although studies have linked disgust proneness to the etiology and maintenance of obsessive-compulsive disorder (OCD) in adults, there remains a paucity of research examining the specificity of this association among youth. The present study employed structural equation modeling to examine the association between disgust proneness, negative affect, and OCD symptom severity in a clinical sample of youth admitted to a residential treatment facility (N =471). Results indicate that disgust proneness and negative affect latent factors independently predicted an OCD symptom severity latent factor. However, when both variables were modeled as predictors simultaneously, latent disgust proneness remained significantly associated with OCD symptom severity, whereas the association between latent negative affect and OCD symptom severity became nonsignificant. Tests of mediation converged in support of disgust proneness as a significant intervening variable between negative affect and OCD symptom severity. Subsequent analysis showed that the path from disgust proneness to OCD symptom severity in the structural model was significantly stronger among those without a primary diagnosis of OCD compared to those with a primary diagnosis of OCD. Given the cross-sectional design, the causal inferences that can be made are limited. The present study is also limited by the exclusive reliance on self-report measures. Disgust proneness may play a uniquely important role in OCD among youth. Copyright © 2017 Elsevier B.V. All rights reserved.
Biostatistics Series Module 10: Brief Overview of Multivariate Methods.
Hazra, Avijit; Gogtay, Nithya
2017-01-01
Multivariate analysis refers to statistical techniques that simultaneously look at three or more variables in relation to the subjects under investigation with the aim of identifying or clarifying the relationships between them. These techniques have been broadly classified as dependence techniques, which explore the relationship between one or more dependent variables and their independent predictors, and interdependence techniques, that make no such distinction but treat all variables equally in a search for underlying relationships. Multiple linear regression models a situation where a single numerical dependent variable is to be predicted from multiple numerical independent variables. Logistic regression is used when the outcome variable is dichotomous in nature. The log-linear technique models count type of data and can be used to analyze cross-tabulations where more than two variables are included. Analysis of covariance is an extension of analysis of variance (ANOVA), in which an additional independent variable of interest, the covariate, is brought into the analysis. It tries to examine whether a difference persists after "controlling" for the effect of the covariate that can impact the numerical dependent variable of interest. Multivariate analysis of variance (MANOVA) is a multivariate extension of ANOVA used when multiple numerical dependent variables have to be incorporated in the analysis. Interdependence techniques are more commonly applied to psychometrics, social sciences and market research. Exploratory factor analysis and principal component analysis are related techniques that seek to extract from a larger number of metric variables, a smaller number of composite factors or components, which are linearly related to the original variables. Cluster analysis aims to identify, in a large number of cases, relatively homogeneous groups called clusters, without prior information about the groups. The calculation intensive nature of multivariate analysis has so far precluded most researchers from using these techniques routinely. The situation is now changing with wider availability, and increasing sophistication of statistical software and researchers should no longer shy away from exploring the applications of multivariate methods to real-life data sets.
ERIC Educational Resources Information Center
Raykov, Tenko; Marcoulides, George A.
2015-01-01
A latent variable modeling procedure that can be used to evaluate intraclass correlation coefficients in two-level settings with discrete response variables is discussed. The approach is readily applied when the purpose is to furnish confidence intervals at prespecified confidence levels for these coefficients in setups with binary or ordinal…
ERIC Educational Resources Information Center
Teachman, Jay D.
1995-01-01
Argues that data on siblings provide a way to account for the impact of unmeasured, omitted variables on relationships of interest because families form a sort of natural experiment, with similar experiences and common genetic heritage. Proposes a latent-variable structural equation approach to the problem, which provides estimates of both within-…
Yasuda, Akihito; Onuki, Yoshinori; Obata, Yasuko; Yamamoto, Rie; Takayama, Kozo
2013-01-01
The "quality by design" concept in pharmaceutical formulation development requires the establishment of a science-based rationale and a design space. We integrated thin-plate spline (TPS) interpolation and Kohonen's self-organizing map (SOM) to visualize the latent structure underlying causal factors and pharmaceutical responses. As a model pharmaceutical product, theophylline tablets were prepared based on a standard formulation. The tensile strength, disintegration time, and stability of these variables were measured as response variables. These responses were predicted quantitatively based on nonlinear TPS. A large amount of data on these tablets was generated and classified into several clusters using an SOM. The experimental values of the responses were predicted with high accuracy, and the data generated for the tablets were classified into several distinct clusters. The SOM feature map allowed us to analyze the global and local correlations between causal factors and tablet characteristics. The results of this study suggest that increasing the proportion of microcrystalline cellulose (MCC) improved the tensile strength and the stability of tensile strength of these theophylline tablets. In addition, the proportion of MCC has an optimum value for disintegration time and stability of disintegration. Increasing the proportion of magnesium stearate extended disintegration time. Increasing the compression force improved tensile strength, but degraded the stability of disintegration. This technique provides a better understanding of the relationships between causal factors and pharmaceutical responses in theophylline tablet formulations.
Dorland, Heleen F; Abma, Femke I; Roelen, Corné A M; Stewart, Roy E; Amick, Benjamin C; Ranchor, Adelita V; Bültmann, Ute
2017-11-01
More than 60% of cancer patients are able to work after cancer diagnosis. However, little is known about their functioning at work. Therefore, the aims of this study were to (1) identify work functioning trajectories in the year following return to work (RTW) in cancer patients and (2) examine baseline sociodemographic, health-related and work-related variables associated with work functioning trajectories. This longitudinal cohort study included 384 cancer patients who have returned to work after cancer diagnosis. Work functioning was measured at baseline, 3, 6, 9 and 12 months follow-up. Latent class growth modeling (LCGM) was used to identify work functioning trajectories. Associations of baseline variables with work functioning trajectories were examined using univariate and multivariate analyses. LCGM analyses with cancer patients who completed on at least three time points the Work Role Functioning Questionnaire (n = 324) identified three work functioning trajectories: "persistently high" (16% of the sample), "moderate to high" (54%) and "persistently low" work functioning (32%). Cancer patients with persistently high work functioning had less time between diagnosis and RTW and had less often a changed meaning of work, while cancer patients with persistently low work functioning reported more baseline cognitive symptoms compared to cancer patients in the other trajectories. This knowledge has implications for cancer care and guidance of cancer patients at work. © 2017 UICC.
Measurement Models for Reasoned Action Theory
Hennessy, Michael; Bleakley, Amy; Fishbein, Martin
2012-01-01
Quantitative researchers distinguish between causal and effect indicators. What are the analytic problems when both types of measures are present in a quantitative reasoned action analysis? To answer this question, we use data from a longitudinal study to estimate the association between two constructs central to reasoned action theory: behavioral beliefs and attitudes toward the behavior. The belief items are causal indicators that define a latent variable index while the attitude items are effect indicators that reflect the operation of a latent variable scale. We identify the issues when effect and causal indicators are present in a single analysis and conclude that both types of indicators can be incorporated in the analysis of data based on the reasoned action approach. PMID:23243315
Which kind of psychometrics is adequate for patient satisfaction questionnaires?
Konerding, Uwe
2016-01-01
The construction and psychometric analysis of patient satisfaction questionnaires are discussed. The discussion is based upon the classification of multi-item questionnaires into scales or indices. Scales consist of items that describe the effects of the latent psychological variable to be measured, and indices consist of items that describe the causes of this variable. Whether patient satisfaction questionnaires should be constructed and analyzed as scales or as indices depends upon the purpose for which these questionnaires are required. If the final aim is improving care with regard to patients' preferences, then these questionnaires should be constructed and analyzed as indices. This implies two requirements: 1) items for patient satisfaction questionnaires should be selected in such a way that the universe of possible causes of patient satisfaction is covered optimally and 2) Cronbach's alpha, principal component analysis, exploratory factor analysis, confirmatory factor analysis, and analyses with models from item response theory, such as the Rasch Model, should not be applied for psychometric analyses. Instead, multivariate regression analyses with a direct rating of patient satisfaction as the dependent variable and the individual questionnaire items as independent variables should be performed. The coefficients produced by such an analysis can be applied for selecting the best items and for weighting the selected items when a sum score is determined. The lower boundaries of the validity of the unweighted and the weighted sum scores can be estimated by their correlations with the direct satisfaction rating. While the first requirement is fulfilled in the majority of the previous patient satisfaction questionnaires, the second one deviates from previous practice. Hence, if patient satisfaction is actually measured with the final aim of improving care with regard to patients' preferences, then future practice should be changed so that the second requirement is also fulfilled.
Yousefzadeh, Gholamreza; Gozashti, Mohammadhossein; Najafipour, Hamid; Gholamhosseinian, Najar Ahmad; Bahramnejad, Abbas; Shokouhi, Mostafa
2016-01-01
Latent autoimmune diabetes in adults (LADA) is autoimmune diabetes with a slow progression characterized by the presence of antibodies associated with Type I diabetes. The present study aimed to assess autoimmune characteristics in patients with LADA in Iran. We attempted to obtain a clear view of autoimmune conditions in LADA among our population. This study was sourced from the population-based survey of KERCARDS aiming assessment of cardiovascular risk factors among a great sample of Iranian population who were resident in Kerman, a great province in southern Iran. Among all diabetic patients who were negative for Anti Glutamic Acid Decarboxylase (GAD) antibody test, 120 were selected as the controls and among 80 patients who were positive for this test diagnosed as LADA, the recorded files of 57 patients were complete considered as the cases. The level of thyroxin is significantly lower in patients with LADA compared with the controls so 73.7% and 45% of patients had normal level of thyroxin, respectively. Also, those with LADA had considerably lower levels of both thyroid peroxydaseantibody (TPO-Ab) and C-peptide when compared with non-LADA group. Using multivariate analyses and with the presence of baseline variables including gender, age, and duration of disease, the diagnosis of LADA was associated with lower serum levels of Anti-TPO, C-peptide, and thyroxin, but not associated with the level of Anti-TTG in serum. LADA patients may face with lower serum levels of C-peptide and thyroid-specific antibodies indicating insulin therapy requirement and authoimmune fundaments of the disease, respectively. Copyright © 2016. Published by Elsevier Ltd.
Latino cigarette smoking patterns by gender in a US national sample
Kristman-Valente, Allison; Flaherty, Brian P.
2015-01-01
Background Latino smokers are a rising public health concern who experience elevated tobacco related health disparities. Purpose Additional information on Latino smoking is needed to inform screening and treatment. Analysis Latent class analysis using smoking frequency, cigarette preferences, onset, smoking duration, cigarettes per day and minutes to first cigarette were used to create multivariate latent smoking profiles for Latino men and women. Results Final models found seven classes for Latinas and nine classes for Latinos. Despite a common finding in the literature that Latino smokers are more likely to be low-risk, intermittent smokers, the majority of classes, for both males and females, described patterns of high-risk, daily smoking. Gender variations in smoking classes were noted. Conclusions Several markers of smoking risk were identified among both male and female Latino smokers including long durations of smoking, daily smoking and preference for specialty cigarettes, all factors associated with long-term health consequences. PMID:26304857
A Two-Step Approach to Analyze Satisfaction Data
ERIC Educational Resources Information Center
Ferrari, Pier Alda; Pagani, Laura; Fiorio, Carlo V.
2011-01-01
In this paper a two-step procedure based on Nonlinear Principal Component Analysis (NLPCA) and Multilevel models (MLM) for the analysis of satisfaction data is proposed. The basic hypothesis is that observed ordinal variables describe different aspects of a latent continuous variable, which depends on covariates connected with individual and…
Factorial versus Typological Models: A Comparison of Methods for Personality Data
ERIC Educational Resources Information Center
von Davier, Matthias; Naemi, Bobby; Roberts, Richard D.
2012-01-01
This article describes an exploration of the distinction between typological and factorial latent variables in the domain of personality theory. Traditionally, many personality variables have been considered to be factorial in nature, even though there are examples of typological constructs dating back to Hippocrates. Recently, some…
A Latent-Variable Causal Model of Faculty Reputational Ratings.
ERIC Educational Resources Information Center
King, Suzanne; Wolfle, Lee M.
A reanalysis was conducted of Saunier's research (1985) on sources of variation in the National Research Council (NRC) reputational ratings of university faculty. Saunier conducted a stepwise regression analysis using 12 predictor variables. Due to problems with multicollinearity and because of the atheoretical nature of stepwise regression,…
Undergraduate Nurse Variables that Predict Academic Achievement and Clinical Competence in Nursing
ERIC Educational Resources Information Center
Blackman, Ian; Hall, Margaret; Darmawan, I Gusti Ngurah.
2007-01-01
A hypothetical model was formulated to explore factors that influenced academic and clinical achievement for undergraduate nursing students. Sixteen latent variables were considered including the students' background, gender, type of first language, age, their previous successes with their undergraduate nursing studies and status given for…
Paths to tobacco abstinence: A repeated-measures latent class analysis.
McCarthy, Danielle E; Ebssa, Lemma; Witkiewitz, Katie; Shiffman, Saul
2015-08-01
Knowledge of smoking change processes may be enhanced by identifying pathways to stable abstinence. We sought to identify latent classes of smokers based on their day-to-day smoking status in the first weeks of a cessation attempt. We examined treatment effects on class membership and compared classes on baseline individual differences and 6-month abstinence rates. In this secondary analysis of a double-blind randomized placebo-controlled clinical trial (N = 1,433) of 5 smoking cessation pharmacotherapies (nicotine patch, nicotine lozenge, bupropion SR, patch and lozenge, or bupropion SR and lozenge), we conducted repeated-measures latent class analysis of daily smoking status (any smoking vs. none) for the first 27 days of a quit attempt. Treatment and covariate relations with latent class membership were examined. Distal outcome analysis compared confirmed 6-month abstinence rates among the latent classes. A 5-class solution was selected. Three-quarters of smokers were in stable smoking or abstinent classes, but 25% were in classes with unstable abstinence probabilities over time. Active treatment (compared to placebo), and particularly the patch and lozenge combination, promoted early quitting. Latent classes differed in 6-month abstinence rates and on several baseline variables, including nicotine dependence, quitting history, self-efficacy, sleep disturbance, and minority status. Repeated-measures latent class analysis identified latent classes of smoking change patterns affected by treatment, related to known risk factors, and predictive of distal outcomes. Tracking behavior early in a change attempt may identify prognostic patterns of change and facilitate adaptive treatment planning. (c) 2015 APA, all rights reserved).
Vacca, G M; Paschino, P; Dettori, M L; Bergamaschi, M; Cipolat-Gotet, C; Bittante, G; Pazzola, M
2016-09-01
Dairy goat farming is practiced worldwide, within a range of different farming systems. Here we investigated the effects of environmental factors and morphology on milk traits of the Sardinian goat population. Sardinian goats are currently reared in Sardinia (Italy) in a low-input context, similar to many goat farming systems, especially in developing countries. Milk and morphological traits from 1,050 Sardinian goats from 42 farms were recorded. We observed a high variability regarding morphological traits, such as coat color, ear length and direction, horn presence, and udder shape. Such variability derived partly from the unplanned repeated crossbreeding of the native Sardinian goats with exotic breeds, especially Maltese goats. The farms located in the mountains were characterized by the traditional farming system and the lowest percentage of crossbred goats. Explanatory factors analysis was used to summarize the interrelated measured milk variables. The explanatory factor related to fat, protein, and energy content of milk (the "Quality" latent variable) explained about 30% of the variance of the whole data set of measured milk traits followed by the "Hygiene" (19%), "Production" (19%), and "Acidity" (11%) factors. The "Quality" and "Hygiene" factors were not affected by any of the farm classification items, whereas "Production" and "Acidity" were affected only by altitude and size of herds, respectively, indicating the adaptation of the local goat population to different environmental conditions. The use of latent explanatory factor analysis allowed us to clearly explain the large variability of milk traits, revealing that the Sardinian goat population cannot be divided into subpopulations based on milk attitude The factors, properly integrated with genetic data, may be useful tools in future selection programs.
Individual Differences in Childhood Sleep Problems Predict Later Cognitive Executive Control
Friedman, Naomi P.; Corley, Robin P.; Hewitt, John K.; Wright, Kenneth P.
2009-01-01
Study Objective: To determine whether individual differences in developmental patterns of general sleep problems are associated with 3 executive function abilities—inhibiting, updating working memory, and task shifting—in late adolescence. Participants: 916 twins (465 female, 451 male) and parents from the Colorado Longitudinal Twin Study. Measurements and Results: Parents reported their children's sleep problems at ages 4 years, 5 y, 7 y, and 9–16 y based on a 7-item scale from the Child-Behavior Checklist; a subset of children (n = 568) completed laboratory assessments of executive functions at age 17. Latent variable growth curve analyses were used to model individual differences in longitudinal trajectories of childhood sleep problems. Sleep problems declined over time, with ~70% of children having ≥ 1 problem at age 4 and ~33% of children at age 16. However, significant individual differences in both the initial levels of problems (intercept) and changes across time (slope) were observed. When executive function latent variables were added to the model, the intercept did not significantly correlate with the later executive function latent variables; however, the slope variable significantly (P < 0.05) negatively correlated with inhibiting (r = −0.27) and updating (r = −0.21), but not shifting (r = −0.10) abilities. Further analyses suggested that the slope variable predicted the variance common to the 3 executive functions (r = −0.29). Conclusions: Early levels of sleep problems do not seem to have appreciable implications for later executive functioning. However, individuals whose sleep problems decrease more across time show better general executive control in late adolescence. Citation: Friedman NP; Corley RP; Hewitt JK; Wright KP. Individual differences in childhood sleep problems predict later cognitive executive control. SLEEP 2009;32(3):323-333. PMID:19294952
Donnellan, M Brent; Kenny, David A; Trzesniewski, Kali H; Lucas, Richard E; Conger, Rand D
2012-12-01
The present research used a latent variable trait-state model to evaluate the longitudinal consistency of self-esteem during the transition from adolescence to adulthood. Analyses were based on ten administrations of the Rosenberg Self-Esteem scale (Rosenberg, 1965) spanning the ages of approximately 13 to 32 for a sample of 451 participants. Results indicated that a completely stable trait factor and an autoregressive trait factor accounted for the majority of the variance in latent self-esteem assessments, whereas state factors accounted for about 16% of the variance in repeated assessments of latent self-esteem. The stability of individual differences in self-esteem increased with age consistent with the cumulative continuity principle of personality development.
Donnellan, M. Brent; Kenny, David A.; Trzesniewski, Kali H.; Lucas, Richard E.; Conger, Rand D.
2012-01-01
The present research used a latent variable trait-state model to evaluate the longitudinal consistency of self-esteem during the transition from adolescence to adulthood. Analyses were based on ten administrations of the Rosenberg Self-Esteem scale (Rosenberg, 1965) spanning the ages of approximately 13 to 32 for a sample of 451 participants. Results indicated that a completely stable trait factor and an autoregressive trait factor accounted for the majority of the variance in latent self-esteem assessments, whereas state factors accounted for about 16% of the variance in repeated assessments of latent self-esteem. The stability of individual differences in self-esteem increased with age consistent with the cumulative continuity principle of personality development. PMID:23180899
Symptom Cluster Research With Biomarkers and Genetics Using Latent Class Analysis.
Conley, Samantha
2017-12-01
The purpose of this article is to provide an overview of latent class analysis (LCA) and examples from symptom cluster research that includes biomarkers and genetics. A review of LCA with genetics and biomarkers was conducted using Medline, Embase, PubMed, and Google Scholar. LCA is a robust latent variable model used to cluster categorical data and allows for the determination of empirically determined symptom clusters. Researchers should consider using LCA to link empirically determined symptom clusters to biomarkers and genetics to better understand the underlying etiology of symptom clusters. The full potential of LCA in symptom cluster research has not yet been realized because it has been used in limited populations, and researchers have explored limited biologic pathways.
On measures of association among genetic variables
Gianola, Daniel; Manfredi, Eduardo; Simianer, Henner
2012-01-01
Summary Systems involving many variables are important in population and quantitative genetics, for example, in multi-trait prediction of breeding values and in exploration of multi-locus associations. We studied departures of the joint distribution of sets of genetic variables from independence. New measures of association based on notions of statistical distance between distributions are presented. These are more general than correlations, which are pairwise measures, and lack a clear interpretation beyond the bivariate normal distribution. Our measures are based on logarithmic (Kullback-Leibler) and on relative ‘distances’ between distributions. Indexes of association are developed and illustrated for quantitative genetics settings in which the joint distribution of the variables is either multivariate normal or multivariate-t, and we show how the indexes can be used to study linkage disequilibrium in a two-locus system with multiple alleles and present applications to systems of correlated beta distributions. Two multivariate beta and multivariate beta-binomial processes are examined, and new distributions are introduced: the GMS-Sarmanov multivariate beta and its beta-binomial counterpart. PMID:22742500
Application of two tests of multivariate discordancy to fisheries data sets
Stapanian, M.A.; Kocovsky, P.M.; Garner, F.C.
2008-01-01
The generalized (Mahalanobis) distance and multivariate kurtosis are two powerful tests of multivariate discordancies (outliers). Unlike the generalized distance test, the multivariate kurtosis test has not been applied as a test of discordancy to fisheries data heretofore. We applied both tests, along with published algorithms for identifying suspected causal variable(s) of discordant observations, to two fisheries data sets from Lake Erie: total length, mass, and age from 1,234 burbot, Lota lota; and 22 combinations of unique subsets of 10 morphometrics taken from 119 yellow perch, Perca flavescens. For the burbot data set, the generalized distance test identified six discordant observations and the multivariate kurtosis test identified 24 discordant observations. In contrast with the multivariate tests, the univariate generalized distance test identified no discordancies when applied separately to each variable. Removing discordancies had a substantial effect on length-versus-mass regression equations. For 500-mm burbot, the percent difference in estimated mass after removing discordancies in our study was greater than the percent difference in masses estimated for burbot of the same length in lakes that differed substantially in productivity. The number of discordant yellow perch detected ranged from 0 to 2 with the multivariate generalized distance test and from 6 to 11 with the multivariate kurtosis test. With the kurtosis test, 108 yellow perch (90.7%) were identified as discordant in zero to two combinations, and five (4.2%) were identified as discordant in either all or 21 of the 22 combinations. The relationship among the variables included in each combination determined which variables were identified as causal. The generalized distance test identified between zero and six discordancies when applied separately to each variable. Removing the discordancies found in at least one-half of the combinations (k=5) had a marked effect on a principal components analysis. In particular, the percent of the total variation explained by second and third principal components, which explain shape, increased by 52 and 44% respectively when the discordancies were removed. Multivariate applications of the tests have numerous ecological advantages over univariate applications, including improved management of fish stocks and interpretation of multivariate morphometric data. ?? 2007 Springer Science+Business Media B.V.
A Multivariate Model of Parent-Adolescent Relationship Variables in Early Adolescence
ERIC Educational Resources Information Center
McKinney, Cliff; Renk, Kimberly
2011-01-01
Given the importance of predicting outcomes for early adolescents, this study examines a multivariate model of parent-adolescent relationship variables, including parenting, family environment, and conflict. Participants, who completed measures assessing these variables, included 710 culturally diverse 11-14-year-olds who were attending a middle…
Hu, Chuanpu; Zhou, Honghui
2016-02-01
Improving the quality of exposure-response modeling is important in clinical drug development. The general joint modeling of multiple endpoints is made possible in part by recent progress on the latent variable indirect response (IDR) modeling for ordered categorical endpoints. This manuscript aims to investigate, when modeling a continuous and a categorical clinical endpoint, the level of improvement achievable by joint modeling in the latent variable IDR modeling framework through the sharing of model parameters for the individual endpoints, guided by the appropriate representation of drug and placebo mechanism. This was illustrated with data from two phase III clinical trials of intravenously administered mAb X for the treatment of rheumatoid arthritis, with the 28-joint disease activity score (DAS28) and 20, 50, and 70% improvement in the American College of Rheumatology (ACR20, ACR50, and ACR70) disease severity criteria were used as efficacy endpoints. The joint modeling framework led to a parsimonious final model with reasonable performance, evaluated by visual predictive check. The results showed that, compared with the more common approach of separately modeling the endpoints, it is possible for the joint model to be more parsimonious and yet better describe the individual endpoints. In particular, the joint model may better describe one endpoint through subject-specific random effects that would not have been estimable from data of this endpoint alone.
NASA Astrophysics Data System (ADS)
Juesas, P.; Ramasso, E.
2016-12-01
Condition monitoring aims at ensuring system safety which is a fundamental requirement for industrial applications and that has become an inescapable social demand. This objective is attained by instrumenting the system and developing data analytics methods such as statistical models able to turn data into relevant knowledge. One difficulty is to be able to correctly estimate the parameters of those methods based on time-series data. This paper suggests the use of the Weighted Distribution Theory together with the Expectation-Maximization algorithm to improve parameter estimation in statistical models with latent variables with an application to health monotonic under uncertainty. The improvement of estimates is made possible by incorporating uncertain and possibly noisy prior knowledge on latent variables in a sound manner. The latent variables are exploited to build a degradation model of dynamical system represented as a sequence of discrete states. Examples on Gaussian Mixture Models, Hidden Markov Models (HMM) with discrete and continuous outputs are presented on both simulated data and benchmarks using the turbofan engine datasets. A focus on the application of a discrete HMM to health monitoring under uncertainty allows to emphasize the interest of the proposed approach in presence of different operating conditions and fault modes. It is shown that the proposed model depicts high robustness in presence of noisy and uncertain prior.
Crawford, John R; Henry, Julie D
2003-06-01
To provide UK normative data for the Depression Anxiety and Stress Scale (DASS) and test its convergent, discriminant and construct validity. Cross-sectional, correlational and confirmatory factor analysis (CFA). The DASS was administered to a non-clinical sample, broadly representative of the general adult UK population (N = 1,771) in terms of demographic variables. Competing models of the latent structure of the DASS were derived from theoretical and empirical sources and evaluated using confirmatory factor analysis. Correlational analysis was used to determine the influence of demographic variables on DASS scores. The convergent and discriminant validity of the measure was examined through correlating the measure with two other measures of depression and anxiety (the HADS and the sAD), and a measure of positive and negative affectivity (the PANAS). The best fitting model (CFI =.93) of the latent structure of the DASS consisted of three correlated factors corresponding to the depression, anxiety and stress scales with correlated error permitted between items comprising the DASS subscales. Demographic variables had only very modest influences on DASS scores. The reliability of the DASS was excellent, and the measure possessed adequate convergent and discriminant validity Conclusions: The DASS is a reliable and valid measure of the constructs it was intended to assess. The utility of this measure for UK clinicians is enhanced by the provision of large sample normative data.
Malm, Christer B.; Khoo, Nelson S.; Granlund, Irene; Lindstedt, Emilia; Hult, Andreas
2016-01-01
The discovery of erythropoietin (EPO) simplified blood doping in sports, but improved detection methods, for EPO has forced cheating athletes to return to blood transfusion. Autologous blood transfusion with cryopreserved red blood cells (RBCs) is the method of choice, because no valid method exists to accurately detect such event. In endurance sports, it can be estimated that elite athletes improve performance by up to 3% with blood doping, regardless of method. Valid detection methods for autologous blood doping is important to maintain credibility of athletic performances. Recreational male (N = 27) and female (N = 11) athletes served as Transfusion (N = 28) and Control (N = 10) subjects in two different transfusion settings. Hematological variables and physical performance were measured before donation of 450 or 900 mL whole blood, and until four weeks after re-infusion of the cryopreserved RBC fraction. Blood was analyzed for transferrin, iron, Hb, EVF, MCV, MCHC, reticulocytes, leucocytes and EPO. Repeated measures multivariate analysis of variance (MANOVA) and pattern recognition using Principal Component Analysis (PCA) and Orthogonal Projections of Latent Structures (OPLS) discriminant analysis (DA) investigated differences between Control and Transfusion groups over time. Significant increase in performance (15 ± 8%) and VO2max (17 ± 10%) (mean ± SD) could be measured 48 h after RBC re-infusion, and remained increased for up to four weeks in some subjects. In total, 533 blood samples were included in the study (Clean = 220, Transfused = 313). In response to blood transfusion, the largest change in hematological variables occurred 48 h after blood donation, when Control and Transfused groups could be separated with OPLS-DA (R2 = 0.76/Q2 = 0.59). RBC re-infusion resulted in the best model (R2 = 0.40/Q2 = 0.10) at the first sampling point (48 h), predicting one false positive and one false negative. Over all, a 25% and 86% false positives ratio was achieved in two separate trials. In conclusions, autologous re-infusion of RBCs increased VO2max and performance as hypothesized, but hematological profiling by multivariate statistics could not reach the WADA stipulated false positive ratio of <0.001% at any time point investigated. A majority of samples remained within limits of normal individual variation at all times. PMID:27284981
2017-04-30
practices in latent variable theory, it is not surprising that effective measurement programs present methodological typing and considering of experimental ...7 3.3 Methodology ...8 Revised Enterprise Modeling Methodology ................................................................ 128 9 Conclusions
Sleep schedules and school performance in Indigenous Australian children.
Blunden, Sarah; Magee, Chris; Attard, Kelly; Clarkson, Larissa; Caputi, Peter; Skinner, Timothy
2018-04-01
Sleep duration and sleep schedule variability have been related to negative health and well-being outcomes in children, but little is known about Australian Indigenous children. Data for children aged 7-9 years came from the Australian Longitudinal Study of Indigenous Children and the National Assessment Program-Literacy and Numeracy (NAPLAN). Latent class analysis determined sleep classes taking into account sleep duration, bedtimes, waketimes, and variability in bedtimes from weekdays to weekends. Regression models tested whether the sleep classes were cross-sectionally associated with grade 3 NAPLAN scores. Latent change score modeling then examined whether the sleep classes predicted changes in NAPLAN performance from grades 3 to 5. Five sleep schedule classes were identified: normative sleep, early risers, long sleep, variable sleep, and short sleep. Overall, long sleepers performed best, with those with reduced sleep (short sleepers and early risers) performing the worse on grammar, numeracy, and writing performance. Latent change score results also showed that long sleepers performed best in spelling and writing and short sleepers and typical sleepers performed the worst over time. In this sample of Australian Indigenous children, short sleep was associated with poorer school performance compared with long sleep, with this performance worsening over time for some performance indicators. Other sleep schedules (eg, early wake times and variable sleep) also had some relationships with school performance. As sleep scheduling is modifiable, this offers opportunity for improvement in sleep and thus performance outcomes for these and potentially all children. Copyright © 2018 National Sleep Foundation. Published by Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Koskela, J. J.; Croke, B. W. F.; Koivusalo, H.; Jakeman, A. J.; Kokkonen, T.
2012-11-01
Bayesian inference is used to study the effect of precipitation and model structural uncertainty on estimates of model parameters and confidence limits of predictive variables in a conceptual rainfall-runoff model in the snow-fed Rudbäck catchment (142 ha) in southern Finland. The IHACRES model is coupled with a simple degree day model to account for snow accumulation and melt. The posterior probability distribution of the model parameters is sampled by using the Differential Evolution Adaptive Metropolis (DREAM(ZS)) algorithm and the generalized likelihood function. Precipitation uncertainty is taken into account by introducing additional latent variables that were used as multipliers for individual storm events. Results suggest that occasional snow water equivalent (SWE) observations together with daily streamflow observations do not contain enough information to simultaneously identify model parameters, precipitation uncertainty and model structural uncertainty in the Rudbäck catchment. The addition of an autoregressive component to account for model structure error and latent variables having uniform priors to account for input uncertainty lead to dubious posterior distributions of model parameters. Thus our hypothesis that informative priors for latent variables could be replaced by additional SWE data could not be confirmed. The model was found to work adequately in 1-day-ahead simulation mode, but the results were poor in the simulation batch mode. This was caused by the interaction of parameters that were used to describe different sources of uncertainty. The findings may have lessons for other cases where parameterizations are similarly high in relation to available prior information.
Medical University admission test: a confirmatory factor analysis of the results.
Luschin-Ebengreuth, Marion; Dimai, Hans P; Ithaler, Daniel; Neges, Heide M; Reibnegger, Gilbert
2016-05-01
The Graz Admission Test has been applied since the academic year 2006/2007. The validity of the Test was demonstrated by a significant improvement of study success and a significant reduction of dropout rate. The purpose of this study was a detailed analysis of the internal correlation structure of the various components of the Graz Admission Test. In particular, the question investigated was whether or not the various test parts constitute a suitable construct which might be designated as "Basic Knowledge in Natural Science." This study is an observational investigation, analyzing the results of the Graz Admission Test for the study of human medicine and dentistry. A total of 4741 applicants were included in the analysis. Principal component factor analysis (PCFA) as well as techniques from structural equation modeling, specifically confirmatory factor analysis (CFA), were employed to detect potential underlying latent variables governing the behavior of the measured variables. PCFA showed good clustering of the science test parts, including also text comprehension. A putative latent variable "Basic Knowledge in Natural Science," investigated by CFA, was indeed shown to govern the response behavior of the applicants in biology, chemistry, physics, and mathematics as well as text comprehension. The analysis of the correlation structure of the various test parts confirmed that the science test parts together with text comprehension constitute a satisfactory instrument for measuring a latent construct variable "Basic Knowledge in Natural Science." The present results suggest the fundamental importance of basic science knowledge for results obtained in the framework of the admission process for medical universities.
Piecewise multivariate modelling of sequential metabolic profiling data.
Rantalainen, Mattias; Cloarec, Olivier; Ebbels, Timothy M D; Lundstedt, Torbjörn; Nicholson, Jeremy K; Holmes, Elaine; Trygg, Johan
2008-02-19
Modelling the time-related behaviour of biological systems is essential for understanding their dynamic responses to perturbations. In metabolic profiling studies, the sampling rate and number of sampling points are often restricted due to experimental and biological constraints. A supervised multivariate modelling approach with the objective to model the time-related variation in the data for short and sparsely sampled time-series is described. A set of piecewise Orthogonal Projections to Latent Structures (OPLS) models are estimated, describing changes between successive time points. The individual OPLS models are linear, but the piecewise combination of several models accommodates modelling and prediction of changes which are non-linear with respect to the time course. We demonstrate the method on both simulated and metabolic profiling data, illustrating how time related changes are successfully modelled and predicted. The proposed method is effective for modelling and prediction of short and multivariate time series data. A key advantage of the method is model transparency, allowing easy interpretation of time-related variation in the data. The method provides a competitive complement to commonly applied multivariate methods such as OPLS and Principal Component Analysis (PCA) for modelling and analysis of short time-series data.
Many multivariate methods are used in describing and predicting relation; each has its unique usage of categorical and non-categorical data. In multivariate analysis of variance (MANOVA), many response variables (y's) are related to many independent variables that are categorical...
Delwiche, Stephen R; Reeves, James B
2010-01-01
In multivariate regression analysis of spectroscopy data, spectral preprocessing is often performed to reduce unwanted background information (offsets, sloped baselines) or accentuate absorption features in intrinsically overlapping bands. These procedures, also known as pretreatments, are commonly smoothing operations or derivatives. While such operations are often useful in reducing the number of latent variables of the actual decomposition and lowering residual error, they also run the risk of misleading the practitioner into accepting calibration equations that are poorly adapted to samples outside of the calibration. The current study developed a graphical method to examine this effect on partial least squares (PLS) regression calibrations of near-infrared (NIR) reflection spectra of ground wheat meal with two analytes, protein content and sodium dodecyl sulfate sedimentation (SDS) volume (an indicator of the quantity of the gluten proteins that contribute to strong doughs). These two properties were chosen because of their differing abilities to be modeled by NIR spectroscopy: excellent for protein content, fair for SDS sedimentation volume. To further demonstrate the potential pitfalls of preprocessing, an artificial component, a randomly generated value, was included in PLS regression trials. Savitzky-Golay (digital filter) smoothing, first-derivative, and second-derivative preprocess functions (5 to 25 centrally symmetric convolution points, derived from quadratic polynomials) were applied to PLS calibrations of 1 to 15 factors. The results demonstrated the danger of an over reliance on preprocessing when (1) the number of samples used in a multivariate calibration is low (<50), (2) the spectral response of the analyte is weak, and (3) the goodness of the calibration is based on the coefficient of determination (R(2)) rather than a term based on residual error. The graphical method has application to the evaluation of other preprocess functions and various types of spectroscopy data.
Obtaining systematic teacher reports of disruptive behavior disorders utilizing DSM-IV.
Wolraich, M L; Feurer, I D; Hannah, J N; Baumgaertel, A; Pinnock, T Y
1998-04-01
This study examines the psychometric properties of the Vanderbilt AD/HD Diagnostic Teacher Rating Scale (VADTRS) and provides preliminary normative data from a large, geographically defined population. The VADTRS consists of the complete list of DSM-IV AD/HD symptoms, a screen for other disruptive behavior disorders, anxiety and depression, and ratings of academic and classroom behavior performance. Teachers in one suburban county completed the scale for their students during 2 consecutive years. Statistical methods included (a) exploratory and confirmatory latent variable analyses of item data, (b) evaluation of the internal consistency of the latent dimensions, (c) evaluation of latent structure concordance between school year samples, and (d) preliminary evaluation of criterion-related validity. The instrument comprises four behavioral dimensions and two performance dimensions. The behavioral dimensions were concordant between school years and were consistent with a priori DSM-IV diagnostic criteria. Correlations between latent dimensions and relevant, known disorders or problems varied from .25 to .66.
An IRT Model with a Parameter-Driven Process for Change
ERIC Educational Resources Information Center
Rijmen, Frank; De Boeck, Paul; van der Maas, Han L. J.
2005-01-01
An IRT model with a parameter-driven process for change is proposed. Quantitative differences between persons are taken into account by a continuous latent variable, as in common IRT models. In addition, qualitative inter-individual differences and auto-dependencies are accounted for by assuming within-subject variability with respect to the…
ERIC Educational Resources Information Center
Danner, Daniel; Hagemann, Dirk; Schankin, Andrea; Hager, Marieke; Funke, Joachim
2011-01-01
The present study investigated cognitive performance measures beyond IQ. In particular, we investigated the psychometric properties of dynamic decision making variables and implicit learning variables and their relation with general intelligence and professional success. N = 173 employees from different companies and occupational groups completed…
Characterizing Student Expectations: A Small Empirical Study
ERIC Educational Resources Information Center
Warwick, Jonathan
2016-01-01
This paper describes the results of a small empirical study (n = 130), in which undergraduate students in the Business Faculty of a UK university were asked to express views and expectations relating to the study of a mathematics. Factor analysis is used to identify latent variables emerging from clusters of the measured variables and these are…
Adolescent Substance Use, Sleep, and Academic Achievement: Evidence of Harm Due to Caffeine
ERIC Educational Resources Information Center
James, Jack E.; Kristjansson, Alfgeir Logi; Sigfusdottir, Inga Dora
2011-01-01
Using academic achievement as the key outcome variable, 7377 Icelandic adolescents were surveyed for cigarette smoking, alcohol use, daytime sleepiness, caffeine use, and potential confounders. Structural equation modeling (SEM) was used to examine direct and indirect effects of measured and latent variables in two models: the first with caffeine…
Chakraborty, Sutirtha
2018-05-26
RNA-Seq technology has revolutionized the face of gene expression profiling by generating read count data measuring the transcript abundances for each queried gene on multiple experimental subjects. But on the downside, the underlying technical artefacts and hidden biological profiles of the samples generate a wide variety of latent effects that may potentially distort the actual transcript/gene expression signals. Standard normalization techniques fail to correct for these hidden variables and lead to flawed downstream analyses. In this work I demonstrate the use of Partial Least Squares (built as an R package 'SVAPLSseq') to correct for the traces of extraneous variability in RNA-Seq data. A novel and thorough comparative analysis of the PLS based method is presented along with some of the other popularly used approaches for latent variable correction in RNA-Seq. Overall, the method is found to achieve a substantially improved estimation of the hidden effect signatures in the RNA-Seq transcriptome expression landscape compared to other available techniques. Copyright © 2017. Published by Elsevier Inc.
Bornstein, Marc H.; Hahn, Chun-Shin; Putnick, Diane L.; Suwalsky, Joan T. D.
2014-01-01
This four-wave prospective longitudinal study evaluated stability of language in 324 children from early childhood to adolescence. Structural equation modeling supported loadings of multiple age-appropriate multi-source measures of child language on single-factor core language skills at 20 months and 4, 10, and 14 years. Large stability coefficients (standardized indirect effect = .46) were obtained between language latent variables from early childhood to adolescence and accounting for child nonverbal intelligence and social competence and maternal verbal intelligence, education, speech, and social desirability. Stability coefficients were similar for girls and boys. Stability of core language skill was stronger from 4 to 10 to 14 years than from 20 months to 4 years, so early intervention to improve lagging language is recommended. PMID:25165797
CORRECTING FOR MEASUREMENT ERROR IN LATENT VARIABLES USED AS PREDICTORS*
Schofield, Lynne Steuerle
2015-01-01
This paper represents a methodological-substantive synergy. A new model, the Mixed Effects Structural Equations (MESE) model which combines structural equations modeling and item response theory is introduced to attend to measurement error bias when using several latent variables as predictors in generalized linear models. The paper investigates racial and gender disparities in STEM retention in higher education. Using the MESE model with 1997 National Longitudinal Survey of Youth data, I find prior mathematics proficiency and personality have been previously underestimated in the STEM retention literature. Pre-college mathematics proficiency and personality explain large portions of the racial and gender gaps. The findings have implications for those who design interventions aimed at increasing the rates of STEM persistence among women and under-represented minorities. PMID:26977218
Klapper, Regina; Kochmann, Judith; O’Hara, Robert B.; Karl, Horst; Kuhn, Thomas
2016-01-01
The use of parasites as biological tags for discrimination of fish stocks has become a commonly used approach in fisheries management. Metazoan parasite community analysis and anisakid nematode population genetics based on a mitochondrial cytochrome marker were applied in order to assess the usefulness of the two parasitological methods for stock discrimination of beaked redfish Sebastes mentella of three fishing grounds in the North East Atlantic. Multivariate, model-based approaches demonstrated that the metazoan parasite fauna of beaked redfish from East Greenland differed from Tampen, northern North Sea, and Bear Island, Barents Sea. A joint model (latent variable model) was used to estimate the effects of covariates on parasite species and identified four parasite species as main source of differences among fishing grounds; namely Chondracanthus nodosus, Anisakis simplex s.s., Hysterothylacium aduncum, and Bothriocephalus scorpii. Due to its high abundance and differences between fishing grounds, Anisakis simplex s.s. was considered as a major biological tag for host stock differentiation. Whilst the sole examination of Anisakis simplex s.s. on a population genetic level is only of limited use, anisakid nematodes (in particular, A. simplex s.s.) can serve as biological tags on a parasite community level. This study confirmed the use of multivariate analyses as a tool to evaluate parasite infra-communities and to identify parasite species that might serve as biological tags. The present study suggests that S. mentella in the northern North Sea and Barents Sea is not sub-structured. PMID:27104735
Thomas, Jennifer J; Eddy, Kamryn T; Ruscio, John; Ng, King Lam; Casale, Kristen E; Becker, Anne E; Lee, Sing
2015-05-01
We examined whether empirically derived eating disorder (ED) categories in Hong Kong Chinese patients (N = 454) would be consistent with recognizable lifetime ED phenotypes derived from latent structure models of European and American samples. We performed latent profile analysis (LPA) using indicator variables from data collected during routine assessment, and then applied taxometric analysis to determine whether latent classes were qualitatively versus quantitatively distinct. Latent profile analysis identified four classes: (i) binge/purge (47%); (ii) non-fat-phobic low-weight (34%); (iii) fat-phobic low-weight (12%); and (iv) overweight disordered eating (6%). Taxometric analysis identified qualitative (categorical) distinctions between the binge/purge and non-fat-phobic low-weight classes, and also between the fat-phobic and non-fat-phobic low-weight classes. Distinctions between the fat-phobic low-weight and binge/purge classes were indeterminate. Empirically derived categories in Hong Kong showed recognizable correspondence with recognizable lifetime ED phenotypes. Although taxometric findings support two distinct classes of low weight EDs, LPA findings also support heterogeneity among non-fat-phobic individuals. Copyright © 2015 John Wiley & Sons, Ltd and Eating Disorders Association.